In File input mode, Amazon SageMaker downloads the training data from Amazon S3 to the storage volume that is attached to the training instance and mounts the directory to the Docker volume for the training container.
Environment provides access to aspects of the environment relevant to training jobs, including hyperparameters, system characteristics, filesystem locations, environment variables and configuration settings.
All models trained in SageMaker automatically emit key metrics that can be collected and viewed in SageMaker Studio.
Q: How does Managed Spot Training work? Q: What is Amazon SageMaker Serverless Inference? All these choices affect how long a hyperparameter tuning job can last.
We recommend periodic checkpoints as a general best practice for long running training jobs.
TensorFlow estimator handles locating the script mode container, uploading your script to a S3 location and creating a SageMaker training job.