[SageMaker] Preparing FSx Input for SageMaker
Preparing FSx Input for SageMaker
- Download and prepare your training dataset on S3.
- Follow the steps listed here to create a FSx linked with your S3 bucket with training data - https://docs.aws.amazon.com/fsx/latest/LustreGuide/create-fs-linked-data-repo.html. Make sure to add an endpoint to your VPC allowing S3 access.
- Follow the steps listed here to configure your SageMaker training job to use FSx https://aws.amazon.com/blogs/machine-learning/speed-up-training-on-amazon-sagemaker-using-amazon-efs-or-amazon-fsx-for-lustre-file-systems/
把分布式训练的这部分内容,独立出来学习。
FSx for Lustre 与 Amazon S3 无缝集成,使您能够轻松地将长期数据集与高性能文件系统链接起来以运行计算密集型工作负载。
您可以自动将数据从 S3 复制到 FSx for Lustre,运行工作负载,然后将结果写回到 S3。有了点s3的缓存的意思~
/* implement */