Amazon SageMaker 1O1

Piyush Jalan
FAUN — Developer Community 🐾
4 min readNov 15, 2019

--

Amazon SageMaker is a fully managed machine learning service. Amazon SageMaker provides every developer and data scientist with the ability to build, train & deploy machine learning models quickly. It provides an integrated Jupyter authoring notebook instance for easy access to data sources for exploration and analysis, so users don’t have to manage servers. It provides common machine learning algorithms that are optimized to run efficiently against extremely large data in a distributed environment.

Image retrieved from: https://www.aws.training

Amazon SageMaker includes three modules: Build, Train & Deploy. The Build module provides a hosted environment to work with user’s data, experiment with algorithms & visualize their output. The Train module allows for one-click model training and tuning at high-scale and low cost. The Deploy module provides a managed environment for users to easily host and test models for inference securely and with low latency.

Build:

Build highly accurate training data sets

Managed Notebooks for Authoring Models

Built-in, High Performance Algorithms

Broad Framework Support

Reinforcement Learning Support with Amazon SageMaker RL

Test and Prototype Locally

Train:

One-click Training

Managed Spot Training

Automatic Model Tuning

Train Once, Run Anywhere

Model tracking capability

Deploy:

One-click Deployment

Fully-managed Hosting with Auto Scaling

Batch Transform

Inference Pipelines

With native support for bring-your-own-algorithms and frameworks, Amazon SageMaker offers flexible distributed training options that adjust to users specific workflows. This is a HIPAA Eligible Service. To train a model in Amazon SageMaker, users need to create a training job. The training job includes the following information:

  • The URL of the S3 bucket where users have stored the training data.
  • The compute resources that users want SageMaker to use for model training.
  • The URL of the S3 bucket where users want to store the output of the job.
  • The Amazon Elastic Container Registry path where the training code is stored.
Image retrieved from: https://docs.aws.amazon.com/sagemaker/latest/dg/how-it-works-training.html

User can deploy model to get predictions in one of two ways:

  • To set up a persistent endpoint to get one prediction at a time, use Amazon SageMaker hosting services.
  • To get predictions for an entire dataset, use Amazon SageMaker batch transform.

Amazon SageMaker provides an Apache Spark library, in both Python and Scala, that users can use to easily train models in Amazon SageMaker using org.apache.spark.sql.DataFrame data frames in their Spark clusters. The Amazon SageMaker Python SDK TensorFlow estimators and models and the Amazon SageMaker open-source TensorFlow containers make writing a TensorFlow script and running it in Amazon SageMaker easier. Users can use SageMaker to train and deploy a model using a custom MXNet code.

Amazon SageMaker conforms to the AWS shared responsibility model, which includes regulations and guidelines for data protection. Users can use encrypted S3 buckets for model artifacts and data, as well as pass an AWS KMS key to SageMaker notebooks, training jobs, hyperparameter tuning jobs, batch transform jobs & endpoints, to encrypt the attached machine learning storage volume. Batch and training job containers and their storage are ephemeral in nature. When the job completes, the output is uploaded to S3 and the instance is torn down. The Amazon SageMaker folder in the ML Amazon EBS volume is the default storage location when the user opens a notebook instance. Amazon SageMaker saves any files within the SageMaker folder. SageMaker ensures that machine learning model artifacts and other system artifacts are encrypted in transit and at rest. Requests to the SageMaker API and console are made over a secure SSL connection. FIPS (Federal Information Processing Standard) validated endpoints are available for the SageMaker API and request a router for hosted models (runtime).

Next Steps: Read about

Amazon SageMaker RL https://docs.aws.amazon.com/sagemaker/latest/dg/reinforcement-learning.html

Amazon SageMaker Ground truth https://docs.aws.amazon.com/sagemaker/latest/dg/sms.html

Stay tuned for more info on ML and SageMaker. Reach out to me for any query at piyush.jalan93@gmail.com.

Thanks

Subscribe to FAUN topics and get your weekly curated email of the must-read tech stories, news, and tutorials 🗞️

Follow us on Twitter 🐦 and Facebook 👥 and Instagram 📷 and join our Facebook and Linkedin Groups 💬

If this post was helpful, please click the clap 👏 button below a few times to show your support for the author! ⬇

--

--

Cloud Architect | Cloud Enthusiast | Helping Customers in Adopting Cloud Technology