Build, Train, and Deploy ML Pipelines using BERT

4.58

Updated on

Course overview

Provider
Coursera
Course type
Free online course
Level
Advanced
Deadline
Flexible
Duration
14 hours
Certificate
Paid Certificate Available
Course author
Antje Barth
  • Store and manage machine learning features using a feature store

  • Debug, profile, tune and evaluate models while tracking data lineage and model artifacts

Description

In the second course of the Practical Data Science Specialization, you will learn to automate a natural language processing task by building an end-to-end machine learning pipeline using Hugging Face’s highly-optimized implementation of the state-of-the-art BERT algorithm with Amazon SageMaker Pipelines. Your pipeline will first transform the dataset into BERT-readable features and store the features in the Amazon SageMaker Feature Store. It will then fine-tune a text classification model to the dataset using a Hugging Face pre-trained model, which has learned to understand the human language from millions of Wikipedia documents. Finally, your pipeline will evaluate the model’s accuracy and only deploy the model if the accuracy exceeds a given threshold.Practical data science is geared towards handling massive datasets that do not fit in your local hardware and could originate from multiple sources. One of the biggest benefits of developing and running data science projects in the cloud is the agility and elasticity that the cloud offers to scale up and out at a minimum cost. The Practical Data Science Specialization helps you develop the practical skills to effectively deploy your data science projects and overcome challenges at each step of the ML workflow using Amazon SageMaker. This Specialization is designed for data-focused developers, scientists, and analysts familiar with the Python and SQL programming languages and want to learn how to build, train, and deploy scalable, end-to-end ML pipelines - both automated and human-in-the-loop - in the AWS cloud.

Similar courses

Machine Learning
  • Flexible deadline
  • 61 hours
  • Certificate
Neural Networks and Deep Learning
  • Flexible deadline
  • 27 hours
  • Certificate
Introduction to Machine Learning in Production
  • Flexible deadline
  • 10 hours
  • Certificate
Build, Train, and Deploy ML Pipelines using BERT
  • English language

  • Recommended provider

  • Certificate available