Machine Learning with PySpark

Updated on

Course overview

Provider
Datacamp
Course type
Free trial availiable
Deadline
Flexible
Duration
4 hours
Certificate
Available on completion
Course author
Andrew Collier

Description

Learn how to make predictions with Apache Spark.
Spark is a powerful, general purpose tool for working with Big Data. Spark transparently handles the distribution of compute tasks across a cluster. This means that operations are fast, but it also allows you to focus on the analysis rather than worry about technical details. In this course you'll learn how to get data into Spark and then delve into the three fundamental Spark Machine Learning algorithms: Linear Regression, Logistic Regression/Classifiers, and creating pipelines. Along the way you'll analyse a large dataset of flight delays and spam text messages. With this background you'll be ready to harness the power of Spark and apply it on your own Machine Learning projects!

Similar courses

Machine Learning
  • Flexible deadline
  • 61 hours
  • Certificate
Neural Networks and Deep Learning
  • Flexible deadline
  • 27 hours
  • Certificate
Introduction to Machine Learning in Production
  • Flexible deadline
  • 10 hours
  • Certificate
  • English language

  • Recommended provider

  • Certificate available