Course overview
- Provider
- Udemy
- Course type
- Paid course
- Level
- All Levels
- Duration
- 2 hours
- Lessons
- 21 lessons
- Certificate
- Available on completion
- Course author
- Artech Learning, LLC.
-
- Learn to perform exploratory data analysis in Spark using sparklyr
- Understand the differences between working with data frames in R and Spark
- Learn how to connect to Spark locally or to a remote Spark cluster
- Learn how to build data products in R that don't rely on storing big data locally
- Learn how to interact with data in Apache Spark through sparklyr and Spark SQL
Description
Welcome to this course: Data Science - Sparklyr Basics for Beginners. Apache Spark has been increasingly adopted for the development of distributed applications. In the past year, transforming the world using data is typically achieved through disrupting and changing real processes in real industries. In order to operate at this level you need to build data science solutions of substance –solutions that solve real problems. Spark SQL APIs provide an optimized interface that helps developers build such applications quickly and easily.
In this course, you'll learn:
- Understand the differences between working with data frames in R and Spark
- Learn to perform exploratory data analysis in Spark using sparklyr
- Learn how to connect to Spark locally or to a remote Spark cluster
- Learn how to build data products in R that don't rely on storing big data locally
- Learn how to interact with data in Apache Spark through sparklyr and Spark SQL
At the end of this course, you will learn to use Spark as a big data operating system, understand how to implement advanced analytics on the new APIs, and explore how easy it is to use Spark in day-to-day tasks.
Similar courses
-
English language
-
Recommended provider
-
Certificate available