Data Processing in Shell

Updated on

Course overview

Provider
Datacamp
Course type
Free trial availiable
Deadline
Flexible
Duration
4 hours
Certificate
Available on completion
Course author
Susan Sun

Description

Learn powerful command-line skills to download, process, and transform data, including machine learning pipeline.
We live in a busy world with tight deadlines. As a result, we fall back on what is familiar and easy, favoring GUI interfaces like Anaconda and RStudio. However, taking the time to learn data analysis on the command line is a great long-term investment because it makes us stronger and more productive data people. In this course, we will take a practical approach to learn simple, powerful, and data-specific command-line skills. Using publicly available Spotify datasets, we will learn how to download, process, clean, and transform data, all via the command line. We will also learn advanced techniques such as command-line based SQL database operations. Finally, we will combine the powers of command line and Python to build a data pipeline for automating a predictive model.

Similar courses

Foundations: Data, Data, Everywhere
  • Flexible deadline
  • 20 hours
  • Certificate
Ask Questions to Make Data-Driven Decisions
  • Flexible deadline
  • 18 hours
  • Certificate
Introduction to Statistics
  • Flexible deadline
  • 15 hours
  • Certificate
  • English language

  • Recommended provider

  • Certificate available