Introduction to Data Science in Python

4.51

Updated on

Course overview

Provider
Coursera
Course type
Free online course
Level
Intermediate
Deadline
Flexible
Duration
31 hours
Certificate
Paid Certificate Available
Course author
Christopher Brooks
  • Understand techniques such as lambdas and manipulating csv files

  • Describe common Python functionality and features used for data science

  • Query DataFrame structures for cleaning and processing

  • Explain distributions, sampling, and t-tests

Description

This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses. This course should be taken before any of the other Applied Data Science with Python courses: Applied Plotting, Charting & Data Representation in Python, Applied Machine Learning in Python, Applied Text Mining in Python, Applied Social Network Analysis in Python.

Similar courses

Foundations: Data, Data, Everywhere
  • Flexible deadline
  • 20 hours
  • Certificate
Ask Questions to Make Data-Driven Decisions
  • Flexible deadline
  • 18 hours
  • Certificate
Introduction to Statistics
  • Flexible deadline
  • 15 hours
  • Certificate
Introduction to Data Science in Python
  • English language

  • Recommended provider

  • Certificate available