Introduction to Designing Data Lakes on AWS

4.71

Updated on

Course overview

Provider
Coursera
Course type
Free online course
Level
Intermediate
Deadline
Flexible
Duration
14 hours
Certificate
Paid Certificate Available
Course author
Rafael Lopes
    • Where to start with a Data Lake?
    • How to build a secure and scalable Data Lake?
    • What are the common components of a Data Lake?
    • Why do you need a Data Lake and what it's value?

Description

In this class, Introduction to Designing Data Lakes on AWS, we will help you understand how to create and operate a data lake in a secure and scalable way, without previous knowledge of data science! Starting with the "WHY" you may want a data lake, we will look at the Data-Lake value proposition, characteristics and components.Designing a data lake is challenging because of the scale and growth of data. Developers need to understand best practices to avoid common mistakes that could be hard to rectify. In this course we will cover the foundations of what a Data Lake is, how to ingest and organize data into the Data Lake, and dive into the data processing that can be done to optimize performance and costs when consuming the data at scale. This course is for professionals (Architects, System Administrators and DevOps) who need to design and build an architecture for secure and scalable Data Lake components. Students will learn about the use cases for a Data Lake and, contrast that with a traditional infrastructure of servers and storage.

Similar courses

Foundations: Data, Data, Everywhere
  • Flexible deadline
  • 20 hours
  • Certificate
Ask Questions to Make Data-Driven Decisions
  • Flexible deadline
  • 18 hours
  • Certificate
Introduction to Statistics
  • Flexible deadline
  • 15 hours
  • Certificate
Introduction to Designing Data Lakes on AWS
  • English language

  • Recommended provider

  • Certificate available