Data Engineering using AWS Data Analytics Course
Build Data Engineering Pipelines on AWS using Data Analytics Services – Glue, EMR, Athena, Kinesis, Lambda, Redshift
Data Engineering is all about building Data Pipelines to get data from multiple sources into Data Lake or Data Warehouse and then from Data Lake or Data Warehouse to downstream systems. As part of this course, I will walk you through how to build Data Engineering Pipelines using AWS Analytics Stack. It includes services such as Glue, Elastic Map Reduce (EMR), Lambda Functions, Athena, QuickSight, and many more.
What you’ll learn
- Data Engineering leveraging AWS Analytics features
- Managing Tables using Glue Catalog
- Engineering Batch Data Pipelines using Glue Jobs
- Orchestrating Batch Data Pipelines using Glue Workflows
- Running Queries using Athena – Server less query engine service
- Using AWS Elastic Map Reduce (EMR) Clusters for building Data Pipelines
- Using AWS Elastic Map Reduce (EMR) Clusters for reports and dashboards
- Data Ingestion using Lambda Functions
- Scheduling using Events Bridge
- Engineering Streaming Pipelines using Kinesis
- Streaming Web Server logs using Kinesis Firehose
- Overview of data processing using Athena
- Running Athena queries or commands using CLI
- Running Athena queries using Python boto3
- Creating Redshift Cluster, Create tables and perform CRUD Operations
- Copy data from s3 to Redshift Tables
Recommended Data Engineering Course
Python Data Science with Pandas: Master 12 Advanced Projects
Udemy Coupons & Promo Codes - November 2023
Cyber Monday deal. Courses up to 80% off