Google Cloud End to End Data Engineering Projects - Building Data Lakehouse from Scratch - End to End Data Engineering Project

Master End-to-End Data Engineering: Real-Time Streaming, AI Integrations, and High-Performance Systems! Dive into hands-on projects, expert-guided tutorials, and cutting-edge technologies for a standout career in data engineering.

4.0 (2)

25 learners

What you'll learn

Understand and implement real-time streaming with Google Cloud for data engineering projects
Learn how to perform real-time socket streaming using Apache Spark
Master the use of Apache Airflow alongside Spark, Pyspark, Java, and Scala for data engineering
Develop skills to build and optimize high-performance, real-time analytics databases

This course includes

47.5 hours of video
Certificate of completion
Access on mobile and TV

Summary

Keywords

big data data engineering big data engineering projects data lakehouse data lake data warehouse data integration data transformation data governance data security apache spark apache kafka

Full Transcript

In this video you will learn to design, implement and maintain secure, scalable and cost effective lakehouse architectures leveraging Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools. Unlock data's full potential through advanced analytics and machine learning. Part 2: https://youtu.be/K84MEdiC1tM FULL COURSE AVAILABLE: https://sh.datamasterylab.com/costsaver Like this video? Support us: https://www.youtube.com/@CodeWithYu/join Timestamps: 0:00 Introduction 1:24 The system architecture 4:59 The modern system architecture 9:15 Implementation of the Current Data Lakehouse on AWS Cloud 11:33 Creating Databases for Data Lakehouse 12:12 Using Glue crawler for Data Lakehouse 17:19 Using Lambda function to automate data orchestration on AWS Cloud 21:03 Coding the Lambda function 43:57 Optimising Lambda Function 48:46 Verification of Results 53:43 Outro Resources: Youtube Source Code: https://buymeacoffee.com/yusuf.ganiyu/youtube-source-code-building-cost-effective-data-lakehouse 🌟 Please LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟 👦🏻 My Linkedin: https://www.linkedin.com/in/yusuf-ganiyu-b90140107/ 🚀 X(Twitter): https://x.com/YusufOGaniyu 📝 Medium: https://medium.com/@yusuf.ganiyu Hashtags: #dataengineering #bigdata #dataanalytics #realtimeanalytics #streaming, #datalakehouse, #datalake, #datawarehouse, #dataintegration, #datatransformation, #datagovernance, #datasecurity, #apachespark, #apachekafka, #apacheflink, #deltalake, #aws, #opensource, #dataingestion, #structureddata, #unstructureddata, #semi-structureddata, #dataanalysis, #advancedanalytics, #dataarchitecture, #costoptimization, #cloudcomputing, #awscloud

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

30-Day Beginner Guitar Challenge for New Players

Master the Guitar in 30 Days: Your Ultimate Beginner Challenge! Unleash your inner guitarist with step-by-step lessons designed to transform you from novice to confident player. Join Your Guitar Academy and kickstart your musical journey today!

⭐ 4.3

36 ratings

7 hours