Course Hive
Search

Welcome

Sign in or create your account

Continue with Google
or
Building Data Lakehouse from Scratch  -  End to End Data Engineering Project
Play lesson

Google Cloud End to End Data Engineering Projects - Building Data Lakehouse from Scratch - End to End Data Engineering Project

Master End-to-End Data Engineering: Real-Time Streaming, AI Integrations, and High-Performance Systems! Dive into hands-on projects, expert-guided tutorials, and cutting-edge technologies for a standout career in data engineering.

4.0 (2)
25 learners

What you'll learn

Understand and implement real-time streaming with Google Cloud for data engineering projects
Learn how to perform real-time socket streaming using Apache Spark
Master the use of Apache Airflow alongside Spark, Pyspark, Java, and Scala for data engineering
Develop skills to build and optimize high-performance, real-time analytics databases

This course includes

  • 47.5 hours of video
  • Certificate of completion
  • Access on mobile and TV

Summary

Keywords

Full Transcript

In this video you will learn to design, implement and maintain secure, scalable and cost effective lakehouse architectures leveraging Apache Spark, Apache Kafka, Apache Flink, Delta Lake, AWS, and open-source tools. Unlock data's full potential through advanced analytics and machine learning. Part 2: https://youtu.be/K84MEdiC1tM FULL COURSE AVAILABLE: https://sh.datamasterylab.com/costsaver Like this video? Support us: https://www.youtube.com/@CodeWithYu/join Timestamps: 0:00 Introduction 1:24 The system architecture 4:59 The modern system architecture 9:15 Implementation of the Current Data Lakehouse on AWS Cloud 11:33 Creating Databases for Data Lakehouse 12:12 Using Glue crawler for Data Lakehouse 17:19 Using Lambda function to automate data orchestration on AWS Cloud 21:03 Coding the Lambda function 43:57 Optimising Lambda Function 48:46 Verification of Results 53:43 Outro Resources: Youtube Source Code: https://buymeacoffee.com/yusuf.ganiyu/youtube-source-code-building-cost-effective-data-lakehouse 🌟 Please LIKE ❤️ and SUBSCRIBE for more AMAZING content! 🌟 👦🏻 My Linkedin: https://www.linkedin.com/in/yusuf-ganiyu-b90140107/ 🚀 X(Twitter): https://x.com/YusufOGaniyu 📝 Medium: https://medium.com/@yusuf.ganiyu Hashtags: #dataengineering #bigdata #dataanalytics #realtimeanalytics #streaming, #datalakehouse, #datalake, #datawarehouse, #dataintegration, #datatransformation, #datagovernance, #datasecurity, #apachespark, #apachekafka, #apacheflink, #deltalake, #aws, #opensource, #dataingestion, #structureddata, #unstructureddata, #semi-structureddata, #dataanalysis, #advancedanalytics, #dataarchitecture, #costoptimization, #cloudcomputing, #awscloud

Course Hive

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

FAQs

Course Hive
Download CourseHive
Keep learning anywhere