Google Cloud End to End Data Engineering Projects - Elasticsearch for High Throughout Systems - 1 Billion records!

Master End-to-End Data Engineering: Real-Time Streaming, AI Integrations, and High-Performance Systems! Dive into hands-on projects, expert-guided tutorials, and cutting-edge technologies for a standout career in data engineering.

4.0 (2)

25 learners

What you'll learn

Understand and implement real-time streaming with Google Cloud for data engineering projects
Learn how to perform real-time socket streaming using Apache Spark
Master the use of Apache Airflow alongside Spark, Pyspark, Java, and Scala for data engineering
Develop skills to build and optimize high-performance, real-time analytics databases

This course includes

47.5 hours of video
Certificate of completion
Access on mobile and TV

Summary

Keywords

Apache Kafka Apache Spark Kafka and Spark integration data engineering high-performance data pipelines big data processing ELK stack Grafana Prometheus real-time data processing Kafka Schema Registry Kafka Control Center

Full Transcript

In this hands-on project, learn how to set up real-time monitoring for a high-performance data pipeline processing 1.2 billion records per hour! We cover everything from configuring Grafana and Prometheus to creating a comprehensive Kafka and Spark monitoring dashboard. Watch to see how modern data engineering tools come together for effective system monitoring. PART 2: https://youtu.be/PL6Tl2sqh8k Timestamps: 0:00 Introduction 2:25 Reading and Parsing Logs from Different Source 4:17 Setting up Elasticsearch for High Throughput 7:45 Elasticsearch UI Navigation 9:02 Setting up Filebeat 20:46 Setting up Logstash 28:00 Querying Indexed Records on Elasticsearch 30:00 Outro Like this video? Support us: https://www.youtube.com/@CodeWithYu/join 👀 Don't just watch it, build it! 🚧 👍 Like, Comment, & Subscribe for more cutting-edge data engineering content! Resources: Full Source Code: https://buymeacoffee.com/yusuf.ganiyu/full-source-code-monitoring-high-performance-architecture-systems Kafka Documentation: https://kafka.apache.org/documentation/ Apache Spark Documentation: https://spark.apache.org/documentation.html JMX Exporter Agent - https://github.com/prometheus/jmx_exporter/releases #Elasticsearch, #Logstash, #Filebeat, #Kibana, #DataEngineering, #BigData, #RealTimeProcessing, #BigDataAnalytics, #DataPipeline, #StreamingData, #KafkaMonitoring, #SparkStreaming, #DataArchitecture, #HighPerformanceComputing