Course Hive
Search

Welcome

Sign in or create your account

Continue with Google
or
14 Read, Parse or Flatten JSON data | JSON file with Schema | from_json | to_json | Multiline JSON
Play lesson

PySpark - Zero to Hero | PySpark Tutorial 2025 | Spark Tutorial 2025 | Learn from Basics to Advanced Performance Optimization - 14 Read, Parse or Flatten JSON data | JSON file with Schema | from_json | to_json | Multiline JSON

4.0 (1)
18 learners

What you'll learn

This course includes

  • 9 hours of video
  • Certificate of completion
  • Access on mobile and TV

Summary

Keywords

Full Transcript

Video explains - How to read JSON files? How to parse JSON data? How to flatten JSON data? What is explode function? What is from_json function ? What is to_json function ? How to write complex schema for JSON ? Chapters 00:00 - Introduction 02:01 - Read Single Line JSON file 03:29 - Read Multiline JSON file 04:42 - Read JSON data in Single column 05:29 - Read JSON file with Schema 07:00 - Write Schema ddl String 09:20 - from_json function 11:00 - to_json function 12:39 - Flatten JSON data Spark JSON Documentation - https://spark.apache.org/docs/latest/sql-data-sources-json.html Local PySpark Jupyter Lab setup - https://youtu.be/WhxljT3IfdM Python Basics - https://www.learnpython.org/ GitHub URL for code - https://github.com/subhamkharwal/pyspark-zero-to-hero/blob/master/10_read_json_files.ipynb The series provides a step-by-step guide to learning PySpark, a popular open-source distributed computing framework that is used for big data processing. New video in every 3 days ❤️ #spark #pyspark #python #dataengineering

Course Hive

Continue this lesson in the app

Install CourseHive on Android or iOS to keep learning while you move.

Related Courses

FAQs

Course Hive
Download CourseHive
Keep learning anywhere