Summary
Keywords
Full Transcript
Hey data engineers! Do you want to know how to organize your data lake to avoid the dreaded data swamp? Find out how the first layer - Raw - should look like and why it is important to have one. ▬▬▬▬▬▬ IMPORTANT LINKS ▬▬▬▬▬▬ My LinkedIn profile: https://www.linkedin.com/in/piotr-tybulewicz-81a8793/ GitHub with my drawings: https://github.com/TybulOnAzure/DP-203 Storage account limits: https://learn.microsoft.com/en-us/azure/storage/common/scalability-targets-standard-account ▬▬▬▬▬▬ SUPPORTING MY CHANNEL ▬▬▬▬▬▬ Buy me a coffee (or beer): https://buymeacoffee.com/piotrv Join this channel to get access to perks: https://www.youtube.com/channel/UCLnXq-Fr-6rAsCitq9nYiGg/join ▬▬▬▬▬▬ CHAPTERS ▬▬▬▬▬▬ 00:00 Introduction 00:24 Basic BI flow revisited 02:40 Data swamp 04:39 Data lake zones 06:46 What is Raw layer? 11:12 Why to have Raw layer? 19:26 How to implement Raw layer? 46:23 Summary
