Have you found any cool resources about data engineering? Put them here
- Data Engineering Zoomcamp by DataTalks.Club (free)
- Big Data Platforms, Autumn 2022: Introduction to Big Data Processing Frameworks by the University of Helsinki (free)
- Awesome Data Engineering Learning Path
- Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems by Martin Kleppmann
- Big Data: Principles and Best Practices of Scalable Realtime Data Systems by Nathan Marz, James Warren
- Practical DataOps: Delivering Agile Data Science at Scale by Harvinder Atwal
- Data Pipelines Pocket Reference: Moving and Processing Data for Analytics by James Densmore
- Best books for data engineering
- Fundamentals of Data Engineering: Plan and Build Robust Data Systems by Joe Reis, Matt Housley
Conference talks from companies, blog posts, etc
- Uber Data Archives (Uber engineering blog)
- Data Engineering Weekly (DE-focused substack)
- Seattle Data Guy (DE-focused substack)
- CS50's Introduction to Computer Science | edX (course)
- Python for Everybody SpecializsationSpecialization (course)
- Practical Python programming
- Intro to SQL: Querying and managing data | Khan Academy
- Mode SQL Tutorial
- Use The Index, Luke (SQL Indexing a nd Tuning e-Book)nfreffx
- SQL Performance Explained (book) e
- What is DAG? (video)
- Airflow, Prefect, and Dagster: An Inside Look (blog post)
- Open-Source Spotlight - Prefect - Kevin Kho (video)
- Prefect as a Data Engineering Project Workflow Tool, with Mary Clair Thompson (Duke) - 11/6/2020 (video)
- ETL vs. ELT: What’s the Difference? (blog post) (print version)
- An Introduction to Modern Data Lake Storage Layers (Hodi, Iceberg, Delta Lake) (blog post)
- Lake House Architecture @ Halodoc: Data Platform 2.0 (blzog post)
- Guide to Data Warehousing. Short and comprehensive information… | by Tomas Peluritis (blog post)
- Snowflake, Redshift, BigQuery, and Others: Cloud Data Warehouse Tools Compared (blog post)
- Building Streaming Analytics: The Journey and Learnings - Maxim Lukichev
- Analytics Engineer: New Role in a Data Team with Victoria Perez Mola (podcast)
- Modern Data Stack for Analytics Engineering - Kyle Shannon (video)
- Analytics Engineering vs Data Engineering | RudderStack Blog (blog post)
- Learn the Fundamentals of Analytics Engineering with dbt (course)
- TODO: What is reverse ETL?
- https://datatalks.club/podcast/s05e02-data-engineering-acronyms.html
- Open-Source Spotlight - Grouparoo - Brian Leonard (video)
- Open-Source Spotlight - Castled.io (Reverse ETL) - Arun Thulasidharan (video)
- From Data Science to Data Engineering with Ellen König – DataTalks.Club (podcast)
- Big Data Engineer vs Data Scientist with Roksolana Diachuk – DataTalks.Club (podcast)
- What Skills Do You Need to Become a Data Engineer (blog post)
- The future history of Data Engineering (blog post)
- What Skills Do Data Engineers Need (blog post)
- How To Start A Data Engineering Project - With Data Engineering Project Ideas (video)
- Data Engineering Project for Beginners - Batch edition (blog post)
- Building a Data Engineering Project in 20 Minutes (blog post)
- Automating Nike Run Club Data Analysis with Python, Airflow and Google Data Studio | by Rich Martin | Medium (blog post)
- The Data Engineering Podcast
- DataTalks.Club Podcast (only some episodes are about data engineering)
- TODO
- Karolina Sowinska - YouTube x`
- Seattle Data Guy - YouTube
- Andreas Kretz - YouTube
- DataTalksClub - YouTube (only some videos are about data engineering)
- Reading List by Lars Albertsson
- GitHub - igorbarinov/awesome-data-engineering (focus is more on tools)
This work is licensed under a Creative Commons Attribution 4.0 International License.
CC BY 4.0