Awesome Delta Lake & Apache Iceberg Resources
A curated list of articles, blog posts, videos, and resources about Delta Lake and Apache Iceberg, automatically maintained by our community and AI-powered aggregator.
🌟 Featured Resources
Official Documentation
- Delta Lake Official Docs - Comprehensive Delta Lake documentation
- Apache Iceberg Official Docs - Complete Iceberg documentation
- Delta Lake GitHub - Delta Lake source code
- Apache Iceberg GitHub - Iceberg source code
Specifications
- Delta Transaction Log Protocol - Delta’s ACID transaction protocol
- Iceberg Table Spec - Apache Iceberg’s table format specification
Recent Articles
This section is automatically updated by our resource aggregator bot. New articles are added weekly and reviewed by the community.
Delta Lake 3.2: Liquid Clustering and Improved Performance
Discovered: 2025-02-01
Delta Lake 3.2 introduces Liquid Clustering as a replacement for static partitioning and Z-ordering, automatically reorganizing data based on actual query patterns for improved performance without manual tuning.
Apache Iceberg 1.5: Row-Level Deletes and Merge-on-Read Improvements
Discovered: 2025-01-15
Iceberg 1.5 ships significant performance improvements for Merge-on-Read tables, enhanced row-level delete efficiency, and expanded metadata statistics support for better query planning across all supported engines.
Choosing Between Delta Lake and Apache Iceberg in 2025
Discovered: 2025-03-10
A comprehensive comparison of both open table formats, covering ecosystem maturity, vendor support, hidden partitioning, streaming integration, and real-world migration experiences from Databricks and Netflix engineering teams.
Lakehouse Architecture with Apache Iceberg on AWS
Discovered: 2025-04-01
Step-by-step guide to building an AWS-native data lakehouse using Apache Iceberg with Amazon Athena, AWS Glue, and S3, covering catalog integration, partition management, and compaction automation.
📚 Learning Resources
Tutorials
- Delta Lake Quickstart - Get started with Delta Lake
- Iceberg Quickstart - Get started with Apache Iceberg
- Migration Guide: Parquet to Delta/Iceberg - Convert existing data lakes
Video Content
- Databricks YouTube Channel - Delta Lake videos and webinars
- Apache Iceberg Talks - Conference presentations
Books
- Delta Lake: The Definitive Guide — Denny Lee, Tristen Wentling, Prashanth Babu, Scott Haines (O’Reilly, 2023)
- Apache Iceberg: The Definitive Guide — Tomer Shiran, Jason Hughes, Alex Merced (O’Reilly, 2024)
- Building the Data Lakehouse — Bill Inmon, et al.
🛠️ Tools and Libraries
Delta Lake Ecosystem
- delta-rs - Native Rust implementation
- kafka-delta-ingest - Stream from Kafka to Delta
- delta-sharing - Open protocol for data sharing
Iceberg Ecosystem
- PyIceberg - Python library for Iceberg
- Iceberg Go - Go implementation
- Nessie - Git-like version control for data lakes
Query Engines
- Apache Spark - Both Delta and Iceberg
- Trino - Both Delta and Iceberg
- Apache Flink - Excellent Iceberg support
- Dremio - Iceberg-native query engine
- Athena - AWS-managed, supports both
🏢 Case Studies
Delta Lake
- Netflix: Processing petabytes of data with Delta Lake
- Comcast: Real-time streaming analytics
- Adobe: Marketing analytics at scale
- Riot Games: Gaming analytics and ML pipelines
Apache Iceberg
- Netflix: Original creator, uses Iceberg for data warehousing
- Apple: Large-scale data processing
- LinkedIn: Data platform modernization
- Expedia: Travel data analytics
📊 Comparisons and Benchmarks
- Feature Comparison Matrix — In-depth side-by-side comparison (60+ criteria)
- TPC-DS Benchmarks — Delta Lake 3.0 performance results
- Onehouse Lakehouse Format Comparison — Delta, Iceberg, and Hudi compared
- Dremio: Open Table Formats in 2024 — Practical comparison with real workloads
🎓 Courses and Training
Free Courses
- Databricks Academy - Free Delta Lake courses
- Apache Iceberg Tutorials - Official tutorials
Paid Courses
🔧 Integration Guides
Cloud Platforms
- Delta Lake on AWS
- Delta Lake on Azure
- Delta Lake on GCP
- Iceberg on AWS
- Iceberg on Azure
- Iceberg on GCP
BI Tools
🎤 Community
Slack Channels
Mailing Lists
Meetups and Conferences
- Data + AI Summit - Annual Databricks conference
- ApacheCon - Apache Software Foundation conference
- Local Data Engineering Meetups
🔬 Research Papers
- Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores
- Apache Iceberg: Unlocking the Power of Open Standards
🤝 Contributing
This awesome list is community-maintained. To add a resource:
- Check if it’s already listed
- Ensure it’s relevant and high-quality
- Submit a PR with your addition
- Include a brief description
Our AI-powered aggregator also discovers new content weekly and creates PRs for review.
See our Contributing Guide for details.
📜 License
This awesome list is part of the Delta Lake & Apache Iceberg Knowledge Hub, licensed under Apache 2.0.
Last Updated: 2026-04-27
Maintained By: Community + AI Aggregator 🤖