Awesome Delta Lake & Apache Iceberg Resources
A curated list of articles, blog posts, videos, and resources about Delta Lake and Apache Iceberg, automatically maintained by our community and AI-powered aggregator.
🌟 Featured Resources
Official Documentation
- Delta Lake Official Docs - Comprehensive Delta Lake documentation
- Apache Iceberg Official Docs - Complete Iceberg documentation
- Delta Lake GitHub - Delta Lake source code
- Apache Iceberg GitHub - Iceberg source code
Specifications
- Delta Transaction Log Protocol - Delta’s ACID transaction protocol
- Iceberg Table Spec - Apache Iceberg’s table format specification
Recent Articles
This section is automatically updated by our resource aggregator bot. New articles are added weekly and reviewed by the community.
Introducing Delta Lake 3.0
Discovered: 2024-01-01
Delta Lake 3.0 brings significant improvements including better performance, enhanced schema evolution capabilities, and improved compatibility with Apache Spark 3.5.
Apache Iceberg: The Definitive Guide
Discovered: 2024-01-01
Comprehensive guide covering Iceberg architecture, design decisions, and best practices for production deployments.
📚 Learning Resources
Tutorials
- Delta Lake Quickstart - Get started with Delta Lake
- Iceberg Quickstart - Get started with Apache Iceberg
- Migration Guide: Parquet to Delta/Iceberg - Convert existing data lakes
Video Content
- Databricks YouTube Channel - Delta Lake videos and webinars
- Apache Iceberg Talks - Conference presentations
Books
- “Delta Lake: The Definitive Guide” by Denny Lee and Tristen Wentling
- “Building the Data Lakehouse” by Bill Inmon, et al.
🛠️ Tools and Libraries
Delta Lake Ecosystem
- delta-rs - Native Rust implementation
- kafka-delta-ingest - Stream from Kafka to Delta
- delta-sharing - Open protocol for data sharing
Iceberg Ecosystem
- PyIceberg - Python library for Iceberg
- Iceberg Go - Go implementation
- Nessie - Git-like version control for data lakes
Query Engines
- Apache Spark - Both Delta and Iceberg
- Trino - Both Delta and Iceberg
- Apache Flink - Excellent Iceberg support
- Dremio - Iceberg-native query engine
- Athena - AWS-managed, supports both
🏢 Case Studies
Delta Lake
- Netflix: Processing petabytes of data with Delta Lake
- Comcast: Real-time streaming analytics
- Adobe: Marketing analytics at scale
- Riot Games: Gaming analytics and ML pipelines
Apache Iceberg
- Netflix: Original creator, uses Iceberg for data warehousing
- Apple: Large-scale data processing
- LinkedIn: Data platform modernization
- Expedia: Travel data analytics
📊 Comparisons and Benchmarks
- Feature Comparison Matrix - Side-by-side comparison
- TPC-DS Benchmarks - Performance benchmarks
- Onehouse Benchmark - Multi-format comparison
🎓 Courses and Training
Free Courses
- Databricks Academy - Free Delta Lake courses
- Apache Iceberg Tutorials - Official tutorials
Paid Courses
🔧 Integration Guides
Cloud Platforms
- Delta Lake on AWS
- Delta Lake on Azure
- Delta Lake on GCP
- Iceberg on AWS
- Iceberg on Azure
- Iceberg on GCP
BI Tools
🎤 Community
Slack Channels
Mailing Lists
Meetups and Conferences
- Data + AI Summit - Annual Databricks conference
- ApacheCon - Apache Software Foundation conference
- Local Data Engineering Meetups
🔬 Research Papers
- Delta Lake: High-Performance ACID Table Storage over Cloud Object Stores
- Apache Iceberg: Unlocking the Power of Open Standards
🤝 Contributing
This awesome list is community-maintained. To add a resource:
- Check if it’s already listed
- Ensure it’s relevant and high-quality
- Submit a PR with your addition
- Include a brief description
Our AI-powered aggregator also discovers new content weekly and creates PRs for review.
See our Contributing Guide for details.
📜 License
This awesome list is part of the Delta Lake & Apache Iceberg Knowledge Hub, licensed under Apache 2.0.
Last Updated: 2025-11-14
Maintained By: Community + AI Aggregator 🤖