Awesome Delta Lake & Apache Iceberg Resources

A curated list of articles, blog posts, videos, and resources about Delta Lake and Apache Iceberg, automatically maintained by our community and AI-powered aggregator.

Official Documentation

Specifications

Recent Articles

This section is automatically updated by our resource aggregator bot. New articles are added weekly and reviewed by the community.

Delta Lake 3.2: Liquid Clustering and Improved Performance

Discovered: 2025-02-01

Delta Lake 3.2 introduces Liquid Clustering as a replacement for static partitioning and Z-ordering, automatically reorganizing data based on actual query patterns for improved performance without manual tuning.


Apache Iceberg 1.5: Row-Level Deletes and Merge-on-Read Improvements

Discovered: 2025-01-15

Iceberg 1.5 ships significant performance improvements for Merge-on-Read tables, enhanced row-level delete efficiency, and expanded metadata statistics support for better query planning across all supported engines.


Choosing Between Delta Lake and Apache Iceberg in 2025

Discovered: 2025-03-10

A comprehensive comparison of both open table formats, covering ecosystem maturity, vendor support, hidden partitioning, streaming integration, and real-world migration experiences from Databricks and Netflix engineering teams.


Lakehouse Architecture with Apache Iceberg on AWS

Discovered: 2025-04-01

Step-by-step guide to building an AWS-native data lakehouse using Apache Iceberg with Amazon Athena, AWS Glue, and S3, covering catalog integration, partition management, and compaction automation.


📚 Learning Resources

Tutorials

Video Content

Books

  • Delta Lake: The Definitive Guide — Denny Lee, Tristen Wentling, Prashanth Babu, Scott Haines (O’Reilly, 2023)
  • Apache Iceberg: The Definitive Guide — Tomer Shiran, Jason Hughes, Alex Merced (O’Reilly, 2024)
  • Building the Data Lakehouse — Bill Inmon, et al.

🛠️ Tools and Libraries

Delta Lake Ecosystem

Iceberg Ecosystem

Query Engines

🏢 Case Studies

Delta Lake

  • Netflix: Processing petabytes of data with Delta Lake
  • Comcast: Real-time streaming analytics
  • Adobe: Marketing analytics at scale
  • Riot Games: Gaming analytics and ML pipelines

Apache Iceberg

  • Netflix: Original creator, uses Iceberg for data warehousing
  • Apple: Large-scale data processing
  • LinkedIn: Data platform modernization
  • Expedia: Travel data analytics

📊 Comparisons and Benchmarks

🎓 Courses and Training

Free Courses

🔧 Integration Guides

Cloud Platforms

BI Tools

🎤 Community

Slack Channels

Mailing Lists

Meetups and Conferences

  • Data + AI Summit - Annual Databricks conference
  • ApacheCon - Apache Software Foundation conference
  • Local Data Engineering Meetups

🔬 Research Papers

🤝 Contributing

This awesome list is community-maintained. To add a resource:

  1. Check if it’s already listed
  2. Ensure it’s relevant and high-quality
  3. Submit a PR with your addition
  4. Include a brief description

Our AI-powered aggregator also discovers new content weekly and creates PRs for review.

See our Contributing Guide for details.

📜 License

This awesome list is part of the Delta Lake & Apache Iceberg Knowledge Hub, licensed under Apache 2.0.


Last Updated: 2026-04-27
Maintained By: Community + AI Aggregator 🤖