Interesting

July 2025

Apache PolarisApache Polaris

Andrew Lamb Accelerating Apache Parquet with metadata stores and specialized indexes using Apache DataFusionAndrew Lamb Accelerating Apache Parquet with metadata stores and specialized indexes using Apache DataFusion

June 2025

Vincent DANIEL Boost Iceberg Performance and Cut Compute Costs with Well-Scoped MERGE StatementsVincent DANIEL Boost Iceberg Performance and Cut Compute Costs with Well-Scoped MERGE Statements

Spark Release 4.0.0 | Apache SparkSpark Release 4.0.0 | Apache Spark

Embed GitHubEmbed GitHub

Data Council The Deconstructed Database and the Advent of the Open Data LakeData Council The Deconstructed Database and the Advent of the Open Data Lake

May 2025

amit Incremental Processing with Apache Iceberg & Spark: A Comprehensive Guideamit Incremental Processing with Apache Iceberg & Spark: A Comprehensive Guide

LakeSphere - Modern Data Lake Management PlatformLakeSphere - Modern Data Lake Management Platform

Apache Iceberg Transactions and Isolation in Apache Iceberg™Apache Iceberg Transactions and Isolation in Apache Iceberg™

Apache Iceberg™ Meetup Lakekeeper: Rust based Iceberg CatalogApache Iceberg™ Meetup Lakekeeper: Rust based Iceberg Catalog

April 2025

Data Council Ten Years of Building Open Source StandardsData Council Ten Years of Building Open Source Standards

@Scale Ray, a Unified Distributed Framework for the Modern AI Stack | Ion Stoica@Scale Ray, a Unified Distributed Framework for the Modern AI Stack | Ion Stoica

Sundog Education with Frank Kane What's New in Apache Spark 4Sundog Education with Frank Kane What's New in Apache Spark 4

March 2025

Home | OpenLineageHome | OpenLineage

Apache DataFusion — Apache DataFusion  documentationApache DataFusion — Apache DataFusion documentation