Delta Lake

Delta Lake · 2026-05-25T19:37:40.301Z

Headed to Data + AI Summit? Don't miss New Foundations of Delta Lake with Kernel and Spark's Data Source V2 with Rahul Potharaju and Tathagata Das. Since 2018, Delta has built the intelligence to deliver a high-performance, intuitive experience and command support on Spark DSv1 — work that set the standard for the last 8 years. Spark’s DSv2 APIs now take on more of that heavy lifting in the engine. This session explores how Delta is being updated to use DSv2 to build new foundations for the next decade of Delta. 👇 🗓️ June 15-18 📍 San Francisco 🔗 Session details: https://lnkd.in/e2dPZavQ #DeltaLake #ApacheSpark #DataAISummit #OpenSource #DataEngineering

Software Development

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture.

Discover all 54 employees

About us

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python. Delta Lake is an independent open-source project and not controlled by any single company. To emphasize this we joined the Delta Lake Project in 2019, which is a sub-project of the Linux Foundation Projects.

Website: https://delta.io
External link for Delta Lake
Industry: Software Development
Company size: 11-50 employees
Headquarters: San Francisco
Type: Partnership
Founded: 2019
Specialties: Delta Lake, Apache Spark, PrestoDB, Trino, Hive, Apache Flink, Apache Beam, Apache Pulsar, Rust, Scala, Java, Python, and Ruby

Locations

Primary

San Francisco, US

Get directions

Employees at Delta Lake

maurice muchiwa

See all employees

Updates

Delta Lake

66,692 followers
1w
Report this post
For open table formats, new features and integrations can create exponentially more implementation work across projects. At Data + AI Summit, Holly Smith and Robert Pack will walk through how Kernel rewrites the source of truth, so projects that adopt it no longer need additional development for new features. They “just appear.” What started as a contribution to Delta could change how open table formats fundamentally integrate everywhere. 🌍 📍 San Francisco | June 15-18 🔗 Session details: https://lnkd.in/eiuUuF4A #DeltaLake #OpenSource #DataEngineering #DataAISummit
Like Comment Share
Delta Lake

66,692 followers
1w Edited
Report this post
From 2017 to now, Delta Lake has grown significantly, with 40M+ downloads/month and powering the daily processing of hundreds of exabytes. Now the conversation shifts to what comes next. 👇 Join the Data + AI Summit (June 15-18) session “The Road to Delta 5.0” for a look at key shifts ahead: 🔹 Transitioning Delta into a catalog-first table format 🔹 Modernizing Delta on Spark with Data Source V2 APIs 🔹 Convergence of Delta and Iceberg formats 🔹 Harmonizing Delta Kernel across Java and Rust implementations 🔗 Session details: https://lnkd.in/e-rknKK2 #deltalake #opensource #dataaisummit
3 Comments

Like Comment Share
Delta Lake

66,692 followers
1w
Report this post
Back by popular demand! 📘 Delta Lake: The Definitive Guide book signing returns to Data + AI Summit 2026. If you are at Data + AI Summit, come meet the authors, say hi to the Delta Lake community, and pick up a signed copy while supplies last. 🗓️ Tuesday, June 16 🕝 2:00–2:30 PM 📍 Dev Lounge, Data + AI Summit Expo, Moscone Center Books tend to go quickly, so plan to stop by early. Hope to see you there! 👋 cc Denny Lee, Scott Haines, Tristen Wentling, Tyler Croy #DeltaLake #DataAISummit #OpenSource #DataEngineering

3 Comments

Like Comment Share
Delta Lake

66,692 followers
2w
Report this post
For years, Delta Lake and Apache Iceberg co-evolved as parallel standards. That era is ending and this session goes deep on the technical how of that convergence. At Data + AI Summit (June 15–18, San Francisco), Anoop Johnson and Ryan Blue (Databricks) will go deep on the technical how of this convergence. What they'll cover: 🔹 𝗜𝗰𝗲𝗯𝗲𝗿𝗴 𝘃𝟰'𝘀 𝗮𝗱𝗮𝗽𝘁𝗶𝘃𝗲 𝗺𝗲𝘁𝗮𝗱𝗮𝘁𝗮 𝘁𝗿𝗲𝗲: what's changing in the redesign, and how the new structure enables single-file commits to significantly accelerate performance 🔹 𝗗𝗲𝗹𝘁𝗮 𝗟𝗮𝗸𝗲 𝟱.𝟬: adopts the Iceberg v4 metadata tree as its native content metadata 🔹 𝗢𝗻𝗲 𝗼𝗻-𝗱𝗶𝘀𝗸 𝘀𝘁𝗿𝘂𝗰𝘁𝘂𝗿𝗲: both Delta and Iceberg clients read and write directly, with no translation layers and no conversion overhead 🔗 Add it to your DAIS schedule: https://lnkd.in/eZgZS256 #DeltaLake #ApacheIceberg #DataAISummit #OpenLakehouse #DataEngineering
4 Comments

Like Comment Share
Delta Lake

66,692 followers
2w
Report this post
Catalog-Managed Tables changed the contract for Delta Lake: unified governance, enforceable constraints, and a path toward multi-table updates. The catalog is now the authoritative broker of commits—not the filesystem—and Delta connectors have to adapt. At Data + AI Summit (June 15–18, San Francisco), Scott Sandre will show how Catalog-Managed Tables support was designed in Delta Kernel. 👇 🔹 A unified, catalog-agnostic API for connectors to build against 🔹 Deep engine integrations for DuckDB, Delta-Spark, and Delta-Flink 🔹 Catalog complexity kept out of Kernel—the one right Delta abstraction 🔗 Session details: https://lnkd.in/ej3Rt7ks #DeltaLake #DeltaKernel #DataAISummit #Lakehouse
Like Comment Share
Delta Lake

66,692 followers
2w
Report this post
Data + AI Summit session: Tyler Croy (Scribd, Inc.) will introduce Virtual Delta Tables and the associated open source code designed for multimodal inference.👇 🔹 Incorporate original data from different sources into a singular logical structure for data scientists and ML engineers 🔹 A Delta Lake interface so the ecosystem from Databricks to DuckDB can be used in multimodal work 🔹 How far you can push the lakehouse architecture in a rapidly changing AI data landscape 🗓️ June 15-18 📍 San Francisco 🔗 Session details: https://lnkd.in/e_JrCfHq #DeltaLake #MultimodalAI #DataAndAISummit #Lakehouse
Like Comment Share
Delta Lake

66,692 followers
3w
Report this post
Headed to Data + AI Summit? Don't miss Your Guide to Open Table Formats — Delta, Iceberg, Best Practices, and What's Next! with Benjamin Mathew and Scott Sandre. This session covers Delta 5.0, Iceberg v4 Adaptive Metadata Tree, Unified Delta Kernel (GA), Catalog-Managed Commits, Auto CDF, FILE data type and VARIANT shredding, plus best practices for working across formats today. 👇 🗓️ June 15–18 📍 San Francisco 🔗 Add it to your agenda: https://lnkd.in/emjjX_dQ #DeltaLake #ApacheIceberg #DataAISummit #OpenSource #DataEngineering
Like Comment Share
Delta Lake

66,692 followers
3w
Report this post
Delta Lake and Apache Iceberg™ have converged on similar ideas—columnar metadata, manifest trees, deletion vectors—but two separate metadata structures still duplicate work. At Data + AI Summit (June 15–18, San Francisco), this session introduces the next major evolution of Delta Lake metadata. 👇 🔹 Unified metadata architecture: Delta Lake commits store content metadata directly in Iceberg v4’s adaptive metadata tree 🔹 Delta Lake gains efficient tree-structured manifests and Iceberg interoperability 🔹 Preserving the transactional guarantees that Delta users depend on 🔗 Session details: https://lnkd.in/ehuzk53w #DeltaLake #ApacheIceberg #DataandAISummit #Lakehouse
2 Comments

Like Comment Share
Delta Lake

66,692 followers
3w
Report this post
DuckDB's Delta and Unity Catalog extensions are no longer experimental. DuckDB can now INSERT into Delta tables, time-travel across versions, and read/write through Unity Catalog. 🔷 INSERT writes — INSERT via ATTACH ... (TYPE delta). Multiple INSERTs in one BEGIN/COMMIT = one Delta version. 🔷 Time travel — AT (VERSION => n), attach with VERSION n, or PIN_SNAPSHOT for a stable snapshot. 🔷 Unity Catalog — Catalog Managed Tables use Catalog Commits to keep UC in sync during concurrent writes. 🔷 Incremental snapshot loading — Faster time travel across nearby versions (nightly builds now, DuckDB v1.5.3 stable next). Delta + Unity Catalog + DuckDB: open storage, governance, fast analytical queries. Docker playground in the post. ⬇️ 🔗 Read more: https://lnkd.in/eCFajASh #DeltaLake #DuckDB #UnityCatalog #OpenSource #OpenLakehouse #Lakehouse
7 Comments

Like Comment Share
Delta Lake

66,692 followers
4w
Report this post
Headed to Data + AI Summit? Don't miss New Foundations of Delta Lake with Kernel and Spark's Data Source V2 with Rahul Potharaju and Tathagata Das. Since 2018, Delta has built the intelligence to deliver a high-performance, intuitive experience and command support on Spark DSv1 — work that set the standard for the last 8 years. Spark’s DSv2 APIs now take on more of that heavy lifting in the engine. This session explores how Delta is being updated to use DSv2 to build new foundations for the next decade of Delta. 👇 🗓️ June 15-18 📍 San Francisco 🔗 Session details: https://lnkd.in/e2dPZavQ #DeltaLake #ApacheSpark #DataAISummit #OpenSource #DataEngineering
1 Comment

Like Comment Share

Delta Lake

Software Development

Delta Lake is an open-source storage framework that enables building a Lakehouse architecture.

About us

Locations

Employees at Delta Lake

maurice muchiwa

Updates

Join now to see what you are missing

Similar pages

Apache Iceberg

Apache Spark

Unity Catalog

Databricks

Apache Hudi

Tabular (now part of Databricks)

MLflow

DuckDB

Snowflake

Apache Airflow

Browse jobs

Engineer jobs

Vice President Staffing jobs

Director of Technology jobs

Channel Sales Manager jobs

Principal jobs

Chief Technology Officer jobs

Analyst jobs

Regional Manager jobs

Business Manager jobs

Area Manager jobs

Information Technology Manager jobs

Account Executive jobs

Sales Manager jobs

Affiliate Manager jobs

Director jobs

Manager jobs

Assistant Vice President jobs

Vice President jobs

Data Engineer jobs

Senior Vice President jobs