Databricks’ Post

View organization page for Databricks

1,062,920 followers

Variant, the native data type for semi-structured data, has been ratified by the Apache Parquet™ community, with support extending across Delta Lake, Apache Iceberg™, and Apache Spark™! Last year, we collaborated with the open source community to create Variant, and were thrilled to see interest from other major open source projects. Now that Variant has been approved, the entire lakehouse ecosystem has a standard, open data type for semi-structured data. https://lnkd.in/gESP97Nm

  • No alternative text description for this image

We are already using it in the physical implementation of JSONiq on top of Spark 😊 Very useful for quickly inferring schemas! And we cannot wait to try out Delta Lake's Variant support, too for implementing the JSONiq Update Facility.

Congrats on the ratification! Variant is a big step forward for the lakehouse ecosystem — standardizing semi-structured data is huge.

Kyle Brockley🥦

From felon to founder | Creator of BrockBox.com 🥦 — Plug-n-Play, Air-Gapped AI | Privacy • Security • Compliance

1d

Love this open data types make the ecosystem stronger for everyone. Now imagine the same standard applied at the human level. Where your personal data health, legal, financial lives in a format you control, locally. That’s what 🥦 BrockBox is all about: open data, but owned by individuals. Enterprise built the cloud. We’re building sovereignty. brockbox.com 🥦 Data ownership = passive income for individuals. 🥦

See more comments

To view or add a comment, sign in

Explore content categories