Variant, the native data type for semi-structured data, has been ratified by the Apache Parquet™ community, with support extending across Delta Lake, Apache Iceberg™, and Apache Spark™! Last year, we collaborated with the open source community to create Variant, and were thrilled to see interest from other major open source projects. Now that Variant has been approved, the entire lakehouse ecosystem has a standard, open data type for semi-structured data. https://lnkd.in/gESP97Nm
Congrats on the ratification! Variant is a big step forward for the lakehouse ecosystem — standardizing semi-structured data is huge.
Love this open data types make the ecosystem stronger for everyone. Now imagine the same standard applied at the human level. Where your personal data health, legal, financial lives in a format you control, locally. That’s what 🥦 BrockBox is all about: open data, but owned by individuals. Enterprise built the cloud. We’re building sovereignty. brockbox.com 🥦 Data ownership = passive income for individuals. 🥦
Lecturer at ETH Zurich
1dWe are already using it in the physical implementation of JSONiq on top of Spark 😊 Very useful for quickly inferring schemas! And we cannot wait to try out Delta Lake's Variant support, too for implementing the JSONiq Update Facility.