Data Science

Soda Core – data quality and data contract verification engine

Soda Core is a data quality and data contract verification engine for modern data stacks.

It lets teams define data quality contracts in YAML and validate both schema and data across supported data platforms. The software provides a command-line interface and Python API, so checks can be run locally during development, embedded in data pipelines, or executed remotely when connected to Soda Cloud.

This is source-available software.

Key Features

  • Defines data contracts using human-readable YAML syntax.
  • Validates schemas and data against contract rules.
  • Provides a command-line interface for creating, testing, publishing, and verifying contracts.
  • Offers a Python API for embedding contract verification in workflows and pipelines.
  • Supports data sources including PostgreSQL, Snowflake, BigQuery, Databricks, DuckDB, and more.
  • Includes more than 50 built-in data quality checks for common and advanced validations.
  • Can integrate with Soda Cloud for centralized management and monitoring.

Website: github.com/sodadata/soda-core
Support:
Developer: Soda Data
License: Elastic License 2.0

Soda Core is written in Python. Learn Python with our recommended free books and free tutorials.


Related Software

Python Data Validation
PydanticData validation using Python type hints
panderaFramework for precision data testing
jsonschema
Implementation of JSON Schema for Python
CerberusLightweight and extensible data validation library
schemaLibrary for validating Python data structures
GXValidating, documenting, and profiling data
marshmallowORM/ODM/framework-agnostic library
VoluptuousPython data validation library
SchematicsCombine types into structures, validate , and transform the shapes of data
ColanderSerialization / deserialization / validation library
ValideerLightweight data validation and adaptation Python library

Read our verdict in the software roundup.


Best Free and Open Source Software Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.

This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk.

You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more.

Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form.
Subscribe
Notify of
guest
0 Comments
Oldest
Newest Most Voted