Soda Core is a data quality and data contract verification engine for modern data stacks.
It lets teams define data quality contracts in YAML and validate both schema and data across supported data platforms. The software provides a command-line interface and Python API, so checks can be run locally during development, embedded in data pipelines, or executed remotely when connected to Soda Cloud.
This is source-available software.
Key Features
- Defines data contracts using human-readable YAML syntax.
- Validates schemas and data against contract rules.
- Provides a command-line interface for creating, testing, publishing, and verifying contracts.
- Offers a Python API for embedding contract verification in workflows and pipelines.
- Supports data sources including PostgreSQL, Snowflake, BigQuery, Databricks, DuckDB, and more.
- Includes more than 50 built-in data quality checks for common and advanced validations.
- Can integrate with Soda Cloud for centralized management and monitoring.
Website: github.com/sodadata/soda-core
Support:
Developer: Soda Data
License: Elastic License 2.0
Soda Core is written in Python. Learn Python with our recommended free books and free tutorials.
Related Software
| Python Data Validation | |
|---|---|
| Pydantic | Data validation using Python type hints |
| pandera | Framework for precision data testing |
| jsonschema | Implementation of JSON Schema for Python |
| Cerberus | Lightweight and extensible data validation library |
| schema | Library for validating Python data structures |
| GX | Validating, documenting, and profiling data |
| marshmallow | ORM/ODM/framework-agnostic library |
| Voluptuous | Python data validation library |
| Schematics | Combine types into structures, validate , and transform the shapes of data |
| Colander | Serialization / deserialization / validation library |
| Valideer | Lightweight data validation and adaptation Python library |
Read our verdict in the software roundup.
Explore our comprehensive directory of recommended free and open source software. Our carefully curated collection spans every major software category.This directory is part of our ongoing series of informative articles for Linux enthusiasts. It features hundreds of detailed reviews, along with open source alternatives to proprietary solutions from major corporations such as Google, Microsoft, Apple, Adobe, IBM, Cisco, Oracle, and Autodesk. You’ll also find interesting projects to try, hardware coverage, free programming books and tutorials, and much more. Discovered a useful open source Linux program that we haven’t covered yet? Let us know by completing this form. |

