Browse free open source Data Integration tools and projects below. Use the toggles on the left to filter open source Data Integration tools by OS, license, language, programming language, and project status.

  • Auth0 for AI Agents now in GA Icon
    Auth0 for AI Agents now in GA

    Ready to implement AI with confidence (without sacrificing security)?

    Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.
    Start building today
  • Zenflow- The AI Workflow Engine for Software Devs Icon
    Zenflow- The AI Workflow Engine for Software Devs

    Parallel agents. Multi-agent orchestration. Specs that turn into shipped code. Zenflow automates planning, coding, testing, and verification.

    Zenflow is the AI workflow engine built for real teams. Parallel agents plan, code, test, and verify in one workflow. With spec-driven development and deep context, Zenflow turns requirements into production-ready output so teams ship faster and stay in flow.
    Try free now
  • 1
    Open Source Data Quality and Profiling

    Open Source Data Quality and Profiling

    World's first open source data quality & data preparation project

    This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2

    PDI Data Vault framework

    Data Vault loading automation using Pentaho Data Integration.

    A metadata driven 'tool' to automate loading a designed Data Vault. It consists of a set of Pentaho Data Integration and database objects. Thel Virtual Machine (VMware) is a 64 bit Ubuntu Server 14.04, with MySQL (Percona Server) and PostgreSQL 9.4 as the database flavours and PDI version 5.2 CE. NB: Directory version_2.4 contains the most recent Virtual Machine. The readme.txt contains info about that VM.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 3
    Jaspersoft ETL
    Jaspersoft ETL is a data integration platform providing high performance data extract-transform-load (ETL) capabilities. Jaspersoft ETL is appropriate for all analytic and operational data integration needs. Activity on this project is located at jas
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    SnapLogic is an Open Source Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Grafana: The open and composable observability platform Icon
    Grafana: The open and composable observability platform

    Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

    Grafana is the open source analytics & monitoring solution for every database.
    Learn More
  • 5
    Daffodil Replicator is a powerful Open Source Java tool for data integration, data migration and data protection in real time. It allows bi-directional data replication and synchronization between homogeneous / heterogeneous databases including Oracle, M
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    PaloKettlePlugin is for Pentaho Data Integration aka Kettle. It's a Cell Input und Output Step for Palo Molap. The first code was developed by mybiq/3A-Strategy, the PDI-3 version has been developed by Stratebi. Now by 3A-Strategy and Litebi for PDI
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    The BioDataServer is a database integration system. It implements a mediator-wrapper architecture and offers a SQL interface. The data integration is based on user defined intergrated schema and adapter that wrap any kind of data source.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Grinn

    Grinn

    graph database and R package for omic data integration

    http://kwanjeeraw.github.io/grinn/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    INDUS is a porject for knowledge acquisition and data integration from heterogeneous distributed data, particularly from bio-informatics databases
    Downloads: 0 This Week
    Last Update:
    See Project
  • Run applications fast and securely in a fully managed environment Icon
    Run applications fast and securely in a fully managed environment

    Cloud Run is a fully-managed compute platform that lets you run your code in a container directly on top of scalable infrastructure.

    Run frontend and backend services, batch jobs, deploy websites and applications, and queue processing workloads without the need to manage infrastructure.
    Try for free
  • 10
    The JasperSoft Business Intelligence Suite provides integrated reporting, analysis, and data integration to make faster, better decisions. * Integrated or stand-alone * Analytic & operational data integration * Embeddable with ERP or CRM
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Developing a "bridge" to facilitate transfer of data between various databases(ith dis-similar schemas). JDBC and XML would be used.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Open Information Integration
    Open Information Integration Tool Suite (Open II) is used by analysts and programmers to accelerate data integration and harmonization across organizations. OpenII has a neutral schema repository for browsing and comparing all sorts of data models. OpenII is built as a Rich Client Platform Application on top of Eclipse 3.x. Developers need to download Eclipse, install the RCP support, the Fatjar plugin and the Delta Pack in one of the 3.x flavors. Release Notes Release Date: Jan 2014 Build Version: 1.0.2666 Notes: 1. Now support for AVRO and HCatalog imports 2. Better support for OWL 3. New OWL and Containing Relationship viewers 4. Added case insensitive option in exact matcher for Harmony
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A lightweight, browsing-based, 100% Python, federated data integration framework. Users may create custom schemas for disparate sources, query and expand results across sources to find related data; for use in fields such as bioinformatics and datamining
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Steps for Kettle ( data integration by pentaho) to manage ( import and export ) TinyErp information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    This project is to develop web applications and data integration functions to provide information on the collection records, ecology, geographic distribution, and taxonomic concepts of the vascular flora of the region.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    bio2rdf
    The Bio2RDF project aims to transforms silos of life science data into a globally distributed network of linked data for biological knowledge discovery. Bio2RDF creates and provides machine understandable descriptions of biological entities using the RDF/RDFS/OWL Semantic Web languages. Using both syntactic and semantic data integration techniques, Bio2RDF seamlessly integrates diverse biological data and enables powerful new SPARQL-based services across its globally distributed knowledge bases.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Apatar Data Integration/ETL
    Apatar is an open source Extract, Transform, and Load (ETL) project. Modular architecture delivers 1. Visual job designer/mapping 2. Connectivity to all major data sources 3. Flexible Deployment Options (GUI, or server engine with JVM, or embedded).
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next