Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Business
Data Management Systems

Open Source Java Data Management Systems

x

Sort By:

Most Popular

Clear All Filters

OS

Windows 92
Linux 90
Mac 79
More...
BSD 71
ChromeOS 64
Desktop Operating Systems 3
Server Operating Systems 2

Category

Business 101
Database 101
Scientific/Engineering 35
Software Development 19
Multimedia 10
System 10
Formats and Protocols 9
Internet 8
Communications 6
Education 6
Artificial Intelligence 5
Printing 2
Security 2
Games 1

License

OSI-Approved Open Source 89
Other License 4
Public Domain 4
Creative Commons Attribution License 1

Translations

English 58
German 13
Spanish 9
French 8
More...
Dutch 4
Russian 4
Catalan 3
Chinese (Simplified) 3
Italian 2
Polish 2
Portuguese 2
Arabic 1
Chinese (Traditional) 1
Croatian 1
Javanese 1
Korean 1
Romanian 1
Slovak 1

Programming Language

Java 101
C++ 10
JavaScript 8
PHP 6
PL/SQL 6
More...
JSP 5
Python 4
Perl 3
C 2
Groovy 2
Prolog 2
Unix Shell 2
Lisp 1
MATLAB 1
Ruby 1
Tcl 1

Status

Production/Stable 37
Beta 25
Alpha 17
Pre-Alpha 9
More...
Planning 7
Mature 6
Inactive 1

Java Data Management Systems

View 4140 business solutions

Data Management Java Database Clear Filters

Browse free open source Java Data Management Systems and projects below. Use the toggles on the left to filter open source Java Data Management Systems by OS, license, language, programming language, and project status.

Auth0 for AI Agents now in GA
Ready to implement AI with confidence (without sacrificing security)?

Connect your AI agents to apps and data more securely, give users control over the actions AI agents can perform and the data they can access, and enable human confirmation for critical agent actions.

Start building today
Orchestrate Your AI Agents with Zenflow
The multi-agent workflow engine for modern teams. Zenflow executes coding, testing, and verification with deep repo awareness

Zenflow orchestrates AI agents like a real engineering system. With parallel execution, spec-driven workflows, and deep multi-repo understanding, agents plan, implement, test, and verify end-to-end. Upgrade to AI workflows that work the way your team does.

Try free now
1

GeoTools, the Java GIS toolkit

Toolkit for working with and mapping geospatial data

GeoTools is an open source (LGPL) Java code library which provides standards compliant methods for the manipulation of geospatial data. GeoTools is an Open Source Geospatial Foundation project. The GeoTools library data structures are based on Open Geospatial Consortium (OGC) specifications.

38 Reviews

Downloads: 310 This Week

Last Update: 7 days ago
See Project
2

MyCAT

Active, high-performance open source database middleware

MyCAT is an Open-Source software, “a large database cluster” oriented to enterprises. MyCAT is an enforced database which is a replacement for MySQL and supports transaction and ACID. Regarded as MySQL cluster of enterprise database, MyCAT can take the place of expensive Oracle cluster. MyCAT is also a new type of database, which seems like a SQL Server integrated with the memory cache technology, NoSQL technology and HDFS big data. And as a new modern enterprise database product, MyCAT is combined with the traditional database and new distributed data warehouse. In a word, MyCAT is a fresh new middleware of database. MyCAT ’s objective is to smoothly migrate the current stand-alone database and applications to cloud side with low cost and to solve the bottleneck problem caused by the rapid growth of data storage and business scale.

Downloads: 7 This Week

Last Update: 2021-06-28
See Project
3

OrientDB

DBMS supporting graph, document, full-text and geospatial models

OrientDB is an Open Source Multi-Model NoSQL DBMS with the support of Native Graphs, Documents, Full-Text search, Reactivity, Geo-Spatial and Object Oriented concepts. It's written in Java and it's amazingly fast. No expensive run-time JOINs, connections are managed as persistent pointers between records. You can traverse thousands of records in no time. Supports schema-less, schema-full and schema-mixed modes. Has a strong security profiling system based on user, roles and predicate security and supports SQL amongst the query languages. Thanks to the SQL layer it's straightforward to use for people skilled in the Relational world. OrientDB adheres to the NoSQL movement even though it supports ACID Transactions and SQL as query language. In this way it's easy to start using it without having to learn too much new stuff.

Downloads: 7 This Week

Last Update: 2025-12-09
See Project
4

Apache HBase

Get random, realtime read/write access to your Big Data

Use Apache HBase™ when you need random, realtime read/write access to your Big Data. This project's goal is the hosting of very large tables, billions of rows X millions of columns, atop clusters of commodity hardware. Apache HBase is an open-source, distributed, versioned, non-relational database modeled after Google's Bigtable. A Distributed Storage System for Structured Data by Chang et al. Just as Bigtable leverages the distributed data storage provided by the Google File System, Apache HBase provides Bigtable-like capabilities on top of Hadoop and HDFS. Thrift gateway and a REST-ful Web service that supports XML, Protobuf, and binary data encoding options. Support for exporting metrics via the Hadoop metrics subsystem to files or Ganglia; or via JMX. Convenient base classes for backing Hadoop MapReduce jobs with Apache HBase tables.

Downloads: 5 This Week

Last Update: 2025-11-14
See Project
Grafana: The open and composable observability platform
Faster answers, predictable costs, and no lock-in built by the team helping to make observability accessible to anyone.

Grafana is the open source analytics & monitoring solution for every database.

Learn More
5

Logstash

Centralize, transform and stash your data

Logstash is a server-side data processing pipeline that dynamically ingests data from numerous sources, transforms it, and ships it to your favorite “stash” regardless of format or complexity. It supports and ingests data of all shapes, sizes and sources, dynamically transforms and prepares this data, and transports it to the output of your choice. Logstash is extensible, with over 200 plugins available to let you create and configure your pipeline how you choose.

Downloads: 4 This Week

Last Update: 5 days ago
See Project
6

IoTDB

Apache IoTDB

Apache IoTDB (Database for Internet of Things) is an IoT native database with high performance for data management and analysis, deployable on the edge and the cloud. Due to its light-weight architecture, high performance and rich feature set together with its deep integration with Apache Hadoop, Spark and Flink, Apache IoTDB can meet the requirements of massive data storage, high-speed data ingestion and complex data analysis in the IoT industrial fields. In the scene of factories, there are tens of devices under LAN network. IoTDB can be installed on a local controller server in the factory to receive data from those devices. The local controller server (normal PC or workstation) with IoTDB can provide the ability to persist data and query data with SQL-like interface. In addition, with TsFile-Sync tool, TsFiles on the local controller can be transmitted to the data center equipped with IoTDB instance in the cloud.

Downloads: 2 This Week

Last Update: 2025-09-08
See Project
7

Open Source Data Quality and Profiling

World's first open source data quality & data preparation project

This project is dedicated to open source data quality and data preparation solutions. Data Quality includes profiling, filtering, governance, similarity check, data enrichment alteration, real time alerting, basket analysis, bubble chart Warehouse validation, single customer view etc. defined by Strategy. This tool is developing high performance integrated data management platform which will seamlessly do Data Integration, Data Profiling, Data Quality, Data Preparation, Dummy Data Creation, Meta Data Discovery, Anomaly Discovery, Data Cleansing, Reporting and Analytic. It also had Hadoop ( Big data ) support to move files to/from Hadoop Grid, Create, Load and Profile Hive Tables. This project is also known as "Aggregate Profiler" Resful API for this project is getting built as (Beta Version) https://sourceforge.net/projects/restful-api-for-osdq/ apache spark based data quality is getting built at https://sourceforge.net/projects/apache-spark-osdq/

8 Reviews

Downloads: 9 This Week

Last Update: 2021-01-20
See Project
8

DataCleaner

Data quality analysis, profiling, cleansing, duplicate detection +more

DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io

3 Reviews

Downloads: 10 This Week

Last Update: 2019-02-12
See Project
9

HugeGraph

A graph database that supports more than 100+ billion data

HugeGraph is a convenient, efficient, and adaptable graph database compatible with the Apache TinkerPop3 framework and the Gremlin query language. HugeGraph supports fast import performance in the case of more than 10 billion Vertices and Edges Graph, millisecond-level OLTP query capability, and can be integrated into big data platforms like Hadoop or Spark for OLAP analysis. The main scenarios of HugeGraph include correlation search, fraud detection, and knowledge graph. Not only supports Gremlin graph query language and RESTful API but also provides commonly used graph algorithm APIs. To help users easily implement various queries and analyses, HugeGraph has a full range of accessory tools, such as supporting distributed storage, data replication, scaling horizontally, and supports many built-in backends of storage engines.

Downloads: 1 This Week

Last Update: 2025-11-28
See Project
G-P - Global EOR Solution
Companies searching for an Employer of Record solution to mitigate risk and manage compliance, taxes, benefits, and payroll anywhere in the world

With G-P's industry-leading Employer of Record (EOR) and Contractor solutions, you can hire, onboard and manage teams in 180+ countries — quickly and compliantly — without setting up entities.

Learn More
10

ksqlDB

The database purpose-built for stream processing applications

Build applications that respond immediately to events. Craft materialized views over streams. Receive real-time push updates, or pull current state on demand. Seamlessly leverage your existing Apache Kafka® infrastructure to deploy stream-processing workloads and bring powerful new capabilities to your applications. Use a familiar, lightweight syntax to pack a powerful punch. Capture, process, and serve queries using only SQL. No other languages or services are required. ksqlDB enables you to build event streaming applications leveraging your familiarity with relational databases. Three categories are foundational to building an application: collections, stream processing, and queries. Streams are immutable, append-only sequences of events. They're useful for representing a series of historical facts. Tables are mutable collections of events. They let you represent the latest version of each value per key.

Downloads: 1 This Week

Last Update: 2021-12-21
See Project
11

MailArchiva

MailArchiva is a powerful, full featured email archiving (email archiver) and compliance solution for mail systems such as Microsoft Exchange. It stores all incoming, outgoing and internal emails for long term storage. A web based user interface is avail

11 Reviews

Downloads: 4 This Week

Last Update: 2014-09-18
See Project
12

ADaMSoft

Open Source and data mining software

ADaMSoft is a free and Open Source Data Mining software developed in Java. It contains data management methods and it can create ready to use reports. It can read data from several sources and it can write the results in different formats.

1 Review

Downloads: 7 This Week

Last Update: 2021-11-30
See Project
13

DB Browser

DBBrowser is an open source (GPL license), cross-platform tool which can be used to view the contents of a database. It works with Oracle and MySQL. The user can view, modify, delete records without writing SQL.

Downloads: 9 This Week

Last Update: 2013-04-15
See Project
14

SnapLogic

SnapLogic is an Open Source Data Integration framework that combines the power of state-of-the-art dynamic programming languages with standard Web interfaces to solve today's most pressing problems in data integration.

Downloads: 8 This Week

Last Update: 2013-04-16
See Project
15

JRobin

JRobin is a 100% pure Java alternative to RRDTool, with about exactly the same specifications. If you provide the same data to RRDTool and JRobin, you will get exactly the same results and graphs. All standard RRDTool operations are supported.

Downloads: 5 This Week

Last Update: 2021-03-10
See Project
16

Adele

Adhoc Data Exploration - Live & Easy

Adele was developed to simplify the daily work with data. Use it as a swiss knife to fill the gap between your work with spreadsheet application like MS Excel and enterprise servers like SAP ERP. Specialized tools like Rapid Miner, KNIME or similiary stuff should not be replaced. But Adele is designed for business people working with spreadsheet applications to analyse their data. There are many technical concepts in an easier way included. For example realtime OLAP, transformations, charts, analysis tools,... Connectors (e.g. JDBC, SAP ABAP, OData) can be used to pre-analyse the data and extract it without saving the data as text files. A plugin concept for enhancements are available. Enjoy! Its free for commercial use too. Adele runs without installation from USB stick for Windows, Linux and MacOSX. Last added changes: - data science tools (V1, IQR) - export to remote and desktop databases (mysql,sqlite, ms access) - internet features for emails and domains

2 Reviews

Downloads: 1 This Week

Last Update: 2017-04-29
See Project
17

Datacleaning Open Source

A group a subprojects for Data Cleaning projects, mainly as a step of a Data Mining Project. Visit www.datacleaningopensource.com to review our current applications or if you want to add yours. NOTE: PROGRAMMING SKILLS ARE REQUIRED.

Downloads: 2 This Week

Last Update: 2015-04-28
See Project
18

Orbit

ORBIT : Operating Business Intelligence Tool

ORBIT : Operating Business Intelligence Tool Making Data Accessible Through Centralized Database Access ORBIT is a business intelligence tool designed to make data accessible to a broad audience within your company by centralizing access to databases. With this application, users can easily create reports, perform interactive analyses, and extract insights from raw data. The application simplifies data handling by providing easy-to-use features for non-technical users while maintaining flexibility for advanced queries.

Downloads: 2 This Week

Last Update: 2025-11-02
See Project
19

Daffodil Replicator

Daffodil Replicator is a powerful Open Source Java tool for data integration, data migration and data protection in real time. It allows bi-directional data replication and synchronization between homogeneous / heterogeneous databases including Oracle, M

1 Review

Downloads: 1 This Week

Last Update: 2019-06-12
See Project
20

CrNiCKL

CrNiCKL (chronicle) is a Java database for time series

CrNiCKL (pronounced "chronicle") is a data manager written in Java handling large sets of heterogeneous time series. A simple schema system allows to confiture value types and time domains. CrNiCKL runs on top of SQL or NoSQL databases. Drivers for JDBC and MongoDB are available.

Downloads: 1 This Week

Last Update: 2017-10-26
See Project
21

PaloKettlePentahoDataIntegrationPDI

PaloKettlePlugin is for Pentaho Data Integration aka Kettle. It's a Cell Input und Output Step for Palo Molap. The first code was developed by mybiq/3A-Strategy, the PDI-3 version has been developed by Stratebi. Now by 3A-Strategy and Litebi for PDI

Downloads: 1 This Week

Last Update: 2015-08-03
See Project
22

Aestel

Applications for data management

"Information is data in action", and, consequently, having good quality data is essential. The AESTEL package contains two highly configurable applications for data management: A data loader and a reporting application, i.e. DataLoader and AEREA, respectively. The data loader application applies user-defined instructions to validate, process and load data. The reporting application provides a query builder and spreadsheet template designer. Both applications work with any relational data model. (Postgres and Oracle have been tested). The two applications have been initially developed for small molecule drug discovery research. However, they can be extended for use in other data domains.

Downloads: 0 This Week

Last Update: 2023-06-04
See Project
23

AlphaMiner

AlphaMiner is an open source data mining platform that offers versatile data mining model building and data cleansing features with an user friendly workflow interface.

Downloads: 0 This Week

Last Update: 2015-12-22
See Project
24

Big Sack

Big Sack: A lightweight Java Key/Value store with undo and disk cache.

Big Sack is a Java persistence mechanism that allows storage of key value pairs following the popular Big Data paradigms. Its a very simple and straightforward way to bridge the gap between in-memory data structures and long-term storage. It has the convenience of Java SDK TreeMap and TreeSet classes and is used the same easy way, but it includes rollback through undo logging to checkpoint data so it does not wind up in an unknown state regardless of failures. Data storage in the exabyte range is possible using filesystem and/or memory-mapped IO. Three levels of configurable write-through caching at different granularities ensure performance.

Downloads: 0 This Week

Last Update: 2013-12-21
See Project
25

BioDataServer

The BioDataServer is a database integration system. It implements a mediator-wrapper architecture and offers a SQL interface. The data integration is based on user defined intergrated schema and adapter that wrap any kind of data source.

Downloads: 0 This Week

Last Update: 2013-03-22
See Project

Previous
You're on page 1
2
3
4
5
Next

Related Searches

kettle

mobile website builder

gt-geojson

mycat

criminal records management system

xml

logstash

coffee factory management system

data analytics

data quality

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: