Enterprise job scheduling middleware with distributed computing
Get random, realtime read/write access to your Big Data
Koog is the official Kotlin framework for building AI agents
DTail is a distributed DevOps tool for tailing, grepping, catting logs
Python module that helps you build complex pipelines of batch jobs
Read Cobol data files in Java
Parallel tool to remove duplicate DNA reads
Data parallel and stream parallel skeletons implemented in erlang.
The Spatial Framework for Hadoop allows developers
The Esri Geometry API for Java enables developers to write apps
A Microservice Toolkit from The New York Times
A tool for doing record analysis and transformation
MapReduce-based tool to remove duplicate DNA reads
Hadoop spliced read aligner for RNA-seq data
Algorithm on Spark for aligning multiple similar DNA/RNA sequences
A Scala API for Cascading
Distributed "massively parallel" SQL query engine
This is a data analytics project for RSS feeds using hadoop MapReduce
Streaming MapReduce with Scalding and Storm
A OWL reasoning framework for the analysis of big biomedical data
Data Mining and Machine Learning Algorithms based on MapReduce
Hadoop mapreduce maven plugin
Distributed RDF Processing over Hadoop