Open Source OCR Engine
Contexts Optical Compression
PDF to Markdown with vision models
OCRmyPDF adds an OCR text layer to scanned PDF files
Formula recognition based on LaTeX-OCR and ONNXRuntime
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
Library for OCR-related tasks powered by Deep Learning
A pure Javascript Multilingual OCR
Free OCR Software: No internet required, easy to use.
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OCR expert VLM powered by Hunyuan's native multimodal architecture
PDF scientific paper translation with preserved formats
A high-quality tool for convert PDF to Markdown and JSON
A cross-platform software for text translation and recognition
Math OCR model that outputs LaTeX and markdown
Readest is a modern, feature-rich ebook reader
Web application that allows you to perform operations on PDF files
Convert AI papers to GUI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
A Repo For Document AI