Recognition and resolution of numbers, units, date/time, etc.
Open Source OCR Engine
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech-to-text, text-to-speech, and speaker recognition
Robust Speech Recognition via Large-Scale Weak Supervision
Contexts Optical Compression
OCR software, free and offline
Speech recognition module for Python
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
A pure Javascript Multilingual OCR
A cross-platform software for text translation and recognition
Underthesea - Vietnamese NLP Toolkit
Open-Source Python3 tool for recognizing layouts, tables, and math
Library for OCR-related tasks powered by Deep Learning
A free, open source, and extensible speech-to-text application
Cross-platform AI language practice app
Open source annotation tool for machine learning practitioners
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Go efficient multilingual NLP and text segmentation
OCRmyPDF adds an OCR text layer to scanned PDF files
Audio foundation model excelling in audio understanding
A full spaCy pipeline and models for scientific/biomedical documents
Toolkit for conversational AI
Readest is a modern, feature-rich ebook reader
Han Language Processing