6 MB Tesseract (with English training data) to fit inside AWS Lambda
-
Updated
Jun 13, 2024 - Shell
6 MB Tesseract (with English training data) to fit inside AWS Lambda
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
Android document document scanning app
🌟 This repository houses a collection of image classification models for various purposes, including vehicle, object, animal, and flower classification. Each classifier is built using deep learning techniques and pre-trained models to accurately identify and categorize images based on their respective classes.
Web interface for recognizing text, proofreading OCR, and creating fully-digitized documents.
Docker Image with latest Tesseract OCR Version 5.x.x built from sources
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Tesseract Open Source OCR Engine (main repository)
Repository for the MA Digital Text Analysis thesis.
A small lightweight HTTP server that converts photos, images and scanned documents to text using optical character recognition by utilizing the power of Google Tesseract.
Textocry - Copy text from Images (chrome extension)
A tool to perform OCR on images and return them as voice and text outputs.
A python based helmet detection system
CCExtractor - Official version maintained by the core team
Python Project pillow, tesseract, and opencv (Coursera) (University of Michigan)
Python 3 Programming Coursera (University of Michigan)
Add a description, image, and links to the tesseract topic page so that developers can more easily learn about it.
To associate your repository with the tesseract topic, visit your repo's landing page and select "manage topics."