Yandex Vision
Yandex Vision OCR recognizes text in an image and outputs it along with automatic punctuation. The service supports and automatically identifies more than 50 languages. Extract standard fields and recognize text in templates and documents, e.g., passports, driver’s licenses, vehicle registration certificates, and license plates. With support for Russian and English, as well as combinations of handwritten and printed texts. The service scans the table structure and outputs text in row and column coordinates. Optical character recognition (OCR), document recognition, and license plate number recognition. Yandex Vision OCR allows you to work with JPEG, PNG, and PDF formats. File sizes should be no larger than 20 MB with no more than 300 pages per file. The service can scan images and find passports from 20 countries, driver’s licenses, vehicle registration documents, and license plates.
Learn more
Google Cloud Natural Language API
Get insightful text analysis with machine learning that extracts, analyzes, and stores text. Train high-quality machine learning custom models without a single line of code with AutoML. Apply natural language understanding (NLU) to apps with Natural Language API. Use entity analysis to find and label fields within a document, including emails, chat, and social media, and then sentiment analysis to understand customer opinions to find actionable product and UX insights. Natural Language with speech-to-text API extracts insights from audio. Vision API adds optical character recognition (OCR) for scanned docs. Translation API understands sentiments in multiple languages. Use custom entity extraction to identify domain-specific entities within documents, many of which don’t appear in standard language models, without having to spend time or money on manual analysis. Train your own high-quality machine learning custom models to classify, extract, and detect sentiment.
Learn more
Tencent Cloud OCR
Tencent Cloud Optical Character Recognition (OCR) can automatically locate and recognize text in images. It features robustness and an average accuracy rate of above 95% for printed text and 90% for handwritten text. Developed independently by the Tencent YouTu Lab, OCR covers all core algorithms for identity document analysis and recognition. It supports both landscape and portrait modes, and can be applied in scenarios with perspective distortion, irregular illumination, partial occlusion and more. OCR not only provides developers with a full range of APIs that can be called directly, but also SDKs that are highly compatible and easy to use.It can recognize Chinese text, English text, Chinese/English text, numbers, and special symbols with higher accuracy. It can recognize complex text at higher accuracy and recall rates, making it suitable for scenarios with a large amount of text, long numeric strings, small font, blurry or skewed text, etc.
Learn more
PrecisionOCR
PrecisionOCR is a ready-to-use, secure, HIPAA-compliant, cloud-based platform for extracting medical meaning from unstructured documents using Optical Character Recognition (OCR).
PrecisionOCR uses custom Optical Character Recognition and AI algorithms to convert PDFs/JPEGs/PNGs into structured, searchable documents. Organizations can work with our team to build OCR report extractors which look for specific types of information to extract or highlight to reduce the noise that comes from extracting all of the data within a document.
Natural language processing (NLP) and machine learning (ML) power the semi-automated and automated transformation of source material such as pdfs or images into structured data records that integrate seamlessly with EMR data using HL7s FHIR standards. Data can be automatically stored along side patient records.
Our OCR document classification is also available along with multiple ways to integrate including API and CLI support.
Learn more