Extract Data From Files
Use our OCR API to extract the text in any file. We accept images, PDFs, Microsoft Office documents, and more. You can send multi-page documents and files with multiple documents too.
The following file formats are supported:
- Images: JPEG, PNG, GIF, SVG, HEIC, WEBP, TIFF
- Microsoft Office: DOC, DOCX, XLS, XLS, PPT, PPTX
- Open Office: ODS, ODT, ODP
- PDF: Both digital and image-only files are supported. PDFs may be single or multi-page and may contain multiple document types (e.g., 3 ID pages plus 1 invoice).
- ZIP: May only contain the supported file formats
- MSG: Outlook message files and the contents within (e.g., email's PDFs attachments)
- Audio: MP3, OGG, FLAC, WAV
- Video: MOV, MP4, AVI, WMV, M4V
- Please contact us if you need another file format.
The benefits of automated
files document processing
Higher customer satisfaction
Quick and easy processing
Handle muilt-page processing
Learn how innovative companies use our AI
Our customers save thousands of employee hours per month using our AI to process even the most complex documents in seconds with 99.7% accuracy.
READ CASE STUDIESOne product for all your data extraction needs
Pick a category to learn how we can automate your
Our AI service can scale infinitely in the cloud. No hardware
- IDs & Driver licenses
- Vehicle registrations
- Vehicle insurances
- Worldwide IDs
- Passports
- Travel visas
- Receipts
- Questions & Answers AI
- Invoices & Purchase order documents
- Checks
- ACORD Forms
- Summary of benefits & coverage
- License plates
- Shipping containers
- Files
- Forms
- Handwriting
- Mobile SDK
- Signatures
- Faces
- Human-In-the-Loop
- Checkboxes & Radio Buttons
- Multimedia
- Document Generation
- Shape Verification
- Photo Captioning
- Custom Taxonomy
- Text-to-Speech AI
- Speech-to-Text AI
- IBAN Detection
- IRS Tax Forms
- Address Geocoding
- Classification
- CMS 1500
- SMART Health Card
- Safety datasheets
- OSHA Forms
- Bill of Lading
- Customs declaration forms
- Semantic
- Documents
- No-code
- Digital Signatures
- Tables
- Checkboxes
- Entities
- Barcodes
- Watermarks
- Nixie labels
- Bank Cards
- US Green Cards
- US Social Security Cards
- Remittance
- Cloud Deployment
- Segmentation AI
- Image Quality AI
- Redaction AI
- Real ID and Enhanced ID