OCR API powered by Tesseract. Extract text from receipts, invoices, scanned PDFs, business cards, and screenshots in 20 languages.
POST /extract
Content-Type: application/json
{
"url": "https://example.com/receipt.jpg",
"language": "eng"
}
{
"status": "ok",
"text": "RECEIPT\nDate: 2024-01-15\nItem 1 $12.50\nItem 2 $8.99\nTotal: $21.49",
"confidence": 94.2,
"word_count": 12,
"language": "eng",
"words": [...]
}
English, Chinese (Simplified/Traditional), Japanese, Korean, Arabic, Hindi, Russian, German, French, Spanish, and more.
Every extraction includes a confidence score (0–100) so you know how reliable the result is.
Optional word-level bounding box coordinates for layout analysis and document understanding.
Pass an image URL or base64-encoded image. Supports JPEG, PNG, GIF, and WebP.
Receipts, invoices, IDs, business cards, scanned PDFs, screenshots, handwritten notes.
Average response under 2 seconds for standard documents. Larger images may take longer.
Free plan available — 500 requests per month, no credit card required.
Get API Key →