๐Ÿ” Advanced Multi-Language OCR System

Powered by Pix2Text, Tesseract, and FastAPI

Extract text from PDFs containing English, Bangla, and Mathematical expressions with high accuracy. Evaluate OCR performance with comprehensive metrics and detailed analysis.

Upload a PDF and extract text using advanced multi-language OCR

Features:

  • ๐ŸŒ Multi-language support: English, Bangla (Bengali), and Mathematical expressions
  • ๐Ÿงฎ Advanced Math Recognition: Pix2Text integration for LaTeX and mathematical formulas
  • ๐Ÿ“Š Detailed Analysis: Character-level classification and confidence scores
  • ๐Ÿ’พ Download Results: Get extracted text and detailed JSON analysis

๐Ÿ”— Links: GitHub Repository | Documentation

โšก Powered by: Pix2Text โ€ข Tesseract OCR โ€ข OpenCV โ€ข FastAPI โ€ข Gradio