IndicPhotoOCR / README.md
anikde's picture
updated paper deatails
45fac0f verified

A newer version of the Gradio SDK is available: 6.2.0

Upgrade
metadata
title: IndicPhotoOCR
colorFrom: purple
colorTo: pink
sdk: gradio
app_file: app.py
pinned: true
CPU: cpu-basic
suggested_storage: small
sdk_version: 4.44.1

IndicPhotoOCR Logo

A Comprehensive Toolkit for Scene Text Recognition in Indian Languages

Open Source Visitor Count GitHub Repo stars GitHub forks Hugging Face Open In Colab

Documentation


IndicPhotoOCR is a scene text recognition toolkit designed for detecting, identifying, and recognizing text across Indian languages, including Assamese, Bengali, Gujarati, Hindi, Kannada, Malayalam, Marathi, Odia, Punjabi, Tamil, Telugu, and English, with support for Urdu and Meitei in the pipeline. It is built to handle the unique scripts and complex structures of Indian languages, offering robust detection and recognition capabilities. The package can be installed with just few lines of code, and a straightforward wrapper function makes it easy to use. The details of the model/Space are described in the accompanying paper on arXiv: Bharat Scene Text: A Novel Comprehensive Dataset and Benchmark for Indian Language Scene Text Understanding