Hindi Character Recognition Dataset
Home » Dataset Download » Hindi Character Recognition Dataset
Hindi Character Recognition Dataset
Datasets
Hindi Character Recognition
File
Hindi Character Recognition
Use Case
Hindi Character Recognition
Description
Explore our Hindi Character Recognition featuring 92,000 images of handwritten Devanagari script.
Description:
The dataset is focused on the classification of handwritten Devanagari script, a significant writing system used primarily in India and Nepal. This script comprises 36 characters representing the sounds of the Sanskrit language and 10 numerical digits. Unlike many Western scripts, Devanagari does not employ capitalization, providing a unique challenge for recognition algorithms. Each character in the Devanagari script is connected by a distinctive horizontal bar, known as the ‘shirorekha,’ which runs along the top of the script, unifying the characters into cohesive words and sentences.
Download Dataset
Dataset Details
- Training Data: 78,200 images (1,700 per character)
- Test Data: 13,800 images (300 per character)
- Total Images: 92,000
- Image Specs: Each image is 32×32 pixels, with 3 color channels.
Context
The challenge lies in differentiating similar-looking characters, especially given the variability of handwritten samples. The dataset aims to address the complexities in recognizing Devanagari script, with a focus on overcoming issues related to sloppy or illegible writing.
Contact Us
Quality Data Creation
Guaranteed TAT
ISO 9001:2015, ISO/IEC 27001:2013 Certified
HIPAA Compliance
GDPR Compliance
Compliance and Security
Let's Discuss your Data collection Requirement With Us
To get a detailed estimation of requirements please reach us.