Hindi Character Recognition Dataset

Hindi Character Recognition Dataset

Datasets

Hindi Character Recognition

File

Hindi Character Recognition

Use Case

Hindi Character Recognition

Description

Explore our Hindi Character Recognition featuring 92,000 images of handwritten Devanagari script.

Hindi Character Recognition Dataset

Description:

The dataset is focused on the classification of handwritten Devanagari script, a significant writing system used primarily in India and Nepal. This script comprises 36 characters representing the sounds of the Sanskrit language and 10 numerical digits. Unlike many Western scripts, Devanagari does not employ capitalization, providing a unique challenge for recognition algorithms. Each character in the Devanagari script is connected by a distinctive horizontal bar, known as the ‘shirorekha,’ which runs along the top of the script, unifying the characters into cohesive words and sentences.

Download Dataset

Dataset Details

  • Training Data: 78,200 images (1,700 per character)
  • Test Data: 13,800 images (300 per character)
  • Total Images: 92,000
  • Image Specs: Each image is 32×32 pixels, with 3 color channels.

Context

The challenge lies in differentiating similar-looking characters, especially given the variability of handwritten samples. The dataset aims to address the complexities in recognizing Devanagari script, with a focus on overcoming issues related to sloppy or illegible writing.

Contact Us

Please enable JavaScript in your browser to complete this form.
Technology

Quality Data Creation

Technology

Guaranteed TAT

Technology

ISO 9001:2015, ISO/IEC 27001:2013 Certified

Technology

HIPAA Compliance

Technology

GDPR Compliance

Technology

Compliance and Security

Let's Discuss your Data collection Requirement With Us

To get a detailed estimation of requirements please reach us.

Scroll to Top