Standard OCR dataset

Standard OCR dataset


Standard OCR dataset


OCR dataset

Use Case

Text Recognition


A Dataset is a curated collection of text data and corresponding images that is specifically designed for training and evaluating OCR systems and algorithms. OCR is a technology that converts printed or handwritten text into machine-readable text, and having standardized datasets is crucial for benchmarking the accuracy and robustness of OCR solutions.

About Dataset

We decided to create this dataset in order to train my own character recognition system


This dataset contains 2 folders

  1. data ( new version with greater pixels intensity)
  2. data2(older version)


if you have tried MNIST hand-written dataset and couldn’t get your objective done give it a try to this dataset


Globose Technology Solutions Private Limited is at the forefront of improving optical character recognition (OCR) technology. We use insights from the Standard OCR dataset to develop advanced OCR solutions that are highly accurate and efficient. Our aim is to revolutionize document processing across various industries by improving data extraction and document management processes with state-of-the-art OCR technologies.

Contact Us

Please enable JavaScript in your browser to complete this form.

Quality Data Creation


Guaranteed TAT


ISO 9001:2015, ISO/IEC 27001:2013 Certified


HIPAA Compliance


GDPR Compliance


Compliance and Security

Let's Discuss your Data collection Requirement With Us

To get a detailed estimation of requirements please reach us.

Scroll to Top

Please provide your details to download the Dataset.