Back

Aida

Calculus Math Handwriting Recognition Dataset

Aida

The Aida Calculus Math Handwriting Recognition Dataset consists of 100,000 images of handwritten math expressions. Each expression is within the topic of limits and written with a dark utensil on plain paper. Each image is accompanied by ground truth math expression in LaTeX as well as bounding boxes and pixel-level masks per character. All images are synthetically generated with a variety of math, styles and other features designed to cover the range of possible student handwriting and photo qualities.Our goal in releasing this dataset is to provide the data science and machine learning community with resources for undertaking the challenging computer vision task of extracting math expressions from images. The data offers something to all levels, from beginners building simple character recognition models to experts who wish to predict pixel-by-pixel masks and decode the complex structure of math expressions.

View this Dataset
->
Aida
View author website
Task
Text Detection
Annotation Types
Bounding Boxes
100000
Items
10
Classes
100000
Labels
Models using this dataset
Last updated on 
January 20, 2022
Licensed under 
Custom