Calculus Math Handwriting Recognition Dataset
The Aida Calculus Math Handwriting Recognition Dataset consists of 100,000 images of handwritten math expressions. Each expression is within the topic of limits and written with a dark utensil on plain paper. Each image is accompanied by ground truth math expression in LaTeX as well as bounding boxes and pixel-level masks per character. All images are synthetically generated with a variety of math, styles and other features designed to cover the range of possible student handwriting and photo qualities.Our goal in releasing this dataset is to provide the data science and machine learning community with resources for undertaking the challenging computer vision task of extracting math expressions from images. The data offers something to all levels, from beginners building simple character recognition models to experts who wish to predict pixel-by-pixel masks and decode the complex structure of math expressions.