Back

B-MOD

Brno Mobile OCR Dataset

B-MOD

Brno Mobile OCR Dataset (B-MOD) is a collection of 2 113 templates (pages of scientific papers). Those templates were captured using 23 various mobile devices under unrestricted conditions ensuring that the obtained photographs contain various amount of blurriness, illumination etc. In total, the dataset contains 19 725 photographs from which more than 500k lines with precise transcriptions was extracted. The templates were divided into three subsets (training, validation and testing). Captured photographs and cropped lines follows this division so photographs of the same templates and lines extracted from them are in the same subset.

View this Dataset
->
FIT BUT
View author website
Task
Text Detection
Annotation Types
Bounding Boxes
19725
Items
10
Classes
19725
Labels
Models using this dataset
Last updated on 
January 20, 2022
Licensed under 
Unknown