Open Datasets

Browse 500+ open source datasets for your next machine learning project. Our list of free datasets keeps growing, so make sure you visit it frequently.

Oops! Something went wrong while submitting the form.
Filters
Use Cases
Clear
Vehicle-Rear
Vehicle-Rear
A novel dataset for vehicle re-identification
Image Classification
Classification Tags
Federal University of Technology - Parana, Curitiba, Brazil
Items
Classes
Labels
GDXray
GDXray
X-ray images for nondestructive testing
Object Detection
Bounding Boxes
Universidad Catolica de Chile
72
Items
5
Classes
72
Labels
NVGesture
NVGesture
Dynamic gestures of touchless driving control
3D Object Detection
Bounding Boxes
NVIDIA Corporation
1532
Items
25
Classes
1532
Labels
AFHQ
AFHQ
Animal Faces-HQ
Face Recognition
Bounding Boxes
15000
Items
3
Classes
15000
Labels
Volleyball
Volleyball
Volleyball action recognition dataset
Event Detection
Bounding Boxes
4803
Items
17
Classes
4830
Labels
SDSS Galaxies
SDSS Galaxies
Large dataset of galaxy images
GAN / Image Generation
Image Pairs
University of Hertfordshire, Hatfield
306006
Items
10
Classes
306006
Labels
VOT2019
VOT2019
Visual Object Tracking benchmark for short-term tracking
Video Object Tracking
Bounding Boxes
Academic and Research Network of Slovenia (ARNES).
Items
Classes
Labels
AnimalWeb
AnimalWeb
A Large-Scale Hierarchical Dataset of Annotated Animal Faces
Face Keypoint Estimation
Keypoint
Computer Vision Laboratory, School of Computer Science
22400
Items
350
Classes
22400
Labels
IIIT-AR-13K
IIIT-AR-13K
A Dataset for Graphical Object Detection in Documents
Object Detection
Bounding Boxes
USODI
13000
Items
5
Classes
13000
Labels
Transparent Objects Datasets
Transparent Objects Datasets
Synthetic and Real-world Datasets of Transparent Objects for Robotics
3D Object Pose
Semantic Segmentation
ROBOTICS AT GOOGLE
50000
Items
5
Classes
50000
Labels
TrackingNet
TrackingNet
A Large-Scale Dataset and Benchmark for Object Tracking in the Wild
Video Object Tracking
Bounding Boxes
King Abdullah University of Science and Technology (KAUST)
14000000
Items
Classes
Labels
SODA10M
SODA10M
Large-scale 2D dataset for object detection in autonomous driving
Object Detection
Bounding Boxes
The Chinese University of Hong Kong
10
Items
20000
Classes
10000000
Labels
Total Text Dataset
Total Text Dataset
Word-level based English curve text dataset
Text Detection
Bounding Boxes
University of Malaya
1555
Items
5
Classes
1555
Labels
nuScenes Dataset
nuScenes Dataset
A large-scale dataset for autonomous driving
Object Detection
Bounding Boxes
Motional
1400000
Items
23
Classes
1400000
Labels
CrowdFix
CrowdFix
Dataset of Human Eye Fixation over Crowd Videos
Image Classification
Classification Tags
MIT
37493
Items
3
Classes
37493
Labels
Multiple Light Source Dataset
Multiple Light Source Dataset
Dataset for computational color science
Computational Photography
Kharkevich Institute for Information
Items
Classes
24
Labels
TAO
TAO
A Large-Scale Benchmark for Tracking Any Object
Video Object Tracking
2907
Items
833
Classes
2907
Labels
COIN
COIN
A Large-scale Dataset for Comprehensive Instructional Video Analysis
Video Classification
Classification Tags
Tsinghua University
46354
Items
Classes
46354
Labels
QMNIST
QMNIST
MNIST with extended 50K test images
Image Classification
Classification Tags
FACEBOOK RESEARCH
10000
Items
2
Classes
60000
Labels
SCUT-HEAD
SCUT-HEAD
Large-scale head detection dataset
Object Detection
Bounding Boxes
South China University of Technology
111251
Items
2
Classes
4405
Labels
MIT-States
MIT-States
MIT-States
53000
Items
245
Classes
53000
Labels

Frequently Asked Questions

Looking for more materials to build trustworthy AI? Discover our resources page, packed with free guides, webinars, and V7 product updates.
Where to find machine learning datasets?

One of the best places to look for quality open source datasets is our own repository. You can use advanced filtering options and the search box to look for very specific datasets.

For example, if you’re only interested in a specific licence, such as public domain datasets, make sure to select the CC-0 option in the licence filter.

You can combine various filtering options to narrow down your search.

If you haven’t found what you’re looking for among the 500+ open datasets we’ve catalogued here, don’t despair—there are other places you may want to visit.

Start with these articles:

Each of them comes with detailed descriptions and links to online datasets for various purposes.

Can I use all public datasets on the V7 platform?

Yes, all sample datasets in our repository can be imported into V7.

What are the benefits of using open datasets?

Open datasets offer a number of benefits for computer vision projects. Firstly, they allow for easier collaboration between researchers. When data is openly available, researchers can more easily share and build upon each other’s work. This helps to accelerate the pace of research and allows for more innovative solutions to be found.

Secondly, open datasets help to ensure that the data used is of high quality. When data is openly available, it is subject to greater scrutiny from the research community. This helps to ensure that any flaws or errors in the data are quickly identified and corrected.

Finally, open datasets allow for replicability of results. When data is openly available, researchers can more easily check and verify each other’s results. This helps to build confidence in the findings of a study and allows for more reliable conclusions to be drawn.

Are all datasets on the list free?

Yes, all the online datasets in our repository are free to use. The only limitations may involve the scope of usage or requirements to attribute the dataset to its source. You can easily see what licence a given open dataset falls under on each dataset’s dedicated page.

What kinds of free datasets can I find here?

Our repository of open image datasets consists of free public datasets for computer vision projects. For your convenience we’ve divided them into several categories, e.g.:

Computer vision task types

Use cases

In fact, you can use advanced filtering options to browse our open image datasets by tasks, annotation types, use cases, or licence. Additionally, you can look for interesting datasets by typing a keyword or sorting the results alphabetically, by popularity or the number of images in the dataset.

What are open datasets?

An open dataset for machine learning is a dataset that is freely available for anyone. You can use them as datasets for projects to train and test your machine learning models.

Ready to get started?
Try our trial or talk to one of our experts.