Browse 500+ open source datasets for your next machine learning project. Our list of free datasets keeps growing, so make sure you visit it frequently.
An open dataset for machine learning is a dataset that is freely available for anyone. You can use them as datasets for projects to train and test your machine learning models. Some of the public datasets in our library can be easily browsed on the V7 platform with a click of a button. Other datasets need to be downloaded. We’ve marked the browsable ones with a V7 icon.
Our repository of open image datasets consists of free public datasets for computer vision projects. For your convenience we’ve divided them into several categories, e.g.:
Computer vision task types
Use cases
In fact, you can use advanced filtering options to browse our open image datasets by tasks, annotation types, use cases, or licence. Additionally, you can look for interesting datasets by typing a keyword or sorting the results alphabetically, by popularity or the number of images in the dataset.
While most datasets on our list must be downloaded and imported into an image annotation tool of your choice to view the images, you can easily look inside the open datasets hosted on V7.
All you need to do is look for a result card with a blue Browsable tag in the top right corner and a V7 icon in the description.
Click on the card, and go to the open dataset’s page. There, in the right-hand panel, click on the View this Dataset button.
After clicking the button, you’ll see all the images from the dataset.
You can click on any image in the open dataset to see the annotations.
Yes, all the online datasets in our repository are free to use. The only limitations may involve the scope of usage or requirements to attribute the dataset to its source. You can easily see what licence a given open dataset falls under on each dataset’s dedicated page.
Open datasets offer a number of benefits for computer vision projects. Firstly, they allow for easier collaboration between researchers. When data is openly available, researchers can more easily share and build upon each other’s work. This helps to accelerate the pace of research and allows for more innovative solutions to be found.
Secondly, open datasets help to ensure that the data used is of high quality. When data is openly available, it is subject to greater scrutiny from the research community. This helps to ensure that any flaws or errors in the data are quickly identified and corrected.
Finally, open datasets allow for replicability of results. When data is openly available, researchers can more easily check and verify each other’s results. This helps to build confidence in the findings of a study and allows for more reliable conclusions to be drawn.
Yes, all sample datasets in our repository can be imported into V7.
One of the best places to look for quality open source datasets is our own repository. You can use advanced filtering options and the search box to look for very specific datasets.
For example, if you’re only interested in a specific licence, such as public domain datasets, make sure to select the CC-0 option in the licence filter.
You can combine various filtering options to narrow down your search.
If you haven’t found what you’re looking for among the 500+ open datasets we’ve catalogued here, don’t despair—there are other places you may want to visit.
Start with these articles:
Each of them comes with detailed descriptions and links to online datasets for various purposes.