Datasets
A dataset in RiverBench is a well-described, validated, and neatly packaged collection of data that can be easily reused. Each dataset has several distributions of different lengths, with both streaming and flat variants. See the documentation pages linked below for more details.
You can find the list of all datasets in the menu on the left.
Tip
Datasets have machine-readable metadata in RDF. You can find RDF download links for each dataset on its documentation page. You can also use the HTTP content negotation mechanism.
All datasets
Dataset | El. type | El. count | RDF-star | Gen. triples | Gen. datasets |
---|---|---|---|---|---|
assist-iot-weather |
graph | 701,278 | |||
assist-iot-weather-graphs |
timestamped named graph | 701,278 | |||
citypulse-traffic |
graph | 4,382,599 | |||
citypulse-traffic-graphs |
timestamped named graph | 4,382,599 | |||
dbpedia-live |
graph | 166,204 | |||
digital-agenda-indicators |
subject graph | 1,440,415 | |||
linked-spending |
subject graph | 2,477,552 | |||
lod-katrina |
graph | 5,893,763 | |||
muziekweb |
subject graph | 2,450,357 | |||
nanopubs |
dataset | 5,000,000 | |||
openaire-lod |
subject graph | 2,000,000 | |||
politiquices |
graph | 17,773 | |||
yago-annotated-facts |
subject graph | 617,768 |
See also
- Benchmark categories – groupings of datasets and benchmark tasks.
- Dataset release format
- Dataset and profile metadata
- Creating a new dataset