Skip to content

Datasets

A dataset in RiverBench is a well-described, validated, and neatly packaged collection of data that can be easily reused. Each dataset has several distributions of different lengths, with both streaming and flat variants. See the documentation pages linked below for more details.

You can find the list of all datasets in the menu on the left.

Tip

Datasets have machine-readable metadata in RDF. You can find RDF download links for each dataset on its documentation page. You can also use the HTTP content negotation mechanism.

All datasets

Dataset El. type Element count Statement count RDF-star
assist-iot-weather graph 701,278 80,646,970
assist-iot-weather-graphs timestamped named graph 701,278 81,348,248
citypulse-traffic graph 4,382,599 157,773,564
citypulse-traffic-graphs timestamped named graph 4,382,599 162,156,163
dbpedia-live graph 166,204 21,831,109
digital-agenda-indicators subject graph 1,440,415 11,669,016
linked-spending subject graph 2,477,552 55,097,866
lod-katrina graph 5,893,763 179,128,407
muziekweb subject graph 2,450,357 36,195,263
nanopubs dataset 5,000,000 171,885,662
officegraph subject graph 14,930,478 91,378,858
openaire-lod subject graph 2,000,000 71,810,467
osm2rdf-denmark subject graph 2,030,923 60,608,642
politiquices graph 17,773 159,957
yago-annotated-facts subject graph 617,768 2,484,547

See also