Skip to content

Datasets

A dataset in RiverBench is a well-described, validated, and neatly packaged collection of data that can be easily reused. Each dataset has several distributions of different lengths, with both streaming and flat variants. See the documentation pages linked below for more details.

You can find the list of all datasets in the menu on the left.

Tip

Datasets have machine-readable metadata in RDF. You can find RDF download links for each dataset on its documentation page. You can also use the HTTP content negotation mechanism.

All datasets

Dataset El. type El. count RDF-star Gen. triples Gen. datasets
assist-iot-weather graph 701,278
assist-iot-weather-graphs timestamped named graph 701,278
citypulse-traffic graph 4,382,599
citypulse-traffic-graphs timestamped named graph 4,382,599
dbpedia-live graph 166,204
digital-agenda-indicators subject graph 1,440,415
linked-spending subject graph 2,477,552
lod-katrina graph 5,893,763
muziekweb subject graph 2,450,357
nanopubs dataset 5,000,000
openaire-lod subject graph 2,000,000
politiquices graph 17,773
yago-annotated-facts subject graph 617,768

See also