Skip to content

Datasets

A dataset in RiverBench is a well-described, validated, and neatly packaged collection of data that can be easily reused. Each dataset has several distributions of different lengths, with both streaming and flat variants. See the documentation pages linked below for more details.

You can find the list of all datasets in the menu on the left.

Tip

Datasets have machine-readable metadata in RDF. You can find RDF download links for each dataset on its documentation page. You can also use the HTTP content negotation mechanism.

All datasets

Dataset El. type El. count RDF-star Gen. triples Gen. datasets
assist-iot-weather triples 701,278
assist-iot-weather-graphs graphs 701,278
citypulse-traffic triples 4,382,599
citypulse-traffic-graphs graphs 4,382,599
dbpedia-live triples 166,204
digital-agenda-indicators triples 1,440,415
linked-spending triples 2,477,552
lod-katrina triples 5,893,763
muziekweb triples 2,450,357
nanopubs quads 5,000,000
politiquices triples 17,773
yago-annotated-facts triples 617,768

See also