Dataset: digital-agenda-indicators (development version)
The dataset of EU Digital Agenda Key Indicators contains statistical information about the European information society. The dataset is composed of a series of statistical observations with various properties and a very regular structure.
This is a large, typical dataset with statistical information, with a very regular structure. In contrast to linked-spending, this dataset does not include textual information and is more homogenous, making it an interesting comparison point for aggressive compression algorithms.
Info
 Download this metadata in RDF: Turtle, N-Triples, RDF/XML, Jelly
 Source repository: dataset-digital-agenda-indicators
 Permanent URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev
Stream preview (click to expand)
@prefix ns1:  <http://purl.org/linked-data/cube#> .
@prefix ns10: <http://eurostat.linked-statistics.org/dic/geo#> .
@prefix ns11: <http://semantic.digital-agenda-data.eu/codelist/unit-measure/> .
@prefix ns4:  <http://semantic.digital-agenda-data.eu/dataset/> .
@prefix ns5:  <http://purl.org/linked-data/sdmx/2009/measure#> .
@prefix ns6:  <http://semantic.digital-agenda-data.eu/def/property/> .
@prefix ns7:  <http://semantic.digital-agenda-data.eu/codelist/breakdown/> .
@prefix ns8:  <http://eurostat.linked-statistics.org/dic/flags#> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
<http://semantic.digital-agenda-data.eu/data/digital-agenda-scoreboard-key-indicators/2011/at/bb_fttpcov//2011>
        rdf:type          ns1:Observation;
        ns1:dataSet       ns4:digital-agenda-scoreboard-key-indicators;
        ns5:obsValue      "TOTAL_POPHH";
        ns6:breakdown     ns7:at;
        ns6:flag          ns8:pc_hh_all;
        ns6:note          0.052999999999999998501;
        ns6:ref-area      ns10:;
        ns6:time-period   <http://reference.data.gov.uk/id/gregorian-year/2011>;
        ns6:unit-measure  ns11:bb_fttpcov .
@prefix ns1:  <http://purl.org/linked-data/cube#> .
@prefix ns10: <http://eurostat.linked-statistics.org/dic/geo#> .
@prefix ns11: <http://semantic.digital-agenda-data.eu/codelist/unit-measure/> .
@prefix ns4:  <http://semantic.digital-agenda-data.eu/dataset/> .
@prefix ns5:  <http://purl.org/linked-data/sdmx/2009/measure#> .
@prefix ns6:  <http://semantic.digital-agenda-data.eu/def/property/> .
@prefix ns7:  <http://semantic.digital-agenda-data.eu/codelist/breakdown/> .
@prefix ns8:  <http://eurostat.linked-statistics.org/dic/flags#> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
<http://semantic.digital-agenda-data.eu/data/digital-agenda-scoreboard-key-indicators/2011/be/bb_fttpcov//2011>
        rdf:type          ns1:Observation;
        ns1:dataSet       ns4:digital-agenda-scoreboard-key-indicators;
        ns5:obsValue      "TOTAL_POPHH";
        ns6:breakdown     ns7:be;
        ns6:flag          ns8:pc_hh_all;
        ns6:note          0.0020000000000000000416;
        ns6:ref-area      ns10:;
        ns6:time-period   <http://reference.data.gov.uk/id/gregorian-year/2011>;
        ns6:unit-measure  ns11:bb_fttpcov .
@prefix ns1:  <http://purl.org/linked-data/cube#> .
@prefix ns10: <http://eurostat.linked-statistics.org/dic/geo#> .
@prefix ns11: <http://semantic.digital-agenda-data.eu/codelist/unit-measure/> .
@prefix ns4:  <http://semantic.digital-agenda-data.eu/dataset/> .
@prefix ns5:  <http://purl.org/linked-data/sdmx/2009/measure#> .
@prefix ns6:  <http://semantic.digital-agenda-data.eu/def/property/> .
@prefix ns7:  <http://semantic.digital-agenda-data.eu/codelist/breakdown/> .
@prefix ns8:  <http://eurostat.linked-statistics.org/dic/flags#> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
<http://semantic.digital-agenda-data.eu/data/digital-agenda-scoreboard-key-indicators/2011/es/bb_fttpcov//2011>
        rdf:type          ns1:Observation;
        ns1:dataSet       ns4:digital-agenda-scoreboard-key-indicators;
        ns5:obsValue      "TOTAL_POPHH";
        ns6:breakdown     ns7:es;
        ns6:flag          ns8:pc_hh_all;
        ns6:note          0.097000000000000002887;
        ns6:ref-area      ns10:;
        ns6:time-period   <http://reference.data.gov.uk/id/gregorian-year/2011>;
        ns6:unit-measure  ns11:bb_fttpcov .
@prefix ns1:  <http://purl.org/linked-data/cube#> .
@prefix ns10: <http://eurostat.linked-statistics.org/dic/geo#> .
@prefix ns11: <http://semantic.digital-agenda-data.eu/codelist/unit-measure/> .
@prefix ns4:  <http://semantic.digital-agenda-data.eu/dataset/> .
@prefix ns5:  <http://purl.org/linked-data/sdmx/2009/measure#> .
@prefix ns6:  <http://semantic.digital-agenda-data.eu/def/property/> .
@prefix ns7:  <http://semantic.digital-agenda-data.eu/codelist/breakdown/> .
@prefix ns8:  <http://eurostat.linked-statistics.org/dic/flags#> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
<http://semantic.digital-agenda-data.eu/data/digital-agenda-scoreboard-key-indicators/2014/cy/bb_fttpcov//2014>
        rdf:type          ns1:Observation;
        ns1:dataSet       ns4:digital-agenda-scoreboard-key-indicators;
        ns5:obsValue      "TOTAL_POPHH";
        ns6:breakdown     ns7:cy;
        ns6:flag          ns8:pc_hh_all;
        ns6:ref-area      ns10:;
        ns6:time-period   <http://reference.data.gov.uk/id/gregorian-year/2014>;
        ns6:unit-measure  ns11:bb_fttpcov .
@prefix ns1:  <http://purl.org/linked-data/cube#> .
@prefix ns10: <http://eurostat.linked-statistics.org/dic/geo#> .
@prefix ns11: <http://semantic.digital-agenda-data.eu/codelist/unit-measure/> .
@prefix ns4:  <http://semantic.digital-agenda-data.eu/dataset/> .
@prefix ns5:  <http://purl.org/linked-data/sdmx/2009/measure#> .
@prefix ns6:  <http://semantic.digital-agenda-data.eu/def/property/> .
@prefix ns7:  <http://semantic.digital-agenda-data.eu/codelist/breakdown/> .
@prefix ns9:  <http://semantic.digital-agenda-data.eu/codelist/indicator/> .
@prefix rdf:  <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .
<http://semantic.digital-agenda-data.eu/data/digital-agenda-scoreboard-key-indicators/bb_penet/total_fbb/subs_per_100_pop/CZ/2021-06>
        rdf:type          ns1:Observation;
        ns1:dataSet       ns4:digital-agenda-scoreboard-key-indicators;
        ns5:obsValue      36.854580318763886737;
        ns6:breakdown     ns7:total_fbb;
        ns6:indicator     ns9:bb_penet;
        ns6:ref-area      ns10:CZ;
        ns6:time-period   <http://reference.data.gov.uk/id/gregorian-month/2021-06>;
        ns6:unit-measure  ns11:subs_per_100_pop .
General information
- Title: EU Digital Agenda Key Indicators (en)
- Identifier: digital-agenda-indicators
- Has version: dev
- Theme: - Digital technology (eurovoc:7219)
- European Commission (eurovoc:4038)
- European Union (eurovoc:4060)
- Information society (eurovoc:6140)
- Official statistics (eurovoc:4267)
 
- Creator: - European Commission, Directorate-General for Communications Networks, Content and Technology (1)    - Name: European Commission, Directorate-General for Communications Networks, Content and Technology
- Homepage: https://commission.europa.eu/index_en
 
- Piotr Sowiński (2)    - Name: Piotr Sowiński
- Comment: Processing the dataset
- Nickname: Ostrzyciel
- Homepage:
 
 
- European Commission, Directorate-General for Communications Networks, Content and Technology (1)    
- License: https://spdx.org/licenses/CC-BY-4.0
- Rights: According to the European Commission reuse notice, reuse is authorised, provided the source is acknowledged. The reuse policy of the European Commission is implemented by the Decision of 12 December 2011. (en)
- Source: http://semantic.digital-agenda-data.eu/dataset/digital-agenda-scoreboard-key-indicators
- Date Issued: 2023-05-04
- Date Modified: 2024-09-11
- Landing page: digital-agenda-indicators (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
Technical metadata
- Has stream type usage: - RDF stream type usage (1)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- RDF stream type usage (2)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
 
- RDF stream type usage (1)    
- Has stream element count: 1,440,415
- Has stream element split: - Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to one statistical observation. The elements are ordered alphabetically, so it is likely that similar observations (e.g., from the the same country) are next to each other in the stream. (en)
- Has subject shape:     - Has subject shape (1)    - Comment: Some observations have no class assigned. (en)
- Target subjects of: http://purl.org/linked-data/sdmx/2009/measure#obsValue
 
- Has subject shape (2)    - Target class: http://purl.org/linked-data/cube#Observation
 
 
- Has subject shape (1)    
 
- Uses vocabulary:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
Distributions
Download links
The dataset is published in a few size variants, each containing a specific number of stream elements. For each size, there are three distribution types available: flat (just an N-Triples/N-Quads file), streaming (a .tar.gz archive with Turtle/TriG files, one file per stream element), and Jelly (a native binary format for streaming RDF). See the documentation for more details.
| Distribution size | Statements | Flat | Streaming | Jelly | 
|---|---|---|---|---|
| 10K | 82,424 | 425.3 KB | 316.9 KB | 188.1 KB | 
| 100K | 811,625 | 4.5 MB | 3.6 MB | 1.7 MB | 
| 1M | 8,108,967 | 40.9 MB | 32.4 MB | 16.4 MB | 
| Full | 11,669,016 | 58.2 MB | 46.2 MB | 23.5 MB | 
The full metadata of all distributions can be found below.
Full stream distribution
- Title: Full stream distribution
- Identifier: stream-full
- Has file name: stream_full.tar.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- Has distribution type: - Full distribution (rb:fullDistribution)
- Stream distribution (rb:streamDistribution)
 
- Has stream element count: 1,440,415
- Byte size: 46.2 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: f03c0befc9e247c76474c556aba53ee2
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 62265bbbf619bd4e3cba4f95911bd224af8040a7
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/stream_full.tar.gz
- Statistics: statistics-full
Full Jelly distribution
- Title: Full Jelly distribution
- Identifier: jelly-full
- Has file name: jelly_full.jelly.gz
- Has stream type usage: - RDF stream type usage (1)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- RDF stream type usage (2)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
 
- RDF stream type usage (1)    
- Has distribution type: - Full distribution (rb:fullDistribution)
- Jelly distribution (rb:jellyDistribution)
 
- Has stream element count: 1,440,415
- Byte size: 23.5 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 8f32104458f739bfa28a0c0308359066
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: d8494e3f52942996b3b12b9a555d3cf38528056b
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/jelly_full.jelly.gz
- Statistics: statistics-full
Full flat distribution
- Title: Full flat distribution
- Identifier: flat-full
- Has file name: flat_full.nt.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- Has distribution type: - Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
 
- Has stream element count: 1,440,415
- Byte size: 58.2 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 424d3aae83c32f8dbd78074403f68217
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 3a09b756ecd09600388b548e866977a651b25df2
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/flat_full.nt.gz
- Statistics: statistics-full
1M elements stream distribution
- Title: 1M elements stream distribution
- Identifier: stream-1m
- Has file name: stream_1M.tar.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- Has distribution type: - Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
 
- Has stream element count: 1,000,000
- Byte size: 32.4 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 0d13d0de67b8ccb95d4339466507d5f5
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 908cbcd1ae53d4dab48020ff8e3eca18e4376ac1
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/stream_1M.tar.gz
- Statistics: statistics-1m
1M elements Jelly distribution
- Title: 1M elements Jelly distribution
- Identifier: jelly-1m
- Has file name: jelly_1M.jelly.gz
- Has stream type usage: - RDF stream type usage (1)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- RDF stream type usage (2)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
 
- RDF stream type usage (1)    
- Has distribution type: - Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 1,000,000
- Byte size: 16.4 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 3ea986fd90d18c447ece55f67e266ff2
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: c706913d161c48fd898281e51476d0853b0bd3b3
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/jelly_1M.jelly.gz
- Statistics: statistics-1m
1M elements flat distribution
- Title: 1M elements flat distribution
- Identifier: flat-1m
- Has file name: flat_1M.nt.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- Has distribution type: - Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 1,000,000
- Byte size: 40.9 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 0c47eeb8f95bdc2a58fd407f0eb03dc2
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 5f179e4156ab9c26a7206d6dad4e237267d96724
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/flat_1M.nt.gz
- Statistics: statistics-1m
100K elements stream distribution
- Title: 100K elements stream distribution
- Identifier: stream-100k
- Has file name: stream_100K.tar.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- Has distribution type: - Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
 
- Has stream element count: 100,000
- Byte size: 3.6 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 34f344d41ea4e8fbf6fe271d3b190370
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: c134f889bdcfb887c5e328706bed7fa1ab7fe7f6
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/stream_100K.tar.gz
- Statistics: statistics-100k
100K elements Jelly distribution
- Title: 100K elements Jelly distribution
- Identifier: jelly-100k
- Has file name: jelly_100K.jelly.gz
- Has stream type usage: - RDF stream type usage (1)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- RDF stream type usage (2)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
 
- RDF stream type usage (1)    
- Has distribution type: - Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 100,000
- Byte size: 1.7 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 18f37ad88d0046282c5897aba3e17884
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 9d37b9ef54cb1e4a50022ee382fd61cd52329d1d
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/jelly_100K.jelly.gz
- Statistics: statistics-100k
100K elements flat distribution
- Title: 100K elements flat distribution
- Identifier: flat-100k
- Has file name: flat_100K.nt.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- Has distribution type: - Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 100,000
- Byte size: 4.5 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 7ac28340f6418f8736cdd69afa0e3271
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 66999f31334b5d6d473059a9c6c6edc738cfc31e
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/flat_100K.nt.gz
- Statistics: statistics-100k
10K elements stream distribution
- Title: 10K elements stream distribution
- Identifier: stream-10k
- Has file name: stream_10K.tar.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
- Has distribution type: - Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
 
- Has stream element count: 10,000
- Byte size: 316.9 KB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 344f6700f3e7b731511d9096d555356b
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: fc99a29c2237441c58322d2e77a49ad97fc1a09d
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/stream_10K.tar.gz
- Statistics: statistics-10k
10K elements Jelly distribution
- Title: 10K elements Jelly distribution
- Identifier: jelly-10k
- Has file name: jelly_10K.jelly.gz
- Has stream type usage: - RDF stream type usage (1)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- RDF stream type usage (2)    - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
 
 
- RDF stream type usage (1)    
- Has distribution type: - Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 10,000
- Byte size: 188.1 KB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 288c55af9672ed83e266a9ac2bbdb04a
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: add4f4a80c6729a56a6a7b225a3ef2b098fa0738
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/jelly_10K.jelly.gz
- Statistics: statistics-10k
10K elements flat distribution
- Title: 10K elements flat distribution
- Identifier: flat-10k
- Has file name: flat_10K.nt.gz
- Has stream type usage: - Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
 
- Has distribution type: - Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
 
- Has stream element count: 10,000
- Byte size: 425.3 KB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum: - Checksum (1)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 888d8e17eb951d7153479dd03224279c
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
 
- Checksum (2)    - Type: Checksum (spdx:Checksum)
- ChecksumValue: 62a677f3c18cee1b54ad9add4ddf46512552fca1
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
 
 
- Checksum (1)    
- Download URL: https://w3id.org/riverbench/datasets/digital-agenda-indicators/dev/files/flat_10K.nt.gz
- Statistics: statistics-10k
Statistics
Statistics for full distributions
- Title: Statistics for full distributions
| Sum | Unique | Mean | St. dev. | Min. | Max. | |
|---|---|---|---|---|---|---|
| IRIs | 23,320,388 | ~ 1,445,216 | 16.19 | 0.61 | 2 | 19 | 
| Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Predicates | 11,668,881 | ~ 10 | 8.10 | 0.38 | 1 | 10 | 
| Objects | 11,669,016 | ~ 754,785 | 8.10 | 0.38 | 1 | 11 | 
| Graphs | 1,440,415 | ~ 1 | 1.00 | 0.00 | 1 | 1 | 
| Statements | 11,669,016 | N/A | 8.10 | 0.38 | 1 | 11 | 
| Literals | 1,457,924 | ~ 753,613 | 1.01 | 0.30 | 0 | 2 | 
| Simple literals | 81,176 | ~ 1,023 | 0.06 | 0.23 | 0 | 1 | 
| Datatype literals | 1,376,748 | ~ 752,828 | 0.96 | 0.21 | 0 | 2 | 
| Language literals | 0 | ~ 0 | 0.00 | 0.00 | 0 | 0 | 
| Datatypes | 1,376,667 | 7 | 0.96 | 0.21 | 0 | 2 | 
| ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Subjects | 1,440,415 | ~ 1,444,337 | 1.00 | 0.00 | 1 | 1 | 
Statistics for 1M distributions
- Title: Statistics for 1M distributions
| Sum | Unique | Mean | St. dev. | Min. | Max. | |
|---|---|---|---|---|---|---|
| IRIs | 16,203,498 | ~ 998,804 | 16.20 | 0.61 | 2 | 19 | 
| Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Predicates | 8,108,836 | ~ 10 | 8.11 | 0.39 | 1 | 10 | 
| Objects | 8,108,967 | ~ 612,803 | 8.11 | 0.39 | 1 | 11 | 
| Graphs | 1,000,000 | ~ 1 | 1.00 | 0.00 | 1 | 1 | 
| Statements | 8,108,967 | N/A | 8.11 | 0.39 | 1 | 11 | 
| Literals | 1,014,305 | ~ 611,100 | 1.01 | 0.31 | 0 | 2 | 
| Simple literals | 62,275 | ~ 940 | 0.06 | 0.24 | 0 | 1 | 
| Datatype literals | 952,030 | ~ 610,289 | 0.95 | 0.21 | 0 | 2 | 
| Language literals | 0 | ~ 0 | 0.00 | 0.00 | 0 | 0 | 
| Datatypes | 951,952 | 7 | 0.95 | 0.21 | 0 | 2 | 
| ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Subjects | 1,000,000 | ~ 998,307 | 1.00 | 0.00 | 1 | 1 | 
Statistics for 100K distributions
- Title: Statistics for 100K distributions
| Sum | Unique | Mean | St. dev. | Min. | Max. | |
|---|---|---|---|---|---|---|
| IRIs | 1,618,602 | ~ 100,272 | 16.19 | 0.48 | 2 | 19 | 
| Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Predicates | 811,624 | ~ 10 | 8.12 | 0.35 | 1 | 10 | 
| Objects | 811,625 | ~ 81,870 | 8.12 | 0.35 | 1 | 10 | 
| Graphs | 100,000 | ~ 1 | 1.00 | 0.00 | 1 | 1 | 
| Statements | 811,625 | N/A | 8.12 | 0.35 | 1 | 10 | 
| Literals | 104,647 | ~ 81,315 | 1.05 | 0.41 | 0 | 2 | 
| Simple literals | 12,276 | ~ 376 | 0.12 | 0.33 | 0 | 1 | 
| Datatype literals | 92,371 | ~ 80,959 | 0.92 | 0.27 | 0 | 2 | 
| Language literals | 0 | ~ 0 | 0.00 | 0.00 | 0 | 0 | 
| Datatypes | 92,370 | 6 | 0.92 | 0.27 | 0 | 1 | 
| ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Subjects | 100,000 | ~ 100,048 | 1.00 | 0.00 | 1 | 1 | 
Statistics for 10K distributions
- Title: Statistics for 10K distributions
| Sum | Unique | Mean | St. dev. | Min. | Max. | |
|---|---|---|---|---|---|---|
| IRIs | 162,375 | ~ 10,197 | 16.24 | 0.60 | 2 | 17 | 
| Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Predicates | 82,424 | ~ 10 | 8.24 | 0.50 | 1 | 9 | 
| Objects | 82,424 | ~ 8,753 | 8.24 | 0.50 | 1 | 9 | 
| Graphs | 10,000 | ~ 1 | 1.00 | 0.00 | 1 | 1 | 
| Statements | 82,424 | N/A | 8.24 | 0.50 | 1 | 9 | 
| Literals | 12,473 | ~ 8,587 | 1.25 | 0.47 | 0 | 2 | 
| Simple literals | 2,975 | ~ 6 | 0.30 | 0.46 | 0 | 1 | 
| Datatype literals | 9,498 | ~ 8,581 | 0.95 | 0.22 | 0 | 1 | 
| Language literals | 0 | ~ 0 | 0.00 | 0.00 | 0 | 0 | 
| Datatypes | 9,498 | 6 | 0.95 | 0.22 | 0 | 1 | 
| ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 | 
| Subjects | 10,000 | ~ 10,033 | 1.00 | 0.00 | 1 | 1 |