muziekweb (development version)
The dataset consists of the main graph of Muziekweb, a high-quality Dutch knowledge base about music, containing information about artists, CD, LPs, and more. The knowledge base is richly annotated and contains plentiful links to external resources.
General information
- Title: Muziekweb
- Identifier: muziekweb
- Has version: dev
- Theme:
- Encyclopedic (rbt:encyclopedic)
- Musical (rbt:musical)
- Creator:
- **
- Nederlands instituut voor Beeld & Geluid
- Netherlands Institute for Sound and Vision (1)**
- Name:
- Nederlands instituut voor Beeld & Geluid
- Netherlands Institute for Sound and Vision
- Homepage: https://www.beeldengeluid.nl/
- Name:
- Piotr Sowiński (2)
- Name: Piotr Sowiński
- Nickname: Ostrzyciel
- Homepage:
- Comment: Processing the dataset
- License: https://spdx.org/licenses/ODC-By-1.0
- Source: https://data.muziekweb.nl/MuziekwebOrganization/Muziekweb
- Date Issued: 2023-05-09
- Date Modified: 2023-05-09
- Landing page: muziekweb (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
Technical metadata
- Has stream element type: Triples (rb:triples)
- Has stream element count: 2,450,357
- Has stream element split:
- Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to a different item in the knowledge base. The size of elements varies depending on how much information is there on a given item.
- Uses ontology:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
- Language: nl
Distributions
Full triple stream distribution
- Title: Full triple stream distribution
- Identifier: stream-full
- Has file name: stream_full.tar.gz
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 2,450,357
- Byte size: 252.24 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
05d1f24a2128afd297ade7d2f5548329
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
d07e8a4b85e664cad42a31579698b47a309d467e
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/stream_full.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 44,452,562
- Unique count (estimated): 3,453,225
- Mean: 18.14
- Standard deviation: 18.46
- Minimum: 3
- Maximum: 6,638
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 15,040,430
- Unique count (estimated): 4,895,817
- Mean: 6.14
- Standard deviation: 5.00
- Minimum: 0
- Maximum: 338
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 4,975,319
- Unique count (estimated): 2,229,827
- Mean: 2.03
- Standard deviation: 0.91
- Minimum: 0
- Maximum: 336
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 5,383,476
- Unique count (estimated): 1,132,123
- Mean: 2.20
- Standard deviation: 3.03
- Minimum: 0
- Maximum: 7
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 4,681,635
- Unique count (estimated): 1,533,823
- Mean: 1.91
- Standard deviation: 2.04
- Minimum: 0
- Maximum: 12
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 2,450,357
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 22,909,902
- Mean: 9.35
- Standard deviation: 6.79
- Minimum: 1
- Maximum: 25
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 34,132,733
- Mean: 13.93
- Standard deviation: 16.91
- Minimum: 1
- Maximum: 6,659
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 36,195,263
- Mean: 14.77
- Standard deviation: 17.50
- Minimum: 1
- Maximum: 6,660
Full flat distribution
- Title: Full flat distribution
- Identifier: flat-full
- Has file name: flat_full.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 2,450,357
- Byte size: 299.99 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
f58b557fcbba80a54e793d4f13374724
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
581bcc07703468d3a61f6bfa948ea97dcf74ab90
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/flat_full.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 44,452,562
- Unique count (estimated): 3,453,225
- Mean: 18.14
- Standard deviation: 18.46
- Minimum: 3
- Maximum: 6,638
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 15,040,430
- Unique count (estimated): 4,895,817
- Mean: 6.14
- Standard deviation: 5.00
- Minimum: 0
- Maximum: 338
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 4,975,319
- Unique count (estimated): 2,229,827
- Mean: 2.03
- Standard deviation: 0.91
- Minimum: 0
- Maximum: 336
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 5,383,476
- Unique count (estimated): 1,132,123
- Mean: 2.20
- Standard deviation: 3.03
- Minimum: 0
- Maximum: 7
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 4,681,635
- Unique count (estimated): 1,533,823
- Mean: 1.91
- Standard deviation: 2.04
- Minimum: 0
- Maximum: 12
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 2,450,357
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 22,909,902
- Mean: 9.35
- Standard deviation: 6.79
- Minimum: 1
- Maximum: 25
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 34,132,733
- Mean: 13.93
- Standard deviation: 16.91
- Minimum: 1
- Maximum: 6,659
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 36,195,263
- Mean: 14.77
- Standard deviation: 17.50
- Minimum: 1
- Maximum: 6,660
1M elements triple stream distribution
- Title: 1M elements triple stream distribution
- Identifier: stream-1m
- Has file name:
stream_1M.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 1,000,000
- Byte size: 87.39 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
540b745ee9a48e10556f57a787bcd2f4
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
604ddb73971e94e99b8cda8b39d9ac8841ebadd8
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/stream_1M.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 9,658,163
- Unique count (estimated): 1,110,239
- Mean: 9.66
- Standard deviation: 7.13
- Minimum: 3
- Maximum: 57
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 4,125,961
- Unique count (estimated): 2,300,962
- Mean: 4.13
- Standard deviation: 2.56
- Minimum: 0
- Maximum: 18
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 2,136,904
- Unique count (estimated): 1,086,807
- Mean: 2.14
- Standard deviation: 0.89
- Minimum: 0
- Maximum: 10
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 783,604
- Unique count (estimated): 386,853
- Mean: 0.78
- Standard deviation: 1.71
- Minimum: 0
- Maximum: 7
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 1,205,453
- Unique count (estimated): 827,230
- Mean: 1.21
- Standard deviation: 1.07
- Minimum: 0
- Maximum: 8
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 1,000,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 6,078,202
- Mean: 6.08
- Standard deviation: 3.60
- Minimum: 1
- Maximum: 24
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 6,705,922
- Mean: 6.71
- Standard deviation: 6.08
- Minimum: 1
- Maximum: 50
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 6,916,692
- Mean: 6.92
- Standard deviation: 6.56
- Minimum: 1
- Maximum: 52
1M elements flat distribution
- Title: 1M elements flat distribution
- Identifier: flat-1m
- Has file name:
flat_1M.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 91.62 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
1011fea0166aff65b58c50494d3e6aca
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
8a5e329b0d7f5c96518a8b7a8919f6192e67aa00
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/flat_1M.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 9,658,163
- Unique count (estimated): 1,110,239
- Mean: 9.66
- Standard deviation: 7.13
- Minimum: 3
- Maximum: 57
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 4,125,961
- Unique count (estimated): 2,300,962
- Mean: 4.13
- Standard deviation: 2.56
- Minimum: 0
- Maximum: 18
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 2,136,904
- Unique count (estimated): 1,086,807
- Mean: 2.14
- Standard deviation: 0.89
- Minimum: 0
- Maximum: 10
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 783,604
- Unique count (estimated): 386,853
- Mean: 0.78
- Standard deviation: 1.71
- Minimum: 0
- Maximum: 7
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 1,205,453
- Unique count (estimated): 827,230
- Mean: 1.21
- Standard deviation: 1.07
- Minimum: 0
- Maximum: 8
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 1,000,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 6,078,202
- Mean: 6.08
- Standard deviation: 3.60
- Minimum: 1
- Maximum: 24
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 6,705,922
- Mean: 6.71
- Standard deviation: 6.08
- Minimum: 1
- Maximum: 50
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 6,916,692
- Mean: 6.92
- Standard deviation: 6.56
- Minimum: 1
- Maximum: 52
100K elements triple stream distribution
- Title: 100K elements triple stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 100,000
- Byte size: 8.41 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
68c9b5e2bb4da4fdce021f66a87b9916
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
a5f995701fdfdf9174f198078f39e2a209455e91
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/stream_100K.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 778,178
- Unique count (estimated): 110,452
- Mean: 7.78
- Standard deviation: 1.38
- Minimum: 5
- Maximum: 9
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 345,232
- Unique count (estimated): 242,134
- Mean: 3.45
- Standard deviation: 0.60
- Minimum: 2
- Maximum: 12
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 219,197
- Unique count (estimated): 118,937
- Mean: 2.19
- Standard deviation: 0.88
- Minimum: 1
- Maximum: 9
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 33,557
- Unique count (estimated): 33,413
- Mean: 0.34
- Standard deviation: 0.47
- Minimum: 0
- Maximum: 1
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 92,478
- Unique count (estimated): 89,792
- Mean: 0.92
- Standard deviation: 0.29
- Minimum: 0
- Maximum: 3
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 100,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 513,894
- Mean: 5.14
- Standard deviation: 0.93
- Minimum: 3
- Maximum: 6
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 509,516
- Mean: 5.10
- Standard deviation: 0.96
- Minimum: 3
- Maximum: 14
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 517,454
- Mean: 5.17
- Standard deviation: 1.01
- Minimum: 3
- Maximum: 14
100K elements flat distribution
- Title: 100K elements flat distribution
- Identifier: flat-100k
- Has file name:
flat_100K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 8.40 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
4ff97f9a0544966689d6ae9f6193840d
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
48d29abfaaf5bb52e00a3b9e43cd2b522ac3235d
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/flat_100K.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 778,178
- Unique count (estimated): 110,452
- Mean: 7.78
- Standard deviation: 1.38
- Minimum: 5
- Maximum: 9
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 345,232
- Unique count (estimated): 242,134
- Mean: 3.45
- Standard deviation: 0.60
- Minimum: 2
- Maximum: 12
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 219,197
- Unique count (estimated): 118,937
- Mean: 2.19
- Standard deviation: 0.88
- Minimum: 1
- Maximum: 9
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 33,557
- Unique count (estimated): 33,413
- Mean: 0.34
- Standard deviation: 0.47
- Minimum: 0
- Maximum: 1
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 92,478
- Unique count (estimated): 89,792
- Mean: 0.92
- Standard deviation: 0.29
- Minimum: 0
- Maximum: 3
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 100,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 513,894
- Mean: 5.14
- Standard deviation: 0.93
- Minimum: 3
- Maximum: 6
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 509,516
- Mean: 5.10
- Standard deviation: 0.96
- Minimum: 3
- Maximum: 14
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 517,454
- Mean: 5.17
- Standard deviation: 1.01
- Minimum: 3
- Maximum: 14
10K elements triple stream distribution
- Title: 10K elements triple stream distribution
- Identifier: stream-10k
- Has file name:
stream_10K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 10,000
- Byte size: 862.33 KB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
cb0047b1c8fe6ec68fbc4a28ad7360fd
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
e38822c619c7b066aaf34b88aca3ba818d3a6272
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/stream_10K.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 77,802
- Unique count (estimated): 12,476
- Mean: 7.78
- Standard deviation: 1.38
- Minimum: 5
- Maximum: 9
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 34,504
- Unique count (estimated): 24,637
- Mean: 3.45
- Standard deviation: 0.60
- Minimum: 2
- Maximum: 8
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 21,909
- Unique count (estimated): 12,109
- Mean: 2.19
- Standard deviation: 0.87
- Minimum: 1
- Maximum: 6
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 3,332
- Unique count (estimated): 3,332
- Mean: 0.33
- Standard deviation: 0.47
- Minimum: 0
- Maximum: 1
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 9,263
- Unique count (estimated): 9,194
- Mean: 0.93
- Standard deviation: 0.29
- Minimum: 0
- Maximum: 2
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 51,364
- Mean: 5.14
- Standard deviation: 0.93
- Minimum: 3
- Maximum: 6
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 50,942
- Mean: 5.09
- Standard deviation: 0.96
- Minimum: 3
- Maximum: 10
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 51,721
- Mean: 5.17
- Standard deviation: 1.01
- Minimum: 3
- Maximum: 11
10K elements flat distribution
- Title: 10K elements flat distribution
- Identifier: flat-10k
- Has file name:
flat_10K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 860.15 KB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
731f48d6c977953b93a29538083de4ee
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
b724456290640444197f05b6117f593af99d44fe
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/muziekweb/dev/files/flat_10K.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 77,802
- Unique count (estimated): 12,476
- Mean: 7.78
- Standard deviation: 1.38
- Minimum: 5
- Maximum: 9
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 34,504
- Unique count (estimated): 24,637
- Mean: 3.45
- Standard deviation: 0.60
- Minimum: 2
- Maximum: 8
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 21,909
- Unique count (estimated): 12,109
- Mean: 2.19
- Standard deviation: 0.87
- Minimum: 1
- Maximum: 6
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 3,332
- Unique count (estimated): 3,332
- Mean: 0.33
- Standard deviation: 0.47
- Minimum: 0
- Maximum: 1
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 9,263
- Unique count (estimated): 9,194
- Mean: 0.93
- Standard deviation: 0.29
- Minimum: 0
- Maximum: 2
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 51,364
- Mean: 5.14
- Standard deviation: 0.93
- Minimum: 3
- Maximum: 6
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 50,942
- Mean: 5.09
- Standard deviation: 0.96
- Minimum: 3
- Maximum: 10
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 51,721
- Mean: 5.17
- Standard deviation: 1.01
- Minimum: 3
- Maximum: 11