linked-spending (development version)
This is a subset of the LinkedSpending dataset (LS package 2013-9), which contains government spending information from around the world. The dataset uses the RDF Data Cube vocabulary. Only the spending observations were kept in this subset, extra contextual information was discarded. See the website and the paper for more details.
Info
Download this metadata in RDF: Turtle, N-Triples, RDF/XML
Source repository: linked-spending
General information
- Title: LinkedSpending
- Identifier: linked-spending
- Has version: dev
- Theme:
- Government (rbt:government)
- Statistical (rbt:statistical)
- Temporal (rbt:temporal)
- Creator:
- Konrad Höffner (1)
- Name: Konrad Höffner
- Homepage: https://www.imise.uni-leipzig.de/Mitarbeiter/Konrad_Hoeffner
- Comment: Creator and maintainer of the LinkedSpending dataset.
- AKSW team (2)
- Name: AKSW team
- Homepage: http://aksw.org/Team
- Piotr Sowiński (3)
- Name: Piotr Sowiński
- Nickname: Ostrzyciel
- Homepage:
- Comment: Processing the dataset
- Konrad Höffner (1)
- License: https://spdx.org/licenses/PDDL-1.0
- Source:
- Date Issued: 2023-05-01
- Date Modified: 2023-05-08
- Landing page: linked-spending (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
Technical metadata
- Has stream element type: Triples (rb:triples)
- Has stream element count: 2,477,552
- Has stream element split:
- Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to one observation in the dataset.
- Uses ontology:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
Distributions
Full triple stream distribution
- Title: Full triple stream distribution
- Identifier: stream-full
- Has file name: stream_full.tar.gz
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 2,477,552
- Byte size: 346.65 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
14c38fead675fb2af0eada9524d4ce2c
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
7e583f895cda90908bae1e82f12cf43f2da65252
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/stream_full.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 83,873,394
- Unique count (estimated): 3,239,719
- Mean: 33.85
- Standard deviation: 12.84
- Minimum: 3
- Maximum: 84
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 2,583,713
- Mean: 1.04
- Standard deviation: 0.21
- Minimum: 0
- Maximum: 2
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 18,740,789
- Unique count (estimated): 4,846,042
- Mean: 7.56
- Standard deviation: 3.87
- Minimum: 0
- Maximum: 43
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 15,143,397
- Unique count (estimated): 4,838,859
- Mean: 6.11
- Standard deviation: 3.69
- Minimum: 0
- Maximum: 43
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 3,597,392
- Unique count (estimated): 7,200
- Mean: 1.45
- Standard deviation: 0.55
- Minimum: 0
- Maximum: 4
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 2,477,552
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 51,106,554
- Mean: 20.63
- Standard deviation: 7.50
- Minimum: 2
- Maximum: 49
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 51,613,790
- Mean: 20.83
- Standard deviation: 7.97
- Minimum: 1
- Maximum: 86
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 55,097,866
- Mean: 22.24
- Standard deviation: 9.61
- Minimum: 2
- Maximum: 86
Full flat distribution
- Title: Full flat distribution
- Identifier: flat-full
- Has file name: flat_full.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 2,477,552
- Byte size: 578.85 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
47c17e3a656f1eb667d57f59782c8548
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
38033a10a3fe1170303d18d600df18195af21f46
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/flat_full.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 83,873,394
- Unique count (estimated): 3,239,719
- Mean: 33.85
- Standard deviation: 12.84
- Minimum: 3
- Maximum: 84
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 2,583,713
- Mean: 1.04
- Standard deviation: 0.21
- Minimum: 0
- Maximum: 2
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 18,740,789
- Unique count (estimated): 4,846,042
- Mean: 7.56
- Standard deviation: 3.87
- Minimum: 0
- Maximum: 43
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 15,143,397
- Unique count (estimated): 4,838,859
- Mean: 6.11
- Standard deviation: 3.69
- Minimum: 0
- Maximum: 43
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 3,597,392
- Unique count (estimated): 7,200
- Mean: 1.45
- Standard deviation: 0.55
- Minimum: 0
- Maximum: 4
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 2,477,552
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 51,106,554
- Mean: 20.63
- Standard deviation: 7.50
- Minimum: 2
- Maximum: 49
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 51,613,790
- Mean: 20.83
- Standard deviation: 7.97
- Minimum: 1
- Maximum: 86
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 55,097,866
- Mean: 22.24
- Standard deviation: 9.61
- Minimum: 2
- Maximum: 86
1M elements triple stream distribution
- Title: 1M elements triple stream distribution
- Identifier: stream-1m
- Has file name:
stream_1M.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 1,000,000
- Byte size: 140.18 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
96dc6852318ec522aad10e515bbee938
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
ebd7d8a5cc47a45152966252a7541bde96843d3a
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/stream_1M.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 35,194,519
- Unique count (estimated): 1,331,576
- Mean: 35.19
- Standard deviation: 13.13
- Minimum: 3
- Maximum: 60
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 1,020,633
- Mean: 1.02
- Standard deviation: 0.15
- Minimum: 0
- Maximum: 2
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 7,242,453
- Unique count (estimated): 2,000,374
- Mean: 7.24
- Standard deviation: 3.04
- Minimum: 0
- Maximum: 26
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 5,666,659
- Unique count (estimated): 1,994,553
- Mean: 5.67
- Standard deviation: 2.79
- Minimum: 0
- Maximum: 26
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 1,575,794
- Unique count (estimated): 5,841
- Mean: 1.58
- Standard deviation: 0.62
- Minimum: 0
- Maximum: 4
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 1,000,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 19,928,203
- Mean: 19.93
- Standard deviation: 5.05
- Minimum: 2
- Maximum: 32
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 22,529,402
- Mean: 22.53
- Standard deviation: 9.84
- Minimum: 1
- Maximum: 52
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 23,371,403
- Mean: 23.37
- Standard deviation: 10.26
- Minimum: 2
- Maximum: 52
1M elements flat distribution
- Title: 1M elements flat distribution
- Identifier: flat-1m
- Has file name:
flat_1M.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 234.35 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
94a35da20e0c4ee7a02d8d180909624e
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
f8d78811882fdabb83df3b880db7f231eda20a7c
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/flat_1M.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 35,194,519
- Unique count (estimated): 1,331,576
- Mean: 35.19
- Standard deviation: 13.13
- Minimum: 3
- Maximum: 60
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 1,020,633
- Mean: 1.02
- Standard deviation: 0.15
- Minimum: 0
- Maximum: 2
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 7,242,453
- Unique count (estimated): 2,000,374
- Mean: 7.24
- Standard deviation: 3.04
- Minimum: 0
- Maximum: 26
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 5,666,659
- Unique count (estimated): 1,994,553
- Mean: 5.67
- Standard deviation: 2.79
- Minimum: 0
- Maximum: 26
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 1,575,794
- Unique count (estimated): 5,841
- Mean: 1.58
- Standard deviation: 0.62
- Minimum: 0
- Maximum: 4
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 1,000,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 19,928,203
- Mean: 19.93
- Standard deviation: 5.05
- Minimum: 2
- Maximum: 32
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 22,529,402
- Mean: 22.53
- Standard deviation: 9.84
- Minimum: 1
- Maximum: 52
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 23,371,403
- Mean: 23.37
- Standard deviation: 10.26
- Minimum: 2
- Maximum: 52
100K elements triple stream distribution
- Title: 100K elements triple stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 100,000
- Byte size: 10.11 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
d94095f305b0e08b805f4a789243a222
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
b11cc839f3a2c81fbd067a14972d48dd353966f2
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/stream_100K.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 2,497,664
- Unique count (estimated): 210,494
- Mean: 24.98
- Standard deviation: 2.52
- Minimum: 3
- Maximum: 30
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 99,849
- Mean: 1.00
- Standard deviation: 0.04
- Minimum: 0
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 809,395
- Unique count (estimated): 186,976
- Mean: 8.09
- Standard deviation: 2.49
- Minimum: 0
- Maximum: 17
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 709,140
- Unique count (estimated): 184,438
- Mean: 7.09
- Standard deviation: 2.49
- Minimum: 0
- Maximum: 17
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 100,255
- Unique count (estimated): 2,538
- Mean: 1.00
- Standard deviation: 0.07
- Minimum: 0
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 100,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 1,716,279
- Mean: 17.16
- Standard deviation: 2.45
- Minimum: 2
- Maximum: 23
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 1,590,629
- Mean: 15.91
- Standard deviation: 2.41
- Minimum: 1
- Maximum: 34
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 1,716,898
- Mean: 17.17
- Standard deviation: 2.44
- Minimum: 2
- Maximum: 34
100K elements flat distribution
- Title: 100K elements flat distribution
- Identifier: flat-100k
- Has file name:
flat_100K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 17.38 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
ce97e7f828d88a7f7d2c11a87a56d666
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
6b961dd25cff74632dec75caf7dc0eca3a40b6e2
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/flat_100K.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 2,497,664
- Unique count (estimated): 210,494
- Mean: 24.98
- Standard deviation: 2.52
- Minimum: 3
- Maximum: 30
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 99,849
- Mean: 1.00
- Standard deviation: 0.04
- Minimum: 0
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 809,395
- Unique count (estimated): 186,976
- Mean: 8.09
- Standard deviation: 2.49
- Minimum: 0
- Maximum: 17
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 709,140
- Unique count (estimated): 184,438
- Mean: 7.09
- Standard deviation: 2.49
- Minimum: 0
- Maximum: 17
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 100,255
- Unique count (estimated): 2,538
- Mean: 1.00
- Standard deviation: 0.07
- Minimum: 0
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 100,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 1,716,279
- Mean: 17.16
- Standard deviation: 2.45
- Minimum: 2
- Maximum: 23
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 1,590,629
- Mean: 15.91
- Standard deviation: 2.41
- Minimum: 1
- Maximum: 34
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 1,716,898
- Mean: 17.17
- Standard deviation: 2.44
- Minimum: 2
- Maximum: 34
10K elements triple stream distribution
- Title: 10K elements triple stream distribution
- Identifier: stream-10k
- Has file name:
stream_10K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 10,000
- Byte size: 1.26 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
eacc487177540e482ab48b9a1e57b3a5
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
7385871a3a4137ea11fc6fe6e5dcbe7e51c166a0
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/stream_10K.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 226,552
- Unique count (estimated): 10,809
- Mean: 22.66
- Standard deviation: 6.07
- Minimum: 3
- Maximum: 30
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 9,907
- Mean: 0.99
- Standard deviation: 0.10
- Minimum: 0
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 86,396
- Unique count (estimated): 32,505
- Mean: 8.64
- Standard deviation: 5.27
- Minimum: 0
- Maximum: 16
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 76,087
- Unique count (estimated): 31,529
- Mean: 7.61
- Standard deviation: 5.28
- Minimum: 0
- Maximum: 15
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 10,309
- Unique count (estimated): 976
- Mean: 1.03
- Standard deviation: 0.22
- Minimum: 0
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 157,827
- Mean: 15.78
- Standard deviation: 5.84
- Minimum: 2
- Maximum: 23
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 155,028
- Mean: 15.50
- Standard deviation: 5.43
- Minimum: 1
- Maximum: 23
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 158,342
- Mean: 15.83
- Standard deviation: 5.82
- Minimum: 2
- Maximum: 23
10K elements flat distribution
- Title: 10K elements flat distribution
- Identifier: flat-10k
- Has file name:
flat_10K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 2.00 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
a5922a27b5f4a9ef998ed1024814a4d6
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
5f87f86ec2ba6b1983cccc3bf22eb1644c7e1966
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/linked-spending/dev/files/flat_10K.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 226,552
- Unique count (estimated): 10,809
- Mean: 22.66
- Standard deviation: 6.07
- Minimum: 3
- Maximum: 30
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 9,907
- Mean: 0.99
- Standard deviation: 0.10
- Minimum: 0
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 86,396
- Unique count (estimated): 32,505
- Mean: 8.64
- Standard deviation: 5.27
- Minimum: 0
- Maximum: 16
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 76,087
- Unique count (estimated): 31,529
- Mean: 7.61
- Standard deviation: 5.28
- Minimum: 0
- Maximum: 15
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 10,309
- Unique count (estimated): 976
- Mean: 1.03
- Standard deviation: 0.22
- Minimum: 0
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 0
- Unique count (estimated): 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 157,827
- Mean: 15.78
- Standard deviation: 5.84
- Minimum: 2
- Maximum: 23
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 155,028
- Mean: 15.50
- Standard deviation: 5.43
- Minimum: 1
- Maximum: 23
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 158,342
- Mean: 15.83
- Standard deviation: 5.82
- Minimum: 2
- Maximum: 23