Skip to content

yago-annotated-facts (0.0.1)

This is a subset of the YAGO 4 knowledge base, based on Wikidata, version from February 24, 2020. This dataset includes only the fact annotations in RDF-star, that is facts about facts. Each stream element corresponds to one item in Wikidata.

General information

Technical metadata

  • Has stream element type: Triples (rb:triples)
  • Has stream element count: 617768
  • Has stream element split:
    • Type: Stream elements split by topic (rb:TopicStreamElementSplit)
    • Comment: Every stream element corresponds to one Wikidata item.
  • Uses ontology: http://schema.org/
  • Conforms to W3C RDF 1.1 specification: no
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: yes

Distributions

100K elements flat distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 36870
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.49411928559812374
  • Sum: 157646
  • Minimum: 1
  • Mean: 1.57646
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 36870
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.49411928559812374
  • Sum: 157646
  • Minimum: 1
  • Mean: 1.57646
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 1455
  • Standard deviation: 9.275426050031342
  • Sum: 226648
  • Minimum: 1
  • Mean: 2.26648
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 1455
  • Standard deviation: 9.275426050031342
  • Sum: 226648
  • Minimum: 1
  • Mean: 2.26648
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 849
  • Standard deviation: 5.244191199708493
  • Sum: 146103
  • Minimum: 1
  • Mean: 1.46103

100K elements triple stream distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 36870
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.49411928559812374
  • Sum: 157646
  • Minimum: 1
  • Mean: 1.57646
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 36870
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 49
  • Standard deviation: 0.9759486871757141
  • Sum: 186960
  • Minimum: 1
  • Mean: 1.8696
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.49411928559812374
  • Sum: 157646
  • Minimum: 1
  • Mean: 1.57646
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 1455
  • Standard deviation: 9.275426050031342
  • Sum: 226648
  • Minimum: 1
  • Mean: 2.26648
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 1455
  • Standard deviation: 9.275426050031342
  • Sum: 226648
  • Minimum: 1
  • Mean: 2.26648
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 849
  • Standard deviation: 5.244191199708493
  • Sum: 146103
  • Minimum: 1
  • Mean: 1.46103

10K elements flat distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 7273
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.4877499359302877
  • Sum: 16100
  • Minimum: 1
  • Mean: 1.61
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 7273
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.4877499359302877
  • Sum: 16100
  • Minimum: 1
  • Mean: 1.61
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 10
  • Standard deviation: 1.340027876575708
  • Sum: 22977
  • Minimum: 1
  • Mean: 2.2977
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 10
  • Standard deviation: 1.340027876575708
  • Sum: 22977
  • Minimum: 1
  • Mean: 2.2977
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 6
  • Standard deviation: 0.5261877611651563
  • Sum: 13762
  • Minimum: 1
  • Mean: 1.3762

10K elements triple stream distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 7273
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.4877499359302877
  • Sum: 16100
  • Minimum: 1
  • Mean: 1.61
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 7273
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 8
  • Standard deviation: 0.9091924988691885
  • Sum: 19370
  • Minimum: 1
  • Mean: 1.937
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.4877499359302877
  • Sum: 16100
  • Minimum: 1
  • Mean: 1.61
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 10
  • Standard deviation: 1.340027876575708
  • Sum: 22977
  • Minimum: 1
  • Mean: 2.2977
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 10
  • Standard deviation: 1.340027876575708
  • Sum: 22977
  • Minimum: 1
  • Mean: 2.2977
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 6
  • Standard deviation: 0.5261877611651563
  • Sum: 13762
  • Minimum: 1
  • Mean: 1.3762

Full flat distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 56980
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.48361126276638566
  • Sum: 1005087
  • Minimum: 1
  • Mean: 1.6269651390165887
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 56980
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.48361126276638566
  • Sum: 1005087
  • Minimum: 1
  • Mean: 1.6269651390165887
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 1455
  • Standard deviation: 6.1013913921407115
  • Sum: 2484547
  • Minimum: 1
  • Mean: 4.021812395591873
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 1455
  • Standard deviation: 6.1013913921407115
  • Sum: 2484547
  • Minimum: 1
  • Mean: 4.021812395591873
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 849
  • Standard deviation: 3.0405068802704873
  • Sum: 1392164
  • Minimum: 1
  • Mean: 2.2535385452143846

Full triple stream distribution

Has statistics

Blank node count statistics
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Unique count: 56980
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Unique count: 2
  • Maximum: 2
  • Standard deviation: 0.48361126276638566
  • Sum: 1005087
  • Minimum: 1
  • Mean: 1.6269651390165887
Language string count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Unique count: 56980
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Maximum: 66
  • Standard deviation: 2.4972773185035106
  • Sum: 1735411
  • Minimum: 1
  • Mean: 2.8091629867523085
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Maximum: 2
  • Standard deviation: 0.48361126276638566
  • Sum: 1005087
  • Minimum: 1
  • Mean: 1.6269651390165887
Quoted triple count statistics
  • Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
  • Maximum: 1455
  • Standard deviation: 6.1013913921407115
  • Sum: 2484547
  • Minimum: 1
  • Mean: 4.021812395591873
Simple literal count statistics
  • Type: Simple literal count statistics (rb:PlainLiteralCountStatistics)
  • Unique count: 0
  • Maximum: 0
  • Standard deviation: 0.0
  • Sum: 0
  • Minimum: 0
  • Mean: 0.0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Maximum: 1455
  • Standard deviation: 6.1013913921407115
  • Sum: 2484547
  • Minimum: 1
  • Mean: 4.021812395591873
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Maximum: 849
  • Standard deviation: 3.0405068802704873
  • Sum: 1392164
  • Minimum: 1
  • Mean: 2.2535385452143846