Skip to content

dbpedia-live (1.0.0)

DBpedia Live was a real-time service that monitored edits on Wikipedia and published a stream of changes to the DBpedia knowledge graph. This dataset contains only the "added" triples in the stream, so it does not include deletes or other types of changes. Only one month (January 2014) is covered at the moment, but the dataset can be easily expanded in the future (the service stopped functioning in 2021). The stream's elements are irregular in size, depending on the volume of traffic on Wikipedia at a given moment and how the DBpedia Live service was able to cope with it. See also the paper.

The dataset was extensively cleaned to fix or remove bad IRIs, bad Unicode, and invalid literals.

Info

Download this metadata in RDF: Turtle, N-Triples, RDF/XML
Source repository: dbpedia-live

General information

Technical metadata

  • Has stream element type: Triples (rb:triples)
  • Has stream element count: 166,204
  • Has stream element split:
    • Type: Stream elements split by time (rb:TimeStreamElementSplit)
    • Has temporal property: http://dbpedia.org/ontology/wikiPageExtracted
    • Comment: Each element corresponds to a batch of recent changes from Wikipedia. The size of the batch may have been influenced by the traffic on Wikipedia, the load on the system, and other factors, so the element sizes are irregular.
  • Uses ontology: http://dbpedia.org/ontology/
  • Conforms to W3C RDF 1.1 specification: yes
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: no

Distributions

Full triple stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 16,654,403
  • Unique count (estimated): 11,540,684
  • Mean: 100.20
  • Standard deviation: 205.40
  • Minimum: 2
  • Maximum: 3,727
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 5,911,441
  • Unique count (estimated): 3,597,388
  • Mean: 35.57
  • Standard deviation: 120.44
  • Minimum: 0
  • Maximum: 6,077
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 3,958,152
  • Unique count (estimated): 2,729,961
  • Mean: 23.82
  • Standard deviation: 80.44
  • Minimum: 0
  • Maximum: 3,455
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 1,953,289
  • Unique count (estimated): 866,665
  • Mean: 11.75
  • Standard deviation: 44.75
  • Minimum: 0
  • Maximum: 2,622
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 11,290,262
  • Mean: 67.93
  • Standard deviation: 130.41
  • Minimum: 1
  • Maximum: 1,961
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 1,881,262
  • Mean: 11.32
  • Standard deviation: 26.04
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 10,060,633
  • Mean: 60.53
  • Standard deviation: 194.28
  • Minimum: 1
  • Maximum: 6,658
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 21,831,109
  • Mean: 131.35
  • Standard deviation: 319.79
  • Minimum: 1
  • Maximum: 8,275

Full flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 16,654,403
  • Unique count (estimated): 11,540,684
  • Mean: 100.20
  • Standard deviation: 205.40
  • Minimum: 2
  • Maximum: 3,727
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 5,911,441
  • Unique count (estimated): 3,597,388
  • Mean: 35.57
  • Standard deviation: 120.44
  • Minimum: 0
  • Maximum: 6,077
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 3,958,152
  • Unique count (estimated): 2,729,961
  • Mean: 23.82
  • Standard deviation: 80.44
  • Minimum: 0
  • Maximum: 3,455
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 1,953,289
  • Unique count (estimated): 866,665
  • Mean: 11.75
  • Standard deviation: 44.75
  • Minimum: 0
  • Maximum: 2,622
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 11,290,262
  • Mean: 67.93
  • Standard deviation: 130.41
  • Minimum: 1
  • Maximum: 1,961
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 1,881,262
  • Mean: 11.32
  • Standard deviation: 26.04
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 10,060,633
  • Mean: 60.53
  • Standard deviation: 194.28
  • Minimum: 1
  • Maximum: 6,658
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 21,831,109
  • Mean: 131.35
  • Standard deviation: 319.79
  • Minimum: 1
  • Maximum: 8,275

100K elements triple stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 13,631,942
  • Unique count (estimated): 9,703,320
  • Mean: 136.32
  • Standard deviation: 224.18
  • Minimum: 2
  • Maximum: 2,681
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 4,561,835
  • Unique count (estimated): 2,808,084
  • Mean: 45.62
  • Standard deviation: 128.93
  • Minimum: 1
  • Maximum: 6,077
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 2,997,501
  • Unique count (estimated): 2,056,214
  • Mean: 29.98
  • Standard deviation: 85.08
  • Minimum: 0
  • Maximum: 3,455
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 1,564,334
  • Unique count (estimated): 751,830
  • Mean: 15.64
  • Standard deviation: 49.61
  • Minimum: 0
  • Maximum: 2,622
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 9,361,620
  • Mean: 93.62
  • Standard deviation: 144.77
  • Minimum: 1
  • Maximum: 1,961
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 1,471,621
  • Mean: 14.72
  • Standard deviation: 28.34
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 7,860,478
  • Mean: 78.60
  • Standard deviation: 206.43
  • Minimum: 1
  • Maximum: 6,658
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 17,814,033
  • Mean: 178.14
  • Standard deviation: 347.79
  • Minimum: 1
  • Maximum: 8,275

100K elements flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 13,631,942
  • Unique count (estimated): 9,703,320
  • Mean: 136.32
  • Standard deviation: 224.18
  • Minimum: 2
  • Maximum: 2,681
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 4,561,835
  • Unique count (estimated): 2,808,084
  • Mean: 45.62
  • Standard deviation: 128.93
  • Minimum: 1
  • Maximum: 6,077
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 2,997,501
  • Unique count (estimated): 2,056,214
  • Mean: 29.98
  • Standard deviation: 85.08
  • Minimum: 0
  • Maximum: 3,455
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 1,564,334
  • Unique count (estimated): 751,830
  • Mean: 15.64
  • Standard deviation: 49.61
  • Minimum: 0
  • Maximum: 2,622
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 9,361,620
  • Mean: 93.62
  • Standard deviation: 144.77
  • Minimum: 1
  • Maximum: 1,961
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 1,471,621
  • Mean: 14.72
  • Standard deviation: 28.34
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 7,860,478
  • Mean: 78.60
  • Standard deviation: 206.43
  • Minimum: 1
  • Maximum: 6,658
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 17,814,033
  • Mean: 178.14
  • Standard deviation: 347.79
  • Minimum: 1
  • Maximum: 8,275

10K elements triple stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 4,227,641
  • Unique count (estimated): 3,163,701
  • Mean: 422.76
  • Standard deviation: 279.53
  • Minimum: 2
  • Maximum: 2,031
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 1,566,963
  • Unique count (estimated): 1,068,951
  • Mean: 156.70
  • Standard deviation: 256.16
  • Minimum: 1
  • Maximum: 2,120
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 1,080,669
  • Unique count (estimated): 798,768
  • Mean: 108.07
  • Standard deviation: 181.60
  • Minimum: 1
  • Maximum: 1,708
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 486,294
  • Unique count (estimated): 270,141
  • Mean: 48.63
  • Standard deviation: 79.92
  • Minimum: 0
  • Maximum: 1,333
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 2,928,996
  • Mean: 292.90
  • Standard deviation: 121.33
  • Minimum: 1
  • Maximum: 1,449
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 430,909
  • Mean: 43.09
  • Standard deviation: 49.27
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 2,622,682
  • Mean: 262.27
  • Standard deviation: 416.36
  • Minimum: 1
  • Maximum: 2,750
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 5,575,053
  • Mean: 557.51
  • Standard deviation: 529.45
  • Minimum: 1
  • Maximum: 4,905

10K elements flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 4,227,641
  • Unique count (estimated): 3,163,701
  • Mean: 422.76
  • Standard deviation: 279.53
  • Minimum: 2
  • Maximum: 2,031
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 1,566,963
  • Unique count (estimated): 1,068,951
  • Mean: 156.70
  • Standard deviation: 256.16
  • Minimum: 1
  • Maximum: 2,120
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 0
  • Unique count (estimated): 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 1,080,669
  • Unique count (estimated): 798,768
  • Mean: 108.07
  • Standard deviation: 181.60
  • Minimum: 1
  • Maximum: 1,708
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 486,294
  • Unique count (estimated): 270,141
  • Mean: 48.63
  • Standard deviation: 79.92
  • Minimum: 0
  • Maximum: 1,333
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 2,928,996
  • Mean: 292.90
  • Standard deviation: 121.33
  • Minimum: 1
  • Maximum: 1,449
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 430,909
  • Mean: 43.09
  • Standard deviation: 49.27
  • Minimum: 1
  • Maximum: 436
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 2,622,682
  • Mean: 262.27
  • Standard deviation: 416.36
  • Minimum: 1
  • Maximum: 2,750
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 0
  • Mean: 0.00
  • Standard deviation: 0.00
  • Minimum: 0
  • Maximum: 0
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 5,575,053
  • Mean: 557.51
  • Standard deviation: 529.45
  • Minimum: 1
  • Maximum: 4,905