Skip to content

nanopubs (0.1.0)

Nanopublications are small units of publishable information, used for scientific results and more. This dataset is based on a subset of a dump of all available nanopublications as of April 5, 2018. Only the first 5M of freely-licensed nanopubs were included. Each nanopub consists of several RDF graphs and thus is an RDF dataset. The included data is primarily from the biomedical domain. More information: paper, website.

Info

Download this metadata in RDF: Turtle, N-Triples, RDF/XML
Source repository: nanopubs

General information

Technical metadata

Distributions

Full quad stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 236,997,015
  • Unique count (estimated): 47,162,247
  • Mean: 47.40
  • Standard deviation: 5.68
  • Minimum: 25
  • Maximum: 142
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 35,643,551
  • Unique count (estimated): 3,277,169
  • Mean: 7.13
  • Standard deviation: 0.60
  • Minimum: 3
  • Maximum: 34
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 18,591,732
  • Unique count (estimated): 2,222,348
  • Mean: 3.72
  • Standard deviation: 2.05
  • Minimum: 1
  • Maximum: 20
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 8,423,981
  • Unique count (estimated): 21,459
  • Mean: 1.68
  • Standard deviation: 0.59
  • Minimum: 0
  • Maximum: 4
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 8,627,838
  • Unique count (estimated): 1,033,269
  • Mean: 1.73
  • Standard deviation: 1.48
  • Minimum: 0
  • Maximum: 32
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 46,588,350
  • Mean: 9.32
  • Standard deviation: 4.43
  • Minimum: 3
  • Maximum: 68
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 97,782,581
  • Mean: 19.56
  • Standard deviation: 1.81
  • Minimum: 11
  • Maximum: 22
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 159,866,280
  • Mean: 31.97
  • Standard deviation: 6.03
  • Minimum: 14
  • Maximum: 157
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 20,000,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 171,885,662
  • Mean: 34.38
  • Standard deviation: 9.93
  • Minimum: 16
  • Maximum: 196

Full flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 236,997,015
  • Unique count (estimated): 47,162,247
  • Mean: 47.40
  • Standard deviation: 5.68
  • Minimum: 25
  • Maximum: 142
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 35,643,551
  • Unique count (estimated): 3,277,169
  • Mean: 7.13
  • Standard deviation: 0.60
  • Minimum: 3
  • Maximum: 34
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 18,591,732
  • Unique count (estimated): 2,222,348
  • Mean: 3.72
  • Standard deviation: 2.05
  • Minimum: 1
  • Maximum: 20
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 8,423,981
  • Unique count (estimated): 21,459
  • Mean: 1.68
  • Standard deviation: 0.59
  • Minimum: 0
  • Maximum: 4
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 8,627,838
  • Unique count (estimated): 1,033,269
  • Mean: 1.73
  • Standard deviation: 1.48
  • Minimum: 0
  • Maximum: 32
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 46,588,350
  • Mean: 9.32
  • Standard deviation: 4.43
  • Minimum: 3
  • Maximum: 68
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 97,782,581
  • Mean: 19.56
  • Standard deviation: 1.81
  • Minimum: 11
  • Maximum: 22
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 159,866,280
  • Mean: 31.97
  • Standard deviation: 6.03
  • Minimum: 14
  • Maximum: 157
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 20,000,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 171,885,662
  • Mean: 34.38
  • Standard deviation: 9.93
  • Minimum: 16
  • Maximum: 196

1M elements quad stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 48,861,863
  • Unique count (estimated): 7,017,131
  • Mean: 48.86
  • Standard deviation: 3.03
  • Minimum: 40
  • Maximum: 100
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 7,030,328
  • Unique count (estimated): 620,603
  • Mean: 7.03
  • Standard deviation: 0.24
  • Minimum: 7
  • Maximum: 22
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 2,321,606
  • Unique count (estimated): 86,968
  • Mean: 2.32
  • Standard deviation: 1.11
  • Minimum: 2
  • Maximum: 20
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 1,944,837
  • Unique count (estimated): 2,895
  • Mean: 1.94
  • Standard deviation: 0.27
  • Minimum: 1
  • Maximum: 3
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 2,763,885
  • Unique count (estimated): 530,740
  • Mean: 2.76
  • Standard deviation: 0.81
  • Minimum: 0
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 9,350,387
  • Mean: 9.35
  • Standard deviation: 2.33
  • Minimum: 6
  • Maximum: 52
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 20,738,344
  • Mean: 20.74
  • Standard deviation: 0.94
  • Minimum: 17
  • Maximum: 22
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 32,151,723
  • Mean: 32.15
  • Standard deviation: 3.09
  • Minimum: 27
  • Maximum: 89
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 4,000,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 33,423,542
  • Mean: 33.42
  • Standard deviation: 4.64
  • Minimum: 28
  • Maximum: 135

1M elements flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 48,861,863
  • Unique count (estimated): 7,017,131
  • Mean: 48.86
  • Standard deviation: 3.03
  • Minimum: 40
  • Maximum: 100
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 7,030,328
  • Unique count (estimated): 620,603
  • Mean: 7.03
  • Standard deviation: 0.24
  • Minimum: 7
  • Maximum: 22
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 2,321,606
  • Unique count (estimated): 86,968
  • Mean: 2.32
  • Standard deviation: 1.11
  • Minimum: 2
  • Maximum: 20
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 1,944,837
  • Unique count (estimated): 2,895
  • Mean: 1.94
  • Standard deviation: 0.27
  • Minimum: 1
  • Maximum: 3
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 2,763,885
  • Unique count (estimated): 530,740
  • Mean: 2.76
  • Standard deviation: 0.81
  • Minimum: 0
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 9,350,387
  • Mean: 9.35
  • Standard deviation: 2.33
  • Minimum: 6
  • Maximum: 52
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 20,738,344
  • Mean: 20.74
  • Standard deviation: 0.94
  • Minimum: 17
  • Maximum: 22
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 32,151,723
  • Mean: 32.15
  • Standard deviation: 3.09
  • Minimum: 27
  • Maximum: 89
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 4,000,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 33,423,542
  • Mean: 33.42
  • Standard deviation: 4.64
  • Minimum: 28
  • Maximum: 135

100K elements quad stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 4,907,266
  • Unique count (estimated): 671,463
  • Mean: 49.07
  • Standard deviation: 1.94
  • Minimum: 44
  • Maximum: 50
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 700,000
  • Unique count (estimated): 60,262
  • Mean: 7.00
  • Standard deviation: 0.00
  • Minimum: 7
  • Maximum: 7
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 200,000
  • Unique count (estimated): 4
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 200,000
  • Unique count (estimated): 154
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 300,000
  • Unique count (estimated): 60,104
  • Mean: 3.00
  • Standard deviation: 0.00
  • Minimum: 3
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 925,880
  • Mean: 9.26
  • Standard deviation: 1.55
  • Minimum: 6
  • Maximum: 10
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 2,100,000
  • Mean: 21.00
  • Standard deviation: 0.00
  • Minimum: 21
  • Maximum: 21
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 3,207,266
  • Mean: 32.07
  • Standard deviation: 1.94
  • Minimum: 27
  • Maximum: 33
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 400,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 3,307,350
  • Mean: 33.07
  • Standard deviation: 1.94
  • Minimum: 29
  • Maximum: 34

100K elements flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 4,907,266
  • Unique count (estimated): 671,463
  • Mean: 49.07
  • Standard deviation: 1.94
  • Minimum: 44
  • Maximum: 50
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 700,000
  • Unique count (estimated): 60,262
  • Mean: 7.00
  • Standard deviation: 0.00
  • Minimum: 7
  • Maximum: 7
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 200,000
  • Unique count (estimated): 4
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 200,000
  • Unique count (estimated): 154
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 300,000
  • Unique count (estimated): 60,104
  • Mean: 3.00
  • Standard deviation: 0.00
  • Minimum: 3
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 925,880
  • Mean: 9.26
  • Standard deviation: 1.55
  • Minimum: 6
  • Maximum: 10
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 2,100,000
  • Mean: 21.00
  • Standard deviation: 0.00
  • Minimum: 21
  • Maximum: 21
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 3,207,266
  • Mean: 32.07
  • Standard deviation: 1.94
  • Minimum: 27
  • Maximum: 33
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 400,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 3,307,350
  • Mean: 33.07
  • Standard deviation: 1.94
  • Minimum: 29
  • Maximum: 34

10K elements quad stream distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 500,000
  • Unique count (estimated): 73,219
  • Mean: 50.00
  • Standard deviation: 0.00
  • Minimum: 50
  • Maximum: 50
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 70,000
  • Unique count (estimated): 6,874
  • Mean: 7.00
  • Standard deviation: 0.00
  • Minimum: 7
  • Maximum: 7
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 20,000
  • Unique count (estimated): 2
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 20,000
  • Unique count (estimated): 15
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 30,000
  • Unique count (estimated): 6,857
  • Mean: 3.00
  • Standard deviation: 0.00
  • Minimum: 3
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 100,000
  • Mean: 10.00
  • Standard deviation: 0.00
  • Minimum: 10
  • Maximum: 10
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 210,000
  • Mean: 21.00
  • Standard deviation: 0.00
  • Minimum: 21
  • Maximum: 21
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 330,000
  • Mean: 33.00
  • Standard deviation: 0.00
  • Minimum: 33
  • Maximum: 33
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 40,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 340,000
  • Mean: 34.00
  • Standard deviation: 0.00
  • Minimum: 34
  • Maximum: 34

10K elements flat distribution

Has statistics

IRI count statistics
  • Type: IRI count statistics (rb:IriCountStatistics)
  • Sum: 500,000
  • Unique count (estimated): 73,219
  • Mean: 50.00
  • Standard deviation: 0.00
  • Minimum: 50
  • Maximum: 50
Blank node count statistics
Literal count statistics
  • Type: Literal count statistics (rb:LiteralCountStatistics)
  • Sum: 70,000
  • Unique count (estimated): 6,874
  • Mean: 7.00
  • Standard deviation: 0.00
  • Minimum: 7
  • Maximum: 7
Simple literal count statistics
  • Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
  • Sum: 20,000
  • Unique count (estimated): 2
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Datatype literal count statistics
  • Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
  • Sum: 20,000
  • Unique count (estimated): 15
  • Mean: 2.00
  • Standard deviation: 0.00
  • Minimum: 2
  • Maximum: 2
Language string count statistics
  • Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
  • Sum: 30,000
  • Unique count (estimated): 6,857
  • Mean: 3.00
  • Standard deviation: 0.00
  • Minimum: 3
  • Maximum: 3
Quoted triple count statistics
Subject count statistics
  • Type: Subject count statistics (rb:SubjectCountStatistics)
  • Sum: 100,000
  • Mean: 10.00
  • Standard deviation: 0.00
  • Minimum: 10
  • Maximum: 10
Predicate count statistics
  • Type: Predicate count statistics (rb:PredicateCountStatistics)
  • Sum: 210,000
  • Mean: 21.00
  • Standard deviation: 0.00
  • Minimum: 21
  • Maximum: 21
Object count statistics
  • Type: Object count statistics (rb:ObjectCountStatistics)
  • Sum: 330,000
  • Mean: 33.00
  • Standard deviation: 0.00
  • Minimum: 33
  • Maximum: 33
Graph count statistics
  • Type: Graph count statistics (rb:GraphCountStatistics)
  • Sum: 40,000
  • Mean: 4.00
  • Standard deviation: 0.00
  • Minimum: 4
  • Maximum: 4
Statement count statistics
  • Type: Statement count statistics (rb:StatementCountStatistics)
  • Sum: 340,000
  • Mean: 34.00
  • Standard deviation: 0.00
  • Minimum: 34
  • Maximum: 34