Skip to content

Dataset: linked-spending (development version)

This is a subset of the LinkedSpending dataset (LS package 2013-9), which contains government spending information from around the world. The dataset uses the RDF Data Cube vocabulary. Only the spending observations were kept in this subset, extra contextual information was discarded. See the website and the paper for more details.

Stream preview (click to expand)
0000000000.ttl
<http://linkedspending.aksw.org/resource/observation-2011saiki_budget-/935a15a2195545d040e64b8b486a3f4a833cbb7c>
        a       <http://purl.org/linked-data/cube#Observation>;
        <http://www.w3.org/2000/01/rdf-schema#label>
                "2011saiki_budget, observation /935a15a2195545d040e64b8b486a3f4a833cbb7c";
        <http://dbpedia.org/ontology/currency>
                <http://dbpedia.org/resource/Japanese_yen>;
        <http://dublincore.org/documents/2012/06/14/dcmi-terms/source>
                [] ;
        <http://linkedspending.aksw.org/ontology/amount>
                "0.0";
        <http://linkedspending.aksw.org/ontology/category>
                <http://openspending.org/2011saiki_budget/category/8>;
        <http://linkedspending.aksw.org/ontology/subcategory>
                <http://openspending.org/2011saiki_budget/subcategory/8-1>;
        <http://linkedspending.aksw.org/ontology/uniquekey>
                "627";
        <http://purl.org/linked-data/cube#dataSet>
                <http://linkedspending.aksw.org/resource/2011saiki_budget>;
        <http://purl.org/linked-data/sdmx/2009/attribute#refArea>
                <http://linkedgeodata.org/triplify/node424313451>;
        <http://purl.org/linked-data/sdmx/2009/dimension#refPeriod>
                "2011-01-01"^^<http://www.w3.org/2001/XMLSchema#date> .
0000000010.ttl
<http://linkedspending.aksw.org/resource/observation-2011saiki_budget-/93b403bcc1b4323e65a08e0a941116ddb35d971e>
        a       <http://purl.org/linked-data/cube#Observation>;
        <http://www.w3.org/2000/01/rdf-schema#label>
                "2011saiki_budget, observation /93b403bcc1b4323e65a08e0a941116ddb35d971e";
        <http://dbpedia.org/ontology/currency>
                <http://dbpedia.org/resource/Japanese_yen>;
        <http://dublincore.org/documents/2012/06/14/dcmi-terms/source>
                [] ;
        <http://linkedspending.aksw.org/ontology/amount>
                "23800.0";
        <http://linkedspending.aksw.org/ontology/category>
                <http://openspending.org/2011saiki_budget/category/2>;
        <http://linkedspending.aksw.org/ontology/subcategory>
                <http://openspending.org/2011saiki_budget/subcategory/2-2>;
        <http://linkedspending.aksw.org/ontology/uniquekey>
                "2730";
        <http://purl.org/linked-data/cube#dataSet>
                <http://linkedspending.aksw.org/resource/2011saiki_budget>;
        <http://purl.org/linked-data/sdmx/2009/attribute#refArea>
                <http://linkedgeodata.org/triplify/node424313451>;
        <http://purl.org/linked-data/sdmx/2009/dimension#refPeriod>
                "2011-01-01"^^<http://www.w3.org/2001/XMLSchema#date> .
0000000100.ttl
<http://linkedspending.aksw.org/resource/observation-2011saiki_budget-/165df673595e4987e590097a4b307bad389a864a>
        a       <http://purl.org/linked-data/cube#Observation>;
        <http://www.w3.org/2000/01/rdf-schema#label>
                "2011saiki_budget, observation /165df673595e4987e590097a4b307bad389a864a";
        <http://dbpedia.org/ontology/currency>
                <http://dbpedia.org/resource/Japanese_yen>;
        <http://dublincore.org/documents/2012/06/14/dcmi-terms/source>
                [] ;
        <http://linkedspending.aksw.org/ontology/amount>
                "4701946.0";
        <http://linkedspending.aksw.org/ontology/category>
                <http://openspending.org/2011saiki_budget/category/2>;
        <http://linkedspending.aksw.org/ontology/subcategory>
                <http://openspending.org/2011saiki_budget/subcategory/2-2>;
        <http://linkedspending.aksw.org/ontology/uniquekey>
                "3478";
        <http://purl.org/linked-data/cube#dataSet>
                <http://linkedspending.aksw.org/resource/2011saiki_budget>;
        <http://purl.org/linked-data/sdmx/2009/attribute#refArea>
                <http://linkedgeodata.org/triplify/node424313451>;
        <http://purl.org/linked-data/sdmx/2009/dimension#refPeriod>
                "2011-01-01"^^<http://www.w3.org/2001/XMLSchema#date> .
0000001000.ttl
<http://linkedspending.aksw.org/resource/observation-2011saiki_budget-/b410f194c507d51d7fa43d983cec8fa3eb76f921>
        a       <http://purl.org/linked-data/cube#Observation>;
        <http://www.w3.org/2000/01/rdf-schema#label>
                "2011saiki_budget, observation /b410f194c507d51d7fa43d983cec8fa3eb76f921";
        <http://dbpedia.org/ontology/currency>
                <http://dbpedia.org/resource/Japanese_yen>;
        <http://dublincore.org/documents/2012/06/14/dcmi-terms/source>
                [] ;
        <http://linkedspending.aksw.org/ontology/amount>
                "11313.0";
        <http://linkedspending.aksw.org/ontology/category>
                <http://openspending.org/2011saiki_budget/category/2>;
        <http://linkedspending.aksw.org/ontology/subcategory>
                <http://openspending.org/2011saiki_budget/subcategory/2-2>;
        <http://linkedspending.aksw.org/ontology/uniquekey>
                "3161";
        <http://purl.org/linked-data/cube#dataSet>
                <http://linkedspending.aksw.org/resource/2011saiki_budget>;
        <http://purl.org/linked-data/sdmx/2009/attribute#refArea>
                <http://linkedgeodata.org/triplify/node424313451>;
        <http://purl.org/linked-data/sdmx/2009/dimension#refPeriod>
                "2011-01-01"^^<http://www.w3.org/2001/XMLSchema#date> .
0000010000.ttl
<http://linkedspending.aksw.org/resource/observation-aide-publique-au-developpement-france-2011-/757555294af72fb53f0434bba6a71b8a9ae9692c>
        a       <http://purl.org/linked-data/cube#Observation>;
        <http://www.w3.org/2000/01/rdf-schema#label>
                "aide-publique-au-developpement-france-2011, observation /757555294af72fb53f0434bba6a71b8a9ae9692c";
        <http://dbpedia.org/ontology/currency>
                <http://dbpedia.org/resource/Euro>;
        <http://dublincore.org/documents/2012/06/14/dcmi-terms/source>
                [] ;
        <http://linkedspending.aksw.org/ontology/amount>
                "504520.0";
        <http://linkedspending.aksw.org/ontology/amount-paid>
                "168810";
        <http://linkedspending.aksw.org/ontology/bimulti>
                "bilatérale";
        <http://linkedspending.aksw.org/ontology/canal>
                "Public sector";
        <http://linkedspending.aksw.org/ontology/code>
                "2011-6627";
        <http://linkedspending.aksw.org/ontology/date-debut>
                "26/09/2011";
        <http://linkedspending.aksw.org/ontology/date-fin>
                "30/06/2017";
        <http://linkedspending.aksw.org/ontology/description>
                "Systèmes de production agricoles durables  à Madagascar";
        <http://linkedspending.aksw.org/ontology/from>
                <http://openspending.org/aide-publique-au-developpement-france-2011/from/afd>;
        <http://linkedspending.aksw.org/ontology/nature-operation>
                "activité déjà notifiée antérieurement (augmentation/diminution d'un engagement antérieur, versement d'un engagement antérieur)";
        <http://linkedspending.aksw.org/ontology/secteur>
                "INDUSTRIES MANUFACTURIERES";
        <http://linkedspending.aksw.org/ontology/titre>
                "EXTENS.SECT.AGRO-IND.PALM.HUIL          ";
        <http://linkedspending.aksw.org/ontology/to>
                <http://openspending.org/aide-publique-au-developpement-france-2011/to/cate-d-ivoire>;
        <http://linkedspending.aksw.org/ontology/type-aide>
                "Fonds communs/financements groupés";
        <http://linkedspending.aksw.org/ontology/type-financement>
                "Prêt d'aide sauf réorganisation de la dette";
        <http://linkedspending.aksw.org/ontology/type-ressource>
                "APD (aide publique au développement)";
        <http://purl.org/linked-data/cube#dataSet>
                <http://linkedspending.aksw.org/resource/aide-publique-au-developpement-france-2011>;
        <http://purl.org/linked-data/sdmx/2009/attribute#refArea>
                <http://linkedgeodata.org/triplify/node1363947712>;
        <http://purl.org/linked-data/sdmx/2009/dimension#refPeriod>
                "1986-12-06"^^<http://www.w3.org/2001/XMLSchema#date> .

General information

  1. BibTeX citation:
    @article{H_ffner_2015, title={LinkedSpending: OpenSpending becomes Linked Open Data}, volume={7}, ISSN={1570-0844}, url={http://dx.doi.org/10.3233/SW-150172}, DOI={10.3233/sw-150172}, number={1}, journal={Semantic Web}, publisher={SAGE Publications}, author={Höffner, Konrad and Martin, Michael and Lehmann, Jens}, editor={Noy, Natasha}, year={2015}, month=Mar, pages={95–104} }
    

Technical metadata

  • Has stream type usage:
    • RDF stream type usage (​1)
    • RDF stream type usage (​2)
      • Type: RDF stream type usage (stax:RdfStreamTypeUsage)
      • Comment: The dataset can be viewed as a stream of graphs corresponding to statistical observations or other subjects (e.g., statistical properties). Each graph is uniquely identified by its subject IRI. (en)
      • Has stream type: RDF subject graph stream (stax:subjectGraphStream)
  • Has stream element count: 2,477,552
  • Has stream element split:
    • Type: Stream elements split by topic (rb:TopicStreamElementSplit)
    • Comment: Each stream element corresponds to one subject (usually an observation). (en)
    • Has subject shape:
      • Has subject shape (​1)
        • Comment: Some elements have their class missing. (en)
        • Target subjects of: Label (rdfs:label)
      • Has subject shape (​2)
        • Comment: Target instances of any class. (en)
        • Target subjects of: Type (rdf:type)
  • Uses vocabulary:
  • Conforms to W3C RDF 1.1 specification: yes
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: no

Distributions

The dataset is published in a few size variants, each containing a specific number of stream elements. For each size, there are three distribution types available: flat (an N-Triples/N-Quads file in the RDF Message Log format), streaming (a .tar.gz archive with Turtle/TriG files, one file per stream element), and Jelly (a native binary format for streaming RDF). See the documentation for more details.

Distribution size Statements Flat Streaming Jelly
10K 158,342 2.0 MB 1.3 MB 1.2 MB
100K 1,716,898 17.0 MB 10.0 MB 11.4 MB
1M 23,371,403 226.2 MB 139.9 MB 131.9 MB
Full 55,097,866 561.0 MB 346.0 MB 326.6 MB

The full metadata of all distributions can be found below.

Full flat distribution

Full stream distribution

Full Jelly distribution

1M elements flat distribution

1M elements stream distribution

1M elements Jelly distribution

100K elements flat distribution

100K elements stream distribution

100K elements Jelly distribution

10K elements flat distribution

10K elements stream distribution

10K elements Jelly distribution

Statistics

Statistics for full distributions

  • Title: Statistics for full distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 83,873,394 ~3,232,432 33.85 12.84 3 84
Blank nodes 2,583,713 N/A 1.04 0.21 0 2
Literals 18,740,789 ~4,836,067 7.56 3.87 0 43
Simple literals 15,143,397 ~4,843,751 6.11 3.69 0 43
Datatype literals 3,597,392 ~7,209 1.45 0.55 0 4
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 3,409,585 3 1.38 0.49 0 2
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 2,477,552 ~2,468,698 1.00 0.00 1 1
Predicates 51,106,554 ~708 20.63 7.50 2 49
Objects 51,613,790 ~8,039,135 20.83 7.97 1 86
Graphs 2,477,552 ~1 1.00 0.00 1 1
Statements 55,097,866 N/A 22.24 9.61 2 86
Bytes per statement N/A N/A 215.17 50.49 2.09 16,200.00

Statistics for 1M distributions

  • Title: Statistics for 1M distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 35,194,519 ~1,336,113 35.19 13.13 3 60
Blank nodes 1,020,633 N/A 1.02 0.15 0 2
Literals 7,242,453 ~2,005,026 7.24 3.04 0 26
Simple literals 5,666,659 ~1,995,706 5.67 2.79 0 26
Datatype literals 1,575,794 ~5,841 1.58 0.62 0 4
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 1,481,292 3 1.48 0.50 0 2
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 1,000,000 ~998,186 1.00 0.00 1 1
Predicates 19,928,203 ~383 19.93 5.05 2 32
Objects 22,529,402 ~3,328,758 22.53 9.84 1 52
Graphs 1,000,000 ~1 1.00 0.00 1 1
Statements 23,371,403 N/A 23.37 10.26 2 52
Bytes per statement N/A N/A 218.21 67.92 2.09 16,200.00

Statistics for 100K distributions

  • Title: Statistics for 100K distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 2,497,664 ~210,282 24.98 2.52 3 30
Blank nodes 99,849 N/A 1.00 0.04 0 1
Literals 809,395 ~187,264 8.09 2.49 0 17
Simple literals 709,140 ~185,894 7.09 2.49 0 17
Datatype literals 100,255 ~2,538 1.00 0.07 0 2
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 100,255 3 1.00 0.07 0 2
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 100,000 ~100,086 1.00 0.00 1 1
Predicates 1,716,279 ~56 17.16 2.45 2 23
Objects 1,590,629 ~396,718 15.91 2.41 1 34
Graphs 100,000 ~1 1.00 0.00 1 1
Statements 1,716,898 N/A 17.17 2.44 2 34
Bytes per statement N/A N/A 228.15 30.31 2.09 2,517.50

Statistics for 10K distributions

  • Title: Statistics for 10K distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 226,552 ~10,827 22.66 6.07 3 30
Blank nodes 9,907 N/A 0.99 0.10 0 1
Literals 86,396 ~32,579 8.64 5.27 0 16
Simple literals 76,087 ~31,639 7.61 5.28 0 15
Datatype literals 10,309 ~976 1.03 0.22 0 2
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 10,309 3 1.03 0.22 0 2
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 10,000 ~9,970 1.00 0.00 1 1
Predicates 157,827 ~48 15.78 5.84 2 23
Objects 155,028 ~43,291 15.50 5.43 1 23
Graphs 10,000 ~1 1.00 0.00 1 1
Statements 158,342 N/A 15.83 5.82 2 23
Bytes per statement N/A N/A 226.19 72.83 2.09 2,517.50