Skip to content

Dataset: openaire-lod (development version)

OpenAIRE LOD was a service that exported data from the OpenAIRE information space in RDF format, using Linked Open Data principles. The data was exported to Zenodo, with the last dump dated at March 3, 2021. This dataset consists of the "result" subset of the OpenAIRE LOD graph, including scientific results such as publications.

Only records that were valid RDF and had the "dateofcollection" property were included here. They were then sorted by the date of collection in ascending order. The first 2 (out of 28) million records that were obtained in this way are a part of this dataset.

See also the project documentation and the used ontology.

Stream preview (click to expand)
0000000000.ttl
<http://lod.openaire.eu/data/result/crossref____::a3c372b7108d0c714c08e4a8378f5f90>
        a       <http://lod.openaire.eu/vocab/ResultEntity>;
        <http://lod.openaire.eu/vocab/author>
                "Gomes, José R. B." , "Illas, Francesc" , "Prates Ramalho, João P.";
        <http://lod.openaire.eu/vocab/bestaccessright>
                "UNKNOWN";
        <http://lod.openaire.eu/vocab/collectedfrom>
                "Crossref";
        <http://lod.openaire.eu/vocab/dateofcollection>
                "2017-01-31T15:48:12.213Z";
        <http://lod.openaire.eu/vocab/outcome>
                <http://lod.openaire.eu/data/project/sgov________::6f7b8585c55b9c26b545708f3452e217> , <http://lod.openaire.eu/data/project/corda__h2020::805040189e1fd94da068ccacec7d8c99>;
        <http://lod.openaire.eu/vocab/pid>
                "10.1039/c6cp06971a";
        <http://lod.openaire.eu/vocab/resourcetype>
                "UNKNOWN";
        <http://lod.openaire.eu/vocab/resulttype>
                "publication";
        <http://lod.openaire.eu/vocab/title>
                "Adsorption of CO on the rutile TiO 2 (110) surface: a dispersion-corrected density functional theory study" .
0000000010.ttl
<http://lod.openaire.eu/data/result/doiboost____::96adf775f366c2857399ff0710f0d9c8>
        a       <http://lod.openaire.eu/vocab/ResultEntity>;
        <http://lod.openaire.eu/vocab/affiliation>
                <http://lod.openaire.eu/data/organization/grid________::14fc4b9a5bf19a1d373537a2a3c16e44>;
        <http://lod.openaire.eu/vocab/author>
                "Rosidi Rosidi" , "Padma Adriana" , "Zaki Baridwan";
        <http://lod.openaire.eu/vocab/bestaccessright>
                "OPEN";
        <http://lod.openaire.eu/vocab/collectedfrom>
                "Microsoft Academic Graph";
        <http://lod.openaire.eu/vocab/dateofacceptance>
                "2014-02-20";
        <http://lod.openaire.eu/vocab/dateofcollection>
                "2017-05-11T18:50:59Z";
        <http://lod.openaire.eu/vocab/description>
                "<jats:p>&lt;p&gt;&lt;strong&gt;&lt;em&gt;Abstract&lt;/em&gt;&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;&lt;em&gt;The purpose of this study is to examine the determinant of tax practitioners ethical decision making behaviour. The factors that were examined in this study were individual factors; PRESOR, &lt;/em&gt;Machiavellian&lt;em&gt;, and situational factors; risk preference, importance of tax to practice, exposure to current tax practice, closeness of client relationship. &lt;/em&gt;&lt;em&gt; &lt;/em&gt;&lt;/p&gt; &lt;p&gt;&lt;em&gt;This study used survey method in gathering the data. Population of this study were tax practitioners joined in IKPI (Ikatan Konsultan Pajak Indonesia) in Jawa Timur, Indonesia. A total of 38 &lt;/em&gt;&lt;em&gt;samples&lt;/em&gt;&lt;em&gt; were processed using Logistic Regression. The model of this study explained &lt;/em&gt;&lt;em&gt;45&lt;/em&gt;&lt;em&gt;% determinants of tax practitioners ethical decision making. &lt;/em&gt;&lt;/p&gt; &lt;p&gt;&lt;em&gt;The results of this study showed that PRESOR and &lt;/em&gt;Machiavellian&lt;em&gt; as individual factors affects tax practitioners ethical decision making. Situational factors in this study, which were risk preference, importance of tax to practice, exposure to current tax practice, closeness of client relationship was proven not to have a significant effect to ethical decision making. &lt;/em&gt;&lt;/p&gt; &lt;p&gt;&lt;em&gt; &lt;/em&gt;&lt;/p&gt; &lt;p&gt;&lt;em&gt;Keywords:  Ethical Decision Making, Individual Factors, PRESOR, &lt;/em&gt;Machiavellian&lt;em&gt;, Situational Factors.&lt;/em&gt;&lt;/p&gt; &lt;p&gt;&lt;strong&gt; &lt;/strong&gt;&lt;/p&gt; &lt;p&gt; &lt;/p&gt;&lt;p&gt;&lt;strong&gt;Abstrak&lt;/strong&gt;&lt;/p&gt; &lt;p&gt;Studi ini bertujuan untuk menguji determinan pengambilan keputusan etis konsultan pajak. Faktor-faktor yang diteliti pada studi ini adalah faktor individu, yaitu PRESOR dan Machiavellian, dan faktor situasional, yaitu preferensi risiko&lt;em&gt;, &lt;/em&gt;dominasi profesional&lt;em&gt;, &lt;/em&gt;kekinian informasi&lt;em&gt;, &lt;/em&gt;dan&lt;em&gt; &lt;/em&gt;hubungan profesional&lt;em&gt;. &lt;/em&gt;&lt;em&gt; &lt;/em&gt;&lt;/p&gt; &lt;p&gt;Studi ini menggunakan metode survei dalam pengambilan data. Populasi yang digunakan adalah konsultan pajak yang terdaftar di Ikatan Konsultan Pajak Indonesia (IKPI) Jawa Timur. Sebanyak 38 sampel yang dapat diolah dengan menggunakan regresi logistik dan hasilnya adalah model studi dapat menjelaskan 45% determinan pengambilan keputusan etis konsultan pajak.&lt;/p&gt; &lt;p&gt;Hasil studi ini menunjukkan bahwa faktor individu yaitu PRESOR dan Machiavellian memberikan pengaruh signifikan terhadap pengambilan keputusan etis konsultan pajak, sedangkan faktor situasional yaitu preferensi risiko&lt;em&gt;, &lt;/em&gt;dominasi profesional&lt;em&gt;, &lt;/em&gt;kekinian informasi&lt;em&gt;, &lt;/em&gt;dan&lt;em&gt; &lt;/em&gt;hubungan profesional tidak berpengaruh secara signifikan terhadap pengambilan keputusan etis.&lt;/p&gt; &lt;p&gt; &lt;/p&gt; &lt;p&gt;Kata kunci: Pengambilan Keputusan Etis, Faktor Individu, PRESOR, Machiavellian, Faktor Situasional.&lt;/p&gt;</jats:p>";
        <http://lod.openaire.eu/vocab/originalid>
                "10.18860/em.v4i2.2456";
        <http://lod.openaire.eu/vocab/pid>
                "10.18860/em.v4i2.2456";
        <http://lod.openaire.eu/vocab/publisher>
                "Maulana Malik Ibrahim State Islamic University";
        <http://lod.openaire.eu/vocab/resourcetype>
                "0001";
        <http://lod.openaire.eu/vocab/resulttype>
                "publication";
        <http://lod.openaire.eu/vocab/similarity>
                <http://lod.openaire.eu/data/result/doiboost____::c2c2ee8a8be966c01a566cb71364c5c2>;
        <http://lod.openaire.eu/vocab/subject>
                "Ethical decision" , "Client relationship" , "Population" , "education.field_of_study" , "education" , "Social psychology" , "Psychology";
        <http://lod.openaire.eu/vocab/title>
                "FAKTOR INDIVIDU DAN FAKTOR SITUASIONAL : DETERMINAN PEMBUATAN KEPUTUSAN ETIS KONSULTAN PAJAK" .
0000000100.ttl
<http://lod.openaire.eu/data/result/doiboost____::a9f5ad97308f4c8a6a7a76968a207b0e>
        a       <http://lod.openaire.eu/vocab/ResultEntity>;
        <http://lod.openaire.eu/vocab/affiliation>
                <http://lod.openaire.eu/data/organization/grid________::fac7d9cacecf503f429d64db4534e837>;
        <http://lod.openaire.eu/vocab/author>
                "Jeremy A. Greenwood";
        <http://lod.openaire.eu/vocab/bestaccessright>
                "RESTRICTED";
        <http://lod.openaire.eu/vocab/collectedfrom>
                "Microsoft Academic Graph";
        <http://lod.openaire.eu/vocab/dateofacceptance>
                "2016-01-27T20:54:57Z";
        <http://lod.openaire.eu/vocab/dateofcollection>
                "2017-08-09T00:55:12Z";
        <http://lod.openaire.eu/vocab/originalid>
                "10.2118/178819-ms";
        <http://lod.openaire.eu/vocab/pid>
                "10.2118/178819-ms";
        <http://lod.openaire.eu/vocab/publisher>
                "Society of Petroleum Engineers";
        <http://lod.openaire.eu/vocab/resourcetype>
                "0004";
        <http://lod.openaire.eu/vocab/resulttype>
                "publication";
        <http://lod.openaire.eu/vocab/subject>
                "Root cause analysis" , "Forensic engineering" , "Structural engineering" , "business.industry" , "business" , "Vibration" , "Engineering";
        <http://lod.openaire.eu/vocab/title>
                "Improvements in the Root Cause Analysis of Drillstring Vibration" .
0000001000.ttl
<http://lod.openaire.eu/data/result/snsf_p3_pubs::4059d2b097867d754474a46a55588dc1>
        a       <http://lod.openaire.eu/vocab/ResultEntity>;
        <http://lod.openaire.eu/vocab/author>
                "Briki M Monnin JHaffen E Sechter D Favrod J Netillard C Cheraitia E Marin K &amp; al.";
        <http://lod.openaire.eu/vocab/bestaccessright>
                "CLOSED";
        <http://lod.openaire.eu/vocab/collectedfrom>
                "SNSF P3 Database";
        <http://lod.openaire.eu/vocab/dateofacceptance>
                "2014-01-01";
        <http://lod.openaire.eu/vocab/dateofcollection>
                "2017-10-27T10:12:22.533Z";
        <http://lod.openaire.eu/vocab/description>
                "A psychotherapeutic approach for schizophrenia is now recommended as an adjuvant for psychopharmacology since antipsychotic medications only have a partial impact especially as regards positive symptoms and insight. In addition cognitive distortions and the lack of metacognitive skills might increase positive symptoms leading to poor social functioning. This underlines the need for speci?c approaches which target cognitive processes relevant for insight and abilities in metacognition. Metacognitive training (MCT) is a structured group intervention which enhances a patient's re?ection on cognitive biases and improves problem solving. The aim of our study was to assess MCTs' short term impact on insight symptoms and quality of life. Fiftypatients with schizophrenia or schizoaffective disorders and persistent positive symptoms (delusions or hallucinations) were enrolled in the study. After Baseline assessment participants were randomised either to supportive therapy or MCT. Both groups used the same design (1 h session twice a week during 8 weeks) although the basic knowledge given to  participants was different between interventions. Participants were assessed at eight weeks based on the Scale to Assess Unawareness of Mental Disorder Positive and Negative Syndrome Scale (PANSS) Psychotic Symptom Rating Scales the Calgary Depression Scale for Schizophrenia and the Quality of Life Scale.Between group différences were signi?cant in favour of MCT on the PANSS positive scale. Between group différences in post  and pretest values showed a trend in favour of MCT for insight on hallucinations. Results of our study indicate that the MCT has an effect on reducing positive symptomatology and a trend impact on insight and social functioning.";
        <http://lod.openaire.eu/vocab/outcome>
                <http://lod.openaire.eu/data/project/snsf________::1c0c8a8a454b595c6f1ec2e4cddaca20>;
        <http://lod.openaire.eu/vocab/pid>
                "10.1016/j.schres.2014.06.005.";
        <http://lod.openaire.eu/vocab/resourcetype>
                "UNKNOWN";
        <http://lod.openaire.eu/vocab/resulttype>
                "publication";
        <http://lod.openaire.eu/vocab/title>
                "Metacognitive training for schizophrenia : a multicentric randomized controlled trial" .
0000010000.ttl
<http://lod.openaire.eu/data/result/scholix_____::500e9076171a764cecd908815abd1fdc>
        a       <http://lod.openaire.eu/vocab/ResultEntity>;
        <http://lod.openaire.eu/vocab/author>
                "Friedrich, R." , "Panizzi, P." , "Fuentes-Prior, P." , "Richter, K." , "Verhamme, I." , "Anderson, P.J." , "Kawabata, S." , "Huber, R." , "Bode, W." , "Bock, P.E.";
        <http://lod.openaire.eu/vocab/bestaccessright>
                "UNKNOWN";
        <http://lod.openaire.eu/vocab/collectedfrom>
                "RCSB";
        <http://lod.openaire.eu/vocab/dateofacceptance>
                "2003-10-07";
        <http://lod.openaire.eu/vocab/dateofcollection>
                "2017-10-31T10:42:43.436+01:00";
        <http://lod.openaire.eu/vocab/originalid>
                "1nu7";
        <http://lod.openaire.eu/vocab/pid>
                "1nu7";
        <http://lod.openaire.eu/vocab/relationship>
                <http://lod.openaire.eu/data/result/scholix_____::9dcf560f2ef1d98e7afef98eff314993>;
        <http://lod.openaire.eu/vocab/resourcetype>
                "UNKNOWN";
        <http://lod.openaire.eu/vocab/resulttype>
                "dataset";
        <http://lod.openaire.eu/vocab/title>
                "Staphylocoagulase-Thrombin Complex" .

General information

  1. BibTeX citation:
    @misc{https://doi.org/10.5281/zenodo.4587369,
      doi = {10.5281/ZENODO.4587369},
      url = {https://zenodo.org/record/4587369},
      author = {Alexiou, Giorgos and Papastefanatos, George and Vahdati, Sahar and Lange, Christoph},
      language = {en},
      title = {OpenAIRE LOD Dump},
      publisher = {Zenodo},
      year = {2021},
      copyright = {Creative Commons Zero v1.0 Universal}
    }
    

Technical metadata

  • Has stream type usage:
    • RDF stream type usage (​1)
      • Type: RDF stream type usage (stax:RdfStreamTypeUsage)
      • Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
      • Has stream type: RDF subject graph stream (stax:subjectGraphStream)
    • RDF stream type usage (​2)
  • Has stream element count: 2,000,000
  • Has stream element split:
  • Uses vocabulary: http://lod.openaire.eu/vocab
  • Conforms to W3C RDF 1.1 specification: yes
  • Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
  • Uses generalized triples: no
  • Uses generalized RDF datasets: no
  • Uses RDF-star: no
  • Language: en

Distributions

The dataset is published in a few size variants, each containing a specific number of stream elements. For each size, there are three distribution types available: flat (an N-Triples/N-Quads file in the RDF Message Log format), streaming (a .tar.gz archive with Turtle/TriG files, one file per stream element), and Jelly (a native binary format for streaming RDF). See the documentation for more details.

Distribution size Statements Flat Streaming Jelly
10K 193,178 3.4 MB 3.0 MB 2.9 MB
100K 2,267,185 48.1 MB 42.8 MB 42.6 MB
1M 42,913,544 1.1 GB 1.1 GB 1.2 GB
Full 71,810,467 1.7 GB 1.6 GB 1.6 GB

The full metadata of all distributions can be found below.

Full flat distribution

Full stream distribution

Full Jelly distribution

1M elements flat distribution

1M elements stream distribution

1M elements Jelly distribution

100K elements flat distribution

100K elements stream distribution

100K elements Jelly distribution

10K elements flat distribution

10K elements stream distribution

10K elements Jelly distribution

Statistics

Statistics for full distributions

  • Title: Statistics for full distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 44,830,559 ~5,910,457 22.42 48.04 10 8,988
Blank nodes 0 N/A 0.00 0.00 0 0
Literals 55,180,959 ~9,575,401 27.59 132.67 5 5,121
Simple literals 55,180,959 ~9,648,186 27.59 132.67 5 5,121
Datatype literals 0 ~0 0.00 0.00 0 0
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 0 0 0.00 0.00 0 0
ASCII control chars 5,234 N/A 0.00 0.94 0 503
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 2,000,000 ~2,001,658 1.00 0.00 1 1
Predicates 28,476,853 ~24 14.24 0.95 8 19
Objects 69,535,884 ~14,095,419 34.77 141.46 7 8,985
Graphs 2,000,000 ~1 1.00 0.00 1 1
Statements 71,810,467 N/A 35.91 141.51 8 8,987
Bytes per statement N/A N/A 282.06 1,382.31 0.28 298,514.27

Statistics for 1M distributions

  • Title: Statistics for 1M distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 26,480,270 ~4,673,971 26.48 67.66 10 8,988
Blank nodes 0 N/A 0.00 0.00 0 0
Literals 29,668,659 ~4,846,089 29.67 187.48 5 5,121
Simple literals 29,668,659 ~4,863,808 29.67 187.48 5 5,121
Datatype literals 0 ~0 0.00 0.00 0 0
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 0 0 0.00 0.00 0 0
ASCII control chars 2 N/A 0.00 0.00 0 2
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 1,000,000 ~999,168 1.00 0.00 1 1
Predicates 13,660,885 ~24 13.66 0.93 8 19
Objects 41,489,263 ~9,092,423 41.49 199.70 7 8,985
Graphs 1,000,000 ~1 1.00 0.00 1 1
Statements 42,913,544 N/A 42.91 199.75 8 8,987
Bytes per statement N/A N/A 380.97 1,949.04 0.28 298,514.27

Statistics for 100K distributions

  • Title: Statistics for 100K distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 1,700,241 ~210,594 17.00 3.85 12 227
Blank nodes 0 N/A 0.00 0.00 0 0
Literals 1,849,721 ~823,894 18.50 28.73 7 3,037
Simple literals 1,849,721 ~832,846 18.50 28.73 7 3,037
Datatype literals 0 ~0 0.00 0.00 0 0
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 0 0 0.00 0.00 0 0
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 100,000 ~99,963 1.00 0.00 1 1
Predicates 1,284,078 ~22 12.84 1.10 10 18
Objects 2,165,891 ~935,600 21.66 29.45 10 3,101
Graphs 100,000 ~1 1.00 0.00 1 1
Statements 2,267,185 N/A 22.67 29.35 10 3,101
Bytes per statement N/A N/A 189.88 171.91 0.95 25,737.47

Statistics for 10K distributions

  • Title: Statistics for 10K distributions
Sum Unique Mean St. dev. Min. Max.
IRIs 170,859 ~29,919 17.09 6.29 13 207
Blank nodes 0 N/A 0.00 0.00 0 0
Literals 138,454 ~56,963 13.85 5.07 7 202
Simple literals 138,454 ~57,082 13.85 5.07 7 202
Datatype literals 0 ~0 0.00 0.00 0 0
Language literals 0 ~0 0.00 0.00 0 0
Datatypes 0 0 0.00 0.00 0 0
ASCII control chars 0 N/A 0.00 0.00 0 0
Quoted triples 0 N/A 0.00 0.00 0 0
Subjects 10,000 ~9,985 1.00 0.00 1 1
Predicates 121,432 ~22 12.14 0.71 10 16
Objects 177,888 ~76,896 17.79 8.41 10 239
Graphs 10,000 ~1 1.00 0.00 1 1
Statements 193,178 N/A 19.32 8.53 10 240
Bytes per statement N/A N/A 185.05 103.82 1.77 2,641.00