Dataset: openaire-lod (development version)
OpenAIRE LOD was a service that exported data from the OpenAIRE information space in RDF format, using Linked Open Data principles. The data was exported to Zenodo, with the last dump dated at March 3, 2021. This dataset consists of the "result" subset of the OpenAIRE LOD graph, including scientific results such as publications.
Only records that were valid RDF and had the "dateofcollection" property were included here. They were then sorted by the date of collection in ascending order. The first 2 (out of 28) million records that were obtained in this way are a part of this dataset.
See also the project documentation and the used ontology.
Info
Download this metadata in RDF: Turtle, N-Triples, RDF/XML, Jelly
Source repository: dataset-openaire-lod
Permanent URL: https://w3id.org/riverbench/datasets/openaire-lod/dev
Stream preview (click to expand)
<http://lod.openaire.eu/data/result/crossref____::a3c372b7108d0c714c08e4a8378f5f90>
a <http://lod.openaire.eu/vocab/ResultEntity>;
<http://lod.openaire.eu/vocab/author>
"Prates Ramalho, João P." , "Gomes, José R. B." , "Illas, Francesc";
<http://lod.openaire.eu/vocab/bestaccessright>
"UNKNOWN";
<http://lod.openaire.eu/vocab/collectedfrom>
"Crossref";
<http://lod.openaire.eu/vocab/dateofcollection>
"2017-01-31T15:48:12.213Z";
<http://lod.openaire.eu/vocab/outcome>
<http://lod.openaire.eu/data/project/sgov________::6f7b8585c55b9c26b545708f3452e217> , <http://lod.openaire.eu/data/project/corda__h2020::805040189e1fd94da068ccacec7d8c99>;
<http://lod.openaire.eu/vocab/pid>
"10.1039/c6cp06971a";
<http://lod.openaire.eu/vocab/resourcetype>
"UNKNOWN";
<http://lod.openaire.eu/vocab/resulttype>
"publication";
<http://lod.openaire.eu/vocab/title>
"Adsorption of CO on the rutile TiO 2 (110) surface: a dispersion-corrected density functional theory study" .
<http://lod.openaire.eu/data/result/doiboost____::96adf775f366c2857399ff0710f0d9c8>
a <http://lod.openaire.eu/vocab/ResultEntity>;
<http://lod.openaire.eu/vocab/affiliation>
<http://lod.openaire.eu/data/organization/grid________::14fc4b9a5bf19a1d373537a2a3c16e44>;
<http://lod.openaire.eu/vocab/author>
"Padma Adriana" , "Zaki Baridwan" , "Rosidi Rosidi";
<http://lod.openaire.eu/vocab/bestaccessright>
"OPEN";
<http://lod.openaire.eu/vocab/collectedfrom>
"Microsoft Academic Graph";
<http://lod.openaire.eu/vocab/dateofacceptance>
"2014-02-20";
<http://lod.openaire.eu/vocab/dateofcollection>
"2017-05-11T18:50:59Z";
<http://lod.openaire.eu/vocab/description>
"<jats:p><p><strong><em>Abstract</em></strong></p> <p><em>The purpose of this study is to examine the determinant of tax practitioners ethical decision making behaviour. The factors that were examined in this study were individual factors; PRESOR, </em>Machiavellian<em>, and situational factors; risk preference, importance of tax to practice, exposure to current tax practice, closeness of client relationship. </em><em> </em></p> <p><em>This study used survey method in gathering the data. Population of this study were tax practitioners joined in IKPI (Ikatan Konsultan Pajak Indonesia) in Jawa Timur, Indonesia. A total of 38 </em><em>samples</em><em> were processed using Logistic Regression. The model of this study explained </em><em>45</em><em>% determinants of tax practitioners ethical decision making. </em></p> <p><em>The results of this study showed that PRESOR and </em>Machiavellian<em> as individual factors affects tax practitioners ethical decision making. Situational factors in this study, which were risk preference, importance of tax to practice, exposure to current tax practice, closeness of client relationship was proven not to have a significant effect to ethical decision making. </em></p> <p><em> </em></p> <p><em>Keywords: Ethical Decision Making, Individual Factors, PRESOR, </em>Machiavellian<em>, Situational Factors.</em></p> <p><strong> </strong></p> <p> </p><p><strong>Abstrak</strong></p> <p>Studi ini bertujuan untuk menguji determinan pengambilan keputusan etis konsultan pajak. Faktor-faktor yang diteliti pada studi ini adalah faktor individu, yaitu PRESOR dan Machiavellian, dan faktor situasional, yaitu preferensi risiko<em>, </em>dominasi profesional<em>, </em>kekinian informasi<em>, </em>dan<em> </em>hubungan profesional<em>. </em><em> </em></p> <p>Studi ini menggunakan metode survei dalam pengambilan data. Populasi yang digunakan adalah konsultan pajak yang terdaftar di Ikatan Konsultan Pajak Indonesia (IKPI) Jawa Timur. Sebanyak 38 sampel yang dapat diolah dengan menggunakan regresi logistik dan hasilnya adalah model studi dapat menjelaskan 45% determinan pengambilan keputusan etis konsultan pajak.</p> <p>Hasil studi ini menunjukkan bahwa faktor individu yaitu PRESOR dan Machiavellian memberikan pengaruh signifikan terhadap pengambilan keputusan etis konsultan pajak, sedangkan faktor situasional yaitu preferensi risiko<em>, </em>dominasi profesional<em>, </em>kekinian informasi<em>, </em>dan<em> </em>hubungan profesional tidak berpengaruh secara signifikan terhadap pengambilan keputusan etis.</p> <p> </p> <p>Kata kunci: Pengambilan Keputusan Etis, Faktor Individu, PRESOR, Machiavellian, Faktor Situasional.</p></jats:p>";
<http://lod.openaire.eu/vocab/originalid>
"10.18860/em.v4i2.2456";
<http://lod.openaire.eu/vocab/pid>
"10.18860/em.v4i2.2456";
<http://lod.openaire.eu/vocab/publisher>
"Maulana Malik Ibrahim State Islamic University";
<http://lod.openaire.eu/vocab/resourcetype>
"0001";
<http://lod.openaire.eu/vocab/resulttype>
"publication";
<http://lod.openaire.eu/vocab/similarity>
<http://lod.openaire.eu/data/result/doiboost____::c2c2ee8a8be966c01a566cb71364c5c2>;
<http://lod.openaire.eu/vocab/subject>
"Ethical decision" , "Social psychology" , "education" , "education.field_of_study" , "Psychology" , "Population" , "Client relationship";
<http://lod.openaire.eu/vocab/title>
"FAKTOR INDIVIDU DAN FAKTOR SITUASIONAL : DETERMINAN PEMBUATAN KEPUTUSAN ETIS KONSULTAN PAJAK" .
<http://lod.openaire.eu/data/result/doiboost____::a9f5ad97308f4c8a6a7a76968a207b0e>
a <http://lod.openaire.eu/vocab/ResultEntity>;
<http://lod.openaire.eu/vocab/affiliation>
<http://lod.openaire.eu/data/organization/grid________::fac7d9cacecf503f429d64db4534e837>;
<http://lod.openaire.eu/vocab/author>
"Jeremy A. Greenwood";
<http://lod.openaire.eu/vocab/bestaccessright>
"RESTRICTED";
<http://lod.openaire.eu/vocab/collectedfrom>
"Microsoft Academic Graph";
<http://lod.openaire.eu/vocab/dateofacceptance>
"2016-01-27T20:54:57Z";
<http://lod.openaire.eu/vocab/dateofcollection>
"2017-08-09T00:55:12Z";
<http://lod.openaire.eu/vocab/originalid>
"10.2118/178819-ms";
<http://lod.openaire.eu/vocab/pid>
"10.2118/178819-ms";
<http://lod.openaire.eu/vocab/publisher>
"Society of Petroleum Engineers";
<http://lod.openaire.eu/vocab/resourcetype>
"0004";
<http://lod.openaire.eu/vocab/resulttype>
"publication";
<http://lod.openaire.eu/vocab/subject>
"business" , "Vibration" , "business.industry" , "Root cause analysis" , "Forensic engineering" , "Structural engineering" , "Engineering";
<http://lod.openaire.eu/vocab/title>
"Improvements in the Root Cause Analysis of Drillstring Vibration" .
<http://lod.openaire.eu/data/result/snsf_p3_pubs::4059d2b097867d754474a46a55588dc1>
a <http://lod.openaire.eu/vocab/ResultEntity>;
<http://lod.openaire.eu/vocab/author>
"Briki M Monnin JHaffen E Sechter D Favrod J Netillard C Cheraitia E Marin K & al.";
<http://lod.openaire.eu/vocab/bestaccessright>
"CLOSED";
<http://lod.openaire.eu/vocab/collectedfrom>
"SNSF P3 Database";
<http://lod.openaire.eu/vocab/dateofacceptance>
"2014-01-01";
<http://lod.openaire.eu/vocab/dateofcollection>
"2017-10-27T10:12:22.533Z";
<http://lod.openaire.eu/vocab/description>
"A psychotherapeutic approach for schizophrenia is now recommended as an adjuvant for psychopharmacology since antipsychotic medications only have a partial impact especially as regards positive symptoms and insight. In addition cognitive distortions and the lack of metacognitive skills might increase positive symptoms leading to poor social functioning. This underlines the need for speci?c approaches which target cognitive processes relevant for insight and abilities in metacognition. Metacognitive training (MCT) is a structured group intervention which enhances a patient's re?ection on cognitive biases and improves problem solving. The aim of our study was to assess MCTs' short term impact on insight symptoms and quality of life. Fiftypatients with schizophrenia or schizoaffective disorders and persistent positive symptoms (delusions or hallucinations) were enrolled in the study. After Baseline assessment participants were randomised either to supportive therapy or MCT. Both groups used the same design (1 h session twice a week during 8 weeks) although the basic knowledge given to participants was different between interventions. Participants were assessed at eight weeks based on the Scale to Assess Unawareness of Mental Disorder Positive and Negative Syndrome Scale (PANSS) Psychotic Symptom Rating Scales the Calgary Depression Scale for Schizophrenia and the Quality of Life Scale.Between group différences were signi?cant in favour of MCT on the PANSS positive scale. Between group différences in post and pretest values showed a trend in favour of MCT for insight on hallucinations. Results of our study indicate that the MCT has an effect on reducing positive symptomatology and a trend impact on insight and social functioning.";
<http://lod.openaire.eu/vocab/outcome>
<http://lod.openaire.eu/data/project/snsf________::1c0c8a8a454b595c6f1ec2e4cddaca20>;
<http://lod.openaire.eu/vocab/pid>
"10.1016/j.schres.2014.06.005.";
<http://lod.openaire.eu/vocab/resourcetype>
"UNKNOWN";
<http://lod.openaire.eu/vocab/resulttype>
"publication";
<http://lod.openaire.eu/vocab/title>
"Metacognitive training for schizophrenia : a multicentric randomized controlled trial" .
<http://lod.openaire.eu/data/result/scholix_____::500e9076171a764cecd908815abd1fdc>
a <http://lod.openaire.eu/vocab/ResultEntity>;
<http://lod.openaire.eu/vocab/author>
"Fuentes-Prior, P." , "Friedrich, R." , "Verhamme, I." , "Richter, K." , "Anderson, P.J." , "Bock, P.E." , "Huber, R." , "Kawabata, S." , "Panizzi, P." , "Bode, W.";
<http://lod.openaire.eu/vocab/bestaccessright>
"UNKNOWN";
<http://lod.openaire.eu/vocab/collectedfrom>
"RCSB";
<http://lod.openaire.eu/vocab/dateofacceptance>
"2003-10-07";
<http://lod.openaire.eu/vocab/dateofcollection>
"2017-10-31T10:42:43.436+01:00";
<http://lod.openaire.eu/vocab/originalid>
"1nu7";
<http://lod.openaire.eu/vocab/pid>
"1nu7";
<http://lod.openaire.eu/vocab/relationship>
<http://lod.openaire.eu/data/result/scholix_____::9dcf560f2ef1d98e7afef98eff314993>;
<http://lod.openaire.eu/vocab/resourcetype>
"UNKNOWN";
<http://lod.openaire.eu/vocab/resulttype>
"dataset";
<http://lod.openaire.eu/vocab/title>
"Staphylocoagulase-Thrombin Complex" .
General information
- Title: OpenAIRE LOD graph (en)
- Identifier:
openaire-lod
- Has version:
dev
- Theme:
- Bibliography (eurovoc:4864)
- Open data (eurovoc:c_5ea6e5c4)
- Open science (eurovoc:c_99a79cea)
- Research report (eurovoc:2896)
- Research results (eurovoc:6306)
- Scientific research (eurovoc:2924)
- Creator:
- Giorgos Alexiou (1)
- Name: Giorgos Alexiou
- Homepage: https://orcid.org/0000-0003-2244-4916
- George Papastefanatos (2)
- Name: George Papastefanatos
- Homepage: https://orcid.org/0000-0002-9273-9843
- Sahar Vahdati (3)
- Name: Sahar Vahdati
- Christoph Lange (4)
- Name: Christoph Lange
- Piotr Sowiński (5)
- Name: Piotr Sowiński
- Comment: Processing the dataset for RiverBench
- Nickname: Ostrzyciel
- Homepage:
- Giorgos Alexiou (1)
- License: https://spdx.org/licenses/CC0-1.0
- Source:
- Date Issued: 2024-07-12
- Date Modified: 2024-08-29
- Landing page: openaire-lod (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
Technical metadata
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has stream element count: 2,000,000
- Has stream element split:
- Type:
- Stream elements split by time (rb:TimeStreamElementSplit)
- Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to exactly one scientific result in OpenAIRE, each with an assigned timestamp (time of collection). (en)
- Has temporal property: http://lod.openaire.eu/vocab/dateofcollection
- Has subject shape:
- Comment: Target instances of class ResultEntity. (en)
- Target class: http://lod.openaire.eu/vocab/ResultEntity
- Type:
- Uses vocabulary: http://lod.openaire.eu/vocab
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
- Language:
en
Distributions
Download links
The dataset is published in a few size variants, each containing a specific number of stream elements. For each size, there are three distribution types available: flat (just an N-Triples/N-Quads file), streaming (a .tar.gz archive with Turtle/TriG files, one file per stream element), and Jelly (a native binary format for streaming RDF). See the documentation for more details.
Distribution size | Statements | Flat | Streaming | Jelly |
---|---|---|---|---|
10K | 193,178 | 3.4 MB | 3.0 MB | 3.1 MB |
100K | 2,267,185 | 48.8 MB | 43.2 MB | 45.0 MB |
1M | 42,913,544 | 1.2 GB | 1.1 GB | 1.2 GB |
Full | 71,810,467 | 1.7 GB | 1.6 GB | 1.7 GB |
The full metadata of all distributions can be found below.
Full stream distribution
- Title: Full stream distribution
- Identifier:
stream-full
- Has file name:
stream_full.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 2,000,000
- Byte size: 1.6 GB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
6c33584c1fce650dd4ef1fc0c8c6732c
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
ca89a5fca2e67b6f76d4270f52ca3ac0f5636278
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/stream_full.tar.gz
- Statistics: statistics-full
Full Jelly distribution
- Title: Full Jelly distribution
- Identifier:
jelly-full
- Has file name:
jelly_full.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Jelly distribution (rb:jellyDistribution)
- Has stream element count: 2,000,000
- Byte size: 1.7 GB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
ebdfdb6a9fce776da8db88d5d186cda0
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
c72b45c6800c2e11b979eaacf078467d6cebbd23
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/jelly_full.jelly.gz
- Statistics: statistics-full
Full flat distribution
- Title: Full flat distribution
- Identifier:
flat-full
- Has file name:
flat_full.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 2,000,000
- Byte size: 1.7 GB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
d2b2c55272f4e622b39ac0e07cf95fd1
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
c9b92ddf7a65fefbb45329c3f110e47d33d1611f
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/flat_full.nt.gz
- Statistics: statistics-full
1M elements stream distribution
- Title: 1M elements stream distribution
- Identifier:
stream-1m
- Has file name:
stream_1M.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 1,000,000
- Byte size: 1.1 GB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
c40ab8234ad663599619cb3b573516b0
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
79d367cd51002a6fdeb4490b86be209abeafc336
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/stream_1M.tar.gz
- Statistics: statistics-1m
1M elements Jelly distribution
- Title: 1M elements Jelly distribution
- Identifier:
jelly-1m
- Has file name:
jelly_1M.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 1.2 GB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
445d395989d1efa696f1a6ce6782aafe
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
2a409f16f3921491137ce40fc40a8a788b05db28
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/jelly_1M.jelly.gz
- Statistics: statistics-1m
1M elements flat distribution
- Title: 1M elements flat distribution
- Identifier:
flat-1m
- Has file name:
flat_1M.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 1,000,000
- Byte size: 1.2 GB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
d526502a8f818cc2922079b747758db7
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
de9a9467557fe639d1f273a2da627f91b9eb8970
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/flat_1M.nt.gz
- Statistics: statistics-1m
100K elements stream distribution
- Title: 100K elements stream distribution
- Identifier:
stream-100k
- Has file name:
stream_100K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 100,000
- Byte size: 43.2 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
61a7336fae7448720075fa7644f728a9
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
4b063a58d22c0cc2cc4097bd510d0d7baead0440
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/stream_100K.tar.gz
- Statistics: statistics-100k
100K elements Jelly distribution
- Title: 100K elements Jelly distribution
- Identifier:
jelly-100k
- Has file name:
jelly_100K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 45.0 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
1321af283ff7a5a344bd9e1872d1ef15
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
97fb24e35b4f37a41da5e2d0294789eedae43698
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/jelly_100K.jelly.gz
- Statistics: statistics-100k
100K elements flat distribution
- Title: 100K elements flat distribution
- Identifier:
flat-100k
- Has file name:
flat_100K.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 100,000
- Byte size: 48.8 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
8769034bb78ce37ff92d23e58c8dc252
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
a0a2851ebd5cddcdf90fa0cbcd63caf05f889a04
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/flat_100K.nt.gz
- Statistics: statistics-100k
10K elements stream distribution
- Title: 10K elements stream distribution
- Identifier:
stream-10k
- Has file name:
stream_10K.tar.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Stream distribution (rb:streamDistribution)
- Has stream element count: 10,000
- Byte size: 3.0 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
4a35a0fcdcd560391fc1cff0105a178b
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
b72a8b01db7a4d9734661bcc1caad4405af0994c
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/stream_10K.tar.gz
- Statistics: statistics-10k
10K elements Jelly distribution
- Title: 10K elements Jelly distribution
- Identifier:
jelly-10k
- Has file name:
jelly_10K.jelly.gz
- Has stream type usage:
- RDF stream type usage (1)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- RDF stream type usage (2)
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a stream of graphs, with each graph corresponding to one scientific result from OpenAIRE. Each graph is uniquely identified by its subject IRI. (en)
- Has stream type: RDF subject graph stream (stax:subjectGraphStream)
- RDF stream type usage (1)
- Has distribution type:
- Jelly distribution (rb:jellyDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 3.1 MB
- Media type: application/x-jelly-rdf
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
b9521ae71cda6a8b377ab3bc899eb639
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
ed4052973f23c2edb25a9e383304abc53fd8c4fa
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/jelly_10K.jelly.gz
- Statistics: statistics-10k
10K elements flat distribution
- Title: 10K elements flat distribution
- Identifier:
flat-10k
- Has file name:
flat_10K.nt.gz
- Has stream type usage:
- Type: RDF stream type usage (stax:RdfStreamTypeUsage)
- Comment: The dataset can be viewed as a flattened stream of triples. (en)
- Has stream type: Flat RDF triple stream (stax:flatTripleStream)
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 3.4 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
5877803471aa03f29796811dbebb76a2
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
475a5838568e7dffa47a3b2abcee281963d84685
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/openaire-lod/dev/files/flat_10K.nt.gz
- Statistics: statistics-10k
Statistics
Statistics for full distributions
- Title: Statistics for full distributions
Sum | Unique (approx.) | Mean | St. dev. | Min. | Max. | |
---|---|---|---|---|---|---|
IRIs | 44,830,559 | 5,938,140 | 22.42 | 48.04 | 10 | 8,988 |
Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Objects | 69,535,884 | 14,154,385 | 34.77 | 141.46 | 7 | 8,985 |
Graphs | 2,000,000 | 1 | 1.00 | 0.00 | 1 | 1 |
Statements | 71,810,467 | N/A | 35.91 | 141.51 | 8 | 8,987 |
Literals | 55,180,959 | 9,633,580 | 27.59 | 132.67 | 5 | 5,121 |
Simple literals | 55,180,959 | 9,633,580 | 27.59 | 132.67 | 5 | 5,121 |
Datatype literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
Language literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
ASCII control chars | 5,234 | N/A | 0.00 | 0.94 | 0 | 503 |
Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Subjects | 2,000,000 | 2,000,131 | 1.00 | 0.00 | 1 | 1 |
Predicates | 28,476,853 | 24 | 14.24 | 0.95 | 8 | 19 |
Statistics for 1M distributions
- Title: Statistics for 1M distributions
Sum | Unique (approx.) | Mean | St. dev. | Min. | Max. | |
---|---|---|---|---|---|---|
IRIs | 26,480,270 | 4,695,708 | 26.48 | 67.66 | 10 | 8,988 |
Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Objects | 41,489,263 | 9,106,902 | 41.49 | 199.70 | 7 | 8,985 |
Graphs | 1,000,000 | 1 | 1.00 | 0.00 | 1 | 1 |
Statements | 42,913,544 | N/A | 42.91 | 199.75 | 8 | 8,987 |
Literals | 29,668,659 | 4,871,239 | 29.67 | 187.48 | 5 | 5,121 |
Simple literals | 29,668,659 | 4,871,239 | 29.67 | 187.48 | 5 | 5,121 |
Datatype literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
Language literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
ASCII control chars | 2 | N/A | 0.00 | 0.00 | 0 | 2 |
Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Subjects | 1,000,000 | 1,000,059 | 1.00 | 0.00 | 1 | 1 |
Predicates | 13,660,885 | 24 | 13.66 | 0.93 | 8 | 19 |
Statistics for 100K distributions
- Title: Statistics for 100K distributions
Sum | Unique (approx.) | Mean | St. dev. | Min. | Max. | |
---|---|---|---|---|---|---|
IRIs | 1,700,241 | 210,791 | 17.00 | 3.85 | 12 | 227 |
Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Objects | 2,165,891 | 940,274 | 21.66 | 29.45 | 10 | 3,101 |
Graphs | 100,000 | 1 | 1.00 | 0.00 | 1 | 1 |
Statements | 2,267,185 | N/A | 22.67 | 29.35 | 10 | 3,101 |
Literals | 1,849,721 | 828,050 | 18.50 | 28.73 | 7 | 3,037 |
Simple literals | 1,849,721 | 828,050 | 18.50 | 28.73 | 7 | 3,037 |
Datatype literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
Language literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Subjects | 100,000 | 100,006 | 1.00 | 0.00 | 1 | 1 |
Predicates | 1,284,078 | 22 | 12.84 | 1.10 | 10 | 18 |
Statistics for 10K distributions
- Title: Statistics for 10K distributions
Sum | Unique (approx.) | Mean | St. dev. | Min. | Max. | |
---|---|---|---|---|---|---|
IRIs | 170,859 | 29,880 | 17.09 | 6.29 | 13 | 207 |
Blank nodes | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Objects | 177,888 | 76,837 | 17.79 | 8.41 | 10 | 239 |
Graphs | 10,000 | 1 | 1.00 | 0.00 | 1 | 1 |
Statements | 193,178 | N/A | 19.32 | 8.53 | 10 | 240 |
Literals | 138,454 | 56,917 | 13.85 | 5.07 | 7 | 202 |
Simple literals | 138,454 | 56,917 | 13.85 | 5.07 | 7 | 202 |
Datatype literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
Language literals | 0 | 0 | 0.00 | 0.00 | 0 | 0 |
ASCII control chars | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Quoted triples | 0 | N/A | 0.00 | 0.00 | 0 | 0 |
Subjects | 10,000 | 9,999 | 1.00 | 0.00 | 1 | 1 |
Predicates | 121,432 | 22 | 12.14 | 0.71 | 10 | 16 |