politiquices (development version)
Support and opposition relations extracted from news articles archived in Arquivo.pt. The dataset describes news articles in Portuguese and the presented political stances. Dataset source, more information about the project (Portuguese).
Info
Download this metadata in RDF: Turtle, N-Triples, RDF/XML
Source repository: politiquices
General information
- Title: Politiquices
- Identifier: politiquices
- Has version: dev
- Theme:
- News (rbt:news)
- Political (rbt:political)
- Temporal (rbt:temporal)
- Creator:
- David Soares Batista (1)
- Name: David Soares Batista
- Homepage: https://www.politiquices.pt/about
- Comment: Dataset creator
- Piotr Sowiński (2)
- Name: Piotr Sowiński
- Nickname: Ostrzyciel
- Homepage:
- Comment: Processing the dataset
- David Soares Batista (1)
- License: https://spdx.org/licenses/CC-BY-4.0
- Source:
- Date Issued: 2023-05-01
- Date Modified: 2023-05-08
- Landing page: politiquices (dev)
- Conforms To: Metadata (https://w3id.org/riverbench/schema/metadata)
Technical metadata
- Has stream element type: Triples (rb:triples)
- Has stream element count: 17,773
- Has stream element split:
- Type: Stream elements split by topic (rb:TopicStreamElementSplit)
- Comment: Each stream element corresponds to one news article.
- Uses ontology:
- Conforms to W3C RDF 1.1 specification: yes
- Conforms to W3C RDF-star draft specification as of December 17, 2021: yes
- Uses generalized triples: no
- Uses generalized RDF datasets: no
- Uses RDF-star: no
Distributions
Full triple stream distribution
- Title: Full triple stream distribution
- Identifier: stream-full
- Has file name: stream_full.tar.gz
- Has distribution type:
- Full distribution (rb:fullDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 17,773
- Byte size: 2.46 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
cf99ca91cd1d5b80c7649a5893d1a3f2
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
860b32e53a780997d9e66497618f788c166d67fd
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/politiquices/dev/files/stream_full.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 213,274
- Unique count (estimated): 18,557
- Mean: 12.00
- Standard deviation: 0.01
- Minimum: 11
- Maximum: 12
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 17,773
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 106,638
- Unique count (estimated): 36,172
- Mean: 6.00
- Standard deviation: 0.00
- Minimum: 6
- Maximum: 6
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 53,319
- Unique count (estimated): 1,295
- Mean: 3.00
- Standard deviation: 0.00
- Minimum: 3
- Maximum: 3
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 35,546
- Unique count (estimated): 17,310
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 17,773
- Unique count (estimated): 17,594
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 35,546
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 159,957
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 159,955
- Mean: 9.00
- Standard deviation: 0.01
- Minimum: 8
- Maximum: 9
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 159,957
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
Full flat distribution
- Title: Full flat distribution
- Identifier: flat-full
- Has file name: flat_full.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Full distribution (rb:fullDistribution)
- Has stream element count: 17,773
- Byte size: 2.93 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
a6cb52b58b9ed7f7da812801a474dece
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
a8def7cd204c4ac4cdb07d1427ef116b442fedac
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/politiquices/dev/files/flat_full.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 213,274
- Unique count (estimated): 18,557
- Mean: 12.00
- Standard deviation: 0.01
- Minimum: 11
- Maximum: 12
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 17,773
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 106,638
- Unique count (estimated): 36,172
- Mean: 6.00
- Standard deviation: 0.00
- Minimum: 6
- Maximum: 6
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 53,319
- Unique count (estimated): 1,295
- Mean: 3.00
- Standard deviation: 0.00
- Minimum: 3
- Maximum: 3
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 35,546
- Unique count (estimated): 17,310
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 17,773
- Unique count (estimated): 17,594
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 35,546
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 159,957
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 159,955
- Mean: 9.00
- Standard deviation: 0.01
- Minimum: 8
- Maximum: 9
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 159,957
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
10K elements triple stream distribution
- Title: 10K elements triple stream distribution
- Identifier: stream-10k
- Has file name:
stream_10K.tar.gz
- Has distribution type:
- Partial distribution (rb:partialDistribution)
- Triple stream distribution (rb:tripleStreamDistribution)
- Has stream element count: 10,000
- Byte size: 1.38 MB
- Media type: text/turtle
- Packaging format: application/tar
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
7b70a6045fa872d570b7778985fcff7b
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
69b85505404a0d9a1638a24c9c53d021cdc8f2d0
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/politiquices/dev/files/stream_10K.tar.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 119,998
- Unique count (estimated): 10,681
- Mean: 12.00
- Standard deviation: 0.01
- Minimum: 11
- Maximum: 12
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 60,000
- Unique count (estimated): 22,263
- Mean: 6.00
- Standard deviation: 0.00
- Minimum: 6
- Maximum: 6
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 30,000
- Unique count (estimated): 1,064
- Mean: 3.00
- Standard deviation: 0.00
- Minimum: 3
- Maximum: 3
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 20,000
- Unique count (estimated): 11,248
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 10,000
- Unique count (estimated): 9,954
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 20,000
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 90,000
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 89,998
- Mean: 9.00
- Standard deviation: 0.01
- Minimum: 8
- Maximum: 9
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 90,000
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
10K elements flat distribution
- Title: 10K elements flat distribution
- Identifier: flat-10k
- Has file name:
flat_10K.nt.gz
- Has distribution type:
- Flat distribution (rb:flatDistribution)
- Partial distribution (rb:partialDistribution)
- Has stream element count: 10,000
- Byte size: 1.65 MB
- Media type: application/n-triples
- Compression format: application/gzip
- Checksum:
- Checksum (1)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
038abdb182c03bddd3dfacc1441ee5bd
- Algorithm: ChecksumAlgorithm_md5 (spdx:checksumAlgorithm_md5)
- Checksum (2)
- Type: Checksum (spdx:Checksum)
- ChecksumValue:
75d1906fd99cf7a385c4bf7bcd6f90ff600210b2
- Algorithm: ChecksumAlgorithm_sha1 (spdx:checksumAlgorithm_sha1)
- Checksum (1)
- Download URL: https://w3id.org/riverbench/datasets/politiquices/dev/files/flat_10K.nt.gz
Has statistics
IRI count statistics
- Type: IRI count statistics (rb:IriCountStatistics)
- Sum: 119,998
- Unique count (estimated): 10,681
- Mean: 12.00
- Standard deviation: 0.01
- Minimum: 11
- Maximum: 12
Blank node count statistics
- Type: Blank node count statistics (rb:BlankNodeCountStatistics)
- Sum: 10,000
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Literal count statistics
- Type: Literal count statistics (rb:LiteralCountStatistics)
- Sum: 60,000
- Unique count (estimated): 22,263
- Mean: 6.00
- Standard deviation: 0.00
- Minimum: 6
- Maximum: 6
Simple literal count statistics
- Type: Simple literal count statistics (rb:SimpleLiteralCountStatistics)
- Sum: 30,000
- Unique count (estimated): 1,064
- Mean: 3.00
- Standard deviation: 0.00
- Minimum: 3
- Maximum: 3
Datatype literal count statistics
- Type: Datatype literal count statistics (rb:DatatypeLiteralCountStatistics)
- Sum: 20,000
- Unique count (estimated): 11,248
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Language string count statistics
- Type: Language string count statistics (rb:LanguageLiteralCountStatistics)
- Sum: 10,000
- Unique count (estimated): 9,954
- Mean: 1.00
- Standard deviation: 0.00
- Minimum: 1
- Maximum: 1
Quoted triple count statistics
- Type: Quoted triple count statistics (rb:QuotedTripleCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Subject count statistics
- Type: Subject count statistics (rb:SubjectCountStatistics)
- Sum: 20,000
- Mean: 2.00
- Standard deviation: 0.00
- Minimum: 2
- Maximum: 2
Predicate count statistics
- Type: Predicate count statistics (rb:PredicateCountStatistics)
- Sum: 90,000
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9
Object count statistics
- Type: Object count statistics (rb:ObjectCountStatistics)
- Sum: 89,998
- Mean: 9.00
- Standard deviation: 0.01
- Minimum: 8
- Maximum: 9
Graph count statistics
- Type: Graph count statistics (rb:GraphCountStatistics)
- Sum: 0
- Mean: 0.00
- Standard deviation: 0.00
- Minimum: 0
- Maximum: 0
Statement count statistics
- Type: Statement count statistics (rb:StatementCountStatistics)
- Sum: 90,000
- Mean: 9.00
- Standard deviation: 0.00
- Minimum: 9
- Maximum: 9