Property | Value(s) |
---|---|
acdh:hasSubject |
Austrian press releases, Austrian media landscape, Austrian Press Agency, Austrian magazines, Austrian newspapers, journalistic prose
|
acdh:hasAppliedMethod |
Part of Speech Tagging
|
acdh:hasContributor | |
acdh:hasCreatedStartDate |
2013-01-01
|
acdh:hasCurator | |
acdh:hasCoverageStartDate |
1986-01-02
|
acdh:hasBinaryUpdatedRole |
admin
|
acdh:hasUsedSoftware |
SketchEngine v2.36.5-SkE-2.151.5-3.99.9, TreeTagger 3.2, RFTagger
|
acdh:hasMetadataCreator | |
acdh:hasIdentifier |
https://arche.acdh.oeaw.ac.at/api/38513, https://id.acdh.oeaw.ac.at/amc, https://hdl.handle.net/21.11115/0000-0012-2113-1
|
acdh:hasDescription |
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing. |
acdh:hasUrl | |
acdh:hasSpatialCoverage | |
acdh:hasLicensor | |
acdh:hasOwner | |
acdh:hasUpdatedRole |
uczeitschner
|
acdh:hasCreatedEndDate |
2017-02-01
|
acdh:hasAccessRestrictionSummary |
public: 5 / restricted: 4
|
acdh:hasBinarySize |
0.08 MB
|
acdh:hasPid | |
acdh:hasCreator | |
acdh:hasAvailableDate |
2013-06-01T00:00:00Z
|
acdh:hasTechnicalInfo |
Source data in XML is annotated with lemma and PoS and indexed with SketchEngine
|
acdh:hasContact | |
acdh:hasCompleteness |
The collection is actively maintained. The corpus is brought up to date (i.e. the most recent publications are added) at least once a year.
|
acdh:hasNumberOfItems |
10
|
acdh:hasLifeCycleStatus | |
acdh:hasCollectedStartDate |
2013-01-01T00:00:00Z
|
acdh:hasLanguage | |
acdh:aclRead |
public
|
rdf:type | |
acdh:hasCollectedEndDate |
2013-01-01T00:00:00Z
|
acdh:hasTitle |
amc: Austrian Media Corpus
|
acdh:hasLicenseSummary |
CC BY 4.0: 6 / InC: 4
|
acdh:hasHosting | |
acdh:hasUpdatedDate |
2022-10-25T08:05:46.974026
|
acdh:hasCoverageEndDate |
2016-12-31
|
acdh:createdBy |
admin
|
acdh:hasRightsHolder | |
acdh:hasExtent |
100910000000 tokens, 667000000 sentences, 305580000 paragraphs, 40159000 documents
|
acdh:hasOaiSet | |
acdh:hasDepositor |
Available since 01 06 2013
TopCollection
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
Show Less
Citation / Title | Relation Type |
---|
Title | Relation type | Type |
---|