| Property | Value(s) |
|---|---|
| rdf:type | |
| acdh:aclRead |
public
|
| acdh:createdBy |
admin
|
| acdh:hasAccessRestrictionSummary |
public: 5 / restricted: 4
|
| acdh:hasAppliedMethod |
Part of Speech Tagging
|
| acdh:hasAvailableDate |
2013-06-01T00:00:00Z
|
| acdh:hasBinarySize |
0.08 MB
|
| acdh:hasBinaryUpdatedRole |
admin
|
| acdh:hasCollectedEndDate |
2013-01-01T00:00:00Z
|
| acdh:hasCollectedStartDate |
2013-01-01T00:00:00Z
|
| acdh:hasCompleteness |
The collection is actively maintained. The corpus is brought up to date (i.e. the most recent publications are added) at least once a year.
|
| acdh:hasContact | |
| acdh:hasContributor | |
| acdh:hasCoverageEndDate |
2016-12-31
|
| acdh:hasCoverageStartDate |
1986-01-02
|
| acdh:hasCreatedEndDate |
2017-02-01
|
| acdh:hasCreatedStartDate |
2013-01-01
|
| acdh:hasCurator | |
| acdh:hasDepositor | |
| acdh:hasDescription |
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing. |
| acdh:hasEditor | |
| acdh:hasExtent |
100910000000 tokens, 667000000 sentences, 305580000 paragraphs, 40159000 documents
|
| acdh:hasHosting | |
| acdh:hasIdentifier |
https://arche.acdh.oeaw.ac.at/api/38513, https://hdl.handle.net/21.11115/0000-0012-2113-1, https://id.acdh.oeaw.ac.at/amc
|
| acdh:hasLanguage | |
| acdh:hasLicenseSummary |
CC BY 4.0: 5 / InC: 4
|
| acdh:hasLicensor | |
| acdh:hasLifeCycleStatus | |
| acdh:hasMetadataCreator | |
| acdh:hasNumberOfItems |
10
|
| acdh:hasOaiSet | |
| acdh:hasOwner | |
| acdh:hasPid | |
| acdh:hasRightsHolder | |
| acdh:hasSpatialCoverage | |
| acdh:hasSubject |
Austrian Press Agency, Austrian magazines, Austrian media landscape, Austrian newspapers, Austrian press releases, journalistic prose
|
| acdh:hasTechnicalInfo |
Source data in XML is annotated with lemma and PoS and indexed with SketchEngine
|
| acdh:hasTitle |
amc: Austrian Media Corpus
|
| acdh:hasUpdatedDate |
2022-10-25T08:05:46.974026
|
| acdh:hasUpdatedRole |
uczeitschner
|
| acdh:hasUrl | |
| acdh:hasUsedSoftware |
RFTagger, SketchEngine v2.36.5-SkE-2.151.5-3.99.9, TreeTagger 3.2
|
Available since 01 06 2013
TopCollection
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
Show Less
| Citation / Title | Relation Type |
|---|
| Title | Relation type | Type |
|---|


