Overview
amc: Austrian Media Corpus
Type:
Collection
person_add
Contact(s):
Hannes Pirker
,
people
Creator(s):
Hannes Pirker
,
Matej Ďurčo
,
Jutta Ransmayr
,
Daniel Schopper
,
people
Curator(s):
Hannes Pirker
,
person
Contributor(s):
Austria Press Agency
,
today
Available Date:
2013-06-01
,
dehaze
Extent:
40159000 documents
,
305580000 paragraphs
,
667000000 sentences
,
100910000000 tokens
,
attachment
Number of Item(s):
4
,
attachment
Binary Size:
56.19 KB
,
copyright
License:
restricted
,
verified_user
Access Restriction:
public
amc: Austrian Media Corpus
Property | Value(s) |
---|---|
rdf:type |
http://www.w3.org/ns/ldp#RDFSource http://www.w3.org/ns/ldp#Container http://fedora.info/definitions/v4/repository#Container http://fedora.info/definitions/v4/repository#Resource https://vocabs.acdh.oeaw.ac.at/schema#Collection |
acdh:hasCoverageEndDate
|
2016-12-31 |
acdh:hasBinarySize
|
56.19 KB |
acdh:hasCreatedEndDate
|
2017-02-01 |
acdh:hasAppliedMethods | Part of Speech Tagging |
acdh:hasCollectedEndDate
|
2013-01-01 |
acdh:hasSpatialCoverage
|
Austria |
acdh:hasLifeCycleStatus
|
active |
acdh:hasExtent
|
40159000 documents 305580000 paragraphs 667000000 sentences 100910000000 tokens |
acdh:hasIdentifier
|
https://id.acdh.oeaw.ac.at/amc https://id.acdh.oeaw.ac.at/uuid/6ad20f66-1160-c599-bcea-6b2073095576 |
acdh:Owner |
Austrian Centre for Digital Humanities |
acdh:hasDepositor
|
Hannes Pirker |
acdh:hasCreator
|
Hannes Pirker Matej Ďurčo Jutta Ransmayr Daniel Schopper |
acdh:hasAccessRestriction
|
public |
acdh:hasTitleImage
|
AMC Logo |
acdh:hasMetadataCreator
|
Hannes Pirker |
acdh:hasLicense
|
restricted |
acdh:hasRightsHolder
|
Austrian Centre for Digital Humanities |
acdh:hasFunder
|
Austrian Centre for Digital Humanities |
acdh:hasLocationURL |
https://www.oeaw.ac.at/acdh/tools/amc-austria-media-corpus/ |
acdh:hasCollectedStartDate
|
2013-01-01 |
acdh:hasHosting
|
ARCHE |
acdh:hasCompleteness
|
The collection is actively maintained. The corpus is brought up to date (i.e. the most recent publications are added) at least once a year. |
acdh:hasContributor
|
Austria Press Agency |
acdh:hasCoverageStartDate
|
1986-01-02 |
acdh:hasAvailableDate
|
2013-06-01 |
acdh:hasNumberOfItems
|
4 |
acdh:hasCreatedStartDate
|
2013-01-01 |
acdh:hasCurator
|
Hannes Pirker |
acdh:hasTitle
|
amc: Austrian Media Corpus |
acdh:hasLicensor
|
Austrian Centre for Digital Humanities |
acdh:hasContact
|
Hannes Pirker |
acdh:hasLanguage
|
deu |
acdh:hasTechnicalInfo
|
Source data in XML is annotated with lemma and PoS and indexed with SketchEngine |
acdh:hasDescription
|
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA). Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing. |
http://www.w3.org/ns/auth/acl#accessControl |
https://arche.acdh.oeaw.ac.at/rest/acls/7a/ec/71/70/7aec7170-9a45-439f-898f-4ef7f08a65d2 |
acdh:hasUsedSoftware
|
SketchEngine v2.36.5-SkE-2.151.5-3.99.9 TreeTagger 3.2 RFTagger |
Please provide your login credentials
Summary
info_outline
Spatial Coverage:
Austria
info_outline
Description:
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA).
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.
Cite Resource
MLA
Copy
Citation information copied!