Available Date: 01 Jun 2013
The AMC was created as part of a public-private cooperation between the Austrian Academy of Sciences and the Austria Press Agency (APA). Thanks to the efforts of APA, the AMC covers a great portion of the Austrian media landscape of the past three decades, comprising a wide range of text types which can be classified as journalistic prose (Austrian newspapers, magazines, press releases). The texts in the AMC are lemmatized and tagged with part-of-speech labels. The corpus can be accessed via a corpus search engine (SketchEngine) on the premises of the institute. Versioning in the collection: Identical major version numbers (e.g. 2.x) indicate, that these versions have been processed using the very same tools and parameters. Increasing minor version numbers (e.g. 2.1, 2.2 ...) indicate the addition of new data. I.e. with increasing version numbers the corpora are monotonically growing.