Property | Value(s) |
---|---|
acdh:hasBinaryUpdatedRole |
admin
|
acdh:hasSpatialCoverage | |
acdh:hasCoverageEndDate |
1914-01-01
|
acdh:hasCoverageStartDate |
1875-01-01
|
acdh:isSourceOf | |
acdh:hasAppliedMethod |
OCR, lemmatization, XML/TEI structural and semantic annotation, part-of-speech-tagging, tokenization
|
acdh:hasArrangement |
1 xml-file represents one printed volume
|
acdh:hasSubject |
historical travel guides, Karl Baedeker
|
acdh:hasTechnicalInfo |
A TEI schema documenting applied annotations is available
|
acdh:hasTitle |
travel!digital Corpus
|
acdh:hasIdentifier |
https://hdl.handle.net/21.11115/0000-000C-29F7-0, https://id.acdh.oeaw.ac.at/traveldigital/Corpus, https://arche.acdh.oeaw.ac.at/api/14006, https://id.acdh.oeaw.ac.at/uuid/268e2e95-d387-f53d-a4ca-041ece2c2f0c
|
acdh:hasDescription |
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204. |
acdh:hasCreator | |
acdh:isPartOf | |
acdh:hasPid | |
acdh:hasAvailableDate |
2018-09-03
|
acdh:hasRelatedDiscipline | |
acdh:hasFunder | |
acdh:hasCreatedStartDate |
2014-12-01
|
acdh:hasPrincipalInvestigator | |
acdh:hasDepositor | |
acdh:hasFilename |
Corpus
|
acdh:hasCompleteness |
project completed, no further changes
|
acdh:hasContact | |
acdh:hasLifeCycleStatus | |
acdh:hasAppliedMethodDescription |
Czeitschner: OCR, XML/TEI tagging, manual correction of lemmatization and part-of-speech tagging;
Eisenheld: manual correction of lemmatization and part-of-speech tagging; Krautgartner: tokenization, automatically lemmatization and part-of-speech tagging, interlinking of corpus and thesaurus; Lanig: manual correction of lemmatization and part-of-speech tagging |
acdh:hasLicensor | |
acdh:hasAccessRestrictionSummary |
public: 5
|
acdh:createdBy |
admin
|
acdh:hasHosting | |
acdh:hasLicenseSummary |
CC BY 4.0: 6
|
acdh:hasRightsHolder | |
acdh:hasOaiSet | |
rdf:type | |
acdh:hasCustomCitation |
author = {Czeitschner, Ulrike and Krautgartner, Barbara and Eisenheld, Victoria and Lanig, Laura}
|
acdh:hasNamingScheme |
file names consist of Editor-Title_Year, e.g. Baedeker-Indien_1914.xml
|
acdh:hasUpdatedDate |
2021-07-15T08:54:15.123856
|
acdh:hasUrl | |
acdh:hasActor | |
acdh:hasCreatedEndDate |
2018-01-26
|
acdh:hasVersion |
1
|
acdh:hasExtent |
5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
|
acdh:hasCurator | |
acdh:hasUsedSoftware |
<oXygen/> XML Editor 17.1, TreeTagger, ABBYY FineReader 8.0
|
acdh:hasLicense | |
acdh:hasUpdatedRole |
uczeitschner
|
acdh:hasCollectedEndDate |
2017-11-30T00:00:00
|
acdh:hasCollectedStartDate |
2004-04-28T00:00:00
|
acdh:hasNumberOfItems |
5
|
acdh:aclRead |
public
|
acdh:hasLanguage | |
acdh:hasBinarySize |
0.12 GB
|
acdh:hasOwner | |
acdh:hasMetadataCreator |
Available since 03 09 2018
Collection
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Less
Citation / Title | Relation Type |
---|
Title | Relation type | Type |
---|