travel!digital Collection /

Overview
Switch to Expert-View
Copy Resource Link

travel!digital Corpus
Type: Collection
device_hub Principal Investigator(s): Ulrike Czeitschner
person_add Contact(s): Barbara Krautgartner , Ulrike Czeitschner
today Created Start Date: 01 Dec 2014
today Created End Date: 26 Jan 2018
today Available Date: 03 Sep 2018
dehaze Extent: 5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
attachment Number of Items: 5
attachment Binary Size: 0.12 GB
copyright License: 48509
copyright Access Restriction: public
device_hub Part of: travel!digital Collection
travel!digital Corpus
Property Value(s)
acdh:aclRead
public
acdh:createdBy
admin
acdh:hasAccessRestriction
acdh:hasActor
acdh:hasAppliedMethod
XML/TEI structural and semantic annotation , lemmatization , tokenization , part-of-speech-tagging , OCR
acdh:hasAppliedMethodDescription
Czeitschner: OCR, XML/TEI tagging, manual correction of lemmatization and part-of-speech tagging;
Eisenheld: manual correction of lemmatization and part-of-speech tagging;
Krautgartner: tokenization, automatically lemmatization and part-of-speech tagging, interlinking of corpus and thesaurus;
Lanig: manual correction of lemmatization and part-of-speech tagging
acdh:hasArrangement
1 xml-file represents one printed volume
acdh:hasAvailableDate
2018-09-03
acdh:hasBinarySize
0.12 GB
acdh:hasBinaryUpdatedDate
2019-12-13T17:11:37.585Z
acdh:hasBinaryUpdatedRole
admin
acdh:hasCollectedEndDate
2017-11-30T00:00:00
acdh:hasCollectedStartDate
2004-04-28T00:00:00
acdh:hasCompleteness
project completed, no further changes
acdh:hasContact
acdh:hasCoverageEndDate
1914-01-01
acdh:hasCoverageStartDate
1875-01-01
acdh:hasCreatedEndDate
2018-01-26
acdh:hasCreatedStartDate
2014-12-01
acdh:hasCreator
acdh:hasDepositor
acdh:hasDerivedPublication
acdh:hasDescription
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
acdh:hasExtent
5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
acdh:hasFunder
acdh:hasHosting
acdh:hasLanguage
acdh:hasLicense
acdh:hasLicensor
acdh:hasLifeCycleStatus
acdh:hasLocationPath
Corpus
acdh:hasMetadataCreator
acdh:hasNamingScheme
file names consist of Editor-Title_Year, e.g. Baedeker-Indien_1914.xml
acdh:hasNumberOfItems
5
acdh:hasOwner
acdh:hasPid
acdh:hasPrincipalInvestigator
acdh:hasRelatedDiscipline
acdh:hasRightsHolder
acdh:hasSource
acdh:hasSpatialCoverage
acdh:hasSubject
historical travel guides , Karl Baedeker
acdh:hasTechnicalInfo
A TEI schema documenting applied annotations is available
acdh:hasTitle
travel!digital Corpus
acdh:hasUpdatedDate
2019-12-13T17:11:37.585Z
acdh:hasUpdatedRole
admin
acdh:hasUrl
acdh:hasUsedSoftware
TreeTagger , ABBYY FineReader 8.0 , <oXygen/> XML Editor 17.1
acdh:hasVersion
1
acdh:isPartOf
acdh:isReferencedBy
rdf:type
acdh:hasIdentifier

Summary

info_outline Subject(s): historical travel guides , Karl Baedeker
info_outline Spatial Coverage: Mediterranean Sea , Palestine , North America , Constantinople , India , Syria , Asia Minor
info_outline Coverage Start Date: 1875
info_outline Coverage End Date: 1914
info_outline Description: A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.

Cite Resource

MLA
Copy Citation information copied!
Victoria Eisenheld, Barbara Krautgartner, Laura Lanig, Ulrike Czeitschner. travel!digital Corpus. ARCHE, http://hdl.handle.net/21.11115/0000-000C-29F7-0. Accessed on 25 Nov 2020.

Child Resource(s)

Switch to Tree-View
5 Result(s) Page 1 of 1 Items Sort by
Type: Resource
info A XML/TEI transcription of: Karl Baedeker: Das Mittelmeer. Leipzig, 1909.
OCR is based on the collection Facsimiles: Baedeker, Mittelmeer 1909. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Summary
Type: Resource
info A XML/TEI transcription of: Karl Baedeker: Indien. Leipzig, 1914.
OCR is based on the collection Facsimiles: Baedeker, Indien 1914. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Summary
Type: Resource
info A XML/TEI transcription of: Karl Baedeker: Konstantinopel und Kleinasien. Leipzig, 1905.
OCR is based on the collection Facsimiles: Baedeker, Konstantinopel und Kleinasien 1905. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Summary
Type: Resource
info A XML/TEI transcription of: Karl Baedeker: Nordamerika. Leipzig, 1893.
OCR is based on the collection Facsimiles: Baedeker, Nordamerika 1893. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Summary
Type: Resource
info A XML/TEI transcription of: Karl Baedeker: Palästina und Syrien. Leipzig, 1875.
OCR is based on the collection Facsimiles: Baedeker, Palästina und Syrien 1875. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Show Summary
5 Result(s) Page 1 of 1 Items Sort by