Overview
Switch to Expert-View
Copy Resource Link

travel!digital Corpus
Type(s): acdh:Collection
device_hub Principal Investigator(s): Ulrike Czeitschner
person_add Contact(s): Ulrike Czeitschner , Barbara Krautgartner
today Created Start Date: 1 Dec 2014
today Created End Date: 26 Jan 2018
today Available Date: 3 Sep 2018
dehaze Extent: 5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
attachment Number of Items: 5
attachment Binary Size: 0.12 GB
copyright License: CC BY 4.0
device_hub Part of: travel!digital Collection
travel!digital Corpus
Property Value(s)
acdh:aclRead
public
acdh:createdBy
admin
acdh:hasAccessRestrictionSummary
public 5
acdh:hasActor
acdh:hasAppliedMethod
lemmatization , OCR , part-of-speech-tagging , tokenization , XML/TEI structural and semantic annotation
acdh:hasAppliedMethodDescription
Czeitschner: OCR, XML/TEI tagging, manual correction of lemmatization and part-of-speech tagging;
Eisenheld: manual correction of lemmatization and part-of-speech tagging;
Krautgartner: tokenization, automatically lemmatization and part-of-speech tagging, interlinking of corpus and thesaurus;
Lanig: manual correction of lemmatization and part-of-speech tagging
acdh:hasArrangement
1 xml-file represents one printed volume
acdh:hasAvailableDate
2018-09-03
acdh:hasBinarySize
0.12 GB
acdh:hasBinaryUpdatedRole
admin
acdh:hasCollectedEndDate
2017-11-30T00:00:00
acdh:hasCollectedStartDate
2004-04-28T00:00:00
acdh:hasCompleteness
project completed, no further changes
acdh:hasContact
acdh:hasCoverageEndDate
1914-01-01
acdh:hasCoverageStartDate
1875-01-01
acdh:hasCreatedEndDate
2018-01-26
acdh:hasCreatedStartDate
2014-12-01
acdh:hasCreator
acdh:hasCurator
acdh:hasCustomCitation
author = {Czeitschner, Ulrike and Krautgartner, Barbara and Eisenheld, Victoria and Lanig, Laura}
acdh:hasDepositor
acdh:hasDescription
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
acdh:hasExtent
5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
acdh:hasFilename
Corpus
acdh:hasFunder
acdh:hasHosting
acdh:hasLanguage
acdh:hasLicense
acdh:hasLicenseSummary
Attribution 4.0 International (CC BY 4.0) 6
acdh:hasLicensor
acdh:hasLifeCycleStatus
acdh:hasMetadataCreator
acdh:hasNamingScheme
file names consist of Editor-Title_Year, e.g. Baedeker-Indien_1914.xml
acdh:hasNumberOfItems
5
acdh:hasOaiSet
acdh:hasOwner
acdh:hasPid
acdh:hasPrincipalInvestigator
acdh:hasRelatedDiscipline
acdh:hasRightsHolder
acdh:hasSpatialCoverage
acdh:hasSubject
historical travel guides , Karl Baedeker
acdh:hasTechnicalInfo
A TEI schema documenting applied annotations is available
acdh:hasTitle
travel!digital Corpus
acdh:hasUpdatedDate
2021-07-15T08:54:15.123856
acdh:hasUpdatedRole
uczeitschner
acdh:hasUrl
acdh:hasUsedSoftware
ABBYY FineReader 8.0 , <oXygen/> XML Editor 17.1 , TreeTagger
acdh:hasVersion
1
acdh:isPartOf
acdh:isSourceOf
rdf:type
acdh:hasIdentifier

Inverse Data

Property Value(s)

Summary

info_outline Subject(s): historical travel guides , Karl Baedeker
info_outline Spatial Coverage: Asia Minor , Constantinople , India , Mediterranean Sea , North America , Palestine , Syria
info_outline Coverage Start Date: 1875
info_outline Coverage End Date: 1914
info_outline Description: A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.

Cite Resource

Child Resource(s)

Switch to Tree-View
5 Result(s) Page 1 of 1 Items Sort by
Type: acdh:Resource
today Available Date: 3 Sep 2018
Show Summary Hide Summary
info A XML/TEI transcription of: Karl Baedeker: Das Mittelmeer. Leipzig, 1909.
OCR is based on the collection Facsimiles: Baedeker, Mittelmeer 1909. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type: acdh:Resource
today Available Date: 3 Sep 2018
Show Summary Hide Summary
info A XML/TEI transcription of: Karl Baedeker: Indien. Leipzig, 1914.
OCR is based on the collection Facsimiles: Baedeker, Indien 1914. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type: acdh:Resource
today Available Date: 3 Sep 2018
Show Summary Hide Summary
info A XML/TEI transcription of: Karl Baedeker: Konstantinopel und Kleinasien. Leipzig, 1905.
OCR is based on the collection Facsimiles: Baedeker, Konstantinopel und Kleinasien 1905. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type: acdh:Resource
today Available Date: 3 Sep 2018
Show Summary Hide Summary
info A XML/TEI transcription of: Karl Baedeker: Nordamerika. Leipzig, 1893.
OCR is based on the collection Facsimiles: Baedeker, Nordamerika 1893. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type: acdh:Resource
today Available Date: 3 Sep 2018
Show Summary Hide Summary
info A XML/TEI transcription of: Karl Baedeker: Palästina und Syrien. Leipzig, 1875.
OCR is based on the collection Facsimiles: Baedeker, Palästina und Syrien 1875. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
5 Result(s) Page 1 of 1 Items Sort by