Overview
travel!digital Corpus
Type(s):
acdh:Collection
fingerprint
PID:
https://hdl.handle.net/21.11115/0000-000C-29F7-0
device_hub
Principal Investigator(s):
Ulrike Czeitschner
person_add
Contact(s):
Ulrike Czeitschner
,
Barbara Krautgartner
people
Creator(s):
Ulrike Czeitschner
,
Victoria Eisenheld
,
Barbara Krautgartner
,
Laura Lanig
today
Created Start Date:
1 Dec 2014
today
Created End Date:
26 Jan 2018
today
Available Date:
3 Sep 2018
dehaze
Extent:
5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
attachment
Number of Items:
5
attachment
Binary Size:
0.12 GB
copyright
Licensor:
Austrian Centre for Digital Humanities and Cultural Heritage
copyright
License:
CC BY 4.0
copyright
Owner:
Austrian Centre for Digital Humanities and Cultural Heritage
label
Identifier(s):
https://arche.acdh.oeaw.ac.at/api/14006
,
https://hdl.handle.net/21.11115/0000-000C-29F7-0 ,
https://id.acdh.oeaw.ac.at/traveldigital/Corpus ,
https://id.acdh.oeaw.ac.at/uuid/268e2e95-d387-f53d-a4ca-041ece2c2f0c
https://hdl.handle.net/21.11115/0000-000C-29F7-0 ,
https://id.acdh.oeaw.ac.at/traveldigital/Corpus ,
https://id.acdh.oeaw.ac.at/uuid/268e2e95-d387-f53d-a4ca-041ece2c2f0c
device_hub
Part of:
travel!digital Collection
travel!digital Corpus
Property | Value(s) |
---|---|
acdh:aclRead |
public
|
acdh:createdBy |
admin
|
acdh:hasAccessRestrictionSummary
|
public 5
|
acdh:hasActor
|
|
acdh:hasAppliedMethod
|
lemmatization
,
OCR
,
part-of-speech-tagging
,
tokenization
,
XML/TEI structural and semantic annotation
|
acdh:hasAppliedMethodDescription
|
Czeitschner: OCR, XML/TEI tagging, manual correction of lemmatization and part-of-speech tagging;
Eisenheld: manual correction of lemmatization and part-of-speech tagging; Krautgartner: tokenization, automatically lemmatization and part-of-speech tagging, interlinking of corpus and thesaurus; Lanig: manual correction of lemmatization and part-of-speech tagging |
acdh:hasArrangement
|
1 xml-file represents one printed volume
|
acdh:hasAvailableDate
|
2018-09-03
|
acdh:hasBinarySize
|
0.12 GB
|
acdh:hasBinaryUpdatedRole |
admin
|
acdh:hasCollectedEndDate
|
2017-11-30T00:00:00
|
acdh:hasCollectedStartDate
|
2004-04-28T00:00:00
|
acdh:hasCompleteness
|
project completed, no further changes
|
acdh:hasContact
|
|
acdh:hasCoverageEndDate
|
1914-01-01
|
acdh:hasCoverageStartDate
|
1875-01-01
|
acdh:hasCreatedEndDate
|
2018-01-26
|
acdh:hasCreatedStartDate
|
2014-12-01
|
acdh:hasCreator
|
|
acdh:hasCurator
|
|
acdh:hasCustomCitation
|
author = {Czeitschner, Ulrike and Krautgartner, Barbara and Eisenheld, Victoria and Lanig, Laura}
|
acdh:hasDepositor
|
|
acdh:hasDescription
|
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204. |
acdh:hasExtent
|
5 XML-files (5 volumes comprising 3089 printed pages, 1,51 million tokens, 1,21 million running words)
|
acdh:hasFilename |
Corpus
|
acdh:hasFunder
|
|
acdh:hasHosting
|
|
acdh:hasLanguage
|
|
acdh:hasLicense
|
|
acdh:hasLicenseSummary
|
Attribution 4.0 International (CC BY 4.0) 6
|
acdh:hasLicensor
|
|
acdh:hasLifeCycleStatus
|
|
acdh:hasMetadataCreator
|
|
acdh:hasNamingScheme
|
file names consist of Editor-Title_Year, e.g. Baedeker-Indien_1914.xml
|
acdh:hasNumberOfItems
|
5
|
acdh:hasOaiSet
|
|
acdh:hasOwner
|
|
acdh:hasPid
|
|
acdh:hasPrincipalInvestigator
|
|
acdh:hasRelatedDiscipline
|
|
acdh:hasRightsHolder
|
|
acdh:hasSpatialCoverage
|
|
acdh:hasSubject
|
historical travel guides
,
Karl Baedeker
|
acdh:hasTechnicalInfo
|
A TEI schema documenting applied annotations is available
|
acdh:hasTitle
|
travel!digital Corpus
|
acdh:hasUpdatedDate
|
2021-07-15T08:54:15.123856
|
acdh:hasUpdatedRole |
uczeitschner
|
acdh:hasUrl
|
|
acdh:hasUsedSoftware
|
ABBYY FineReader 8.0
,
<oXygen/> XML Editor 17.1
,
TreeTagger
|
acdh:hasVersion
|
1
|
acdh:isPartOf
|
|
acdh:isSourceOf
|
|
rdf:type | |
acdh:hasIdentifier
|
Inverse Data
Property | Value(s) |
---|
Dissemination Services
Summary
info_outline
Related Discipline(s):
Cultural anthropology
,
Cultural history
,
Digital humanities
,
Linguistics and Literature
info_outline
Subject(s):
historical travel guides
,
Karl Baedeker
info_outline
Spatial Coverage:
Asia Minor
,
Constantinople
,
India
,
Mediterranean Sea
,
North America
,
Palestine
,
Syria
info_outline
Coverage Start Date:
1875
info_outline
Coverage End Date:
1914
info_outline
Description:
A collection of XML/TEI transcriptions of early German travel guides on non-European countries which were released by the Baedeker publishing house between 1875 and 1914 (5 volumes, first editions). OCR is based on the collection travel!digital Facsimiles. The texts are tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Cite Resource
Copy
Citation information copied!
Type:
acdh:Resource
today
Available Date:
3 Sep 2018
info
A XML/TEI transcription of: Karl Baedeker: Das Mittelmeer. Leipzig, 1909.
OCR is based on the collection Facsimiles: Baedeker, Mittelmeer 1909. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
OCR is based on the collection Facsimiles: Baedeker, Mittelmeer 1909. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type:
acdh:Resource
today
Available Date:
3 Sep 2018
info
A XML/TEI transcription of: Karl Baedeker: Indien. Leipzig, 1914.
OCR is based on the collection Facsimiles: Baedeker, Indien 1914. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
OCR is based on the collection Facsimiles: Baedeker, Indien 1914. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type:
acdh:Resource
today
Available Date:
3 Sep 2018
info
A XML/TEI transcription of: Karl Baedeker: Konstantinopel und Kleinasien. Leipzig, 1905.
OCR is based on the collection Facsimiles: Baedeker, Konstantinopel und Kleinasien 1905. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
OCR is based on the collection Facsimiles: Baedeker, Konstantinopel und Kleinasien 1905. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type:
acdh:Resource
today
Available Date:
3 Sep 2018
info
A XML/TEI transcription of: Karl Baedeker: Nordamerika. Leipzig, 1893.
OCR is based on the collection Facsimiles: Baedeker, Nordamerika 1893. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
OCR is based on the collection Facsimiles: Baedeker, Nordamerika 1893. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
Type:
acdh:Resource
today
Available Date:
3 Sep 2018
info
A XML/TEI transcription of: Karl Baedeker: Palästina und Syrien. Leipzig, 1875.
OCR is based on the collection Facsimiles: Baedeker, Palästina und Syrien 1875. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.
OCR is based on the collection Facsimiles: Baedeker, Palästina und Syrien 1875. The text is tokenized, lemmatized, labelled with part-of-speech-tags (STTS tagset produced with TreeTagger), and manually corrected. Semantic annotation includes personal names of historical, mythological/religious and literary figures (except occurences in street names and company names), dates, selected designations of groups and sights.
Basic XML annotation was done at the Austrian Academy of Sciences (AAC-Austrian Academy Corpus). Transformation to TEI/P5, linguistic and semantic annotation was done within the GO!DIGITAL 1.0 project "travel!digital. Exploring People and Monuments in Baedeker Guidebooks (1875-1914)", Project-Nr.: ÖAW0204.