Overview
Switch to Expert-View
Copy Resource Link

DIGITARIUM
Type(s): acdh:TopCollection
title Alternative Title: Wienerisches DIGITARIUM
person_add Contact(s): Claudia Resch
people Creator(s): Dario Kampkaspar , Claudia Resch
person Contributor(s): Nora Fischer , Nina Claudia Rastinger
today Created Start Date: 1 Mar 2017
today Created End Date: 29 Feb 2020
today Available Date: 23 Dec 2022
attachment Number of Items: 5783
attachment Binary Size: 13.43 GB
DIGITARIUM
Property Value(s)
acdh:aclRead
pandorfer
acdh:aclWrite
pandorfer
acdh:createdBy
pandorfer
acdh:hasAccessRestrictionSummary
public 5781
acdh:hasAlternativeTitle
Wienerisches DIGITARIUM
acdh:hasAppliedMethodDescription
The methodical approach used for this collection follows a sequence of steps to be taken for every issue: Once digital pages provided by ANNO - Austrian Newspaper Online have been obtained, they were subjected to preprocessing and automatic deskewing. The recognition phase with layout analysis and text recognition relied on the Transkribus software. As results in recognizing broken scripts like German blackletter typeface are usually far from satisfactory, the project started out with a few issues that were transcribed completely by hand and used to train an initial model. Subsequently, the following issues to be processed were first recognized by the software and then corrected to an accuracy of around 99.7%. Each such corrected set then served as a training and test data set for a new model that was applied to the next batch of issues. The current model specifically trained for the Diarium generates text with an error rate of less than 1 per 100 characters within a standard paragraph from a good quality image.
In a next step the transcribed full-text was exported as a single TEI XML file per issue. Some post-processing was then required to prepare the text for publishing, e.g.the application of a basic whitespace tokenization in order to be able to address every “word” in the text with a unique identifier. Additionally, the pixel coordinates of the text regions were used to find their relative position within a page. This was done by applying a series of XSLT transformations to the files exported from Transkribus. As soon as all automated processes were completed, the results were checked visually and uploaded to the project’s web application DIGITARIUM.
acdh:hasAvailableDate
2022-12-23
acdh:hasBinarySize
13.43 GB
acdh:hasCollectedEndDate
2020-02-29
acdh:hasCollectedStartDate
2017-03-01
acdh:hasCompleteness
Project completed, no further changes.
acdh:hasContact
acdh:hasContributor
acdh:hasCoverageEndDate
1799-12-31
acdh:hasCoverageStartDate
1703-01-01
acdh:hasCreatedEndDate
2020-02-29
acdh:hasCreatedStartDate
2017-03-01
acdh:hasCreator
acdh:hasCurator
acdh:hasCustomCitation
author = {Resch, Claudia and Kampkaspar, Dario}
acdh:hasDepositor
acdh:hasDescription
Wienerisches DIGITARIUM is a digital collection of more than 300 transcribed full text issues of the “Wien[n]erisches Diarium”, a historical newspaper that was founded in Vienna in 1703 and is still published under the title “Wiener Zeitung”. The issues provided as facsimiles and in XML/TEI P5 including different layers of annotation are evenly distributed over the 18th century and offer a reliable basis for a wide range of research interests.
The collection was created within the project “Das Wien[n]erische Diarium: Digitaler Datenschatz für die geisteswissenschaftlichen Disziplinen“ (PI: Claudia Resch) which was funded by the “go!digital2.0” program of the Austrian Academy of Sciences and carried out at the Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH) from 1 March 2017 to 29 February 2020 (Project-Nr. GD 2016/16, ÖAW 0704).
acdh:hasDigitisingAgent
acdh:hasEditorialPractice
In order to create a scientifically sound basis for philological interpretation and allow as many research interests as possible, only complete issues have been included. Normalising interventions were preferably avoided and the historical language was reproduced as close to the printed original as possible. Hence, the typography of the original was largely retained, i.e. "u" and "v" or "i" and "j" have been preserved as well as ligatures, small caps or the change between Fraktur and Antiqua printing – only the differentiation of the two variants of "s" and "r" (so-called "long s" and "round r") was omitted. Consonantal ligatures, such as those found in "tz", "ct", "st" or "ff", are resolved in the transcribed text. Double hyphens (in compositions such as "Reichs=Raht") are represented by equal signs, since these come closest to the print image of the time. Unreadable passages as well as uncertain passages added by the editors have been marked in the transcription with angle brackets.
acdh:hasHosting
acdh:hasLanguage
acdh:hasLicenseSummary
CC0 1.0 Universal (CC0 1.0) Public Domain Dedication 5447
Attribution 4.0 International (CC BY 4.0) 336
acdh:hasLicensor
acdh:hasLifeCycleStatus
acdh:hasMetadataCreator
acdh:hasNonLinkedIdentifier
Austrian Academy of Sciences programme "go!digital 2.0": GD 2016/16
acdh:hasNumberOfItems
5783
acdh:hasOwner
acdh:hasPid
acdh:hasRelatedDiscipline
acdh:hasRightsHolder
acdh:hasSubject
historical newspapers
acdh:hasTitle
DIGITARIUM
acdh:hasUpdatedDate
2023-01-12T13:53:42.199033
acdh:hasUpdatedRole
pandorfer
rdf:type
acdh:hasIdentifier

Inverse Data

Property Value(s)

Summary

info_outline Subject(s): historical newspapers
info_outline Coverage Start Date: 1703
info_outline Coverage End Date: 1799
info_outline Description: Wienerisches DIGITARIUM is a digital collection of more than 300 transcribed full text issues of the “Wien[n]erisches Diarium”, a historical newspaper that was founded in Vienna in 1703 and is still published under the title “Wiener Zeitung”. The issues provided as facsimiles and in XML/TEI P5 including different layers of annotation are evenly distributed over the 18th century and offer a reliable basis for a wide range of research interests.
The collection was created within the project “Das Wien[n]erische Diarium: Digitaler Datenschatz für die geisteswissenschaftlichen Disziplinen“ (PI: Claudia Resch) which was funded by the “go!digital2.0” program of the Austrian Academy of Sciences and carried out at the Austrian Centre for Digital Humanities and Cultural Heritage (ACDH-CH) from 1 March 2017 to 29 February 2020 (Project-Nr. GD 2016/16, ÖAW 0704).

Cite Resource

Child Resource(s)

Switch to Tree-View
3 Result(s) Page 1 of 1 Items Sort by
Type: acdh:Collection
today Available Date: 23 Dec 2022
Show Summary Hide Summary
info The subcollection contains the image files of the newspaper pages scanned by the ÖNB and aligned and/or cropped by the project team as required.
Type: acdh:Resource
today Available Date: 23 Dec 2022
Type: acdh:Collection
today Available Date: 23 Dec 2022
Show Summary Hide Summary
info The subcollection contains the full texts, annotated by the project team in XML/TEI format.
3 Result(s) Page 1 of 1 Items Sort by