Overview
Switch to Expert-View
Copy Resource Link

VOICE 2.0 POS XML
Type(s): acdh:Collection
VOICE 2.0 POS XML
Property Value(s)
acdh:createdBy
dstoxreiter
acdh:hasAccessRestrictionSummary
public 153
acdh:hasAppliedMethod
VOICE, the Vienna-Oxford International Corpus of English, is a one-million-word corpus of naturally-occurring, non-scripted, face-to-face interactions carried out using English as a lingua franca (ELF), i.e. English used as a common means of communication among speakers from different first-language backgrounds. The interactions recorded and transcribed are complete speech events from different domains (educational, leisure, professional) and represent different speech event types (conversation, interview, meeting, panel, press conference, question-answer session, seminar discussion, service encounter, working group discussion, workshop discussion).
acdh:hasAppliedMethodDescription
VOICE is based on audio-recordings carried out between July 2001 and November 2007, usually using portable mini-disc recorders with external microphones. These audio-recordings capture 151 naturally-occurring, non-scripted, face-to-face interactions involving 753 identified individuals from 49 different first language backgrounds using English as a lingua franca (ELF), i.e. English used as a common means of communication among speakers from different first-language backgrounds. Most of the audio-recordings are supplemented by detailed field notes including information about the nature of the speech event and the interaction taking place as well as about the participants engaging in these ELF interactions. The audio-recordings were transcribed, checked and proof-read by trained transcribers and researchers in accordance with the VOICE mark-up and spelling conventions. See sub-collection Documentation for more information on mark-up and spelling conventions. Details for each electronic text are given in the individual text headers. The principles and practices underlying the selection and design of the corpus are documented in the project and sampling description of the Corpus Header.
acdh:hasArrangement
VOICE 2.0 POS XML is stored in a TEI-based XML format. Each sub-collection represents one of five domains in VOICE (ED: educational, LE: leisure, PB: professional buisness, PO: professional organizational, PR: professional research and science). Domains in VOICE denote socially defined situations or areas of activity. The domain collections contain all resources (i.e. the individual corpus texts, which are transcripts of the speech events) in a TEI-based XML format.
acdh:hasAvailableDate
2022-04-06
acdh:hasBinarySize
0.06 GB
acdh:hasContact
acdh:hasCreatedEndDate
2013-01-31
acdh:hasCreatedStartDate
2005-06-01
acdh:hasCreator
acdh:hasCurator
acdh:hasCustomCitation
year = {2013},
date = {2013-01-31T00:00:00.000000},
author = {Seidlhofer, Barbara and Majewski, Stefan and Osimk-Teasdale, Ruth and Pitzl, Marie-Luise and Radeka, Michael and Dorn, Nora},
version = {},
bookauthor = {},
acdh:hasDepositor
acdh:hasDescription
The most wide-spread contemporary use of English throughout the world is that of English as a lingua franca (ELF), i.e. English used as a common means of communication among speakers from different first-language backgrounds (Seidlhofer 2011). Nevertheless, linguistic descriptions before the mid-2000s focused almost entirely on English as spoken and written by its native speakers. Starting in 2005, the VOICE project sought to redress the balance by compiling the first general corpus capturing spoken ELF interactions as they happen naturally in various contexts. VOICE was designed and compiled to make possible linguistic descriptions of this most common contemporary use of English by providing a corpus of spoken ELF interactions which has been freely accessible to linguistic researchers all over the world since 2009. The Vienna-Oxford International Corpus of English (VOICE) was initially created by Barbara Seidlhofer (founding director) and Angelika Breiteneder, Theresa Klimpfinger, Stefan Majewski, Marie-Luise Pitzl (project researchers) from 2005 to 2011 at the English Department at the University of Vienna. VOICE 1.0 Online was released in 2009, VOICE 1.0 XML in 2011. VOICE POS XML 2.0 was the first part-of-speech tagged version of VOICE and was based on the same data as VOICE 2.0 XML. VOICE 2.0 XML and VOICE POS XML 2.0 included minor revisions with regard to previous versions and were released in 2013. Additional researchers centrally involved in the creation of VOICE 2.0 POS XML were Ruth Osimk-Teasdale, Michael Radeka and Nora Dorn.
acdh:hasHosting
acdh:hasLicense
acdh:hasLicenseSummary
Attribution-NonCommercial-ShareAlike 3.0 Austria (CC BY-NC-SA 3.0 AT) 159
acdh:hasLicensor
acdh:hasMetadataCreator
acdh:hasNote
VOICE 2.0 POS XML metadata is stored in a separate corpus header in TEI-based XML format with a version specific binary. Furthermore, each resource binary (i.e. each XML corpus text) containes metadata relevant to the specific speech event in its TEI header.
acdh:hasNumberOfItems
158
acdh:hasOwner
acdh:hasPid
acdh:hasPrincipalInvestigator
acdh:hasRelatedDiscipline
acdh:hasRightsHolder
acdh:hasSubject
conversation , educational , English as a lingua franca , interaction , interculturality , interview , leisure , lemmatization , meeting , multilingualism , panel , part-of-speech tagging , press conference , professional business , professional organizational , professional research and science , question-answer session , seminar discussion , service encounter , working group discussion , workshop discussion
acdh:hasTitle
VOICE 2.0 POS XML
acdh:hasUpdatedDate
2022-04-06T11:02:09.780325
acdh:hasUpdatedRole
dstoxreiter
acdh:hasUrl
acdh:hasVersion
POS 2.0
acdh:isNewVersionOf
acdh:isPartOf
rdf:type
acdh:hasIdentifier

Inverse Data

Property Value(s)

Summary

info_outline Subject(s): conversation , educational , English as a lingua franca , interaction , interculturality , interview , leisure , lemmatization , meeting , multilingualism , panel , part-of-speech tagging , press conference , professional business , professional organizational , professional research and science , question-answer session , seminar discussion , service encounter , working group discussion , workshop discussion
info_outline Description: The most wide-spread contemporary use of English throughout the world is that of English as a lingua franca (ELF), i.e. English used as a common means of communication among speakers from different first-language backgrounds (Seidlhofer 2011). Nevertheless, linguistic descriptions before the mid-2000s focused almost entirely on English as spoken and written by its native speakers. Starting in 2005, the VOICE project sought to redress the balance by compiling the first general corpus capturing spoken ELF interactions as they happen naturally in various contexts. VOICE was designed and compiled to make possible linguistic descriptions of this most common contemporary use of English by providing a corpus of spoken ELF interactions which has been freely accessible to linguistic researchers all over the world since 2009. The Vienna-Oxford International Corpus of English (VOICE) was initially created by Barbara Seidlhofer (founding director) and Angelika Breiteneder, Theresa Klimpfinger, Stefan Majewski, Marie-Luise Pitzl (project researchers) from 2005 to 2011 at the English Department at the University of Vienna. VOICE 1.0 Online was released in 2009, VOICE 1.0 XML in 2011. VOICE POS XML 2.0 was the first part-of-speech tagged version of VOICE and was based on the same data as VOICE 2.0 XML. VOICE 2.0 XML and VOICE POS XML 2.0 included minor revisions with regard to previous versions and were released in 2013. Additional researchers centrally involved in the creation of VOICE 2.0 POS XML were Ruth Osimk-Teasdale, Michael Radeka and Nora Dorn.
info_outline Note: VOICE 2.0 POS XML metadata is stored in a separate corpus header in TEI-based XML format with a version specific binary. Furthermore, each resource binary (i.e. each XML corpus text) containes metadata relevant to the specific speech event in its TEI header.

Cite Resource

Child Resource(s)

Switch to Tree-View
7 Result(s) Page 1 of 1 Items Sort by
Type: acdh:Collection
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info The educational domain includes all social situations connected with institutions or people involved in teaching, training or studying.
Type: acdh:Collection
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info The leisure domain includes all social situations occurring during the time that is spent doing something one chooses to do when one is not working or studying.
Type: acdh:Collection
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info The professional business domain includes all social situations connected with activities of making, buying, selling or supplying goods or services for money.
Type: acdh:Collection
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info The professional organizational domain includes all social situations connected with activities of international organizations or networks which are not doing research or business.
Type: acdh:Collection
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info The professional research/science domain includes all social situations connected with the careful study of a subject, especially in order to discover new facts or information about it.
Type: acdh:Metadata
today Available Date: 6 Apr 2022 Version: POS 2.0
Show Summary Hide Summary
info TEI/XML conform corpus header containing metadata of the underlying TEI/XML resources.
Type: acdh:Resource
today Available Date: 6 Apr 2022
7 Result(s) Page 1 of 1 Items Sort by