International Corpus of English
home home


15 September, 2020
The current VOICE Online interface was moved to a new server provided by the Austrian Center for Digital Humanities and Cultural Heritage at the Austrian Academy of Sciences. For users of VOICE, this means continued, more stable and reliable access to VOICE Online and VOICE POS Online until the new VOICE Online interface that is developed as part of the VOICE CLARIAH project is ready in summer 2021. To access VOICE Online click HERE.
30 May, 2014
The Part-Of-Speech Tagging and Lemmatization Manual has been updated to contain correction of some errata (1st revised version May 2014).
1 October, 2013
We are delighted to announce that Cornelia Hülmbauer, who has collaborated with the team from the very beginning in various ways, has completed her PhD studies and successfully defended her PhD thesis The real, the virtual and the plurilingual. Congratulations!
17 September, 2013
The VOICE team announces that after eight years the VOICE project as funded by the Austrian Science Fund has come to an end. The corpus will of course continue to be available, online and for download, and we hope that you will continue to use it. Please do let us know when you produce and/or publish studies for which you have used VOICE. If you want to contact VOICE, you're welcome to do so via The end of the VOICE project was celebrated with many friends and supporters. For this occasion, Prof. Henry Widdowson composed a poem which is available here.

The VOICE team celebrates 8 years of successful project work

May 28, 2013
The Austrian newspaper "der Standard" reports on a selection of the ‘most important’ publications and researchers of the last 10 years in Austria, as nominated by the Austrian Science Fund. Barbara Seidlhofer’s monograph “Understanding English as a lingua franca” and her research are among those listed and are commented on in the newspaper article. For those who read German, here is the link :-)
February 18, 2013
An updated version of VOICE 2.0 XML is available for download. The name of the download package has changed to "". This version includes some corrections to the tagset description of the headers, as well as a number of previously missing tokens for pronunciation variations and coinages. We recommend replacing previously downloaded versions with this version of VOICE XML. Many thanks to Ray Carey for his helpful feedback.
February 5, 2013
The VOICE XML download package has been updated to version 2.0.1. This update includes minor changes which concern the format of the derived text files.
January 22, 2013
The VOICE Team are delighted to announce further developments of the VOICE Corpus.
The following new versions are now available:
VOICE POS Online 2.0 and VOICE POS XML are the first part-of-speech tagged and lemmatized versions of VOICE. VOICE is thus the first publicly accessible corpus of spoken ELF interactions to be annotated in this way. VOICE POS Online 2.0 is made available online for all registered corpus users. VOICE POS XML 2.0 is available for download as part of the XML download package, together with VOICE 2.0 XML. VOICE 2.0 Online and VOICE 2.0 XML are updated versions of the corpus with further revisions in the corpus texts.

The VOICE Team celebrates the release of VOICE POS and the update to versions 2.0

April 24, 2012
Barbara Seidlhofer gives an interview on the Austrian radio station "Ö1"'s programme "Von Tag zu Tag" on the topic of the ownership of English and ELF (listen here)
November 15, 2011
The VOICE team are delighted to announce that VOICE XML has been incorporated into the Oxford Text Archive (OTA) and is now available for download there.
October 21, 2011
Barbara Seidlhofer gives an invited keynote address on English as a Lingua Franca as negotiable language at a conference entitled 'Inglese Lingua Franca nell’ambito delle Relazioni Internazionali' in Venice.
September 25, 2011
Stefan Majewski has completed his Master's degree with the thesis entitled Design and implementation of a research infrastructure for a corpus of spoken ELF. Congratulations!
September 10, 2011
Barbara Seidlhofer gives an invited plenary address with the title '"Proficiency in English": the ELF perspective' at the ATECR/IATELF Conference in Brno.
August 30-31, 2011
Barbara Seidlhofer gives an invited lecture and seminar entitled 'ELF (English as a Lingua Franca): new perspectives on English' at Waseda University in Tokio.
August 23-28, 2011
Barbara Seidlhofer gives an invited plenary address entitled 'Anglophone-centric attitudes and the globalization of English' at AILA in Beijing. Stefan Majewski, Ruth Osimk-Teasdale and Marie-Luise Pitzl also attend AILA Beijing and give paper presentations on VOICE and ELF within the symposia 'East-West dialogue on English as a Lingua Franca, Part Two: ELF in Europe', convened by Jennifer Jenkins and Anna Mauranen and 'The VOICEs of Europe and Asia: diversity of data but harmony in approach', convened by Barbara Seidlhofer and Andy Kirkpatrick (see Presentations).
July 26-27, 2011
Barbara Seidlhofer gives an invited plenary address on translanguaging in ELF and takes part at a Round Table on 'Literacy, language teaching and social inclusion' at the IX Congresso Brasileiro de Linguistica Aplicada in Rio de Janeiro. (see Presentations)
July 3-8, 2011
Marie-Luise Pitzl attends the 12th International Pragmatics Conference in Manchester and gives an individual paper presentation on the functions of creative idioms and metaphors in ELF (see Presentations).
June 1-5,2011
Stefan Majewski, Ruth Osimk and Michael Radeka attend ICAME 32 and give individual papers on VOICE XML and Part of Speech Tagging ELF data with TBL. (see Presentations).
May 26-28, 2011
The VOICE Team attend the 4th International Conference of English as a Lingua Franca and give individual presentations and poster presentations there. (see Presentations).
May 9, 2011
The Austrian news magazine Profil publishes an article entitled "Whatever works", based on an interview with Barbara Seidlhofer.
May 5, 2011
Barbara Seidlhofer, Stefan Majewski, Ruth Osimk, Marie-Luise Pitzl and Michael Radeka are pleased to announce that VOICE XML is now available for download. VOICE XML is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 3.0 Unported License and includes all corpus texts in XML format as well as derived HTML and TXT versions of the corpus with reduced mark-up. For more information on VOICE XML see Availability.
May 5, 2011
The release of VOICE XML also marks the release of VOICE 1.1, an updated version of the corpus that includes minor revisions in some of the corpus texts. As of 5 May 2011, the web interface of VOICE Online is therefore updated to VOICE 1.1 Online.
April 27, 2011
Barbara Seidlhofer participates in a TV discussion on the role on English in the scientific community on the a science forum (alpha-Forum Wissenschaft) of the German TV sender BR alpha.
March 23, 2011
Marie-Luise Pitzl has completed her PhD studies and successfully defended her PhD thesis Creativity in English as a lingua franca: Idiom and metaphor. Congratulations!
December 29, 2010
VoiceScribe 1.0.2 has been released. This version of VoiceScribe fixes several bugs, most notably a bug that prevented audio files with non-English characters from being played.
November 24, 2010
Barbara Seidlhofer and her team members Stefan Majewski, Ruth Osimk, Marie-Luise Pitzl and Michael Radeka are pleased to announce that anonymised audio material is now available for 23 speech events. This material is freely accessible from within the VOICE Online interface after a free registration for the VOICE Online services. The audio material covers approximately 22 hours of field-recordings, which equals about 20% of the entire corpus. We trust that this new feature will further increase the value of VOICE for research. For detailed information on using the new audio features, please refer to the subsection audio files in Using VOICE Online.
August 31, 2010
Angelika Breiteneder changes her professional focus, but will continue to work freelance for the VOICE project.
August 17, 2010
Michael Radeka joins the VOICE Team. His area of expertise is computational and corpus linguistics. Michael will focus on developing a methodology for annotating VOICE with POS tags by applying different classifier combination systems.
July 1, 2010
Ulrich Greiner publishes an article in Die Zeit entitled "Ist Deutsch noch zu retten?". The article focuses on the future of German, but also addresses the role of English as a lingua franca in this context. Parts of the article are based on an interview with Barbara Seidlhofer.
June 30 - July 3, 2010
Angelika Breiteneder and Ruth Osimk attend the 9th Teaching and Language Corpora Conference in Brno, Czech Republic. Angelika Breiteneder holds a paper on ‘ELF and ELT: Insights from VOICE' (see Presentations).
May 26-30, 2010
Stefan Majewski and Ruth Osimk attend ICAME 31. Stefan Majewski gives an individual paper on CorpusQuery, the infrastructure behind VOICE. Ruth Osimk gives a work-in-progress paper on the theoretical considerations of POS tagging VOICE (see Presentations).
May 22-25, 2010
The 3rd International Conference of English as a Lingua Franca takes place at the University of Vienna. Angelika Breiteneder and Marie-Luise Pitzl hold invited papers at young researchers' plenary panels, Stefan Majewski and Ruth Osimk give individual visual presentations (for all see Presentations).
March 12-14, 2010
Stefan Majewski attends the xmlprague conference and wins the FLWOR Foundation's XQuery competition.
March 7, 2010
Lucy Pickering and Barbara Seidlhofer chair a colloquium on "English as a Lingua Franca: Recent Theoretical and Empirical Developments" at the AAAL 2010 Annual Conference in Atlanta, Georgia. The colloquium includes presentations by Angelika Breiteneder and Marie-Luise Pitzl (see Presentations).
February 25-27, 2010
Barbara Seidlhofer and Henry Widdowson give an invited keynote on the use of English as a lingua franca in education at the APAC ELT Convention in Barcelona.
February 2, 2010
Austrian Radio Ö1 broadcasts an interview with Barbara Seidlhofer and Marie-Luise Pitzl as part of the programme 'Dimensions – The world of science'.
December 1st-2nd, 2009
Marie-Luise Pitzl holds two lectures on ‘English as a Lingua Franca: exploiting a linguistic resource' and ‘Researching English as a Lingua Franca’ at the University of Verona, Department for English, German and Slavic Studies.
November, 2009
The VOICE Team is currently working on adding features to the corpus, especially investigating the possibility of extending the mark-up with POS-tagging and preparing selected sound files for release.
October 12th-16th, 2009
Marie-Luise Pitzl and Stefan Majewski hold a VOICE workshop on 'ELF Corpus Building' at the Research Centre into Language Education and Acquisition in Multilingual Societies (RCLEAMS) at the Hong Kong Institute of Education.
September 23rd, 2009
Barbara Seidlhofer holds a lecture on English as a Lingua Franca in relation to the conference theme ‘More security by means of multilingualism” at the Linguistic Symposium of the Language Institute of the Austrian Armed Forces (AAF) in Vienna. (see Presentations).
June 26th, 2009

Barbara Seidlhofer holds a lecture about ELF and the creation of VOICE at the faculty of philology and culture (see Presentations).

This event marks the opening of the exhibition of the faculty of philology and culture "Die Philologisch-Kulturwissenschaftliche Fakultät im Spiegel ihrer Forschung (Projekte - Plattformen - Kollegs)", where the VOICE-project is represented with a poster.

The exhibition can be visited from June 26th until July 17th in the main university building.

June 15th, 2009

Ruth Osimk joins the VOICE team. She focuses on POS tagging for ELF with the aim to extend the mark-up of the VOICE corpus.

June 1st, 2009

The 3rd International Conference of English as a Lingua Franca will take place in Vienna, May 22-25, 2010. Please visit the conference website for updates.

May 22nd, 2009

Barbara Seidlhofer and her team members Angelika Breiteneder, Theresa Klimpfinger, Stefan Majewski and Marie-Luise Pitzl are delighted to announce that the Vienna-Oxford International Corpus of English (VOICE) 1.0 Online is now available as a free-of-charge resource for non-commercial research purposes.

VOICE 1.0 Online with (just over) one million words breaks new ground in that it is the first corpus of English as lingua franca (ELF) to become publicly available.

In order to access VOICE 1.0 Online, register for your personal user account and log in to search or browse VOICE.

The publication of VOICE 1.0 Online is also accompanied by corpus documentation, statistics and detailed user instructions for the online search interface, all of which can be accessed via this website.

If you are looking for older news items, please see our News History