DEVELOPING SOFTWARE FOR CORPUS RESEARCH

Oliver Mason

DEVELOPING SOFTWARE FOR CORPUS RESEARCH

Authors

Oliver Mason

Keywords: corpus linguistics, corpus analysis, developing software, programming

Abstract

Despite the central role of the computer in corpus research, programming is generally not seen as a core skill within corpus linguistics. As a consequence, limitations in software for text and corpus analysis slow down the progress of research while analysts often have to rely on third party software or even manual data analysis if no suitable software is available. Apart from software itself, data formats are also of great importance for text processing. But again, many practitioners are not very aware of the options available to them, and thus idiosyncratic text formats often make sharing of resources difficult if not impossible. This article discusses some issues relating to both data and processing which should aid researchers to become more aware of the choices available to them when it comes to using computers in linguistic research. It also describes an easy way towards automating some common text processing tasks that can easily be acquired without knowledge of actual computer programming.

Downloads

Download data is not yet available.

Metrics

Views/Downloads

Abstract
354
PDF
264

Author Biography

Oliver Mason

University of Birmingham

How to Cite

Mason, O. (2008). DEVELOPING SOFTWARE FOR CORPUS RESEARCH. International Journal of English Studies, 8(1), 141–156. Retrieved from https://revistas.um.es/ijes/article/view/49141

Download Citation

Issue

Vol. 8 No. 1 (2008): Monograph: Software-aided Analysis of Language

Section

Articles

The works published in this journal are subject to the following terms:

1. The Publications Services at the University of Murcia (the publisher) retains the property rights (copyright) of published works, and encourages and enables the reuse of the same under the license specified in item 2.

2. The works are published in the electronic edition of the magazine under a Creative Commons Attribution Non-commercial Share Alike 4.0.

3.Conditions of self-archiving. Authors are encouraged to disseminate pre-print (draft papers prior to being assessed) and/or post-print versions (those reviewed and accepted for publication) of their papers before publication, because it encourages distribution earlier and thus leads to a possible increase in citations and circulation among the academic community.

RoMEO color: green

SUMMARY IN SPANISH

La Universidad de Murcia se reserva los derechos de copyright del material publicado en esta revista bajo la licencia Creative Commons Attribution Non-commercial Share Alike 4.0., aplicable tanto para la reproducción como la reutilización de los materiales contenidos en ella.

DEVELOPING SOFTWARE FOR CORPUS RESEARCH

Authors

Abstract

Downloads

Author Biography

Oliver Mason

jcr

sjr

fecyt

links

dialnetmetricas

Information