
Contents
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
20.1 Introduction 20.1 Introduction
-
20.2 What Is a Corpus? 20.2 What Is a Corpus?
-
20.2.1 Monolingual, Comparable, and Parallel Corpora 20.2.1 Monolingual, Comparable, and Parallel Corpora
-
20.2.2 Spoken Corpora 20.2.2 Spoken Corpora
-
20.2.3 Research Questions and Corpora 20.2.3 Research Questions and Corpora
-
-
20.3 A History of Corpus Linguistics 20.3 A History of Corpus Linguistics
-
20.4 What Corpora Are in Existence? 20.4 What Corpora Are in Existence?
-
20.5 Web as Corpus 20.5 Web as Corpus
-
20.6 The Exploitation of Corpora in NLP 20.6 The Exploitation of Corpora in NLP
-
20.7 Conclusion 20.7 Conclusion
-
Further Reading and Relevant Resources Further Reading and Relevant Resources
-
References References
-
-
-
-
-
-
-
-
-
-
-
20 Corpora
Get accessTony McEnery is Distinguished Professor of English Language and Linguistics at Lancaster University and Changjiang Chair at Xi’an Jiaotong University, China. He is the author of many books and papers on corpus linguistics, including Corpus Linguistics: Method, Theory and Practice (with Andrew Hardie, Cambridge University Press, 2011). He was founding Director of the ESRC Corpus Approaches to Social Science (CASS) Centre, which was awarded the Queen’s Anniversary Prize for its work on corpus linguistics in 2015.
-
Published:02 May 2016
Cite
Abstract
In this chapter the use of corpora in natural-language processing (NLP) is overviewed. The chapter begins by defining what a corpus is. In doing so it introduces different types of corpora such as monolingual, parallel and comparable corpora. It also discusses key issues in corpus design, notably balance and representativeness. The chapter then overviews the history of corpus linguistics, from its early beginnings in the pre computer age to its current digital form. Following this there is a brief survey of the current state of corpora, taking into account recent innovations in corpus construction, notably the development of the notion of the ‘Web as corpus’. The chapter concludes by briefly considering the use of corpora in a range of NLP systems.
Sign in
Personal account
- Sign in with email/username & password
- Get email alerts
- Save searches
- Purchase content
- Activate your purchase/trial code
- Add your ORCID iD
Purchase
Our books are available by subscription or purchase to libraries and institutions.
Purchasing informationMonth: | Total Views: |
---|---|
October 2022 | 11 |
November 2022 | 18 |
December 2022 | 10 |
January 2023 | 14 |
February 2023 | 9 |
March 2023 | 12 |
April 2023 | 5 |
May 2023 | 4 |
June 2023 | 13 |
July 2023 | 6 |
August 2023 | 6 |
September 2023 | 11 |
October 2023 | 11 |
November 2023 | 12 |
December 2023 | 3 |
January 2024 | 10 |
February 2024 | 8 |
March 2024 | 18 |
April 2024 | 19 |
May 2024 | 19 |
June 2024 | 9 |
July 2024 | 5 |
August 2024 | 11 |
September 2024 | 11 |
October 2024 | 6 |
December 2024 | 6 |
January 2025 | 11 |
February 2025 | 2 |
March 2025 | 6 |
April 2025 | 10 |
May 2025 | 4 |
Get help with access
Institutional access
Access to content on Oxford Academic is often provided through institutional subscriptions and purchases. If you are a member of an institution with an active account, you may be able to access content in one of the following ways:
IP based access
Typically, access is provided across an institutional network to a range of IP addresses. This authentication occurs automatically, and it is not possible to sign out of an IP authenticated account.
Sign in through your institution
Choose this option to get remote access when outside your institution. Shibboleth/Open Athens technology is used to provide single sign-on between your institution’s website and Oxford Academic.
If your institution is not listed or you cannot sign in to your institution’s website, please contact your librarian or administrator.
Sign in with a library card
Enter your library card number to sign in. If you cannot sign in, please contact your librarian.
Society Members
Society member access to a journal is achieved in one of the following ways:
Sign in through society site
Many societies offer single sign-on between the society website and Oxford Academic. If you see ‘Sign in through society site’ in the sign in pane within a journal:
If you do not have a society account or have forgotten your username or password, please contact your society.
Sign in using a personal account
Some societies use Oxford Academic personal accounts to provide access to their members. See below.
Personal account
A personal account can be used to get email alerts, save searches, purchase content, and activate subscriptions.
Some societies use Oxford Academic personal accounts to provide access to their members.
Viewing your signed in accounts
Click the account icon in the top right to:
Signed in but can't access content
Oxford Academic is home to a wide variety of products. The institutional subscription may not cover the content that you are trying to access. If you believe you should have access to that content, please contact your librarian.
Institutional account management
For librarians and administrators, your personal account also provides access to institutional account management. Here you will find options to view and activate subscriptions, manage institutional settings and access options, access usage statistics, and more.