Corpus Linguistics



         Jesus Guerrero Perez

    Corpus linguistics means to explore actual
    patterns of language use and as a tool for
    developing materials for classroom language
    instruction. Corpus linguistics provides an
    extremely powerful tool for the analysis of
    natural language and can provide
    tremendous insights as to how language use
    varies in different situations, such as spoken
    versus written or formal interactions versus
    casual conversation.
   A corpus refers to a large principled
    collection of natural texts. The process of
    creating written transcripts of spoken
    language can be quite time – consuming,
    involving a series of choices based on
    research interests of the corpus compilers
   Corpus design and compilation A corpus,
    as a defined above, is a large and
    principled collection of texts stored in
    electronic format.

   Types of corpora There are many types of
    corpora as there are research topics in
    linguistics General corpora Specialized
    corpora Learners corpus
   Issues in corpus design One of the most
    important factors in corpus linguistics is
    the design of the corpus. A corpus of one
    million words will not be large enough to
    provide reliable information about less
    frequent lexical items. An issue to
    consider in devising a representative
    sample whether or not it should be based
    on production or reception.

    Corpus compilation When creating a corpus ,
    data collection involves obtaining or creating
    electronic versions of the target texts.
    Written data are far less labor than spoken
    corpora. The data collection phase of building
    a spoken corpus is lengthy and expensive.
    Most spoken corpora use orthographic
    transcription system that does not attempt to
    capture prosodic details or phonetic variation.

More Related Content

PPTX
Corpus linguistics
PDF
Corpus linguistics intro
PPTX
Corpus linguistics the basics
PPTX
Branches of linguistics
PPT
Who Speaks English
PPTX
How to Use Corpora in Language Teaching
PPTX
Theories of Second Language Acquisition-Creative Construction Theory
PPTX
Corpus Linguistics
Corpus linguistics
Corpus linguistics intro
Corpus linguistics the basics
Branches of linguistics
Who Speaks English
How to Use Corpora in Language Teaching
Theories of Second Language Acquisition-Creative Construction Theory
Corpus Linguistics

What's hot (20)

PPTX
Corpus linguistics
PPTX
SLA Research Methodology Trends
PPTX
competence vs performance elt
PDF
Theories and concepts about translation
PPTX
Corpora in language teaching
PPTX
affixation in morphology
PPTX
Pidgin and Creole (Language Varieties)
PPTX
Linguistic oriented theories
PPT
Morphology Son
PPSX
Corpus linguistics
PPTX
Morphological productivity
PPTX
Ethnography of communication
PPTX
Understanding language
DOCX
Corpus Analysis in Corpus linguistics
PPTX
Isogloss, Dialect, Idiolect, Vernacular.pptx
PPTX
Corpus linguistics
PDF
Phonetics features of plosive
PDF
Corpus Linguistics for Language Teaching and Learning
PPTX
Corpus linguistics
PPTX
Linguistic theories approaches and methods
Corpus linguistics
SLA Research Methodology Trends
competence vs performance elt
Theories and concepts about translation
Corpora in language teaching
affixation in morphology
Pidgin and Creole (Language Varieties)
Linguistic oriented theories
Morphology Son
Corpus linguistics
Morphological productivity
Ethnography of communication
Understanding language
Corpus Analysis in Corpus linguistics
Isogloss, Dialect, Idiolect, Vernacular.pptx
Corpus linguistics
Phonetics features of plosive
Corpus Linguistics for Language Teaching and Learning
Corpus linguistics
Linguistic theories approaches and methods
Ad

Viewers also liked (8)

PPT
Applications of CL to FLT
PDF
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
PPTX
Introduction to corpus linguistics 1
PPTX
What can a corpus tell us about grammar
PDF
Foreign Language Classroom Assessment in Support of Teaching and Learning
ODP
Quantitative Individuated Corpus Linguistics
PDF
Corpus Tools for Language Teaching
PPT
Tracking Learning: Using Corpus Linguistics to Assess Language Development
Applications of CL to FLT
Analysing Word Meaning over Time by Exploiting Temporal Random Indexing
Introduction to corpus linguistics 1
What can a corpus tell us about grammar
Foreign Language Classroom Assessment in Support of Teaching and Learning
Quantitative Individuated Corpus Linguistics
Corpus Tools for Language Teaching
Tracking Learning: Using Corpus Linguistics to Assess Language Development
Ad

Similar to Corpus linguistics (20)

PPTX
Corpus study design
DOCX
Corpus Linguistics
PPTX
Corpus linguistics, ch6
PPT
Corpus linguistics in language learning
PDF
Corpus Based Language Studies An advanced resource book 1st Edition Tony Mcenery
PDF
Corpus-Based Studies of Legal Language for Translation Purposes:
PPTX
corpus.pptx
DOCX
Corpus approaches to discourse analysis
PDF
(Ebook) Corpus linguistics for grammar: a guide for research by Christian Jon...
PPT
Sacodeyl Birmingham 2007
PPT
What can a corpus tell us about discourse
PPTX
Corpus Linguistics
PPTX
Corpus linguistics
PDF
Applied Corpus Linguistics A Multidimensional Perspective Language And Comput...
PDF
Corpus Linguistics For Elt Research And Practice 1st Edition Ivor Timmis
PDF
Corpus Linguistics: An Introduction
PPTX
corpus linguistics.pptx
PPTX
Corpus Linguistics
PDF
Using Corpora In Discourse Analysis Paul Baker
PPT
The Corpus In The Classroom
Corpus study design
Corpus Linguistics
Corpus linguistics, ch6
Corpus linguistics in language learning
Corpus Based Language Studies An advanced resource book 1st Edition Tony Mcenery
Corpus-Based Studies of Legal Language for Translation Purposes:
corpus.pptx
Corpus approaches to discourse analysis
(Ebook) Corpus linguistics for grammar: a guide for research by Christian Jon...
Sacodeyl Birmingham 2007
What can a corpus tell us about discourse
Corpus Linguistics
Corpus linguistics
Applied Corpus Linguistics A Multidimensional Perspective Language And Comput...
Corpus Linguistics For Elt Research And Practice 1st Edition Ivor Timmis
Corpus Linguistics: An Introduction
corpus linguistics.pptx
Corpus Linguistics
Using Corpora In Discourse Analysis Paul Baker
The Corpus In The Classroom

More from jesuspickers80 (14)

PPTX
Discourse analysis
PPTX
Pragmatics
PPTX
Psycholinguistics
PPTX
Sociolinguistics
PPTX
Sociolinguistics
PPTX
Sociolinguistics
PPTX
Sociolinguistics
PPTX
Sociolinguistics
PPTX
Sociolinguistics
PPTX
Second language acquisition
PPTX
Second language acquisition
PPTX
Psycholinguistics
PPTX
Pragmatics
PPTX
Discourse analysis
Discourse analysis
Pragmatics
Psycholinguistics
Sociolinguistics
Sociolinguistics
Sociolinguistics
Sociolinguistics
Sociolinguistics
Sociolinguistics
Second language acquisition
Second language acquisition
Psycholinguistics
Pragmatics
Discourse analysis

Recently uploaded (20)

PDF
Health aspects of bilberry: A review on its general benefits
PPTX
Integrated Management of Neonatal and Childhood Illnesses (IMNCI) – Unit IV |...
PPTX
PLASMA AND ITS CONSTITUENTS 123.pptx
PDF
The TKT Course. Modules 1, 2, 3.for self study
PDF
Chevening Scholarship Application and Interview Preparation Guide
PPTX
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
PDF
Disorder of Endocrine system (1).pdfyyhyyyy
PPTX
Power Point PR B.Inggris 12 Ed. 2019.pptx
PDF
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
PDF
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
PDF
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...
PPTX
2025 High Blood Pressure Guideline Slide Set.pptx
PPTX
Climate Change and Its Global Impact.pptx
PDF
Physical education and sports and CWSN notes
PPTX
principlesofmanagementsem1slides-131211060335-phpapp01 (1).ppt
PDF
Laparoscopic Colorectal Surgery at WLH Hospital
PPTX
Diploma pharmaceutics notes..helps diploma students
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PPTX
ACFE CERTIFICATION TRAINING ON LAW.pptx
PDF
0520_Scheme_of_Work_(for_examination_from_2021).pdf
Health aspects of bilberry: A review on its general benefits
Integrated Management of Neonatal and Childhood Illnesses (IMNCI) – Unit IV |...
PLASMA AND ITS CONSTITUENTS 123.pptx
The TKT Course. Modules 1, 2, 3.for self study
Chevening Scholarship Application and Interview Preparation Guide
Key-Features-of-the-SHS-Program-v4-Slides (3) PPT2.pptx
Disorder of Endocrine system (1).pdfyyhyyyy
Power Point PR B.Inggris 12 Ed. 2019.pptx
LIFE & LIVING TRILOGY - PART - (2) THE PURPOSE OF LIFE.pdf
Myanmar Dental Journal, The Journal of the Myanmar Dental Association (2013).pdf
Lecture on Viruses: Structure, Classification, Replication, Effects on Cells,...
2025 High Blood Pressure Guideline Slide Set.pptx
Climate Change and Its Global Impact.pptx
Physical education and sports and CWSN notes
principlesofmanagementsem1slides-131211060335-phpapp01 (1).ppt
Laparoscopic Colorectal Surgery at WLH Hospital
Diploma pharmaceutics notes..helps diploma students
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
ACFE CERTIFICATION TRAINING ON LAW.pptx
0520_Scheme_of_Work_(for_examination_from_2021).pdf

Corpus linguistics

  • 1. Corpus Linguistics Jesus Guerrero Perez
  • 2. Corpus linguistics means to explore actual patterns of language use and as a tool for developing materials for classroom language instruction. Corpus linguistics provides an extremely powerful tool for the analysis of natural language and can provide tremendous insights as to how language use varies in different situations, such as spoken versus written or formal interactions versus casual conversation.
  • 3. A corpus refers to a large principled collection of natural texts. The process of creating written transcripts of spoken language can be quite time – consuming, involving a series of choices based on research interests of the corpus compilers
  • 4. Corpus design and compilation A corpus, as a defined above, is a large and principled collection of texts stored in electronic format.  Types of corpora There are many types of corpora as there are research topics in linguistics General corpora Specialized corpora Learners corpus
  • 5. Issues in corpus design One of the most important factors in corpus linguistics is the design of the corpus. A corpus of one million words will not be large enough to provide reliable information about less frequent lexical items. An issue to consider in devising a representative sample whether or not it should be based on production or reception.
  • 6. Corpus compilation When creating a corpus , data collection involves obtaining or creating electronic versions of the target texts. Written data are far less labor than spoken corpora. The data collection phase of building a spoken corpus is lengthy and expensive. Most spoken corpora use orthographic transcription system that does not attempt to capture prosodic details or phonetic variation.