Hopp til hovedinnhold
Omslagsbilde

Natural Language Processing for Corpus Linguistics

Dunn, Jonathan

Produseres på bestilling

Leveringstid: 3-10 dager

Handlinger

Beskrivelse

Omtale

Corpus analysis can be expanded and scaled up by incorporating computational methods from natural language processing. This Element shows how text classification and text similarity models can extend our ability to undertake corpus linguistics across very large corpora. These computational methods are becoming increasingly important as corpora grow too large for more traditional types of linguistic analysis. We draw on five case studies to show how and why to use computational methods, ranging from usage-based grammar to authorship analysis to using social media for corpus-based sociolinguistics. Each section is accompanied by an interactive code notebook that shows how to implement the analysis in Python. A stand-alone Python package is also available to help readers use these methods with their own data. Because large-scale analysis introduces new ethical problems, this Element pairs each new methodology with a discussion of potential ethical implications.

  • Utgivelsesdato:

    31.03.2022

  • ISBN/Varenr:

    9781009074438

  • Språk:

    Engelsk

  • Forlag:

    Cambridge University Press

  • Innbinding:

    Heftet

  • Fagtema:

    Språk og lingvistikk

  • Serie:

    Elements in Corpus Linguistics

  • Litteraturtype:

    Faglitteratur

  • Sider:

    75

  • Høyde:

    15.2 cm

  • Bredde:

    22.8 cm

Automatic Image Tagging for Corpus Linguistics : A Multimodal Study of News Representations of Islam

Automatic Image Tagging for Corpus Linguistics : A Multimodal Study of News Representations of Islam

Schmuck, Hanna • Qian, Yufang • Baker, Paul
9781009581257 Heftet
31.07.2025
Engelsk

Forventes utgitt
Lexical Multidimensional Analysis : Identifying Discourses and Ideologies

Lexical Multidimensional Analysis : Identifying Discourses and Ideologies

Fitzsimmons-Doolan, Shannon • Sardinha, Tony Berber
9781009335690 Heftet
27.02.2025
Engelsk

Produseres på bestilling
Social Group Representation in a Diachronic News Corpus

Social Group Representation in a Diachronic News Corpus

Elmerot, Irene
9781009014212 Heftet
06.02.2025
Engelsk

Produseres på bestilling
Programming for Corpus Linguistics with Python and Dataframes

Programming for Corpus Linguistics with Python and Dataframes

Keller, Daniel
9781108822589 Heftet
20.06.2024
Engelsk

Produseres på bestilling
Collocations, Corpora and Language Learning

Collocations, Corpora and Language Learning

Szudarski, Pawel
9781108994798 Heftet
20.07.2023
Engelsk

Produseres på bestilling
Corpus-Assisted Discourse Studies

Corpus-Assisted Discourse Studies

Mautner, Gerlinde • Baker, Paul • Gillings, Mathew
9781009168151 Heftet
06.04.2023
Engelsk

Produseres på bestilling
Shaping Writing Grades : Collocation and Writing Context Effects

Shaping Writing Grades : Collocation and Writing Context Effects

McCallum, Lee • Durrant, Philip
9781009074445 Heftet
08.09.2022
Engelsk

Produseres på bestilling
Analysing Language, Sex and Age in a Corpus of Patient Feedback : A Comparison of Approaches

Analysing Language, Sex and Age in a Corpus of Patient Feedback : A Comparison of Approaches

Brookes, Gavin • Baker, Paul
9781009013772 Heftet
21.07.2022
Engelsk

Produseres på bestilling
The Impact of Everyday Language Change on the Practices of Visual Artists

The Impact of Everyday Language Change on the Practices of Visual Artists

Hocking, Darryl
9781009225731 Heftet
19.05.2022
Engelsk

Produseres på bestilling
Computational Construction Grammar : A Usage-Based Approach

Computational Construction Grammar : A Usage-Based Approach

Dunn, Jonathan
9781009507608 Innbundet
06.06.2024
Engelsk

Produseres på bestilling
Computational Construction Grammar : A Usage-Based Approach

Computational Construction Grammar : A Usage-Based Approach

Dunn, Jonathan
9781009233767 Heftet
06.06.2024
Engelsk

Produseres på bestilling