Udvidet returret til d. 31. januar 2025

Programming for Corpus Linguistics with Python and Dataframes - Daniel (Western Kentucky University) Keller - Bog

Bag om Programming for Corpus Linguistics with Python and Dataframes

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Vis mere
  • Sprog:
  • Engelsk
  • ISBN:
  • 9781009486781
  • Indbinding:
  • Hardback
  • Sideantal:
  • 114
  • Udgivet:
  • 20. juni 2024
  • Størrelse:
  • 152x229x8 mm.
  • Vægt:
  • 306 g.
  • 8-11 hverdage.
  • 9. december 2024
På lager

Normalpris

  • BLACK WEEK

Medlemspris

Prøv i 30 dage for 45 kr.
Herefter fra 79 kr./md. Ingen binding.

Beskrivelse af Programming for Corpus Linguistics with Python and Dataframes

This Element offers intermediate or experienced programmers algorithms for Corpus Linguistic (CL) programming in the Python language using dataframes that provide a fast, efficient, intuitive set of methods for working with large, complex datasets such as corpora. This Element demonstrates principles of dataframe programming applied to CL analyses, as well as complete algorithms for creating concordances; producing lists of collocates, keywords, and lexical bundles; and performing key feature analysis. An additional algorithm for creating dataframe corpora is presented including methods for tokenizing, part-of-speech tagging, and lemmatizing using spaCy. This Element provides a set of core skills that can be applied to a range of CL research questions, as well as to original analyses not possible with existing corpus software.

Brugerbedømmelser af Programming for Corpus Linguistics with Python and Dataframes



Find lignende bøger
Bogen Programming for Corpus Linguistics with Python and Dataframes findes i følgende kategorier:

Gør som tusindvis af andre bogelskere

Tilmeld dig nyhedsbrevet og få gode tilbud og inspiration til din næste læsning.