Udvidet returret til d. 31. januar 2025

Characteristics of Contemporary Printed Turkish - Gkhan Dalkili - Bog

Bag om Characteristics of Contemporary Printed Turkish

Models of natural languages and language characteristics are widely used in many computer science applications such as data security, language identification, spell checking, data compression, authorship attribution and speech recognition. In the scope of this study, a large scale corpus is created and used to discover language characteristics of Turkish. Word and letter based analyses are made on this corpus to build a base for several NLP studies. In the author identification part, we used two different methods based on word n- grams to identify author of an anonymous text. For 16 authors, training and test set articles are collected, and mentioned two methods are applied on these article sets. Finally, obtained results from two methods are compared with each other and most successful method is determined. This study can help professionals working on author identification, corpus linguistics, n-gram analysis, cryptanalysis, and speech recognition.

Vis mere
  • Sprog:
  • Engelsk
  • ISBN:
  • 9783838385075
  • Indbinding:
  • Paperback
  • Sideantal:
  • 80
  • Udgivet:
  • 13. juli 2010
  • Størrelse:
  • 152x229x5 mm.
  • Vægt:
  • 127 g.
  • 2-3 uger.
  • 17. december 2024
Forlænget returret til d. 31. januar 2025

Normalpris

  • BLACK WEEK

Medlemspris

Prøv i 30 dage for 45 kr.
Herefter fra 79 kr./md. Ingen binding.

Beskrivelse af Characteristics of Contemporary Printed Turkish

Models of natural languages and language characteristics are widely used in many computer science applications such as data security, language identification, spell checking, data compression, authorship attribution and speech recognition. In the scope of this study, a large scale corpus is created and used to discover language characteristics of Turkish. Word and letter based analyses are made on this corpus to build a base for several NLP studies. In the author identification part, we used two different methods based on word n- grams to identify author of an anonymous text. For 16 authors, training and test set articles are collected, and mentioned two methods are applied on these article sets. Finally, obtained results from two methods are compared with each other and most successful method is determined. This study can help professionals working on author identification, corpus linguistics, n-gram analysis, cryptanalysis, and speech recognition.

Brugerbedømmelser af Characteristics of Contemporary Printed Turkish



Find lignende bøger
Bogen Characteristics of Contemporary Printed Turkish findes i følgende kategorier:

Gør som tusindvis af andre bogelskere

Tilmeld dig nyhedsbrevet og få gode tilbud og inspiration til din næste læsning.