Word List

Return to Concordance, Concordance (publishing), Compendium

Snippet from Wikipedia: Word list

A word list is a list of words in a lexicon, generally sorted by frequency of occurrence (either by graded levels, or as a ranked list). A word list is compiled by lexical frequency analysis within a given text corpus, and is used in corpus linguistics to investigate genealogies and evolution of languages and texts. A word which appears only once in the corpus is called a hapax legomena. In pedagogy, word lists are used in curriculum design for vocabulary acquisition. A lexicon sorted by frequency "provides a rational basis for making sure that learners get the best return for their vocabulary learning effort" (Nation 1997), but is mainly intended for course writers, not directly for learners. Frequency lists are also made for lexicographical purposes, serving as a sort of checklist to ensure that common words are not left out. Some major pitfalls are the corpus content, the corpus register, and the definition of "word". While word counting is a thousand years old, with still gigantic analysis done by hand in the mid-20th century, natural language electronic processing of large corpora such as movie subtitles (SUBTLEX megastudy) has accelerated the research field.

In computational linguistics, a frequency list is a sorted list of words (word types) together with their frequency, where frequency here usually means the number of occurrences in a given corpus, from which the rank can be derived as the position in the list.

Writing: Famous Kin of Authors, Writers and Novelists, Cloud Monk's Books (Cloud Monk's Package Manager Book, DevOps for 20 Languages by Cloud Monk and Functional Programming Compare and Contrast 10 Languages by Cloud Monk), Cloud Monk Library, Cloud Monk, Technical Writing, Technical Writing Bibliography, Technical Writing Courses, Writing Tools (AWESOME DokuWiki, Leanpub, DokuWiki, Spell Checker, Grammar Checker - Grammarly, Mind Maps, Outlining), 75 Greatest Books Ever Written, Technical Writing Glossary, Writing Topics, Awesome Writing, GitHub Writing. (navbar_writing - see also navbar_art)


Cloud Monk is Retired ( for now). Buddha with you. © 2025 and Beginningless Time - Present Moment - Three Times: The Buddhas or Fair Use. Disclaimers

SYI LU SENG E MU CHYWE YE. NAN. WEI LA YE. WEI LA YE. SA WA HE.