Zipf law plot (frequency as function of frequency rank) for various texts.

The frequency tables are available at the website [https://www.ic.unicamp.br/~stolfi/EXPORT/projects/voynich/Notes/tr-stats/dat/ UNICAMP website]. The languages, texts and the frequency files are:

[[English language|English]]. Text of [[H. G. Wells]]'s novel ''[[The War of the Worlds]]'' (1898), excluding numbers, mapped to lowercase.

* Whole text. Sample: <nowiki>no one would have believed in the last years of the nineteenth century [...] there were already a couple of score of passengers aboard some of</nowiki> File engl/wow/tot.1/gud.wfr (original 60293 words, truncated/filtered to 35027 words, ''N'' = 4869 distinct).


* Whole text. Sample: <nowiki>no one would have believed in the last years of the nineteenth century [...] there were already a couple of score of passengers aboard some of</nowiki> File engl/wow/tot.1/gud.wfr (original 60293 words, truncated/filtered to 35027 words, ''N'' = 4869 distinct).

The original annotated full texts, before truncation/filtering, are in the companion files */*/org/main.src.  The truncated/filtered texts -- one word per line, without punctuation -- are in */*/gud.tlw.