Zipf law plot (frequency as function of frequency rank) for various texts. The languages, texts and the word frequency files are: [[Hebrew language|Hebrew]]. The first five books (''[[Torah]]'', ''Pentateuch'') of the Hebrew Bible (''Tanak''). From the 10th century version (the [[Masoretic text]]) of the original, probably composed mainly around ~500 BCE from earlier texts. From the ''Sacred Texts'' site, maintained by John B. Hare. In an ad-hoc single-byte encoding designed to look vaguely phonetic under an ISO-Latin-1 font. '''With''' vowel points but '''without''' cantillation marks. * Whole text. Sample: ''<nowiki>bĪ°rëĄsđïyą bĪâr⥠Ą°ęlöhïym Ąëą häsĪđâmäyïm w°Ąëą hâĄâręþ w°hâĄâręþ</nowiki>'' [...] ''<nowiki>kĪâlhäyĪâmïym</nowiki>''. File hebr/tav/tot.1/gud.wfr (original 66311 words, truncated/filtered to 35027 words, ''N'' = 12487 distinct). The first five books (''[[Torah]]'', ''Pentateuch'') of the Hebrew Bible (''Tanak''). From the 10th century version (the [[Masoretic text]]) of the original, composed mainly around ~500 BCE from earlier texts. From the ''Sacred Texts'' site, maintained by John B. Hare. In an ad-hoc single-byte encoding designed to look vaguely phonetic under an ISO-Latin-1 font. '''Without''' vowel points and catillation marks. * Whole text. Sample: ''<nowiki>bĪrĄsđyą bĪrĄ Ąlhym Ąą hsĪđmym wĄą hĄrþ whĄrþ hyąh ąhwĪ wbhwĪ wįsđk</nowiki>'' [...] ''<nowiki>lsĪēŋr hþĪhb tmĄ hwĪĄ wĄmbĪŋynyw ŋmd hnĪąq wsēŋr sđįr þmįbĪw</nowiki>''. File hebr/tad/tot.1/gud.wfr (original 66311 words, truncated/filtered to 35027 words, ''N'' = 11856 distinct). The word frequency files '*/*/*/gud.wfr' are available at the [https://www.ic.unicamp.br/~stolfi/EXPORT/projects/voynich/Notes/tr-stats/dat/ UNICAMP website]. The original annotated full texts, before truncation/filtering, are in the companion files */*/org/main.src. The truncated/filtered texts -- one word per line, without punctuation -- are in */*/*/gud.tlw.