Zipf law plot (frequency as function of frequency rank) for various texts.

The languages, texts and the word frequency files are:

[[Latin]]. The first five books (the ''Pentateuch'') from the Latin version (''Vulgate'') of the ''Old Testament'', edited by [[St. Jerome]] around 400 CE.  Converted to lowercase.

* Book 1, ''Genesis''. Sample: ''<nowiki>in principio creavit deus caelum et terram terra autem erat inanis et</nowiki>'' [...] ''<nowiki>conditus aromatibus repositus est in loculo in aegypto</nowiki>''. File latn/ptt/gen.1/gud.wfr (25217 words, ''N'' = 5713 distinct).


* Book 2, ''Exodus''. Sample: ''<nowiki>haec sunt nomina filiorum israhel qui ingressi sunt aegyptum cum iacob</nowiki>'' [...] ''<nowiki>videntibus populis israhel per cunctas mansiones suas</nowiki>''. File latn/ptt/exo.1/gud.wfr (20060 words, ''N'' = 4701 distinct).


* Book 4, ''Numeri''. Sample: ''<nowiki>locutusque est dominus ad mosen in deserto sinai in tabernaculo foederis</nowiki>'' [...] ''<nowiki>iordanem contra hiericho</nowiki>''. File latn/ptt/num.1/gud.wfr (19316 words, ''N'' = 4340 distinct).


* Book 3, ''Leviticus''. Sample: ''<nowiki>vocavit autem mosen et locutus est ei dominus de tabernaculo testimonii</nowiki>'' [...] ''<nowiki>mandavit dominus mosi ad filios israhel in monte sinai</nowiki>''. File latn/ptt/lev.1/gud.wfr (13775 words, ''N'' = 3233 distinct).


* Book 5, ''Deuteronomium''. Sample: ''<nowiki>haec sunt verba quae locutus est moses ad omnem israhel trans iordanem</nowiki>'' [...] ''<nowiki>coram universo israhel</nowiki>''. File latn/ptt/deu.1/gud.wfr (18502 words, ''N'' = 4466 distinct).

The word frequency files '*/*/*/gud.wfr' are available at the [https://www.ic.unicamp.br/~stolfi/EXPORT/projects/voynich/Notes/tr-stats/dat/ UNICAMP website].  The original annotated full texts are in the companion files */*/org/main.src.  The extracted texts -- one word per line, without punctuation -- are in */*/*/gud.tlw.