Zipf law plot (frequency as function of frequency rank) for various texts.

The frequency tables are available at the website [https://www.ic.unicamp.br/~stolfi/EXPORT/projects/voynich/Notes/tr-stats/dat/ UNICAMP website]. The languages, texts and the frequency files are:

Voynichese, the language of the ''[[Voynich Manuscript]]''. Prose-like parts from Majority Vote version of the text, excluding 'labels'. Extracted from the Landini/Zandbergen Interlinear Transcription 1.6e6.

* 'Cosmological' section, part 1 (page f57v). Sample: <nowiki>sa l y saeos ar okees o d soefchees sos okey defo f o rkedam sh ofol sar [...] d f s y l k l r ar o r t l s d y dar teodar otadal sheky otchody r l</nowiki> File voyn/prs/cos.1/gud.wfr (original 168 words, truncated/filtered to 146 words, ''N'' = 63 distinct).

Labels, titles, word lists, and other isolated words from the Majority Vote version extracted from the Landini/Zandbergen Interlinear Transcription 1.6e6.

* Whole list. Sample: <nowiki>ytoain dairol olkchdal oparairdly otardaly otodaram aralarar ocfhor [...] okeody daiisaly ypary opchytch ypcholdy loralody opchdard oror sheey</nowiki> File voyn/lab/tot.1/gud.wfr (original 1021 words, truncated/filtered to 1003 words, ''N'' = 721 distinct).

The original annotated full texts, before truncation/filtering, are in the companion files */*/org/main.src.  The truncated/filtered texts -- one word per line, without punctuation -- are in */*/gud.tlw.