Hacking at the Voynich manuscript Notebook - volume 6 Warning: these notebooks aren't strictly chronological logs. Sometimes I go back and redo things, clarify comments, delete garbage, etc. Summary of previous notebooks ============================= On 97-07-05 I obtained Landini's interlinear transcription of the VMs, version 1.6 (landini-interln16.evt) from http://sun1.bham.ac.uk/G.Landini/evmt/intrln16.zip I manually extracted from it a homogeneous, full-text sample bio-m-evt.evt, consisting of pages 147-166 (f75r--f84v) of the "biological" section, in Currier's Language B, hand 2. This section includes Currier's and Friedman's transcriptions. Currier's seems to be the most complete of them. The two versions have many differences (affecting 5-10% of the words), and often disagree even in the grouping of symbols: where one sees two words the other sees a single word, what is [A] for one may be [CI] for the other, and so on. So I decided to break all characters doen to individual "logical" strokes, and use one (computer) character to encode each stroke. I called this new encoding "jsa" (Jorge's Super-Analytic). After mapping to jsa, I generated a "consensus" version of the biological section, and got these digraph counts: q o c i l g y s x j u TOT ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- . 1398 965 1877 361 60 . . . . . . 4661 q 1 . 1229 18 . 1 154 . . . 700 . 2103 o 21 486 1 63 1087 1071 . . . . . . 2729 c 4 167 176 6137 1209 232 2114 2921 1019 . . . 13979 i 4 1 1 8 1997 2 . . 560 1616 37 457 4683 l . . . . . . 16 . . . 1566 . 1582 g 52 . 74 2150 4 4 . . . . . . 2284 y 2790 26 2 47 13 43 . . . . . . 2921 s 463 1 99 1013 1 2 . . . . . . 1579 x 827 24 105 488 5 167 . . . . . . 1616 j 46 . 76 2175 6 . . . . . . . 2303 u 453 . 1 3 . . . . . . . . 457 ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 4661 2103 2729 13979 4683 1582 2284 2921 1579 1616 2303 457 40897 Some conclusions we get from this and other data: The valid \i/ sequences are \ij/ \is/ \iis/ \iiu/ \iiiu/ \ix/; the others are likely to be scription or transcription errors. \ci/ and \o/ are lexically similar but distinct glyphs. The suffixes \ij/, \is/, \iiu/, and \iiiu/ are preceded almost exclusively by \ci/ and strictly word-final. It seems plausible that these are errors: \oij/ (4 occurrences) should be \ciij/ ( 32 occurrences) \oiiu/ (2 occurrences) should be \ciiiu/ (109 occurrences) \ciiu/ (4 occurrences) should be \ciiiu/ (109 occurrences) \oiiiu/ (9 occurrences) should be \ciiiiu/ (329 occurrences) \ciiiiiu/ (4 occurrences) should be \ciiiiu/ (329 occurrences) \ciiix/ (2 occurrences) should be \ciix/ (403 occurrences) \ciiis/ (19 occurrences) may also be a misreading of \ciis/ (291 occurrences). \cg/ is always a glyph. \qo/ is a combination that occurs only in word-initial position. \qc/ is likely to be a misreading/miswriting of \qo/. \cy/ is always a glyph, almost certainly a final form of \ci/. \qj/, \lj/, \qg/, \lg/ are glyphs. \cs/ is a glyph closely related to (but distinct from) \c/. \ccg/ is almost always followed by \ci/ or \cy/. Here "glyph" means a group of strokes that can be treated as a single symbol for analysis; it may actually be part of a larger, still unrecognized symbol. Summarizing again: \iiiu/, \iiu/, \iis/, \ij/ The ziggies: strictly final, preceded always by \ci/ or, more rarely, by \o/. \ix/ Usually initial or preceded by \ci/ or \o/; followed by any letter except ziggies and \qo/, \ix/, \is/ \is/ Similar to \ix/ except that it cannot be followed by capitals or \cg/, either. \cy/ Almost always final, but occasionaly followed by other letters. Preceded by about the same letters as \ci/; indeed, it is probably the final form of \ci/. \cg/ May be followed by many letters, most often \cy/ and \ci/. Almost always prededed by \c/, or initial; rarely by \ix/ or \o/. \cs/ Most often followed by \c/, somewhat less often by \o/, \ci/, or word break. Most often initial, but also preceded by \ix/, gallows, \c/, \cy/, \cg/, \is/. \lj/, \qj/ The H-gallows: Very similar to each other, different from the rest, but somewhat similar to the P-gallows. They probably combine with \c/ on both sides to make glyphs. It is very likely that \l/ and \q/ are exactly equivalent. \lg/, \qg/ The P-gallows: Very similar to each other, different from the rest, but somewhat similar to the P-gallows. They probably combine with \c/ on both sides to make glyphs. It is very likely that \l/ and \q/ are exactly equivalent. They may be merely ornate forms of some letter, or several letters (\cg/, perhaps), used mainly in the first line of each paragraph (and perhaps of each page?) \qo/ Strictly initial, almost always followed by a capital. Sometimes misread as \qc/? \ci/ May be followed only by the ziggies, \ix/, or \ir/ only. Often follows a capital, but also \cg/, \cs/, \c/, \ix/, \is/, or word break. \o/ Similar to \a/, but is very often word-initial. Other conclusions: * The manuscript does not appear to use any hyphenation mark. Either words are not broken across lines, which would be unusual, or they are broken without any extra marks. Such word breaks may result in statistical anomalies at the beginning and end of lines. Could this explain Currier's claim that lines are "functional units"? * Note that parsing sequences like \cij/, \ciis/, and \ciiis/ requires some care: the right parsings are c+ij, c+iis, ci+iis. * The parsing of \ciis/ is ambiguous: ci+is or c+iis. Declaring \ciiis/ to be a misreading of \ciis/ would remove the ambiguity. * The parsing of \ciiiu/ is ambiguous, too; but since the \iu/ series does not seem to follow a bare \c/, it seems safe to parse it as ci+iiu. * The gallows characters \qj/ and \lj/ appear to be closely related: for every common word with \lj/, there appears to be a a word with \qj/ that occurs with about 1/4 the frequency. * There seems to be a kinship between the glyphs \cs/ (when not attached to the following \c/s) \ir/, and the gallows \lj/ and \qj/ (also, when unattached). * The same phenomenon can be noted with respect to prefixes containing \cc/ and \csc/: for every word beginning with \cc/, there is a word where the first \cc/ is replaced by \csc/, and practically the same frequency. * There apepars to be much confusion between the suffixes \iu/ and \iiiu/. They are almost surely distinct letters, but in about one half of the cases, Currier sees \iiu/ where Friedman has \iiiu/. * There appears to be much confusions between \o/ and \ci/. The strings of \c/, \cs/, \lj/, \qj/, \lg/, \qg/ must be treated together, after collapsing the glyphs listed above, since there seem to be glyphs consisting of gallows preceded and followed by \c/ or \cc/. When this is taken into account, we can see that a single \c/ is not a glyph, but \cs/ is. In fact, after shrinking \ci/ to `a', \cs/ to `z', the gallows to `H' or `P', the only possible glyphs of the form [czHp]* with length at most 3 are freq glyph ---- ----- 795 H 52 P 152 z 138 cc 70 zc 482 Hc 484 ccc 439 zcc ? 493 Hcc ? 19 cHc 4 cPc The ones marked `?' may be composite, z+cc and H+cc, but this hypothesis does not seem very likely (perhaps they are *sometimes* composite?) The significant strings of length 4 that cannot be parsed into the glyphs above are 20 cHcc 4 cPcc Strings with 4 or more [czHP]'s tend to be quite ambiguous. Looking at the raw texts, it seems that the main source of "?"s is the confusion between "M" and "N" by Currier and/or Friedman. So I decided to map both [N] and [M] (and other lookalikes) to "m". I christened the new encoding "hop". --- fsg2hop ------------------------ #! /n/gnu/bin/gawk -f # Recoding an interlinear file from the FSG alphabet to # my Lossy Ad-hoc Semi-Analytic Fault-Tolerant encoding BEGIN { print "# Output of fsg2hop - Stolfi's Semi-Analytic Fault-Tolerant alphabet" } /^ *$/ { print; next } /^ *#/ { print; next } /^<[^>.;]*>/ { print; next } /^<[^>]*\.[^>]*;[A-Z]> / { curtxt = substr($0,20) # We discard "%" and "!" since the conversion # will destroy synchronism anyway. gsub(/[%!]/, "", curtxt); # First, the conversion from FSG to JSA (Stolfi's super-analytic) gsub(/IIIK/, "iiiij", curtxt); gsub(/IIIL/, "iiiiu", curtxt); gsub(/IIIR/, "iiiis", curtxt); gsub(/IIIE/, "iiiix", curtxt); gsub(/IIE/, "iiix", curtxt); gsub(/IIR/, "iiis", curtxt); gsub(/IIK/, "iiij", curtxt); gsub(/HZ/, "cqjc", curtxt); gsub(/PZ/, "cqgc", curtxt); gsub(/DZ/, "cljc", curtxt); gsub(/FZ/, "clgc", curtxt); gsub(/IE/, "iix", curtxt); gsub(/IR/, "iis", curtxt); gsub(/IK/, "iij", curtxt); gsub(/2/, "cs", curtxt); gsub(/4/, "q", curtxt); gsub(/6/, "cj", curtxt); gsub(/7/, "ig", curtxt); gsub(/8/, "cg", curtxt); gsub(/A/, "ci", curtxt); gsub(/C/, "c", curtxt); gsub(/D/, "lj", curtxt); gsub(/E/, "ix", curtxt); gsub(/F/, "lg", curtxt); gsub(/G/, "cy", curtxt); gsub(/H/, "qj", curtxt); gsub(/I/, "i", curtxt); gsub(/K/, "ij", curtxt); gsub(/L/, "iu", curtxt); gsub(/M/, "iiiu", curtxt); gsub(/N/, "iiu", curtxt); gsub(/O/, "o", curtxt); gsub(/P/, "qg", curtxt); gsub(/R/, "is", curtxt); gsub(/S/, "csc", curtxt); gsub(/T/, "cc", curtxt); gsub(/V/, "?", curtxt); gsub(/Y/, "?", curtxt); # Now, the conversion from JSA to HOP: gsub(/[ql]j/, "H", curtxt); gsub(/[ql]g/, "P", curtxt); gsub(/cs/, "z", curtxt); gsub(/ij/, "k", curtxt); gsub(/ix/, "e", curtxt); gsub(/is/, "r", curtxt); gsub(/iiu/, "n", curtxt); gsub(/y/, "i", curtxt); gsub(/ci/, "a", curtxt); gsub(/cg/, "8", curtxt); gsub(/ir/, "w", curtxt); gsub(/i*n/, "m", curtxt); print (substr($0,1,19) curtxt); next } ------------------------------------ After mapping Currier and Friedman to the "hop" encoding, I created a consensus bio-j-hop.evt. I also created by hand a file bio-j-hop.evj, which is like bio-j-hop.evt except that it has " " instead of "." as word-space, and " //" instead of "-" for end-of-line, and " =" instead of "=" for end-of-paragraph. It allows me to find the page and line numbers of a word, given its "hop" encoding. Extracted the text files: extract-words-from-interlin \ -chars "aocz8HPerqkmw" \ bio-j-hop.evt \ bio-j-hop lines words bytes file ------ ------- --------- ------------ 7670 7670 41815 bio-j-hop.wds 1510 1510 9982 bio-j-hop.dic 5894 5894 33804 bio-j-hop-gut.wds 949 949 6236 bio-j-hop-gut.dic 843 843 2464 bio-j-hop-fun.wds 5 5 24 bio-j-hop-fun.dic 933 933 5547 bio-j-hop-bad.wds 556 556 3722 bio-j-hop-bad.dic Digraph counts: a o c z 8 H P e r q k m w TT ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- . 251 1235 757 912 472 276 86 313 103 1489 . . . 5894 a 3196 2 4 19 26 14 78 2 491 345 5 39 802 23 5046 o 28 5 1 39 6 21 1776 68 1173 240 6 5 19 1 3388 c 10 1059 226 4047 44 1865 408 33 15 4 . . 5 . 7716 z 58 109 90 957 10 3 4 1 1 . . . . . 1233 8 64 2245 50 45 32 1 5 . 5 1 . . . . 2448 H 12 1125 98 1479 47 5 . . 9 . . . 1 . 2776 P 2 20 43 116 17 3 . . . . . . . . 201 e 1121 130 117 216 122 61 227 10 4 2 1 . . . 2011 r 514 90 48 24 15 3 1 . . . . . . . 695 q 1 5 1474 17 2 . 1 1 . . . . . . 1501 k 43 . 1 . . . . . . . . . . . 44 m 822 4 1 . . . . . . . . . . . 827 w 23 1 . . . . . . . . . . . . 24 ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 5894 5046 3388 7716 1233 2448 2776 201 2011 695 1501 44 827 24 33804 Next-symbol probability (× 99): a o c z 8 H P e r q k m w TT -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- . 4 21 13 15 8 5 1 5 2 25 . . . 99 a 63 . . . 1 . 2 . 10 7 . 1 16 . 99 o 1 . . 1 . 1 52 2 34 7 . . 1 . 99 c . 14 3 52 1 24 5 . . . . . . . 99 z 5 9 7 77 1 . . . . . . . . . 99 8 3 91 2 2 1 . . . . . . . . . 99 H . 40 3 53 2 . . . . . . . . . 99 P 1 10 21 57 8 1 . . . . . . . . 99 e 55 6 6 11 6 3 11 . . . . . . . 99 r 73 13 7 3 2 . . . . . . . . . 99 q . . 97 1 . . . . . . . . . . 99 k 97 . 2 . . . . . . . . . . . 99 m 98 . . . . . . . . . . . . . 99 w 95 4 . . . . . . . . . . . . 99 -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 17 15 10 23 4 7 8 1 6 2 4 0 2 0 99 Previous-symbol probability (× 99): a o c z 8 H P e r q k m w TT -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- . 5 36 10 73 19 10 42 15 15 98 . . . 17 a 54 . . . 2 1 3 1 24 49 . 88 96 95 15 o . . . 1 . 1 63 33 58 34 . 11 2 4 10 c . 21 7 52 4 75 15 16 1 1 . . 1 . 23 z 1 2 3 12 1 . . . . . . . . . 4 8 1 44 1 1 3 . . . . . . . . . 7 H . 22 3 19 4 . . . . . . . . . 8 P . . 1 1 1 . . . . . . . . . 1 e 19 3 3 3 10 2 8 5 . . . . . . 6 r 9 2 1 . 1 . . . . . . . . . 2 q . . 43 . . . . . . . . . . . 4 k 1 . . . . . . . . . . . . . 0 m 14 . . . . . . . . . . . . . 2 w . . . . . . . . . . . . . . 0 -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Rebuilt .fix.wds, .fix.dic: cat bio-j-hop.wds \ | sed -e '/?/s/^.*$/???/g' \ > .fix.wds cat .fix.wds \ | sort | uniq \ > .fix.dic cat .fix.wds \ | wfreq \ > .fix.frq lines words bytes file ------ ------- --------- ------------ 955 955 6264 .fix.dic 957 2871 17757 .fix.frq 7670 7670 40000 .fix.wds 97-09-10 stolfi =============== Wrote a small script to generate KWIC index for a given word. Here is the result: cat .fix.wds \ | kwic-index -v key=oHae cccHcc8a ??? 8arok // zoeHc8a oHae 8ar oHa oHar oHar oe z // zor zccHa qoHam oHae 8ae oeccc8a 8am ??? ??? ccc8a qoe zccoe oeccc8a // oHae oPae oHcca eoe ??? Hc8a // oHc8a qcHa ??? oea oHae zccc8a qoHcc8a qoHcca ea // zae 8oe zc8a oeHc8a qoHam oHae ??? // = qHor zcc8a ccc8or ccae zcc8 qoHcc8a ??? oHae ccc8ae // 8ae or ccc8a ccca qoHcc8a oeHc8a ??? oHcc8a oHae // zaeHcca zor zcccHca 8am oecca // eccc8a zcc8ae qoHc8a oHae ccc8a qoHae zcc8a ccc8a qoe // 8zcc8a 8cc8a qoHcc8a oHc8a oHae Hc8a oHca ??? // qoHa // zoe Hcc8a zcccHca ??? oHae zcc8a oHar oe ccc8a cHc8a 8am aHc8a aoe zccca qoHcca oHae oe Hcc8a 8aHa // 8zcc8a oe cczca oe ccccHa 8ar oHae 8ae oeccc8a // cHa ??? // ??? zcca oe zcc8a oHae oHccca oHcoe oeccc8a qoe // zcoe zam zcccHa or ??? oHae oeHae qoHar ??? // zor or zccH oHar Pcc8a ??? oHae zcc8a // ??? ezc8a ??? Hzcca 8zc8a oHccar zccH ??? oHae ora // qoHcca qoHccca ??? // zcam zccHca qoea Hzcc8a oHae zccca qoHam zcca ??? // qoHam ccca qoHam oHam oHam oHae oe 8ak // ??? ccoe zcc8a qoHcc8a qoHam o8a ??? oHae // zoe zcccoe oe cccca oeoHccc8a qoHam qoHam ??? qoHam oHae qoHam oHc8a ??? // ??? qoHae 8a cccHca eccca ??? oHae // qoHc8a qoHcc8a ccccHca oeccc8a oe // zoeccca qoHcca eHar oHae oeHcca oHam zcccHca oe // qoHc8a 8aea // Hoe Ham oHae ccc8a qoHar oe zcc8a ccccHca roea // Hzcc8a qoHc8a oeHam oHae cccHca qoHa 8am ??? // = Hzcc8a qoHam zcc8a qoHaz oHae qoPzcc8a qoHa eccc8a ??? // eccc8a ??? Hoe ccca qoHcc8a oHae 8am oe 8ae // cccoe oe oHam // Hor Ham oHae a rccca qoHae oeor am a rccca qoHae oeor am oHae ??? // ??? 8ccc8a qoHae ??? zcc8a qoe ??? oHaw oHae ccc8a 8a // Pccc8a qoHca zccHa // qoHc8a oHam Haw oHae zar oe ??? oeHam ae aHc8a 8ar ??? qoHa aHc8a oHae // 8zcc8a aHcc8a ??? 8am ??? zcc8a qoe zcc8am zcccHca oHae zccHa qoHam ccc8oe // 8zcc8a cccca ram zccz qoHccca qoHam oHae 8a ??? // oram zcc8a ccc8a oHa cc8a 8a ccccHca oHae ccez // Hoe zccoe qoHc8a ccca qzam qoHcca qoHam oHam oHae oHcc8a qoHae 8ak // zam ??? // ??? qocHcoe zcccHca oHae qoHcae zcccHc8a cHae oHc8a qoHc8a 8aezc8a // qoHam ccc8a qoHae oHae ccc8a qoHaez // oazcca qoe qoHam ccoe cccHa ??? ccca oHae ccc8a Hora oHzcc8a qoHca ezcc8a // Poe Har zcc8a qoHc8a oHae zcca qoHar cccHca oHccca qoHccc8a ??? ??? qoHcc8a 8am ccccHca oHae oe Hc8a ccc8a 8oeHc8a oHc8ak // ??? ??? ??? ??? oHae ccc8a ccc8a // ??? oe qoHca qoHcoea // ccae 8am oHae cc8a oHae cccHcor aea // // ccae 8am oHae cc8a oHae cccHcor aea // qoHc8a zcca It seems that "oHae" is often preceded or followed by "ccc8a" or "zcc8a". cat .fix.wds \ | kwic-index -v key=roe oHae8a 8ar oHar oHc8a 8a roe // Hccc8a Pccc8a qoHcca ??? PaHc8a oePccc8a qoHc8a zPcca ccc8a roe ??? oPccc8a qoHc8a // oezcc8a ??? ??? qoHcc8a qoHar oHcc8a roe ??? // ??? oeHcca oe ear // eoe ccca ??? roe 8am oHa qoHa oHaeor zcccHca ror cccca zcccHca qoHam ccc8a roe // acccoe Ham zcca qoHam aHam oeHam zcc8a qoHa 8ccc8a roe oe cHc8a // aHca oHccc8a ccccHa ??? 8ae ??? Pcc8a roe qoHc8a roe // 8ae zcoe 8ae ??? Pcc8a roe qoHc8a roe // 8ae zcoe 8ar oe cccHc8a qoHoe8a // 8oeccc8a eccca roe roe cccca zam ??? // qoHoe8a // 8oeccc8a eccca roe roe cccca zam ??? // = cat .fix.wds \ | kwic-index -v key=zar qoe zcc8a qoe oHam ccar zar oea // qoHzcc8a qoe zcca qa oe Hcca 8am zam zar zcc8a qoHcor oHcc8a // qoHcc8a ??? oHca 8ae aHae ccc8a zar // z ccor zcc8a qoHc8 zcc8a ??? HzccoHcc8a ozccPoez // zar oeHcca zcoHam zccoeoe oHc8a qcHcc8a zaw zcccHca eHcc8a eccc8a // zar zccc8a qoHcc8a qoeHca ecc8a ??? // oeccca qoHca qoHam oecccc8a zar ??? // zam ??? qoHcc8a 8am ccc8a qoe Hcc8a qoHccc8a zar ??? ccccHa 8a // qoHam qoHam oHaecca Hae or ccccHca zar // qoHa cccaqa Ham ccca oe or ??? // ??? zar oe ??? oHcca zcor qoHcca // qoHc8a oHam Haw oHae zar oe ??? oeHam ae oe qoHae zc oe zcaeza // zar oe eoram ccca qoHam o ccc8a eHc8a qoHccc8a qoHc8a cccHc8a zar // zccc8aw oHccc8a qoHcc8a ccc8am cat .fix.wds \ | kwic-index -v key=oeHam ??? 8c8a Hc8a // qoqoHcca oeHam qoe zccc8a qoHcor zccc8a qoHae // ??? qoe ccc8a oqoHam oeHam cccHca qoHae 8ar // qoHae qoHc8a oHc8a 8ae or oHcc8 oeHam // qoHoe oHc8 oHam ccc8 oe // qoHcc8a qoHca 8am oeHam 8ae ccc8a oeoe 8ae ccccHa Ham oe // qor oeHcca oeHam oe cczca oe ccccHa 8ar ??? zccca oe ccc8a oeHar oeHam // oeHam zcca qoHca oHar oe ccc8a oeHar oeHam // oeHam zcca qoHca oHar oe oHcc8a // qoHcc8a zccca Haz cccca oeHam oe ora ccoe ??? oHa zcca qoHc8a qoHc8a qoHc8a 8ar oeHam ccak // 8ccc8a eccca qcHa qoHcca oe oezc8a qoHam oHcc8a oeHam oHzcca zam oe // aHcc8a cccHca oHoe Hcoe zccoe qoHae oeHam cccHca // qoHcc8a qoe ??? Hcoe qoHa qoHae zcc8a zae oeHam ??? qoHe // azcca e ??? // ??? zcar oHam oeHam oe oeHam oram oeor ccccHca ??? zcar oHam oeHam oe oeHam oram oeor ccccHca oeor // ??? eoeok // qoHam ??? oeHam zcc8a qoHam oroe z // Pccc8a roea // Hzcc8a qoHc8a oeHam oHae cccHca qoHa 8am ??? Ham zcca qoHam ccc8a qoHoe oeHam zccHccae // eor ar oe Ham oeHar zcca qoHam ??? oeHam oHam zcae qoHa // oecccoe oHcc8a qoHca ea // qoHam oeHam oeHam oe Hccoe // Poe qoHca ea // qoHam oeHam oeHam oe Hccoe // Poe or oe Hccoe // Poe or oeHam ocHca qoHam oHam oHar zcca qoeHa // qoe oe ??? oeHam oe ccc8a qoHam oezcca ??? qoeHam zcc8a // qoHcc8a qoHcr oeHam oezcc8a qoHca qoHcca ??? ??? zccar zcca oezcca cHcor aecc8a oeHam zccHca qoea // Hoe ??? azccc8a zca ccca ecccHca aHar oeHam oe // e zcc8a qoHc8a Haw oHae zar oe ??? oeHam ae oe ??? 8e // Har // ??? ??? aHam oeHam zcc8a qoHa 8ccc8a roe oe ccca qoHccca qoHa oHam oHam oeHam oHam oHar aea // qoHam qoHam zccae qoe ccc8a qoHaiw oeHam zcc8a // = Hccae oeHaw qo8a zcar ??? qoHccca Hccca oeHam oPccc8a qPoe zcHa orae // ccoe ??? cccHca 8ae cccHca oeHam oeHccar // Pcccoe zcc8a qoHam cat .fix.wds \ | kwic-index -v key=oeHar ??? qca Ham zcccHa eHam oeHar or // 8acHca eHako aHcca oe ??? zccca oe ccc8a oeHar oeHam // oeHam zcca qoHca zcHcoe qoHar zccHa cccHam ??? oeHar oHam zccHa qoHae ??? // ??? // = Pzcoe Ham oeHar zcca qoHam ??? oeHam oHam zccHca qoea // Hoe ??? oeHar a qoe qoe Ham ??? // e zcc8a qoHc8a zor oeHar oe Ham oe Hzc8a // qocca ee8ar cccca qoae qoHcc8a oeHar zccc8a qoHam oeae // qoe 97-09-10 stolfi =============== Let's look at the ladies-in-tubes labels collected by Jim Reeds on page f77v: Locus Currier FSG HOP Comments --------- ----------- ----------- ----------- ----------------------------- 152 N1 1 OFAN/AFOE ODAN ADOE oHam aHoe ; ladies with hands in tubing 152 N2 1 OPOE/ZC89 OHOE SC89 oHoe zcc8a ; under N1 152 N3 1 OEFS8OE OEDT8OE oeHcc8oe ; center top 152 N4 1 OPOEOR OHOEOR oHoeor ; 152 N5 1 ORSC8AE ORTC8AE orccc8ae ; under N4 152 W1 1 2ORORAE 2ORORAE zororae ; above lady's head 152 W2 1 OECOC8N OECOC8N oecoc8m ; on her vascular boat's hull 152 E1 1 OFA ODA oHa ; "oHam aHoe": The word "oHam" is very common: 75v 76v 78v 79v 105 30.4 14.6 oHam .1.121223223.3.12.1..21.133422653535141114..3..43.33.12.111 The word "aHoe" doesnt occur at all, but it may be "oHoe", which does; see below. "oHoe zcc8a": The word "oHoe" occurs 11 times: 4 times followed by "ccc8a", once preceded by "zcc8a", once followed by "8ae zcc8a". The word "zcc8a" is very common. 75v 76v 78v 79v 11 33.9 16.2 oHoe ......1......1..........1.1...111.....1.............1....11 233 31.0 18.3 zcc8a 3362314964524.854555723254313.11335332325362432445686576476 qoHcc8a ??? oeHoe ocHcca zcc8a oHoe ??? e8a // oHczcca oeHc8a ccc8a qoHam ccc8a Ham cccHcc8a oHoe oHa // zam oHam zccHcc8a // ??? qoe zcc8a oeHc8a oHoe ccc8a qoHam 8ae // cccca // ??? qoHzcc8a qoHam ??? oHoe zcccoe ??? ccoHa oHcccaz // 8aHam oHc8a 8Hcca Har oe oHoe ??? // ??? ??? Har qor cccca Ham cce oe oHoe 8am oHam oe oHcc8a qoHcm zcc8a qoHca ??? qoHar cccHca oHoe Hcoe zccoe qoHae oeHam cccHca ??? oe ar zccHca eHoe oHoe ccc8a qoHam cccHca qoHae // // a zcca oHccc8 ar oHoe 8ae zcc8a qoHaea // ??? oe zcc8a qoHc8a // qoHcca oHoe ccc8a ??? // 8zccca qoHcc8a // zor ??? 8oe Hc8a oHoe ccc8a // = "oeHcc8oe": The word "oeHcc8oe" does not occur at all. However, "oeHcc" occurs once, right after the picture, on the facing page. The similar words "oeHcc8", "oeHcc8a", "oeHcc8ae" occur a few times later on (except for one early occurence of "oeHcc8a" on f75r). The word "8oe" appears first on f76r, and is fairly frequent later on. 75r 76r 78r 79r 84r 1 21.5 16.7 oeHcc .....................1..................................... 1 55.5 16.7 oeHcc8 .......................................................1... 15 40.1 13.4 oeHcc8a ..1........................1.11....11......2..121...11..1.. 1 52.5 16.7 oeHcc8ae ....................................................1...... 26 36.5 15.2 8oe ..........1.1..1.1..11.2...111...1..1.1.1.1.111....2....221 "oHoeor": The word "oHoeor" doesn't occur. However, "oHoe" occurs 11 times (see above), and "or" is quite common. Its distribution is lumpy, and there is a cluster beginning on f77v. 75r 76r 77v 66 28.5 16.3 or 13..3.2..14.1.....1.231235312.15....1412.2..1.....1..213112 "orccc8ae": The word "orccc8ae" does not occur. But "or" is fairly common (see above), and "ccc8ae" occurs thrice, roughly at the right place: 76r 77v 3 14.2 6.8 ccc8ae ..........1.1......1....................................... ccae zcc8 qoHcc8a ??? oHae ccc8ae // 8ae or ccc8a qoeam qoccHcc8a 8cc8a // Hccc8a ezccc8a ccc8ae ccc8a ccccHcca // Poezc8ae oHc8aw qoHcca ??? // Hzcc8a qoHam ccc8ae Pccc8a cccHa zam ccc8a qoHam "zororae": The word "zororae" does not occur. The words "zor" occurs 13 times, with no obvious clustering, and "orae" occurs once. The similar word "oroe" occurs 6 times, but not particularly in the right spot. But "or" occurs in the right place (see above) and "ae" do occurs in three clusters, nearby: 76r 77v 78v 80r 13 30.2 17.1 zor ...11......1......1..1...1........2....11........11.......1 66 28.5 16.3 or 13..3.2..14.1.....1.231235312.15....1412.2..1.....1..213112 20 21.5 15.2 ae ......311.22.111.........11..........112.1................1 1 48.5 16.7 orae ................................................1.......... 6 29.2 18.4 oroe ....1.1..........................1.1...1...............1... "oecoc8m": The word "oecoc8m" does not occur. Indeed, the sequence "coc" occurs only twice in the bio section, and "oc8" doesn't occur at all. So, the "o" may be a misreading. The sequences "cac" and "ac8" don't occur, either. If we read it as "oeccc8m", we get some close matches: 76r 77v 1 39.5 16.7 oeccc8 .......................................1................... 1 44.5 16.7 Poeccc8 ............................................1.............. 6 18.3 19.2 qoeccc8a 1....1.11..........................1................1...... 24 26.5 19.3 oeccc8a ...2.31...2.2....11....11.......1.....1....11..11.1...1...2 "oHa": The word "oHa" is fairly common, with a cluster of occurrences at the right spot. Its "inflected" forms "oHam", "oHae", "oHar" are even more common, with suggestive clustering. 76r 77v 31 25.9 14.6 oHa ..11...1.231.2......1111...1.12.12..1..2.1.1..21......1.... 105 30.4 14.6 oHam .1.121223223.3.12.1..21.133422653535141114..3..43.33.12.111 43 32.8 15.9 oHae ..11.11...21......1.1.21123.1..1111122..1111..2.11...12..22 40 20.4 16.7 oHar 214211.113121.......2.11.3...12.1.1..1.2..1....1......11.1. I made this report into an HTML page in my Voynich site. I take these statistics as being mildly encouraging: most of the labels can be found in the text, and in several cases there is a cluster of occurrences on page f77v or soon thereafter. On the positive side, I think this data supports my belief that Voynichese is a natural language (and not a complex cypher or random text), and that the "words" are indeed words (i.e. units of meaning). On the negative side, I am worried by the apparent inconsistency in spelling and word spacing in the manuscript itself. These observations give some more weight to the "ignorant scribe" hypothesis (that the Beinecke VMs is a copy, made by two or more scribes who could not understand the original). The apparent confusion between "a" and "o" in the original manuscript suggests that I should identify those two letters in my next error-tolerant encoding... 97-09-11 stolfi =============== Created a script "jsa2pak" that tries to raise the entropy of Voynichese by condensing the [czH] strings into shorter letters. --- jsa2pak ------------------------ #! /n/gnu/bin/sed -f # An attempt to raise the entropy of Voynichese s/qj/H/g s/lj/K/g s/[ql]g/P/g s/ij/k/g s/ix/e/g s/is/r/g s/iiu/n/g s/y/i/g s/ci/a/g s/cg/8/g s/cs/z/g s/zccccHcc/VXY/g s/zccccKcc/VXX/g s/ccHzccc/CHZC/g s/ccKzccc/CKZC/g s/cccHccc/UHU/g s/cccKccc/UKU/g s/ccccHcc/CCY/g s/ccccKcc/CCX/g s/zccHccc/VHU/g s/zccKccc/VKU/g s/zcccHcc/ZCY/g s/zcccKcc/ZCX/g s/zzcccHc/zVT/g s/zzcccKc/zVS/g s/ccHccc/CI/g s/ccKccc/CJ/g s/cccHcc/UY/g s/cccKcc/UX/g s/ccccHc/CCB/g s/ccccKc/CCA/g s/zccHcc/VY/g s/zccKcc/VX/g s/zcccHc/ZCB/g s/zcccKc/ZCA/g s/zzcHcc/zZY/g s/zzcKcc/zZX/g s/Hcccc/YC/g s/Hczcc/BV/g s/Kcccc/XC/g s/Kczcc/AV/g s/cHccc/WU/g s/cHccz/Mz/g s/cKccc/MU/g s/cKccz/Lz/g s/ccHcc/CY/g s/ccKcc/CX/g s/cccHc/UB/g s/cccKc/UA/g s/ccccH/UW/g s/ccccK/UM/g s/ccccc/CU/g s/ccccz/CCz/g s/zcHcc/ZY/g s/zcKcc/ZX/g s/zccHc/VB/g s/zccKc/VA/g s/zcccH/VW/g s/zcccK/VM/g s/zcccc/ZU/g s/zcccz/ZCz/g s/zzccH/zVH/g s/zzccK/zVK/g s/Hccc/I/g s/Hccz/Yz/g s/Hczc/BZ/g s/Hzcc/HV/g s/Kccc/J/g s/Kccz/Xz/g s/Kczc/AZ/g s/Kzcc/KV/g s/cHcc/M/g s/cKcc/L/g s/ccHc/CB/g s/ccKc/CA/g s/cccH/UH/g s/cccK/UK/g s/cccc/F/g s/cccz/Uz/g s/zHcc/zY/g s/zKcc/zX/g s/zcHc/ZB/g s/zcKc/ZA/g s/zccH/VH/g s/zccK/VK/g s/zccc/G/g s/zccz/Vz/g s/zzcc/zV/g s/Hcc/Y/g s/Hcz/Bz/g s/Hzc/HZ/g s/Kcc/X/g s/Kcz/Az/g s/Kzc/KZ/g s/cHc/T/g s/cKc/S/g s/ccH/CH/g s/ccK/CK/g s/ccc/U/g s/ccz/Cz/g s/czc/cZ/g s/zcH/ZH/g s/zcK/ZK/g s/zcc/V/g s/zcz/Zz/g s/Hc/B/g s/Kc/A/g s/cH/W/g s/cK/M/g s/cc/C/g s/zH/E/g s/zK/O/g s/zc/Z/g s/ir/w/g s/qo/q/g s/in/m/g ------------------------------------ Created a packed text file: cat bio-m-evt.evt \ | fsg2jsa \ | grep ';C>' \ | sed \ -e 's/{[^}]*}//g' \ > .tmp-c-jsa.evt extract-words-from-interlin \ -recode jsa2pak \ -chars "a8eoqKUrVAXnCmHzBPZYGFckiJwMIWLTSjuOE" \ .tmp-c-jsa.evt \ .tmp-c-pak lines words bytes file ------ ------- --------- ------------ 7227 7227 32048 .tmp-c-pak.wds 1575 1575 8526 .tmp-c-pak.dic 6420 6420 29572 .tmp-c-pak-gut.wds 1541 1541 8362 .tmp-c-pak-gut.dic 807 807 2476 .tmp-c-pak-fun.wds 34 34 164 .tmp-c-pak-fun.dic 0 0 0 .tmp-c-pak-bad.wds 0 0 0 .tmp-c-pak-bad.dic Digraph counts: TT a 8 e o q K U r V A X n C m H z B P Z Y G F c k i J w M I W L T S j u O E ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 6420 . 271 514 365 1363 1646 59 529 134 486 15 18 . 179 . 67 260 29 98 135 24 85 65 20 . 3 9 . 8 16 1 3 10 4 . . 3 1 a 5742 3516 7 25 574 12 10 33 11 412 17 12 25 479 4 401 22 12 16 4 4 19 6 5 8 44 27 3 28 1 . . 1 1 1 . 2 . . 8 2728 73 2466 2 10 72 1 2 23 2 26 4 2 . 19 . . 1 1 . 7 . 5 3 7 . 2 . . . . . . . . . . . . e 2349 1085 161 84 7 159 2 124 224 13 116 54 82 . 69 . 19 16 11 17 38 6 27 20 2 2 . 10 . . 1 . . . . . . . . o 2273 24 12 22 1151 1 7 192 4 285 5 73 71 6 5 14 154 7 77 52 1 52 1 1 5 7 8 11 2 3 11 . 3 2 4 . . . . q 1668 15 11 9 191 3 1 554 6 15 2 233 268 1 14 . 138 2 63 31 . 58 1 . 3 . . 18 . 6 7 3 9 3 3 . . . . K 1024 14 884 2 11 79 . . 2 . 23 . . . 1 1 . . . . 5 . . . . . 1 . 1 . . . . . . . . . . U 926 1 205 475 . 66 . 22 . 2 . 48 18 . . . 8 16 32 14 . 3 . . . . . . . 12 . 4 . . . . . . . r 874 588 128 5 1 82 . 1 27 . 16 . . . 7 . 1 . . 2 1 . 5 4 5 . 1 . . . . . . . . . . . . V 739 1 163 390 . 61 1 22 . . . 30 14 1 . . 10 3 15 5 . 5 . . . . . . . 8 . 9 . . 1 . . . . A 536 1 197 305 . 18 . . . 1 3 . . 5 . . . 2 . . 4 . . . . . . . . . . . . . . . . . . X 519 1 186 311 2 5 . . . 2 . . . 1 7 . . 3 . . . 1 . . . . . . . . . . . . . . . . . n 498 482 4 3 . 7 . . 1 . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . C 473 1 88 91 12 43 . 11 3 3 . 65 17 . 65 . 5 18 36 6 3 6 . . . . . . . . . . . . . . . . . m 439 429 6 1 . 2 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . H 430 4 331 3 3 57 . . 1 . 19 . 1 . . . . 1 . . 6 . . . . . 4 . . . . . . . . . . . . z 351 73 142 3 1 114 . . . . 10 . 1 . 2 . . . . 1 2 . 1 . . . 1 . . . . . . . . . . . . B 281 1 103 160 1 11 . . . . . . . 1 . . . 2 . . 2 . . . . . . . . . . . . . . . . . . P 251 5 22 3 . 51 . . 89 . 16 . . . 37 . . . . . 9 . 2 2 15 . . . . . . . . . . . . . . Z 218 4 51 40 5 37 . 4 2 1 . 2 1 1 61 . 6 2 . . . 1 . . . . . . . . . . . . . . . . . Y 175 . 59 100 1 8 . . . 1 . . . . 2 . . 3 . . . . . . . . . . . . . . . . . 1 . . . G 133 1 52 69 . 5 . . . 2 . . . . . . . . . 4 . . . . . . . . . . . . . . . . . . . F 100 1 57 35 . 3 . . . 1 . . . . . . . . . 2 . . . . . . . . . . . . . . . 1 . . . c 65 2 13 14 1 2 . . 1 . . . 1 3 1 . . . 1 15 1 . . . . . . . . . . . . . . 10 . . . k 57 55 . . 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . i 56 . . . 9 . . . . . . . . . . 23 . . . . . . . . . 4 8 . 8 . . . . . . . 4 . . J 51 . 26 25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 39 31 7 . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M 38 . 31 3 . 1 . . . . . . . . . . . 2 . . . . . . . . 1 . . . . . . . . . . . . I 35 . 10 22 . 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . W 17 . 16 . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . L 16 . 11 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . T 16 . 7 5 . 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . S 13 . 10 2 . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . j 12 11 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . u 6 1 1 . 3 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . O 3 . 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . E 1 . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 29572 6420 5742 2728 2349 2273 1668 1024 926 874 739 536 519 498 473 439 430 351 281 251 218 175 133 100 65 57 56 51 39 38 35 17 16 16 13 12 6 3 1 Next-symbol probability (× 99): a 8 e o q K U r V A X n C m H z B P Z Y G F c k i J w M I W L T S j u O E TT -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- . 4 8 6 21 25 1 8 2 7 . . . 3 . 1 4 . 2 2 . 1 1 . . . . . . . . . . . . . . . 99 a 61 . . 10 . . 1 . 7 . . . 8 . 7 . . . . . . . . . 1 . . . . . . . . . . . . . 99 8 3 89 . . 3 . . 1 . 1 . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . 99 e 46 7 4 . 7 . 5 9 1 5 2 3 . 3 . 1 1 . 1 2 . 1 1 . . . . . . . . . . . . . . . 99 o 1 1 1 50 . . 8 . 12 . 3 3 . . 1 7 . 3 2 . 2 . . . . . . . . . . . . . . . . . 99 q 1 1 1 11 . . 33 . 1 . 14 16 . 1 . 8 . 4 2 . 3 . . . . . 1 . . . . 1 . . . . . . 99 K 1 85 . 1 8 . . . . 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 U . 22 51 . 7 . 2 . . . 5 2 . . . 1 2 3 1 . . . . . . . . . 1 . . . . . . . . . 99 r 67 14 1 . 9 . . 3 . 2 . . . 1 . . . . . . . 1 . 1 . . . . . . . . . . . . . . 99 V . 22 52 . 8 . 3 . . . 4 2 . . . 1 . 2 1 . 1 . . . . . . . 1 . 1 . . . . . . . 99 A . 36 56 . 3 . . . . 1 . . 1 . . . . . . 1 . . . . . . . . . . . . . . . . . . 99 X . 35 59 . 1 . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . . . . . . . . 99 n 96 1 1 . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 C . 18 19 3 9 . 2 1 1 . 14 4 . 14 . 1 4 8 1 1 1 . . . . . . . . . . . . . . . . . 99 m 97 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 H 1 76 1 1 13 . . . . 4 . . . . . . . . . 1 . . . . . 1 . . . . . . . . . . . . 99 z 21 40 1 . 32 . . . . 3 . . . 1 . . . . . 1 . . . . . . . . . . . . . . . . . . 99 B . 36 56 . 4 . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . . . . . 99 P 2 9 1 . 20 . . 35 . 6 . . . 15 . . . . . 4 . 1 1 6 . . . . . . . . . . . . . . 99 Z 2 23 18 2 17 . 2 1 . . 1 . . 28 . 3 1 . . . . . . . . . . . . . . . . . . . . . 99 Y . 33 57 1 5 . . . 1 . . . . 1 . . 2 . . . . . . . . . . . . . . . . . 1 . . . 99 G 1 39 51 . 4 . . . 1 . . . . . . . . . 3 . . . . . . . . . . . . . . . . . . . 99 F 1 56 35 . 3 . . . 1 . . . . . . . . . 2 . . . . . . . . . . . . . . . 1 . . . 99 c 3 20 21 2 3 . . 2 . . . 2 5 2 . . . 2 23 2 . . . . . . . . . . . . . . 15 . . . 99 k 96 . . 2 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 i . . . 16 . . . . . . . . . . 41 . . . . . . . . . 7 14 . 14 . . . . . . . 7 . . 99 J . 50 49 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 w 79 18 . . . . . 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 M . 81 8 . 3 . . . . . . . . . . . 5 . . . . . . . . 3 . . . . . . . . . . . . 99 I . 28 62 . 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 W . 93 . . . . . 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 L . 68 31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 T . 43 31 . 25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 S . 76 15 . 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 j 91 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 u 17 17 . 50 17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 O . 99 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 E . . . . 99 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 21 19 9 8 8 6 3 3 3 2 2 2 2 2 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 0 99 Previous-symbol probability (× 99): a 8 e o q K U r V A X n C m H z B P Z Y G F c k i J w M I W L T S j u O E TT -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- . 5 19 15 59 98 6 57 15 65 3 3 . 37 . 15 73 10 39 61 14 63 64 30 . 5 17 . 21 45 6 19 62 30 . . 99 99 21 a 54 . 1 24 1 1 3 1 47 2 2 5 95 1 90 5 3 6 2 2 11 4 5 12 76 48 6 71 3 . . 6 6 8 . 33 . . 19 8 1 43 . . 3 . . 2 . 3 1 . . 4 . . . . . 3 . 4 3 11 . 4 . . . . . . . . . . . . 9 e 17 3 3 . 7 . 12 24 1 16 10 16 . 14 . 4 5 4 7 17 3 20 20 3 3 . 19 . . 3 . . . . . . . . 8 o . . 1 49 . . 19 . 32 1 13 14 1 1 3 35 2 27 21 . 29 1 1 8 12 14 21 5 8 31 . 19 12 30 . . . . 8 q . . . 8 . . 54 1 2 . 43 51 . 3 . 32 1 22 12 . 33 1 . 5 . . 35 . 16 20 17 56 19 23 . . . . 6 K . 15 . . 3 . . . . 3 . . . . . . . . . 2 . . . . . 2 . 3 . . . . . . . . . . 3 U . 4 17 . 3 . 2 . . . 9 3 . . . 2 5 11 6 . 2 . . . . . . . 31 . 23 . . . . . . . 3 r 9 2 . . 4 . . 3 . 2 . . . 1 . . . . 1 . . 4 4 8 . 2 . . . . . . . . . . . . 3 V . 3 14 . 3 . 2 . . . 6 3 . . . 2 1 5 2 . 3 . . . . . . . 21 . 52 . . 8 . . . . 2 A . 3 11 . 1 . . . . . . . 1 . . . 1 . . 2 . . . . . . . . . . . . . . . . . . 2 X . 3 11 . . . . . . . . . . 1 . . 1 . . . 1 . . . . . . . . . . . . . . . . . 2 n 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 C . 2 3 1 2 . 1 . . . 12 3 . 14 . 1 5 13 2 1 3 . . . . . . . . . . . . . . . . . 2 m 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 H . 6 . . 2 . . . . 3 . . . . . . . . . 3 . . . . . 7 . . . . . . . . . . . . 1 z 1 2 . . 5 . . . . 1 . . . . . . . . . 1 . 1 . . . 2 . . . . . . . . . . . . 1 B . 2 6 . . . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . . . . . 1 P . . . . 2 . . 10 . 2 . . . 8 . . . . . 4 . 1 2 23 . . . . . . . . . . . . . . 1 Z . 1 1 . 2 . . . . . . . . 13 . 1 1 . . . 1 . . . . . . . . . . . . . . . . . 1 Y . 1 4 . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . 8 . . . 1 G . 1 3 . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . 0 F . 1 1 . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . 8 . . . 0 c . . 1 . . . . . . . . . 1 . . . . . 6 . . . . . . . . . . . . . . . 83 . . . 0 k 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 i . . . . . . . . . . . . . . 5 . . . . . . . . . 7 14 . 20 . . . . . . . 66 . . 0 J . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 w . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 M . 1 . . . . . . . . . . . . . . 1 . . . . . . . . 2 . . . . . . . . . . . . 0 I . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 W . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 L . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 T . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 S . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 j . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 u . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 O . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 E . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0 -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Taking only the "pak" symbols with frequency >0.5%, we have the following "alphabet": JSA ci cg ix o lj qj lg qg qo is iiu iiiu cs ; ccc zcc cHcc cKcc Hcc Kcc Hc Kc cc zc PEK a 8 e o H K P Q q r n m z ; U V M N X Y A B C Z We should also merge "8a", "KA", "HA", "oe" into single letters: PEK 8a Ha Ka oe JSA d f p y Defined a new script "jsa2pek" --- jsa2pek ------------------------ #! /n/gnu/bin/sed -f # A second attempt to raise the entropy of Voynichese #-- first stage: HOP-like encoding --- s/lj/H/g s/qj/K/g s/lg/P/g s/qg/Q/g s/ij/k/g s/ix/e/g s/is/r/g s/iiu/n/g s/y/i/g s/ci/a/g s/cg/8/g s/cs/z/g s/ir/w/g s/qo/q/g s/in/m/g #-- second stage: zcHK strings --- s/zccccHcc/ZUX/g s/zccccKcc/ZUY/g s/ccHzccc/CHZC/g s/ccKzccc/CKZC/g s/cccHccc/UAC/g s/cccKccc/UBC/g s/ccccHcc/CCX/g s/ccccKcc/CCY/g s/zccHccc/VAC/g s/zccKccc/VBC/g s/zcccHcc/ZCX/g s/zcccKcc/ZCY/g s/zzcccHc/zZCA/g s/zzcccKc/zZCB/g s/ccHccc/CAC/g s/ccKccc/CBC/g s/cccHcc/UX/g s/cccKcc/UY/g s/ccccHc/CCA/g s/ccccKc/CCB/g s/zccHcc/VX/g s/zccKcc/VY/g s/zcccHc/ZCA/g s/zcccKc/ZCB/g s/zzcHcc/zZX/g s/zzcKcc/zZY/g s/Hcccc/AU/g s/Hczcc/AV/g s/Kcccc/BU/g s/Kczcc/BV/g s/cHccc/cAC/g s/cHccz/cXz/g s/cKccc/cBC/g s/cKccz/cYz/g s/ccHcc/CX/g s/ccKcc/CY/g s/cccHc/UA/g s/cccKc/UB/g s/ccccH/CCH/g s/ccccK/CCK/g s/ccccc/CU/g s/ccccz/CCz/g s/zcHcc/ZX/g s/zcKcc/ZY/g s/zccHc/VA/g s/zccKc/VB/g s/zcccH/ZCH/g s/zcccK/ZCK/g s/zcccc/ZU/g s/zcccz/ZCz/g s/zzccH/zVH/g s/zzccK/zVK/g s/Hccc/HU/g s/Hccz/Xz/g s/Hczc/AZ/g s/Hzcc/HV/g s/Kccc/KU/g s/Kccz/Yz/g s/Kczc/BZ/g s/Kzcc/KV/g s/cHcc/M/g s/cKcc/N/g s/ccHc/CA/g s/ccKc/CB/g s/cccH/UH/g s/cccK/UK/g s/cccc/CC/g s/cccz/Uz/g s/zHcc/zX/g s/zKcc/zY/g s/zcHc/ZA/g s/zcKc/ZB/g s/zccH/VH/g s/zccK/VK/g s/zccc/ZC/g s/zccz/Vz/g s/zzcc/zV/g s/Hcc/X/g s/Hcz/Az/g s/Hzc/HZ/g s/Kcc/Y/g s/Kcz/Bz/g s/Kzc/KZ/g s/cHc/cA/g s/cKc/cB/g s/ccH/CH/g s/ccK/CK/g s/ccc/U/g s/ccz/Cz/g s/czc/cZ/g s/zcH/ZH/g s/zcK/ZK/g s/zcc/zC/g s/zcz/Zz/g s/Hc/A/g s/Kc/B/g s/cH/cH/g s/cK/cK/g s/cc/C/g s/zc/Z/g #-- third stage: common digraphs --- s/8a/d/g s/Ha/f/g s/Ka/p/g s/oe/y/g ------------------------------------ Reran it all: extract-words-from-interlin \ -recode jsa2pek \ -chars "aeoydHKfpPQqrnmcz8UVMNXYABCZijkuw" \ .tmp-c-jsa.evt \ .tmp-c-pek lines words bytes file ------ ------- --------- ------------ 7227 7227 28127 .tmp-c-pek.wds 1586 1586 7627 .tmp-c-pek.dic 6420 6420 25662 .tmp-c-pek-gut.wds 1552 1552 7474 .tmp-c-pek-gut.dic 807 807 2465 .tmp-c-pek-fun.wds 34 34 153 .tmp-c-pek-fun.dic 0 0 0 .tmp-c-pek-bad.wds 0 0 0 .tmp-c-pek-bad.dic Digraph counts: TT a e o y d H K f p P Q q r n m c z 8 U V M N X Y A B C Z i j k u w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 6420 . 271 365 799 564 368 35 52 33 31 6 92 1646 134 . . 37 642 146 515 91 3 6 17 22 16 31 258 237 3 . . . . a 2019 1281 4 178 3 2 9 8 1 18 13 2 . 6 133 83 114 3 22 5 11 . 1 1 21 16 10 11 8 10 13 . 20 2 10 e 1198 537 90 2 53 46 35 12 . 32 4 2 5 1 9 . . 1 71 6 143 6 . . 26 3 16 5 51 40 . . 2 . . o 1122 24 12 . 1 . 18 39 42 164 123 8 44 7 285 6 14 11 12 4 4 . 3 3 70 52 74 77 6 2 8 . 7 . 2 y 1151 548 71 5 42 18 41 12 7 78 9 4 6 1 4 . . 1 54 2 81 . . . 56 3 38 6 38 26 . . . . . d 2466 1995 1 133 1 1 1 2 1 8 5 . 1 3 122 54 97 2 1 2 . . . . 3 3 3 4 . . 4 . 11 . 8 H 190 14 . 11 26 53 1 . . . . . . . . . 1 . . 1 51 23 . . . . . . 1 5 2 . . . 1 K 135 4 . 3 15 43 2 . . . . . . . . . . . 1 1 36 19 . . 1 . . . . 6 4 . . . . f 910 161 2 176 3 1 5 . . . . . 1 . 98 275 153 4 3 . . . . . . . . 1 1 . 8 . 11 . 7 p 347 79 . 87 1 . 2 . 1 . 1 . . 1 59 67 37 1 3 1 . . . . . . . . . . 2 . 2 . 3 P 36 1 2 . . 3 . . . . . . . . . . . 3 . . 19 . . . . . . . 6 2 . . . . . Q 215 4 20 . 11 37 2 . . . . . . . . . . 12 16 1 69 . . . . . . . 34 9 . . . . . q 1668 15 11 191 1 2 5 74 25 498 120 6 25 1 15 1 . 16 4 4 6 . 9 2 265 58 236 63 14 1 . . . . . r 874 588 128 1 31 51 4 1 . . 1 . 2 . . . . 5 16 1 27 . . . . . . . 11 6 1 . . . . n 498 482 4 . 1 6 3 . . . . . . . . . . . 1 . 1 . . . . . . . . . . . . . . m 439 429 6 . 1 1 . . . . . . . . . . . . . 1 1 . . . . . . . . . . . . . . c 103 2 13 1 1 1 12 1 . 3 3 3 12 . . 3 . . . 2 . . . . . 2 13 18 2 1 . 10 . . . z 924 73 142 1 34 80 3 . 1 3 . . 1 . . . . . . . . 9 . . 1 . . . 571 4 1 . . . . 8 262 73 . 10 18 54 2 1 . 1 . . . 1 2 . . 7 27 . 23 . . . 2 . 4 1 22 12 2 . . . . U 1003 1 246 . 14 55 498 1 . 20 8 2 12 . 2 . . . 16 25 . . . . 18 4 49 32 . . . . . . . V 151 . 20 . 1 3 28 4 1 17 9 . . . . . . . 3 1 . . . . 13 5 31 15 . . . . . . . M 16 . 11 . . . 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 12 . 8 . . 1 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . X 511 1 186 2 1 4 298 . . . . . . . 2 1 . . 3 13 . . . . . . . . . . . . . . . Y 175 . 59 1 5 3 94 . . . . . . . 1 . . . 5 6 . . . . . . . . . . . 1 . . . A 558 1 207 . 7 12 299 . . . . . . . 1 5 . . 2 8 7 3 . . . . . . 2 4 . . . . . B 300 1 110 1 5 10 160 . . . . . . . . 1 . . 2 5 2 . . . . . . . 1 2 . . . . . C 1419 4 335 12 33 75 530 . 1 31 17 3 14 1 6 1 . . 18 25 3 . . . 17 6 66 36 181 3 . 1 . . . Z 370 4 51 5 12 25 38 . 3 4 3 . . . 1 1 . . 2 2 3 . . . 1 1 2 . 212 . . . . . . i 56 . . 9 . . . . . . . . . . . . 23 . . . . . . . . . . . . . 8 . 4 4 8 j 12 11 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 57 55 . 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . u 6 1 1 3 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 39 31 7 . . . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 25662 6420 2019 1198 1122 1151 2466 190 135 910 347 36 215 1668 874 498 439 103 924 262 1003 151 16 12 511 175 558 300 1419 370 56 12 57 6 39 Next-symbol probability (× 99): a e o y d H K f p P Q q r n m c z 8 U V M N X Y A B C Z i j k u w TT -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- . 4 6 12 9 6 1 1 1 . . 1 25 2 . . 1 10 2 8 1 . . . . . . 4 4 . . . . . 99 a 63 . 9 . . . . . 1 1 . . . 7 4 6 . 1 . 1 . . . 1 1 . 1 . . 1 . 1 . . 99 e 44 7 . 4 4 3 1 . 3 . . . . 1 . . . 6 . 12 . . . 2 . 1 . 4 3 . . . . . 99 o 2 1 . . . 2 3 4 14 11 1 4 1 25 1 1 1 1 . . . . . 6 5 7 7 1 . 1 . 1 . . 99 y 47 6 . 4 2 4 1 1 7 1 . 1 . . . . . 5 . 7 . . . 5 . 3 1 3 2 . . . . . 99 d 80 . 5 . . . . . . . . . . 5 2 4 . . . . . . . . . . . . . . . . . . 99 H 7 . 6 14 28 1 . . . . . . . . . 1 . . 1 27 12 . . . . . . 1 3 1 . . . 1 99 K 3 . 2 11 32 1 . . . . . . . . . . . 1 1 26 14 . . 1 . . . . 4 3 . . . . 99 f 18 . 19 . . 1 . . . . . . . 11 30 17 . . . . . . . . . . . . . 1 . 1 . 1 99 p 23 . 25 . . 1 . . . . . . . 17 19 11 . 1 . . . . . . . . . . . 1 . 1 . 1 99 P 3 6 . . 8 . . . . . . . . . . . 8 . . 52 . . . . . . . 17 6 . . . . . 99 Q 2 9 . 5 17 1 . . . . . . . . . . 6 7 . 32 . . . . . . . 16 4 . . . . . 99 q 1 1 11 . . . 4 1 30 7 . 1 . 1 . . 1 . . . . 1 . 16 3 14 4 1 . . . . . . 99 r 67 14 . 4 6 . . . . . . . . . . . 1 2 . 3 . . . . . . . 1 1 . . . . . 99 n 96 1 . . 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 m 97 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 c 2 12 1 1 1 12 1 . 3 3 3 12 . . 3 . . . 2 . . . . . 2 12 17 2 1 . 10 . . . 99 z 8 15 . 4 9 . . . . . . . . . . . . . . . 1 . . . . . . 61 . . . . . . 99 8 28 . 4 7 20 1 . . . . . . . 1 . . 3 10 . 9 . . . 1 . 2 . 8 5 1 . . . . 99 U . 24 . 1 5 49 . . 2 1 . 1 . . . . . 2 2 . . . . 2 . 5 3 . . . . . . . 99 V . 13 . 1 2 18 3 1 11 6 . . . . . . . 2 1 . . . . 9 3 20 10 . . . . . . . 99 M . 68 . . . 31 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 N . 66 . . 8 25 . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 X . 36 . . 1 58 . . . . . . . . . . . 1 3 . . . . . . . . . . . . . . . 99 Y . 33 1 3 2 53 . . . . . . . 1 . . . 3 3 . . . . . . . . . . . 1 . . . 99 A . 37 . 1 2 53 . . . . . . . . 1 . . . 1 1 1 . . . . . . . 1 . . . . . 99 B . 36 . 2 3 53 . . . . . . . . . . . 1 2 1 . . . . . . . . 1 . . . . . 99 C . 23 1 2 5 37 . . 2 1 . 1 . . . . . 1 2 . . . . 1 . 5 3 13 . . . . . . 99 Z 1 14 1 3 7 10 . 1 1 1 . . . . . . . 1 1 1 . . . . . 1 . 57 . . . . . . 99 i . . 16 . . . . . . . . . . . . 41 . . . . . . . . . . . . . 14 . 7 7 14 99 j 91 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 k 96 . 2 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 u 17 17 50 17 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 w 79 18 . . . . . . . . . . . . . . . . . 3 . . . . . . . . . . . . . . 99 -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 25 8 5 4 4 10 1 1 4 1 0 1 6 3 2 2 0 4 1 4 1 0 0 2 1 2 1 5 1 0 0 0 0 0 99 It seems that "M = cHcc" and "N = cKcc" are not useful letters; better parse them as "cX" and "cY" and let them drop out because of the single "c". Should map "8a" to "u" early on, then handle "u" like "a". Defined new encoding "PIK": --- jsa2pik ------------------------ #! /n/gnu/bin/sed -f # A second attempt to raise the entropy of Voynichese #-- first stage: HOP-like encoding --- s/\*/?/g s/lj/H/g s/qj/K/g s/lg/P/g s/qg/Q/g s/ij/k/g s/ix/e/g s/is/r/g s/iiu/n/g s/y/i/g s/ci/a/g s/cg/8/g s/cs/z/g s/ir/w/g s/qo/q/g s/in/m/g s/u/?/g s/g/?/g #-- second stage: --- s/8a/u/g s/oe/y/g #-- third stage: zcHK strings --- s/zccccHcc/ZUX/g s/zccccKcc/ZUY/g s/ccHzccc/CHZC/g s/ccKzccc/CKZC/g s/cccHccc/UAC/g s/cccKccc/UBC/g s/ccccHcc/CCX/g s/ccccKcc/CCY/g s/zccHccc/VAC/g s/zccKccc/VBC/g s/zcccHcc/ZCX/g s/zcccKcc/ZCY/g s/zzcccHc/zZCA/g s/zzcccKc/zZCB/g s/ccHccc/CAC/g s/ccKccc/CBC/g s/cccHcc/UX/g s/cccKcc/UY/g s/ccccHc/CCA/g s/ccccKc/CCB/g s/zccHcc/VX/g s/zccKcc/VY/g s/zcccHc/ZCA/g s/zcccKc/ZCB/g s/zzcHcc/zZX/g s/zzcKcc/zZY/g s/Hcccc/AU/g s/Hczcc/AV/g s/Kcccc/BU/g s/Kczcc/BV/g s/cHccc/cAC/g s/cHccz/cXz/g s/cKccc/cBC/g s/cKccz/cYz/g s/ccHcc/CX/g s/ccKcc/CY/g s/cccHc/UA/g s/cccKc/UB/g s/ccccH/CCH/g s/ccccK/CCK/g s/ccccc/CU/g s/ccccz/CCz/g s/zcHcc/ZX/g s/zcKcc/ZY/g s/zccHc/VA/g s/zccKc/VB/g s/zcccH/ZCH/g s/zcccK/ZCK/g s/zcccc/ZU/g s/zcccz/ZCz/g s/zzccH/zVH/g s/zzccK/zVK/g s/Hccc/HU/g s/Hccz/Xz/g s/Hczc/AZ/g s/Hzcc/HV/g s/Kccc/KU/g s/Kccz/Yz/g s/Kczc/BZ/g s/Kzcc/KV/g s/cHcc/cX/g s/cKcc/cY/g s/ccHc/CA/g s/ccKc/CB/g s/cccH/UH/g s/cccK/UK/g s/cccc/CC/g s/cccz/Uz/g s/zHcc/zX/g s/zKcc/zY/g s/zcHc/ZA/g s/zcKc/ZB/g s/zccH/VH/g s/zccK/VK/g s/zccc/ZC/g s/zccz/Vz/g s/zzcc/zV/g s/Hcc/X/g s/Hcz/Az/g s/Hzc/HZ/g s/Kcc/Y/g s/Kcz/Bz/g s/Kzc/KZ/g s/cHc/cA/g s/cKc/cB/g s/ccH/CH/g s/ccK/CK/g s/ccc/U/g s/ccz/Cz/g s/czc/cZ/g s/zcH/ZH/g s/zcK/ZK/g s/zcc/V/g s/zcz/Zz/g s/Hc/A/g s/Kc/B/g s/cH/cH/g s/cK/cK/g s/cc/C/g s/zc/Z/g #--- fourth stage: HK-vowel combos --- s/Ha/f/g s/Ka/b/g s/Aa/J/g s/Au/L/g s/Ba/F/g s/Bu/G/g s/Xa/R/g s/Xu/S/g s/Ya/M/g s/Yu/N/g ------------------------------------ extract-words-from-interlin \ -recode jsa2pik \ -chars "aueoyqUVCZHKfbPQrnmcz8JLFGRSMNXYABijkw" \ .tmp-c-jsa.evt \ .tmp-c-pik lines words bytes file ------ ------- --------- ------------ 7227 7227 26714 .tmp-c-pik.wds 1586 1586 7391 .tmp-c-pik.dic 6414 6414 24216 .tmp-c-pik-gut.wds 1546 1546 7205 .tmp-c-pik-gut.dic 773 773 2311 .tmp-c-pik-fun.wds 2 2 5 .tmp-c-pik-fun.dic 40 40 187 .tmp-c-pik-bad.wds 38 38 181 .tmp-c-pik-bad.dic Digraph counts: TT a u e o y q U V C Z H K f b P Q r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 6414 . 271 368 365 797 564 1646 515 469 258 236 35 52 33 31 6 92 134 . . 46 264 145 2 12 2 22 . 12 10 8 5 4 2 7 1 . . . a 1435 731 3 9 163 3 2 5 11 17 8 10 7 1 18 13 2 . 123 82 114 5 4 4 . 8 2 9 8 12 6 5 1 4 2 . 13 . 20 10 u 1607 1177 1 . 122 1 . 3 . . . . 2 1 7 3 . . 108 54 95 1 . 2 1 2 1 2 . 2 1 . . 1 . . 4 . 9 7 e 1195 535 90 35 2 53 46 1 143 66 51 40 12 . 32 4 2 5 9 . . 1 11 6 2 11 . 5 9 14 . 3 2 . 3 . . . 2 . o 1118 24 12 18 . 1 . 7 4 5 6 2 39 41 164 123 7 44 285 6 14 17 7 4 12 53 14 56 19 43 22 26 8 4 9 7 7 . 6 2 y 1151 548 71 41 5 42 18 1 81 49 38 26 12 7 78 9 4 6 4 . . 1 5 2 7 27 . 4 28 27 . 3 1 . 4 2 . . . . q 1668 15 11 5 191 1 2 1 6 2 14 1 74 25 498 120 6 25 15 1 . 27 2 4 46 168 8 49 93 165 10 43 7 5 22 6 . . . . U 1003 1 246 498 . 14 55 . . . . . 1 . 20 8 2 12 2 . . . 16 25 40 5 27 3 9 7 2 2 2 . 4 2 . . . . V 720 1 163 372 . 18 43 1 . . . . 4 1 17 9 1 4 . 1 . . 3 18 28 2 14 1 6 6 4 1 1 . 1 . . . . . C 849 3 191 186 12 16 35 . 3 . 181 3 . 1 31 17 2 10 6 . . . 18 8 59 6 34 2 10 7 3 2 . 1 1 . . 1 . . Z 369 4 50 38 5 12 25 . 3 . 212 . . 3 4 3 . . 1 1 . . 2 2 1 . . . . 1 1 . . . 1 . . . . . H 190 14 . 1 11 26 53 . 51 23 1 5 . . . . . . . . 1 . . 1 . . . . . . . . . . . . 2 . . 1 K 134 4 . 2 3 14 43 . 36 19 . 6 . . . . . . . . . . 1 1 . . . . . 1 . . . . . . 4 . . . f 910 161 2 5 176 3 1 . . . 1 . . . . . . 1 98 275 153 4 3 . . . . 1 . . . . . . . . 8 . 11 7 b 347 79 . 2 87 1 . 1 . . . . . 1 . 1 . . 59 67 37 1 3 1 . . . . . . . . . . . . 2 . 2 3 P 35 1 2 . . . 3 . 19 . 5 2 . . . . . . . . . 3 . . . . . . . . . . . . . . . . . . Q 215 4 20 2 . 11 37 . 69 16 34 9 . . . . . . . . . 12 . 1 . . . . . . . . . . . . . . . . r 874 588 128 4 1 31 51 . 27 16 11 6 1 . . 1 . 2 . . . 5 . 1 . . . . . . . . . . . . 1 . . . n 498 482 4 3 . 1 6 . 1 . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . m 439 429 6 . . 1 1 . 1 . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . c 131 2 13 12 1 1 1 . . . 2 1 1 . 3 3 3 12 . 3 . . . 2 9 2 7 5 11 5 8 3 . 3 2 6 . 10 . . z 354 72 142 3 1 34 80 . . 9 2 4 . 1 3 . . 1 . . . . . . . . . . 1 . . . . . . . 1 . . . 8 261 73 . 2 10 18 54 1 23 26 22 12 1 . 1 . . . 2 . . 7 1 . . 3 1 . 2 . . . . . 1 . 1 . . . J 207 193 1 . 7 . . . . . . . . . . . . . 5 1 . . . . . . . . . . . . . . . . . . . . L 299 284 . 1 4 . . . . . . . . . 1 . . 1 6 . . 1 . . . . . . . . . . . . . . . . 1 . F 110 104 . . 2 . . 1 . . . . 1 . . . . . 1 . . . . . . . . . . . . . . 1 . . . . . . G 160 147 . . 2 . 1 . . . . . . . . 1 . . 5 . 2 . 1 . . . . . . . . . . . . . . . . 1 R 197 193 . . 2 . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . S 302 294 . . 5 . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . 1 . . . . . . . . M 67 60 . . 4 . . . . . . . . . . . . . 2 . . . . 1 . . . . . . . . . . . . . . . . N 97 92 . . . . . . . . . . . . . . . . 2 . . . . . . . . 1 1 . . . . . . . . . 1 . X 27 1 . . 2 1 4 . . . . . . . . . . . 2 1 . . 3 13 . . . . . . . . . . . . . . . . Y 23 . . . 1 5 4 . . . . . . . . . . . 1 . . . 5 6 . . . . . . . . . . . . . 1 . . A 52 1 . . . 7 12 . 7 3 2 4 . . . . . . 1 5 . . 2 8 . . . . . . . . . . . . . . . . B 30 1 . . 1 5 10 . 2 . 1 2 . . . . . . . 1 . . 2 5 . . . . . . . . . . . . . . . . i 52 . . . 9 . . . . . . . . . . . . . . . 23 . . . . . . . . . . . . . . . 8 . 4 8 j 12 11 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 56 54 . . 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 39 31 7 . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 23647 6414 1435 1607 1195 1118 1151 1668 1003 720 849 369 190 134 910 347 35 215 874 498 439 131 354 261 207 299 110 160 197 302 67 97 27 23 52 30 52 12 56 39 Next-symbol probability (× 99): TT a u e o y q U V C Z H K f b P Q r n m c z 8 J L F G R S M N X Y A B i j k w -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 99 . 4 6 6 12 9 25 8 7 4 4 1 1 1 . . 1 2 . . 1 4 2 . . . . . . . . . . . . . . . . a 99 50 . 1 11 . . . 1 1 1 1 . . 1 1 . . 8 6 8 . . . . 1 . 1 1 1 . . . . . . 1 . 1 1 u 99 73 . . 8 . . . . . . . . . . . . . 7 3 6 . . . . . . . . . . . . . . . . . 1 . e 99 44 7 3 . 4 4 . 12 5 4 3 1 . 3 . . . 1 . . . 1 . . 1 . . 1 1 . . . . . . . . . . o 99 2 1 2 . . . 1 . . 1 . 3 4 15 11 1 4 25 1 1 2 1 . 1 5 1 5 2 4 2 2 1 . 1 1 1 . 1 . y 99 47 6 4 . 4 2 . 7 4 3 2 1 1 7 1 . 1 . . . . . . 1 2 . . 2 2 . . . . . . . . . . q 99 1 1 . 11 . . . . . 1 . 4 1 30 7 . 1 1 . . 2 . . 3 10 . 3 6 10 1 3 . . 1 . . . . . U 99 . 24 49 . 1 5 . . . . . . . 2 1 . 1 . . . . 2 2 4 . 3 . 1 1 . . . . . . . . . . V 99 . 22 51 . 2 6 . . . . . 1 . 2 1 . 1 . . . . . 2 4 . 2 . 1 1 1 . . . . . . . . . C 99 . 22 22 1 2 4 . . . 21 . . . 4 2 . 1 1 . . . 2 1 7 1 4 . 1 1 . . . . . . . . . . Z 99 1 13 10 1 3 7 . 1 . 57 . . 1 1 1 . . . . . . 1 1 . . . . . . . . . . . . . . . . H 99 7 . 1 6 14 28 . 27 12 1 3 . . . . . . . . 1 . . 1 . . . . . . . . . . . . 1 . . 1 K 99 3 . 1 2 10 32 . 27 14 . 4 . . . . . . . . . . 1 1 . . . . . 1 . . . . . . 3 . . . f 99 18 . 1 19 . . . . . . . . . . . . . 11 30 17 . . . . . . . . . . . . . . . 1 . 1 1 b 99 23 . 1 25 . . . . . . . . . . . . . 17 19 11 . 1 . . . . . . . . . . . . . 1 . 1 1 P 99 3 6 . . . 8 . 54 . 14 6 . . . . . . . . . 8 . . . . . . . . . . . . . . . . . . Q 99 2 9 1 . 5 17 . 32 7 16 4 . . . . . . . . . 6 . . . . . . . . . . . . . . . . . . r 99 67 14 . . 4 6 . 3 2 1 1 . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . n 99 96 1 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . m 99 97 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c 99 2 10 9 1 1 1 . . . 2 1 1 . 2 2 2 9 . 2 . . . 2 7 2 5 4 8 4 6 2 . 2 2 5 . 8 . . z 99 20 40 1 . 10 22 . . 3 1 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . 8 99 28 . 1 4 7 20 . 9 10 8 5 . . . . . . 1 . . 3 . . . 1 . . 1 . . . . . . . . . . . J 99 92 . . 3 . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . L 99 94 . . 1 . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . F 99 94 . . 2 . . 1 . . . . 1 . . . . . 1 . . . . . . . . . . . . . . 1 . . . . . . G 99 91 . . 1 . 1 . . . . . . . . 1 . . 3 . 1 . 1 . . . . . . . . . . . . . . . . 1 R 99 97 . . 1 . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . S 99 96 . . 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M 99 89 . . 6 . . . . . . . . . . . . . 3 . . . . 1 . . . . . . . . . . . . . . . . N 99 94 . . . . . . . . . . . . . . . . 2 . . . . . . . . 1 1 . . . . . . . . . 1 . X 99 4 . . 7 4 15 . . . . . . . . . . . 7 4 . . 11 48 . . . . . . . . . . . . . . . . Y 99 . . . 4 22 17 . . . . . . . . . . . 4 . . . 22 26 . . . . . . . . . . . . . 4 . . A 99 2 . . . 13 23 . 13 6 4 8 . . . . . . 2 10 . . 4 15 . . . . . . . . . . . . . . . . B 99 3 . . 3 17 33 . 7 . 3 7 . . . . . . . 3 . . 7 17 . . . . . . . . . . . . . . . . i 99 . . . 17 . . . . . . . . . . . . . . . 44 . . . . . . . . . . . . . . . 15 . 8 15 j 99 91 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 99 95 . . 2 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 99 79 18 . . . . . 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 27 6 7 5 5 5 7 4 3 4 2 1 1 4 1 0 1 4 2 2 1 1 1 1 1 0 1 1 1 0 0 0 0 0 0 0 0 0 0 Previous-symbol probability (× 99): TT a u e o y q U V C Z H K f b P Q r n m c z 8 J L F G R S M N X Y A B i j k w -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 27 . 19 23 30 71 49 98 51 64 30 63 18 38 4 9 17 42 15 . . 35 74 55 1 4 2 14 . 4 15 8 18 17 4 23 2 . . . a 6 11 . 1 14 . . . 1 2 1 3 4 1 2 4 6 . 14 16 26 4 1 2 . 3 2 6 4 4 9 5 4 17 4 . 25 . 35 25 u 7 18 . . 10 . . . . . . . 1 1 1 1 . . 12 11 21 1 . 1 . 1 1 1 . 1 1 . . 4 . . 8 . 16 18 e 5 8 6 2 . 5 4 . 14 9 6 11 6 . 3 1 6 2 1 . . 1 3 2 1 4 . 3 5 5 . 3 7 . 6 . . . 4 . o 5 . 1 1 . . . . . 1 1 1 20 30 18 35 20 20 32 1 3 13 2 2 6 18 13 35 10 14 33 27 29 17 17 23 13 . 11 5 y 5 8 5 3 . 4 2 . 8 7 4 7 6 5 8 3 11 3 . . . 1 1 1 3 9 . 2 14 9 . 3 4 . 8 7 . . . . q 7 . 1 . 16 . . . 1 . 2 . 39 18 54 34 17 12 2 . . 20 1 2 22 56 7 30 47 54 15 44 26 22 42 20 . . . . U 4 . 17 31 . 1 5 . . . . . 1 . 2 2 6 6 . . . . 4 9 19 2 24 2 5 2 3 2 7 . 8 7 . . . . V 3 . 11 23 . 2 4 . . . . . 2 1 2 3 3 2 . . . . 1 7 13 1 13 1 3 2 6 1 4 . 2 . . . . . C 4 . 13 11 1 1 3 . . . 21 1 . 1 3 5 6 5 1 . . . 5 3 28 2 31 1 5 2 4 2 . 4 2 . . 8 . . Z 2 . 3 2 . 1 2 . . . 25 . . 2 . 1 . . . . . . 1 1 . . . . . . 1 . . . 2 . . . . . H 1 . . . 1 2 5 . 5 3 . 1 . . . . . . . . . . . . . . . . . . . . . . . . 4 . . 3 K 1 . . . . 1 4 . 4 3 . 2 . . . . . . . . . . . . . . . . . . . . . . . . 8 . . . f 4 2 . . 15 . . . . . . . . . . . . . 11 55 35 3 1 . . . . 1 . . . . . . . . 15 . 19 18 b 1 1 . . 7 . . . . . . . . 1 . . . . 7 13 8 1 1 . . . . . . . . . . . . . 4 . 4 8 P 0 . . . . . . . 2 . 1 1 . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . Q 1 . 1 . . 1 3 . 7 2 4 2 . . . . . . . . . 9 . . . . . . . . . . . . . . . . . . r 4 9 9 . . 3 4 . 3 2 1 2 1 . . . . 1 . . . 4 . . . . . . . . . . . . . . 2 . . . n 2 7 . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . m 2 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c 1 . 1 1 . . . . . . . . 1 . . 1 8 6 . 1 . . . 1 4 1 6 3 6 2 12 3 . 13 4 20 . 83 . . z 1 1 10 . . 3 7 . . 1 . 1 . 1 . . . . . . . . . . . . . . 1 . . . . . . . 2 . . . 8 1 1 . . 1 2 5 . 2 4 3 3 1 . . . . . . . . 5 . . . 1 1 . 1 . . . . . 2 . 2 . . . J 1 3 . . 1 . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . L 1 4 . . . . . . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . . . 2 . F 0 2 . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . 4 . . . . . . G 1 2 . . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . 3 R 1 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . S 1 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . . . . . . . . M 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 . . . . . . . . . 2 . X 0 . . . . . . . . . . . . . . . . . . . . . 1 5 . . . . . . . . . . . . . . . . Y 0 . . . . . . . . . . . . . . . . . . . . . 1 2 . . . . . . . . . . . . . 8 . . A 0 . . . . 1 1 . 1 . . 1 . . . . . . . 1 . . 1 3 . . . . . . . . . . . . . . . . B 0 . . . . . 1 . . . . 1 . . . . . . . . . . 1 2 . . . . . . . . . . . . . . . . i 0 . . . 1 . . . . . . . . . . . . . . . 5 . . . . . . . . . . . . . . . 15 . 7 20 j 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Next-symbol entropy: TT a u e o y q U V C Z H K f b P Q r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 3.659 . 0.193 0.237 0.235 0.374 0.308 0.504 0.292 0.276 0.186 0.175 0.041 0.056 0.039 0.037 0.009 0.088 0.117 . . 0.051 0.189 0.124 0.004 0.017 0.004 0.028 . 0.017 0.015 0.012 0.008 0.007 0.004 0.011 0.002 . . . a 2.806 0.496 0.019 0.046 0.356 0.019 0.013 0.028 0.054 0.076 0.042 0.050 0.037 0.007 0.079 0.061 0.013 . 0.304 0.236 0.290 0.028 0.024 0.024 . 0.042 0.013 0.046 0.042 0.058 0.033 0.028 0.007 0.024 0.013 . 0.061 . 0.086 0.050 u 1.558 0.329 0.007 . 0.282 0.007 . 0.017 . . . . 0.012 0.007 0.034 0.017 . . 0.262 0.164 0.241 0.007 . 0.012 0.007 0.012 0.007 0.012 . 0.012 0.007 . . 0.007 . . 0.022 . 0.042 0.034 e 3.067 0.519 0.281 0.149 0.015 0.199 0.181 0.009 0.367 0.231 0.194 0.164 0.067 . 0.140 0.028 0.015 0.033 0.053 . . 0.009 0.062 0.038 0.015 0.062 . 0.033 0.053 0.075 . 0.022 0.015 . 0.022 . . . 0.015 . o 3.957 0.119 0.070 0.096 . 0.009 . 0.046 0.029 0.035 0.040 0.016 0.169 0.175 0.406 0.350 0.046 0.184 0.503 0.040 0.079 0.092 0.046 0.029 0.070 0.209 0.079 0.216 0.100 0.181 0.112 0.126 0.051 0.029 0.056 0.046 0.046 . 0.040 0.016 y 3.109 0.510 0.248 0.171 0.034 0.174 0.094 0.009 0.269 0.194 0.162 0.124 0.069 0.045 0.263 0.055 0.028 0.040 0.028 . . 0.009 0.034 0.016 0.045 0.127 . 0.028 0.130 0.127 . 0.022 0.009 . 0.028 0.016 . . . . q 3.597 0.061 0.048 0.025 0.358 0.006 0.012 0.006 0.029 0.012 0.058 0.006 0.199 0.091 0.521 0.273 0.029 0.091 0.061 0.006 . 0.096 0.012 0.021 0.143 0.334 0.037 0.150 0.232 0.330 0.044 0.136 0.033 0.025 0.082 0.029 . . . . U 2.446 0.010 0.497 0.502 . 0.086 0.230 . . . . . 0.010 . 0.113 0.056 0.018 0.076 0.018 . . . 0.095 0.133 0.185 0.038 0.140 0.025 0.061 0.050 0.018 0.018 0.018 . 0.032 0.018 . . . . V 2.401 0.013 0.485 0.492 . 0.133 0.243 0.013 . . . . 0.042 0.013 0.128 0.079 0.013 0.042 . 0.013 . . 0.033 0.133 0.182 0.024 0.111 0.013 0.058 0.058 0.042 0.013 0.013 . 0.013 . . . . . C 3.277 0.029 0.484 0.480 0.087 0.108 0.190 . 0.029 . 0.475 0.029 . 0.011 0.174 0.113 0.021 0.075 0.050 . . . 0.118 0.063 0.267 0.050 0.186 0.021 0.075 0.057 0.029 0.021 . 0.011 0.011 . . 0.011 . . Z 2.227 0.071 0.391 0.338 0.084 0.161 0.263 . 0.056 . 0.459 . . 0.056 0.071 0.056 . . 0.023 0.023 . . 0.041 0.041 0.023 . . . . 0.023 0.023 . . . 0.023 . . . . . H 2.706 0.277 . 0.040 0.238 0.393 0.514 . 0.509 0.369 0.040 0.138 . . . . . . . . 0.040 . . 0.040 . . . . . . . . . . . . 0.069 . . 0.040 K 2.650 0.151 . 0.091 0.123 0.340 0.526 . 0.509 0.400 . 0.201 . . . . . . . . . . 0.053 0.053 . . . . . 0.053 . . . . . . 0.151 . . . f 2.585 0.442 0.019 0.041 0.458 0.027 0.011 . . . 0.011 . . . . . . 0.011 0.346 0.522 0.432 0.034 0.027 . . . . 0.011 . . . . . . . . 0.060 . 0.077 0.054 b 2.617 0.486 . 0.043 0.500 0.024 . 0.024 . . . . . 0.024 . 0.024 . . 0.435 0.458 0.344 0.024 0.059 0.024 . . . . . . . . . . . . 0.043 . 0.043 0.059 P 2.106 0.147 0.236 . . . 0.304 . 0.478 . 0.401 0.236 . . . . . . . . . 0.304 . . . . . . . . . . . . . . . . . . Q 2.831 0.107 0.319 0.063 . 0.219 0.437 . 0.526 0.279 0.421 0.192 . . . . . . . . . 0.232 . 0.036 . . . . . . . . . . . . . . . . r 1.744 0.385 0.406 0.036 0.011 0.171 0.239 . 0.155 0.106 0.079 0.049 0.011 . . 0.011 . 0.020 . . . 0.043 . 0.011 . . . . . . . . . . . . 0.011 . . . n 0.277 0.046 0.056 0.044 . 0.018 0.077 . 0.018 . . . . . . . . . . . . . 0.018 . . . . . . . . . . . . . . . . . m 0.197 0.032 0.085 . . 0.020 0.020 . 0.020 . . . . . . . . . . . . . . 0.020 . . . . . . . . . . . . . . . . c 4.325 0.092 0.331 0.316 0.054 0.054 0.054 . . . 0.092 0.054 0.054 . 0.125 0.125 0.125 0.316 . 0.125 . . . 0.092 0.265 0.092 0.226 0.180 0.300 0.180 0.246 0.125 . 0.125 0.092 0.204 . 0.283 . . z 2.292 0.467 0.529 0.058 0.024 0.325 0.485 . . 0.135 0.042 0.073 . 0.024 0.058 . . 0.024 . . . . . . . . . . 0.024 . . . . . . . 0.024 . . . 8 3.167 0.514 . 0.054 0.180 0.266 0.470 0.031 0.309 0.331 0.301 0.204 0.031 . 0.031 . . . 0.054 . . 0.140 0.031 . . 0.074 0.031 . 0.054 . . . . . 0.031 . 0.031 . . . J 0.464 0.094 0.037 . 0.165 . . . . . . . . . . . . . 0.130 0.037 . . . . . . . . . . . . . . . . . . . . L 0.404 0.071 . 0.028 0.083 . . . . . . . . . 0.028 . . 0.028 0.113 . . 0.028 . . . . . . . . . . . . . . . . 0.028 . F 0.428 0.077 . . 0.105 . . 0.062 . . . . 0.062 . . . . . 0.062 . . . . . . . . . . . . . . 0.062 . . . . . . G 0.610 0.112 . . 0.079 . 0.046 . . . . . . . . 0.046 . . 0.156 . 0.079 . 0.046 . . . . . . . . . . . . . . . . 0.046 R 0.163 0.029 . . 0.067 . . . . . . . . . . . . . 0.067 . . . . . . . . . . . . . . . . . . . . . S 0.217 0.038 . . 0.098 . . . . . . . . . . 0.027 . . 0.027 . . . . . . . . . . . . 0.027 . . . . . . . . M 0.627 0.143 . . 0.243 . . . . . . . . . . . . . 0.151 . . . . 0.091 . . . . . . . . . . . . . . . . N 0.392 0.072 . . . . . . . . . . . . . . . . 0.115 . . . . . . . . 0.068 0.068 . . . . . . . . . 0.068 . X 2.353 0.176 . . 0.278 0.176 0.408 . . . . . . . . . . . 0.278 0.176 . . 0.352 0.508 . . . . . . . . . . . . . . . . Y 2.492 . . . 0.197 0.479 0.439 . . . . . . . . . . . 0.197 . . . 0.479 0.506 . . . . . . . . . . . . . 0.197 . . A 3.110 0.110 . . . 0.389 0.488 . 0.389 0.237 0.181 0.285 . . . . . . 0.110 0.325 . . 0.181 0.415 . . . . . . . . . . . . . . . . B 2.826 0.164 . . 0.164 0.431 0.528 . 0.260 . 0.164 0.260 . . . . . . . 0.164 . . 0.260 0.431 . . . . . . . . . . . . . . . . i 2.074 . . . 0.438 . . . . . . . . . . . . . . . 0.521 . . . . . . . . . . . . . . . 0.415 . 0.285 0.415 j 0.414 0.115 0.299 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0.258 0.051 . . 0.104 0.104 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0.844 0.263 0.445 . . . . . 0.136 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 2.761 0.511 0.245 0.264 0.218 0.208 0.212 0.270 0.193 0.153 0.172 0.094 0.056 0.042 0.181 0.089 0.014 0.062 0.176 0.117 0.107 0.042 0.091 0.072 0.060 0.080 0.036 0.049 0.058 0.080 0.024 0.033 0.011 0.010 0.019 0.012 0.019 0.006 0.021 0.015 Previous-symbol entropy: TT a u e o y q U V C Z H K f b P Q r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 0.511 . 0.454 0.487 0.523 0.348 0.504 0.019 0.494 0.403 0.522 0.412 0.450 0.530 0.174 0.311 0.436 0.524 0.415 . . 0.530 0.316 0.471 0.065 0.186 0.105 0.394 . 0.185 0.410 0.297 0.451 0.439 0.181 0.490 0.110 . . . a 0.245 0.357 0.019 0.042 0.392 0.023 0.016 0.025 0.071 0.128 0.063 0.141 0.175 0.053 0.112 0.178 0.236 . 0.398 0.429 0.505 0.180 0.073 0.092 . 0.140 0.105 0.234 0.188 0.185 0.312 0.221 0.176 0.439 0.181 . 0.500 . 0.531 0.503 u 0.264 0.449 0.007 . 0.336 0.009 . 0.016 . . . . 0.069 0.053 0.054 0.059 . . 0.373 0.348 0.478 0.054 . 0.054 0.037 0.048 0.062 0.079 . 0.048 0.091 . . 0.197 . . 0.285 . 0.424 0.445 e 0.218 0.299 0.251 0.120 0.015 0.209 0.186 0.006 0.401 0.316 0.244 0.347 0.252 . 0.170 0.074 0.236 0.126 0.068 . . 0.054 0.156 0.125 0.065 0.175 . 0.156 0.203 0.205 . 0.155 0.278 . 0.237 . . . 0.172 . o 0.208 0.030 0.058 0.073 . 0.009 . 0.033 0.032 0.050 0.050 0.041 0.469 0.523 0.446 0.530 0.464 0.468 0.527 0.077 0.159 0.382 0.112 0.092 0.238 0.442 0.379 0.530 0.325 0.400 0.528 0.509 0.520 0.439 0.438 0.490 0.389 . 0.345 0.220 y 0.212 0.303 0.215 0.135 0.033 0.178 0.094 0.006 0.293 0.264 0.201 0.270 0.252 0.222 0.304 0.137 0.358 0.144 0.036 . . 0.054 0.087 0.054 0.165 0.313 . 0.133 0.400 0.311 . 0.155 0.176 . 0.285 0.260 . . . . q 0.270 0.020 0.054 0.026 0.423 0.009 0.016 0.006 0.044 0.024 0.098 0.023 0.530 0.452 0.476 0.530 0.436 0.361 0.101 0.018 . 0.470 0.042 0.092 0.482 0.467 0.275 0.523 0.511 0.476 0.410 0.520 0.505 0.479 0.525 0.464 . . . . U 0.193 0.002 0.436 0.524 . 0.079 0.210 . . . . . 0.040 . 0.121 0.125 0.236 0.232 0.020 . . . 0.202 0.324 0.458 0.099 0.497 0.108 0.203 0.126 0.151 0.115 0.278 . 0.285 0.260 . . . . V 0.153 0.002 0.356 0.489 . 0.096 0.177 0.006 . . . . 0.117 0.053 0.107 0.137 0.147 0.107 . 0.018 . . 0.058 0.266 0.390 0.048 0.379 0.046 0.153 0.112 0.243 0.068 0.176 . 0.110 . . . . . C 0.172 0.005 0.387 0.360 0.067 0.088 0.153 . 0.025 . 0.475 0.056 . 0.053 0.166 0.213 0.236 0.206 0.049 . . . 0.219 0.154 0.516 0.113 0.524 0.079 0.218 0.126 0.201 0.115 . 0.197 0.110 . . 0.299 . . Z 0.094 0.007 0.169 0.128 0.033 0.070 0.120 . 0.025 . 0.500 . . 0.123 0.034 0.059 . . 0.011 0.018 . . 0.042 0.054 0.037 . . . . 0.027 0.091 . . . 0.110 . . . . . H 0.056 0.019 . 0.007 0.062 0.126 0.204 . 0.219 0.159 0.011 0.084 . . . . . . . . 0.020 . . 0.031 . . . . . . . . . . . . 0.181 . . 0.136 K 0.042 0.007 . 0.012 0.022 0.079 0.177 . 0.172 0.138 . 0.097 . . . . . . . . . . 0.024 0.031 . . . . . 0.027 . . . . . . 0.285 . . . f 0.181 0.133 0.013 0.026 0.407 0.023 0.009 . . . 0.011 . . . . . . 0.036 0.354 0.473 0.530 0.154 0.058 . . . . 0.046 . . . . . . . . 0.415 . 0.461 0.445 b 0.089 0.078 . 0.012 0.275 0.009 . 0.006 . . . . . 0.053 . 0.024 . . 0.263 0.389 0.301 0.054 0.058 0.031 . . . . . . . . . . . . 0.181 . 0.172 0.285 P 0.014 0.002 0.013 . . . 0.022 . 0.108 . 0.044 0.041 . . . . . . . . . 0.125 . . . . . . . . . . . . . . . . . . Q 0.062 0.007 0.086 0.012 . 0.066 0.159 . 0.266 0.122 0.186 0.131 . . . . . . . . . 0.316 . 0.031 . . . . . . . . . . . . . . . . r 0.176 0.316 0.311 0.022 0.009 0.143 0.199 . 0.140 0.122 0.081 0.097 0.040 . . 0.024 . 0.063 . . . 0.180 . 0.031 . . . . . . . . . . . . 0.110 . . . n 0.117 0.281 0.024 0.017 . 0.009 0.040 . 0.010 . . . . . . . . . . . . . 0.024 . . . . . . . . . . . . . . . . . m 0.107 0.261 0.033 . . 0.009 0.009 . 0.010 . . . . . . . . . . . . . . 0.031 . . . . . . . . . . . . . . . . c 0.042 0.004 0.061 0.053 0.009 0.009 0.009 . . . 0.021 0.023 0.040 . 0.027 0.059 0.304 0.232 . 0.044 . . . 0.054 0.197 0.048 0.253 0.156 0.232 0.098 0.366 0.155 . 0.383 0.181 0.464 . 0.219 . . z 0.091 0.073 0.330 0.017 0.009 0.153 0.267 . . 0.079 0.021 0.071 . 0.053 0.027 . . 0.036 . . . . . . . . . . 0.039 . . . . . . . 0.110 . . . 8 0.072 0.073 . 0.012 0.058 0.096 0.207 0.006 0.125 0.173 0.137 0.161 0.040 . 0.011 . . . 0.020 . . 0.226 0.024 . . 0.067 0.062 . 0.067 . . . . . 0.110 . 0.110 . . . J 0.060 0.152 0.007 . 0.043 . . . . . . . . . . . . . 0.043 0.018 . . . . . . . . . . . . . . . . . . . . L 0.080 0.199 . 0.007 0.028 . . . . . . . . . 0.011 . . 0.036 0.049 . . 0.054 . . . . . . . . . . . . . . . . 0.104 . F 0.036 0.096 . . 0.015 . . 0.006 . . . . 0.040 . . . . . 0.011 . . . . . . . . . . . . . . 0.197 . . . . . . G 0.049 0.125 . . 0.015 . 0.009 . . . . . . . . 0.024 . . 0.043 . 0.035 . 0.024 . . . . . . . . . . . . . . . . 0.136 R 0.058 0.152 . . 0.015 . . . . . . . . . . . . . 0.020 . . . . . . . . . . . . . . . . . . . . . S 0.080 0.204 . . 0.033 . . . . . . . . . . 0.024 . . 0.011 . . . . . . . . . . . . 0.068 . . . . . . . . M 0.024 0.063 . . 0.028 . . . . . . . . . . . . . 0.020 . . . . 0.031 . . . . . . . . . . . . . . . . N 0.033 0.088 . . . . . . . . . . . . . . . . 0.020 . . . . . . . . 0.046 0.039 . . . . . . . . . 0.104 . X 0.011 0.002 . . 0.015 0.009 0.028 . . . . . . . . . . . 0.020 0.018 . . 0.058 0.216 . . . . . . . . . . . . . . . . Y 0.010 . . . 0.009 0.035 0.028 . . . . . . . . . . . 0.011 . . . 0.087 0.125 . . . . . . . . . . . . . 0.299 . . A 0.019 0.002 . . . 0.046 0.069 . 0.050 0.033 0.021 0.071 . . . . . . 0.011 0.067 . . 0.042 0.154 . . . . . . . . . . . . . . . . B 0.012 0.002 . . 0.009 0.035 0.059 . 0.018 . 0.011 0.041 . . . . . . . 0.018 . . 0.042 0.109 . . . . . . . . . . . . . . . . i 0.019 . . . 0.053 . . . . . . . . . . . . . . . 0.223 . . . . . . . . . . . . . . . 0.415 . 0.272 0.469 j 0.006 0.016 0.007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0.021 0.058 . . 0.009 0.009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0.015 0.037 0.037 . . . . . 0.010 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 2.761 3.924 3.329 2.578 2.943 1.974 2.973 0.138 2.513 2.010 2.697 2.106 2.513 2.166 2.239 2.510 3.089 2.572 2.894 1.934 2.251 2.830 1.748 2.653 2.651 2.148 2.639 2.529 2.580 2.328 2.800 2.379 2.560 2.769 2.751 2.429 3.090 0.817 2.584 2.637 Modified again the encoding, mapping Vu and Uu to separate letters: --- jsa2pok ------------------------ #! /n/gnu/bin/sed -f # A second attempt to raise the entropy of Voynichese #-- first stage: HOP-like encoding --- s/\*/?/g s/lj/H/g s/qj/K/g s/lg/P/g s/qg/Q/g s/ij/k/g s/ix/e/g s/is/r/g s/iiu/n/g s/y/i/g s/ci/a/g s/cg/8/g s/cs/z/g s/ir/w/g s/qo/q/g s/in/m/g s/u/?/g s/g/?/g #-- second stage: --- s/8a/u/g s/oe/y/g #-- third stage: zcHK strings --- s/zccccHcc/ZUX/g s/zccccKcc/ZUY/g s/ccHzccc/CHZC/g s/ccKzccc/CKZC/g s/cccHccc/UAC/g s/cccKccc/UBC/g s/ccccHcc/CCX/g s/ccccKcc/CCY/g s/zccHccc/VAC/g s/zccKccc/VBC/g s/zcccHcc/ZCX/g s/zcccKcc/ZCY/g s/zzcccHc/zZCA/g s/zzcccKc/zZCB/g s/ccHccc/CAC/g s/ccKccc/CBC/g s/cccHcc/UX/g s/cccKcc/UY/g s/ccccHc/CCA/g s/ccccKc/CCB/g s/zccHcc/VX/g s/zccKcc/VY/g s/zcccHc/ZCA/g s/zcccKc/ZCB/g s/zzcHcc/zZX/g s/zzcKcc/zZY/g s/Hcccc/AU/g s/Hczcc/AV/g s/Kcccc/BU/g s/Kczcc/BV/g s/cHccc/cAC/g s/cHccz/cXz/g s/cKccc/cBC/g s/cKccz/cYz/g s/ccHcc/CX/g s/ccKcc/CY/g s/cccHc/UA/g s/cccKc/UB/g s/ccccH/CCH/g s/ccccK/CCK/g s/ccccc/CU/g s/ccccz/CCz/g s/zcHcc/ZX/g s/zcKcc/ZY/g s/zccHc/VA/g s/zccKc/VB/g s/zcccH/ZCH/g s/zcccK/ZCK/g s/zcccc/ZU/g s/zcccz/ZCz/g s/zzccH/zVH/g s/zzccK/zVK/g s/Hccc/HU/g s/Hccz/Xz/g s/Hczc/AZ/g s/Hzcc/HV/g s/Kccc/KU/g s/Kccz/Yz/g s/Kczc/BZ/g s/Kzcc/KV/g s/cHcc/cX/g s/cKcc/cY/g s/ccHc/CA/g s/ccKc/CB/g s/cccH/UH/g s/cccK/UK/g s/cccc/CC/g s/cccz/Uz/g s/zHcc/zX/g s/zKcc/zY/g s/zcHc/ZA/g s/zcKc/ZB/g s/zccH/VH/g s/zccK/VK/g s/zccc/ZC/g s/zccz/Vz/g s/zzcc/zV/g s/Hcc/X/g s/Hcz/Az/g s/Hzc/HZ/g s/Kcc/Y/g s/Kcz/Bz/g s/Kzc/KZ/g s/cHc/cA/g s/cKc/cB/g s/ccH/CH/g s/ccK/CK/g s/ccc/U/g s/ccz/Cz/g s/czc/cZ/g s/zcH/ZH/g s/zcK/ZK/g s/zcc/V/g s/zcz/Zz/g s/Hc/A/g s/Kc/B/g s/cH/cH/g s/cK/cK/g s/cc/C/g s/zc/Z/g #--- fourth stage: HK-vowel and UV-u combos --- s/Ha/f/g s/Ka/b/g s/Aa/J/g s/Au/L/g s/Ba/F/g s/Bu/G/g s/Xa/R/g s/Xu/S/g s/Ya/M/g s/Yu/N/g s/Uu/6/g s/Vu/7/g ------------------------------------ extract-words-from-interlin \ -recode jsa2pok \ -chars "aueoyqUVCZHKfbPQ67rnmcz8JLFGRSMNXYABijkw" \ .tmp-c-jsa.evt \ .tmp-c-pok Digraph counts: TT a u e o y q U V C Z H K f b P Q 6 7 r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 6414 . 271 368 365 797 564 1646 299 240 258 236 35 52 33 31 6 92 216 229 134 . . 46 264 145 2 12 2 22 . 12 10 8 5 4 2 7 1 . . . q 1668 15 11 5 191 1 2 1 2 . 14 1 74 25 498 120 6 25 4 2 15 1 . 27 2 4 46 168 8 49 93 165 10 43 7 5 22 6 . . . . a 1435 731 3 9 163 3 2 5 5 8 8 10 7 1 18 13 2 . 6 9 123 82 114 5 4 4 . 8 2 9 8 12 6 5 1 4 2 . 13 . 20 10 e 1195 535 90 35 2 53 46 1 60 23 51 40 12 . 32 4 2 5 83 43 9 . . 1 11 6 2 11 . 5 9 14 . 3 2 . 3 . . . 2 . y 1151 548 71 41 5 42 18 1 34 29 38 26 12 7 78 9 4 6 47 20 4 . . 1 5 2 7 27 . 4 28 27 . 3 1 . 4 2 . . . . o 1118 24 12 18 . 1 . 7 1 4 6 2 39 41 164 123 7 44 3 1 285 6 14 17 7 4 12 53 14 56 19 43 22 26 8 4 9 7 7 . 6 2 f 910 161 2 5 176 3 1 . . . 1 . . . . . . 1 . . 98 275 153 4 3 . . . . 1 . . . . . . . . 8 . 11 7 r 874 588 128 4 1 31 51 . 18 6 11 6 1 . . 1 . 2 9 10 . . . 5 . 1 . . . . . . . . . . . . 1 . . . C 849 3 191 186 12 16 35 . 3 . 181 3 . 1 31 17 2 10 . . 6 . . . 18 8 59 6 34 2 10 7 3 2 . 1 1 . . 1 . . u 737 361 1 . 107 . . 2 . . . . 1 . 4 3 . . . . 92 52 85 1 . 2 1 1 1 1 . 2 1 . . . . . 4 . 8 7 U 505 1 246 . . 14 55 . . . . . 1 . 20 8 2 12 . . 2 . . . 16 25 40 5 27 3 9 7 2 2 2 . 4 2 . . . . 6 498 460 . . 7 1 . 1 . . . . . 1 3 . . . . . 15 2 6 . . . . 1 . . . . . . . 1 . . . . . . n 498 482 4 3 . 1 6 . 1 . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . m 439 429 6 . . 1 1 . . . . . . . . . . . 1 . . . . . . 1 . . . . . . . . . . . . . . . . 7 372 356 . . 8 . . . . . . . 1 . . . . . . . 1 . 4 . . . . . . 1 . . . . . . . . . . 1 . Z 369 4 50 38 5 12 25 . 2 . 212 . . 3 4 3 . . 1 . 1 1 . . 2 2 1 . . . . 1 1 . . . 1 . . . . . z 354 72 142 3 1 34 80 . . 4 2 4 . 1 3 . . 1 . 5 . . . . . . . . . . 1 . . . . . . . 1 . . . V 348 1 163 . . 18 43 1 . . . . 4 1 17 9 1 4 . . . 1 . . 3 18 28 2 14 1 6 6 4 1 1 . 1 . . . . . b 347 79 . 2 87 1 . 1 . . . . . 1 . 1 . . . . 59 67 37 1 3 1 . . . . . . . . . . . . 2 . 2 3 S 302 294 . . 5 . . . . . . . . . . 1 . . . . 1 . . . . . . . . . . . . 1 . . . . . . . . L 299 284 . 1 4 . . . . . . . . . 1 . . 1 . . 6 . . 1 . . . . . . . . . . . . . . . . 1 . 8 261 73 . 2 10 18 54 1 11 6 22 12 1 . 1 . . . 12 20 2 . . 7 1 . . 3 1 . 2 . . . . . 1 . 1 . . . Q 215 4 20 2 . 11 37 . 17 6 34 9 . . . . . . 52 10 . . . 12 . 1 . . . . . . . . . . . . . . . . J 207 193 1 . 7 . . . . . . . . . . . . . . . 5 1 . . . . . . . . . . . . . . . . . . . . R 197 193 . . 2 . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . H 190 14 . 1 11 26 53 . 27 11 1 5 . . . . . . 24 12 . . 1 . . 1 . . . . . . . . . . . . 2 . . 1 G 160 147 . . 2 . 1 . . . . . . . . 1 . . . . 5 . 2 . 1 . . . . . . . . . . . . . . . . 1 K 134 4 . 2 3 14 43 . 14 8 . 6 . . . . . . 22 11 . . . . 1 1 . . . . . 1 . . . . . . 4 . . . c 131 2 13 12 1 1 1 . . . 2 1 1 . 3 3 3 12 . . . 3 . . . 2 9 2 7 5 11 5 8 3 . 3 2 6 . 10 . . F 110 104 . . 2 . . 1 . . . . 1 . . . . . . . 1 . . . . . . . . . . . . . . 1 . . . . . . N 97 92 . . . . . . . . . . . . . . . . . . 2 . . . . . . . . 1 1 . . . . . . . . . 1 . M 67 60 . . 4 . . . . . . . . . . . . . . . 2 . . . . 1 . . . . . . . . . . . . . . . . k 56 54 . . 1 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A 52 1 . . . 7 12 . 5 3 2 4 . . . . . . 2 . 1 5 . . 2 8 . . . . . . . . . . . . . . . . i 52 . . . 9 . . . . . . . . . . . . . . . . . 23 . . . . . . . . . . . . . . . 8 . 4 8 w 39 31 7 . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . P 35 1 2 . . . 3 . 4 . 5 2 . . . . . . 15 . . . . 3 . . . . . . . . . . . . . . . . . . B 30 1 . . 1 5 10 . 1 . 1 2 . . . . . . 1 . . 1 . . 2 5 . . . . . . . . . . . . . . . . X 27 1 . . 2 1 4 . . . . . . . . . . . . . 2 1 . . 3 13 . . . . . . . . . . . . . . . . Y 23 . . . 1 5 4 . . . . . . . . . . . . . 1 . . . 5 6 . . . . . . . . . . . . . 1 . . j 12 11 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 22777 6414 1435 737 1195 1118 1151 1668 505 348 849 369 190 134 910 347 35 215 498 372 874 498 439 131 354 261 207 299 110 160 197 302 67 97 27 23 52 30 52 12 56 39 Next-symbol probability (× 99): TT a u e o y q U V C Z H K f b P Q 6 7 r n m c z 8 J L F G R S M N X Y A B i j k w -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 99 . 4 6 6 12 9 25 5 4 4 4 1 1 1 . . 1 3 4 2 . . 1 4 2 . . . . . . . . . . . . . . . . a 99 50 . 1 11 . . . . 1 1 1 . . 1 1 . . . 1 8 6 8 . . . . 1 . 1 1 1 . . . . . . 1 . 1 1 u 99 48 . . 14 . . . . . . . . . 1 . . . . . 12 7 11 . . . . . . . . . . . . . . . 1 . 1 1 e 99 44 7 3 . 4 4 . 5 2 4 3 1 . 3 . . . 7 4 1 . . . 1 . . 1 . . 1 1 . . . . . . . . . . o 99 2 1 2 . . . 1 . . 1 . 3 4 15 11 1 4 . . 25 1 1 2 1 . 1 5 1 5 2 4 2 2 1 . 1 1 1 . 1 . y 99 47 6 4 . 4 2 . 3 2 3 2 1 1 7 1 . 1 4 2 . . . . . . 1 2 . . 2 2 . . . . . . . . . . q 99 1 1 . 11 . . . . . 1 . 4 1 30 7 . 1 . . 1 . . 2 . . 3 10 . 3 6 10 1 3 . . 1 . . . . . U 99 . 48 . . 3 11 . . . . . . . 4 2 . 2 . . . . . . 3 5 8 1 5 1 2 1 . . . . 1 . . . . . V 99 . 46 . . 5 12 . . . . . 1 . 5 3 . 1 . . . . . . 1 5 8 1 4 . 2 2 1 . . . . . . . . . C 99 . 22 22 1 2 4 . . . 21 . . . 4 2 . 1 . . 1 . . . 2 1 7 1 4 . 1 1 . . . . . . . . . . Z 99 1 13 10 1 3 7 . 1 . 57 . . 1 1 1 . . . . . . . . 1 1 . . . . . . . . . . . . . . . . H 99 7 . 1 6 14 28 . 14 6 1 3 . . . . . . 13 6 . . 1 . . 1 . . . . . . . . . . . . 1 . . 1 K 99 3 . 1 2 10 32 . 10 6 . 4 . . . . . . 16 8 . . . . 1 1 . . . . . 1 . . . . . . 3 . . . f 99 18 . 1 19 . . . . . . . . . . . . . . . 11 30 17 . . . . . . . . . . . . . . . 1 . 1 1 b 99 23 . 1 25 . . . . . . . . . . . . . . . 17 19 11 . 1 . . . . . . . . . . . . . 1 . 1 1 P 99 3 6 . . . 8 . 11 . 14 6 . . . . . . 42 . . . . 8 . . . . . . . . . . . . . . . . . . Q 99 2 9 1 . 5 17 . 8 3 16 4 . . . . . . 24 5 . . . 6 . . . . . . . . . . . . . . . . . . 6 99 91 . . 1 . . . . . . . . . 1 . . . . . 3 . 1 . . . . . . . . . . . . . . . . . . . 7 99 95 . . 2 . . . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . r 99 67 14 . . 4 6 . 2 1 1 1 . . . . . . 1 1 . . . 1 . . . . . . . . . . . . . . . . . . n 99 96 1 1 . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . m 99 97 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c 99 2 10 9 1 1 1 . . . 2 1 1 . 2 2 2 9 . . . 2 . . . 2 7 2 5 4 8 4 6 2 . 2 2 5 . 8 . . z 99 20 40 1 . 10 22 . . 1 1 1 . . 1 . . . . 1 . . . . . . . . . . . . . . . . . . . . . . 8 99 28 . 1 4 7 20 . 4 2 8 5 . . . . . . 5 8 1 . . 3 . . . 1 . . 1 . . . . . . . . . . . J 99 92 . . 3 . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . L 99 94 . . 1 . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . F 99 94 . . 2 . . 1 . . . . 1 . . . . . . . 1 . . . . . . . . . . . . . . 1 . . . . . . G 99 91 . . 1 . 1 . . . . . . . . 1 . . . . 3 . 1 . 1 . . . . . . . . . . . . . . . . 1 R 99 97 . . 1 . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . S 99 96 . . 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . M 99 89 . . 6 . . . . . . . . . . . . . . . 3 . . . . 1 . . . . . . . . . . . . . . . . N 99 94 . . . . . . . . . . . . . . . . . . 2 . . . . . . . . 1 1 . . . . . . . . . 1 . X 99 4 . . 7 4 15 . . . . . . . . . . . . . 7 4 . . 11 48 . . . . . . . . . . . . . . . . Y 99 . . . 4 22 17 . . . . . . . . . . . . . 4 . . . 22 26 . . . . . . . . . . . . . 4 . . A 99 2 . . . 13 23 . 10 6 4 8 . . . . . . 4 . 2 10 . . 4 15 . . . . . . . . . . . . . . . . B 99 3 . . 3 17 33 . 3 . 3 7 . . . . . . 3 . . 3 . . 7 17 . . . . . . . . . . . . . . . . i 99 . . . 17 . . . . . . . . . . . . . . . . . 44 . . . . . . . . . . . . . . . 15 . 8 15 j 99 91 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 99 95 . . 2 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 99 79 18 . . . . . 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 28 6 3 5 5 5 7 2 2 4 2 1 1 4 2 0 1 2 2 4 2 2 1 2 1 1 1 0 1 1 1 0 0 0 0 0 0 0 0 0 0 Previous-symbol probability (× 99): TT a u e o y q U V C Z H K f b P Q 6 7 r n m c z 8 J L F G R S M N X Y A B i j k w -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 28 . 19 49 30 71 49 98 59 68 30 63 18 38 4 9 17 42 43 61 15 . . 35 74 55 1 4 2 14 . 4 15 8 18 17 4 23 2 . . . a 6 11 . 1 14 . . . 1 2 1 3 4 1 2 4 6 . 1 2 14 16 26 4 1 2 . 3 2 6 4 4 9 5 4 17 4 . 25 . 35 25 u 3 6 . . 9 . . . . . . . 1 . . 1 . . . . 10 10 19 1 . 1 . . 1 1 . 1 1 . . . . . 8 . 14 18 e 5 8 6 5 . 5 4 . 12 7 6 11 6 . 3 1 6 2 17 11 1 . . 1 3 2 1 4 . 3 5 5 . 3 7 . 6 . . . 4 . o 5 . 1 2 . . . . . 1 1 1 20 30 18 35 20 20 1 . 32 1 3 13 2 2 6 18 13 35 10 14 33 27 29 17 17 23 13 . 11 5 y 5 8 5 6 . 4 2 . 7 8 4 7 6 5 8 3 11 3 9 5 . . . 1 1 1 3 9 . 2 14 9 . 3 4 . 8 7 . . . . q 7 . 1 1 16 . . . . . 2 . 39 18 54 34 17 12 1 1 2 . . 20 1 2 22 56 7 30 47 54 15 44 26 22 42 20 . . . . U 2 . 17 . . 1 5 . . . . . 1 . 2 2 6 6 . . . . . . 4 9 19 2 24 2 5 2 3 2 7 . 8 7 . . . . V 2 . 11 . . 2 4 . . . . . 2 1 2 3 3 2 . . . . . . 1 7 13 1 13 1 3 2 6 1 4 . 2 . . . . . C 4 . 13 25 1 1 3 . 1 . 21 1 . 1 3 5 6 5 . . 1 . . . 5 3 28 2 31 1 5 2 4 2 . 4 2 . . 8 . . Z 2 . 3 5 . 1 2 . . . 25 . . 2 . 1 . . . . . . . . 1 1 . . . . . . 1 . . . 2 . . . . . H 1 . . . 1 2 5 . 5 3 . 1 . . . . . . 5 3 . . . . . . . . . . . . . . . . . . 4 . . 3 K 1 . . . . 1 4 . 3 2 . 2 . . . . . . 4 3 . . . . . . . . . . . . . . . . . . 8 . . . f 4 2 . 1 15 . . . . . . . . . . . . . . . 11 55 35 3 1 . . . . 1 . . . . . . . . 15 . 19 18 b 2 1 . . 7 . . . . . . . . 1 . . . . . . 7 13 8 1 1 . . . . . . . . . . . . . 4 . 4 8 P 0 . . . . . . . 1 . 1 1 . . . . . . 3 . . . . 2 . . . . . . . . . . . . . . . . . . Q 1 . 1 . . 1 3 . 3 2 4 2 . . . . . . 10 3 . . . 9 . . . . . . . . . . . . . . . . . . 6 2 7 . . 1 . . . . . . . . 1 . . . . . . 2 . 1 . . . . . . . . . . . . 4 . . . . . . 7 2 5 . . 1 . . . . . . . 1 . . . . . . . . . 1 . . . . . . 1 . . . . . . . . . . 2 . r 4 9 9 1 . 3 4 . 4 2 1 2 1 . . . . 1 2 3 . . . 4 . . . . . . . . . . . . . . 2 . . . n 2 7 . . . . 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . m 2 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . c 1 . 1 2 . . . . . . . . 1 . . 1 8 6 . . . 1 . . . 1 4 1 6 3 6 2 12 3 . 13 4 20 . 83 . . z 2 1 10 . . 3 7 . . 1 . 1 . 1 . . . . . 1 . . . . . . . . . . 1 . . . . . . . 2 . . . 8 1 1 . . 1 2 5 . 2 2 3 3 1 . . . . . 2 5 . . . 5 . . . 1 1 . 1 . . . . . 2 . 2 . . . J 1 3 . . 1 . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . L 1 4 . . . . . . . . . . . . . . . . . . 1 . . 1 . . . . . . . . . . . . . . . . 2 . F 0 2 . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . . . 4 . . . . . . G 1 2 . . . . . . . . . . . . . . . . . . 1 . . . . . . . . . . . . . . . . . . . . 3 R 1 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . S 1 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 . . . . . . . . M 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . N 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 1 . . . . . . . . . 2 . X 0 . . . . . . . . . . . . . . . . . . . . . . . 1 5 . . . . . . . . . . . . . . . . Y 0 . . . . . . . . . . . . . . . . . . . . . . . 1 2 . . . . . . . . . . . . . 8 . . A 0 . . . . 1 1 . 1 1 . 1 . . . . . . . . . 1 . . 1 3 . . . . . . . . . . . . . . . . B 0 . . . . . 1 . . . . 1 . . . . . . . . . . . . 1 2 . . . . . . . . . . . . . . . . i 0 . . . 1 . . . . . . . . . . . . . . . . . 5 . . . . . . . . . . . . . . . 15 . 7 20 j 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Next-symbol entropy: TT a u e o y q U V C Z H K f b P Q 6 7 r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 3.811 . 0.193 0.237 0.235 0.374 0.308 0.504 0.206 0.177 0.186 0.175 0.041 0.056 0.039 0.037 0.009 0.088 0.165 0.172 0.117 . . 0.051 0.189 0.124 0.004 0.017 0.004 0.028 . 0.017 0.015 0.012 0.008 0.007 0.004 0.011 0.002 . . . a 2.826 0.496 0.019 0.046 0.356 0.019 0.013 0.028 0.028 0.042 0.042 0.050 0.037 0.007 0.079 0.061 0.013 . 0.033 0.046 0.304 0.236 0.290 0.028 0.024 0.024 . 0.042 0.013 0.046 0.042 0.058 0.033 0.028 0.007 0.024 0.013 . 0.061 . 0.086 0.050 u 2.334 0.504 0.013 . 0.404 . . 0.023 . . . . 0.013 . 0.041 0.032 . . . . 0.375 0.270 0.359 0.013 . 0.023 0.013 0.013 0.013 0.013 . 0.023 0.013 . . . . . 0.041 . 0.071 0.064 e 3.236 0.519 0.281 0.149 0.015 0.199 0.181 0.009 0.217 0.110 0.194 0.164 0.067 . 0.140 0.028 0.015 0.033 0.267 0.173 0.053 . . 0.009 0.062 0.038 0.015 0.062 . 0.033 0.053 0.075 . 0.022 0.015 . 0.022 . . . 0.015 . o 3.963 0.119 0.070 0.096 . 0.009 . 0.046 0.009 0.029 0.040 0.016 0.169 0.175 0.406 0.350 0.046 0.184 0.023 0.009 0.503 0.040 0.079 0.092 0.046 0.029 0.070 0.209 0.079 0.216 0.100 0.181 0.112 0.126 0.051 0.029 0.056 0.046 0.046 . 0.040 0.016 y 3.219 0.510 0.248 0.171 0.034 0.174 0.094 0.009 0.150 0.134 0.162 0.124 0.069 0.045 0.263 0.055 0.028 0.040 0.188 0.102 0.028 . . 0.009 0.034 0.016 0.045 0.127 . 0.028 0.130 0.127 . 0.022 0.009 . 0.028 0.016 . . . . q 3.601 0.061 0.048 0.025 0.358 0.006 0.012 0.006 0.012 . 0.058 0.006 0.199 0.091 0.521 0.273 0.029 0.091 0.021 0.012 0.061 0.006 . 0.096 0.012 0.021 0.143 0.334 0.037 0.150 0.232 0.330 0.044 0.136 0.033 0.025 0.082 0.029 . . . . U 2.872 0.018 0.505 . . 0.143 0.348 . . . . . 0.018 . 0.184 0.095 0.032 0.128 . . 0.032 . . . 0.158 0.215 0.290 0.066 0.226 0.044 0.104 0.086 0.032 0.032 0.032 . 0.055 0.032 . . . . V 2.900 0.024 0.513 . . 0.221 0.373 0.024 . . . . 0.074 0.024 0.213 0.136 0.024 0.074 . . . 0.024 . . 0.059 0.221 0.293 0.043 0.186 0.024 0.101 0.101 0.074 0.024 0.024 . 0.024 . . . . . C 3.277 0.029 0.484 0.480 0.087 0.108 0.190 . 0.029 . 0.475 0.029 . 0.011 0.174 0.113 0.021 0.075 . . 0.050 . . . 0.118 0.063 0.267 0.050 0.186 0.021 0.075 0.057 0.029 0.021 . 0.011 0.011 . . 0.011 . . Z 2.234 0.071 0.391 0.338 0.084 0.161 0.263 . 0.041 . 0.459 . . 0.056 0.071 0.056 . . 0.023 . 0.023 0.023 . . 0.041 0.041 0.023 . . . . 0.023 0.023 . . . 0.023 . . . . . H 3.095 0.277 . 0.040 0.238 0.393 0.514 . 0.400 0.238 0.040 0.138 . . . . . . 0.377 0.252 . . 0.040 . . 0.040 . . . . . . . . . . . . 0.069 . . 0.040 K 3.048 0.151 . 0.091 0.123 0.340 0.526 . 0.340 0.243 . 0.201 . . . . . . 0.428 0.296 . . . . 0.053 0.053 . . . . . 0.053 . . . . . . 0.151 . . . f 2.585 0.442 0.019 0.041 0.458 0.027 0.011 . . . 0.011 . . . . . . 0.011 . . 0.346 0.522 0.432 0.034 0.027 . . . . 0.011 . . . . . . . . 0.060 . 0.077 0.054 b 2.617 0.486 . 0.043 0.500 0.024 . 0.024 . . . . . 0.024 . 0.024 . . . . 0.435 0.458 0.344 0.024 0.059 0.024 . . . . . . . . . . . . 0.043 . 0.043 0.059 P 2.509 0.147 0.236 . . . 0.304 . 0.358 . 0.401 0.236 . . . . . . 0.524 . . . . 0.304 . . . . . . . . . . . . . . . . . . Q 3.160 0.107 0.319 0.063 . 0.219 0.437 . 0.289 0.144 0.421 0.192 . . . . . . 0.495 0.206 . . . 0.232 . 0.036 . . . . . . . . . . . . . . . . 6 0.588 0.106 . . 0.086 0.018 . 0.018 . . . . . 0.018 0.044 . . . . . 0.152 0.032 0.077 . . . . 0.018 . . . . . . . 0.018 . . . . . . 7 0.342 0.061 . . 0.119 . . . . . . . 0.023 . . . . . . . 0.023 . 0.070 . . . . . . 0.023 . . . . . . . . . . 0.023 . r 1.790 0.385 0.406 0.036 0.011 0.171 0.239 . 0.115 0.049 0.079 0.049 0.011 . . 0.011 . 0.020 0.068 0.074 . . . 0.043 . 0.011 . . . . . . . . . . . . 0.011 . . . n 0.277 0.046 0.056 0.044 . 0.018 0.077 . 0.018 . . . . . . . . . . . . . . . 0.018 . . . . . . . . . . . . . . . . . m 0.197 0.032 0.085 . . 0.020 0.020 . . . . . . . . . . . 0.020 . . . . . . 0.020 . . . . . . . . . . . . . . . . c 4.325 0.092 0.331 0.316 0.054 0.054 0.054 . . . 0.092 0.054 0.054 . 0.125 0.125 0.125 0.316 . . . 0.125 . . . 0.092 0.265 0.092 0.226 0.180 0.300 0.180 0.246 0.125 . 0.125 0.092 0.204 . 0.283 . . z 2.317 0.467 0.529 0.058 0.024 0.325 0.485 . . 0.073 0.042 0.073 . 0.024 0.058 . . 0.024 . 0.087 . . . . . . . . . . 0.024 . . . . . . . 0.024 . . . 8 3.333 0.514 . 0.054 0.180 0.266 0.470 0.031 0.193 0.125 0.301 0.204 0.031 . 0.031 . . . 0.204 0.284 0.054 . . 0.140 0.031 . . 0.074 0.031 . 0.054 . . . . . 0.031 . 0.031 . . . J 0.464 0.094 0.037 . 0.165 . . . . . . . . . . . . . . . 0.130 0.037 . . . . . . . . . . . . . . . . . . . . L 0.404 0.071 . 0.028 0.083 . . . . . . . . . 0.028 . . 0.028 . . 0.113 . . 0.028 . . . . . . . . . . . . . . . . 0.028 . F 0.428 0.077 . . 0.105 . . 0.062 . . . . 0.062 . . . . . . . 0.062 . . . . . . . . . . . . . . 0.062 . . . . . . G 0.610 0.112 . . 0.079 . 0.046 . . . . . . . . 0.046 . . . . 0.156 . 0.079 . 0.046 . . . . . . . . . . . . . . . . 0.046 R 0.163 0.029 . . 0.067 . . . . . . . . . . . . . . . 0.067 . . . . . . . . . . . . . . . . . . . . . S 0.217 0.038 . . 0.098 . . . . . . . . . . 0.027 . . . . 0.027 . . . . . . . . . . . . 0.027 . . . . . . . . M 0.627 0.143 . . 0.243 . . . . . . . . . . . . . . . 0.151 . . . . 0.091 . . . . . . . . . . . . . . . . N 0.392 0.072 . . . . . . . . . . . . . . . . . . 0.115 . . . . . . . . 0.068 0.068 . . . . . . . . . 0.068 . X 2.353 0.176 . . 0.278 0.176 0.408 . . . . . . . . . . . . . 0.278 0.176 . . 0.352 0.508 . . . . . . . . . . . . . . . . Y 2.492 . . . 0.197 0.479 0.439 . . . . . . . . . . . . . 0.197 . . . 0.479 0.506 . . . . . . . . . . . . . 0.197 . . A 3.226 0.110 . . . 0.389 0.488 . 0.325 0.237 0.181 0.285 . . . . . . 0.181 . 0.110 0.325 . . 0.181 0.415 . . . . . . . . . . . . . . . . B 2.892 0.164 . . 0.164 0.431 0.528 . 0.164 . 0.164 0.260 . . . . . . 0.164 . . 0.164 . . 0.260 0.431 . . . . . . . . . . . . . . . . i 2.074 . . . 0.438 . . . . . . . . . . . . . . . . . 0.521 . . . . . . . . . . . . . . . 0.415 . 0.285 0.415 j 0.414 0.115 0.299 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0.258 0.051 . . 0.104 0.104 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0.844 0.263 0.445 . . . . . 0.136 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 2.847 0.515 0.251 0.160 0.223 0.213 0.218 0.276 0.122 0.092 0.177 0.096 0.058 0.044 0.186 0.092 0.014 0.063 0.121 0.097 0.180 0.121 0.110 0.043 0.093 0.074 0.062 0.082 0.037 0.050 0.059 0.083 0.025 0.034 0.012 0.010 0.020 0.013 0.020 0.006 0.021 0.016 Previous-symbol entropy: TT a u e o y q U V C Z H K f b P Q 6 7 r n m c z 8 J L F G R S M N X Y A B i j k w ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 0.515 . 0.454 0.500 0.523 0.348 0.504 0.019 0.448 0.370 0.522 0.412 0.450 0.530 0.174 0.311 0.436 0.524 0.523 0.431 0.415 . . 0.530 0.316 0.471 0.065 0.186 0.105 0.394 . 0.185 0.410 0.297 0.451 0.439 0.181 0.490 0.110 . . . a 0.251 0.357 0.019 0.078 0.392 0.023 0.016 0.025 0.066 0.125 0.063 0.141 0.175 0.053 0.112 0.178 0.236 . 0.077 0.130 0.398 0.429 0.505 0.180 0.073 0.092 . 0.140 0.105 0.234 0.188 0.185 0.312 0.221 0.176 0.439 0.181 . 0.500 . 0.531 0.503 u 0.160 0.234 0.007 . 0.312 . . 0.012 . . . . 0.040 . 0.034 0.059 . . . . 0.342 0.340 0.459 0.054 . 0.054 0.037 0.028 0.062 0.046 . 0.048 0.091 . . . . . 0.285 . 0.401 0.445 e 0.223 0.299 0.251 0.209 0.015 0.209 0.186 0.006 0.365 0.259 0.244 0.347 0.252 . 0.170 0.074 0.236 0.126 0.431 0.360 0.068 . . 0.054 0.156 0.125 0.065 0.175 . 0.156 0.203 0.205 . 0.155 0.278 . 0.237 . . . 0.172 . o 0.213 0.030 0.058 0.131 . 0.009 . 0.033 0.018 0.074 0.050 0.041 0.469 0.523 0.446 0.530 0.464 0.468 0.044 0.023 0.527 0.077 0.159 0.382 0.112 0.092 0.238 0.442 0.379 0.530 0.325 0.400 0.528 0.509 0.520 0.439 0.438 0.490 0.389 . 0.345 0.220 y 0.218 0.303 0.215 0.232 0.033 0.178 0.094 0.006 0.262 0.299 0.201 0.270 0.252 0.222 0.304 0.137 0.358 0.144 0.321 0.227 0.036 . . 0.054 0.087 0.054 0.165 0.313 . 0.133 0.400 0.311 . 0.155 0.176 . 0.285 0.260 . . . . q 0.276 0.020 0.054 0.049 0.423 0.009 0.016 0.006 0.032 . 0.098 0.023 0.530 0.452 0.476 0.530 0.436 0.361 0.056 0.041 0.101 0.018 . 0.470 0.042 0.092 0.482 0.467 0.275 0.523 0.511 0.476 0.410 0.520 0.505 0.479 0.525 0.464 . . . . U 0.122 0.002 0.436 . . 0.079 0.210 . . . . . 0.040 . 0.121 0.125 0.236 0.232 . . 0.020 . . . 0.202 0.324 0.458 0.099 0.497 0.108 0.203 0.126 0.151 0.115 0.278 . 0.285 0.260 . . . . V 0.092 0.002 0.356 . . 0.096 0.177 0.006 . . . . 0.117 0.053 0.107 0.137 0.147 0.107 . . . 0.018 . . 0.058 0.266 0.390 0.048 0.379 0.046 0.153 0.112 0.243 0.068 0.176 . 0.110 . . . . . C 0.177 0.005 0.387 0.501 0.067 0.088 0.153 . 0.044 . 0.475 0.056 . 0.053 0.166 0.213 0.236 0.206 . . 0.049 . . . 0.219 0.154 0.516 0.113 0.524 0.079 0.218 0.126 0.201 0.115 . 0.197 0.110 . . 0.299 . . Z 0.096 0.007 0.169 0.221 0.033 0.070 0.120 . 0.032 . 0.500 . . 0.123 0.034 0.059 . . 0.018 . 0.011 0.018 . . 0.042 0.054 0.037 . . . . 0.027 0.091 . . . 0.110 . . . . . H 0.058 0.019 . 0.013 0.062 0.126 0.204 . 0.226 0.158 0.011 0.084 . . . . . . 0.211 0.160 . . 0.020 . . 0.031 . . . . . . . . . . . . 0.181 . . 0.136 K 0.044 0.007 . 0.023 0.022 0.079 0.177 . 0.143 0.125 . 0.097 . . . . . . 0.199 0.150 . . . . 0.024 0.031 . . . . . 0.027 . . . . . . 0.285 . . . f 0.186 0.133 0.013 0.049 0.407 0.023 0.009 . . . 0.011 . . . . . . 0.036 . . 0.354 0.473 0.530 0.154 0.058 . . . . 0.046 . . . . . . . . 0.415 . 0.461 0.445 b 0.092 0.078 . 0.023 0.275 0.009 . 0.006 . . . . . 0.053 . 0.024 . . . . 0.263 0.389 0.301 0.054 0.058 0.031 . . . . . . . . . . . . 0.181 . 0.172 0.285 P 0.014 0.002 0.013 . . . 0.022 . 0.055 . 0.044 0.041 . . . . . . 0.152 . . . . 0.125 . . . . . . . . . . . . . . . . . . Q 0.063 0.007 0.086 0.023 . 0.066 0.159 . 0.165 0.101 0.186 0.131 . . . . . . 0.340 0.140 . . . 0.316 . 0.031 . . . . . . . . . . . . . . . . 6 0.121 0.273 . . 0.043 0.009 . 0.006 . . . . . 0.053 0.027 . . . . . 0.101 0.032 0.085 . . . . 0.028 . . . . . . . 0.197 . . . . . . 7 0.097 0.232 . . 0.048 . . . . . . . 0.040 . . . . . . . 0.011 . 0.062 . . . . . . 0.046 . . . . . . . . . . 0.104 . r 0.180 0.316 0.311 0.041 0.009 0.143 0.199 . 0.171 0.101 0.081 0.097 0.040 . . 0.024 . 0.063 0.105 0.140 . . . 0.180 . 0.031 . . . . . . . . . . . . 0.110 . . . n 0.121 0.281 0.024 0.032 . 0.009 0.040 . 0.018 . . . . . . . . . . . . . . . 0.024 . . . . . . . . . . . . . . . . . m 0.110 0.261 0.033 . . 0.009 0.009 . . . . . . . . . . . 0.018 . . . . . . 0.031 . . . . . . . . . . . . . . . . c 0.043 0.004 0.061 0.097 0.009 0.009 0.009 . . . 0.021 0.023 0.040 . 0.027 0.059 0.304 0.232 . . . 0.044 . . . 0.054 0.197 0.048 0.253 0.156 0.232 0.098 0.366 0.155 . 0.383 0.181 0.464 . 0.219 . . z 0.093 0.073 0.330 0.032 0.009 0.153 0.267 . . 0.074 0.021 0.071 . 0.053 0.027 . . 0.036 . 0.084 . . . . . . . . . . 0.039 . . . . . . . 0.110 . . . 8 0.074 0.073 . 0.023 0.058 0.096 0.207 0.006 0.120 0.101 0.137 0.161 0.040 . 0.011 . . . 0.130 0.227 0.020 . . 0.226 0.024 . . 0.067 0.062 . 0.067 . . . . . 0.110 . 0.110 . . . J 0.062 0.152 0.007 . 0.043 . . . . . . . . . . . . . . . 0.043 0.018 . . . . . . . . . . . . . . . . . . . . L 0.082 0.199 . 0.013 0.028 . . . . . . . . . 0.011 . . 0.036 . . 0.049 . . 0.054 . . . . . . . . . . . . . . . . 0.104 . F 0.037 0.096 . . 0.015 . . 0.006 . . . . 0.040 . . . . . . . 0.011 . . . . . . . . . . . . . . 0.197 . . . . . . G 0.050 0.125 . . 0.015 . 0.009 . . . . . . . . 0.024 . . . . 0.043 . 0.035 . 0.024 . . . . . . . . . . . . . . . . 0.136 R 0.059 0.152 . . 0.015 . . . . . . . . . . . . . . . 0.020 . . . . . . . . . . . . . . . . . . . . . S 0.083 0.204 . . 0.033 . . . . . . . . . . 0.024 . . . . 0.011 . . . . . . . . . . . . 0.068 . . . . . . . . M 0.025 0.063 . . 0.028 . . . . . . . . . . . . . . . 0.020 . . . . 0.031 . . . . . . . . . . . . . . . . N 0.034 0.088 . . . . . . . . . . . . . . . . . . 0.020 . . . . . . . . 0.046 0.039 . . . . . . . . . 0.104 . X 0.012 0.002 . . 0.015 0.009 0.028 . . . . . . . . . . . . . 0.020 0.018 . . 0.058 0.216 . . . . . . . . . . . . . . . . Y 0.010 . . . 0.009 0.035 0.028 . . . . . . . . . . . . . 0.011 . . . 0.087 0.125 . . . . . . . . . . . . . 0.299 . . A 0.020 0.002 . . . 0.046 0.069 . 0.066 0.059 0.021 0.071 . . . . . . 0.032 . 0.011 0.067 . . 0.042 0.154 . . . . . . . . . . . . . . . . B 0.013 0.002 . . 0.009 0.035 0.059 . 0.018 . 0.011 0.041 . . . . . . 0.018 . . 0.018 . . 0.042 0.109 . . . . . . . . . . . . . . . . i 0.020 . . . 0.053 . . . . . . . . . . . . . . . . . 0.223 . . . . . . . . . . . . . . . 0.415 . 0.272 0.469 j 0.006 0.016 0.007 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . k 0.021 0.058 . . 0.009 0.009 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . w 0.016 0.037 0.037 . . . . . 0.018 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 2.847 4.213 3.329 2.290 3.011 1.974 2.973 0.140 2.266 1.845 2.697 2.106 2.523 2.166 2.247 2.510 3.089 2.572 2.674 2.112 2.975 1.959 2.378 2.830 1.748 2.653 2.651 2.154 2.639 2.541 2.580 2.328 2.800 2.379 2.560 2.769 2.751 2.429 3.090 0.817 2.664 2.637 The entropy H2 is still a bit too low, almost surely because of the initial/final constraints, So we get this alphabet, with approximate frogguy and FSG correpondences pok count jsaz guy2 FSG --- ----- --------- ------------------- ------------------ q 1668 qo *4o *4O a 1435 ci a, 9* A, G* e 1195 ix x+ E+ y 1151 oix +ox+ +OE+ o 1118 o +o +O f 910 ljci lpa DA r 874 is 2* R* C 849 cc -et -T u 737 cgci +8a, +89* +8A, +8G* U 505 ccc +etc, +cet, +ccc +TC, +CT, +CCC 6 498 ccccgci +etc89*, +cet89* +TC8G*, +CT8G* n 498 iiu iv* N* m 439 iiiu iiv* M* 7 372 zcccgci +e'tc89*, +set89* +SC8G*, +2T8G* Z 369 zc +e't, +sc +S, +2C z 354 z =s =2 V 348 zcc +e'tc, +set, +scc +SC, +2T, +2CC b 347 qjci qpa, qp9* HA, HG* S 302 ljcccgci lpet89* DT8G* L 299 ljccgci lpc89* DC8G* 8 261 cg +8 +8 Q 215 qg +dj +P J 207 ljcci lpc9* DCG* R 197 ljccci lpet9= DTG= H 190 lj lp D G 160 qjccgci qpc89* HC8G* K 134 qj -qp -H c 131 c -c -C F 110 qjci qp9* HG* N 97 qjcccgci qpet89* HT8G* M 67 qjccci qpet9* HTG* k 56 ij ig* K* A 52 ljc lpc DC i 52 i i I w 39 iis i2 IR P 35 lg fj F B 30 qjc qpc HC X 27 ljcc lpet, lpcc DT, DCC Y 23 qjcc qpet, qpcc HT, HCC j 12 j ? ? * = always space. = = space 3/4 of the time. + = space half the time. - = space 1/4 of the time. 97-09-12 stolfi =============== Dennis "Ixohoxi" sent me a sample of latin text. Let's see if I can lower its entropy to that of Voynichese, with the substitutions above: cat latn.txt \ | count-digraph-freqs \ -v showentropy=1 \ -v chars=' iaueonrpclgbshdtmfvxzq' Digraph counts: TT i a u e o n r p c l g b s h d t m f v x z q ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 1318 . 122 111 17 212 17 47 77 54 65 12 9 25 153 13 99 63 63 56 51 . 1 51 i 815 107 8 66 49 27 36 121 12 13 35 36 5 25 73 8 22 115 20 3 19 12 . 3 a 570 50 11 1 19 37 . 51 16 5 24 44 8 34 29 9 62 67 60 . 40 . . 3 u 513 11 49 33 12 46 27 38 26 15 8 25 1 4 84 . 11 22 93 1 1 6 . . e 888 152 28 7 27 5 12 61 138 4 18 19 52 11 66 . 26 172 56 3 1 27 . 3 o 422 70 7 4 . 1 . 88 32 13 28 15 2 5 49 1 12 8 71 . 4 2 . 10 n 443 81 61 38 41 41 32 4 . 4 8 . 15 . 24 . 17 63 . 4 5 2 . 3 r 380 45 37 58 24 106 30 6 12 . 4 . 9 1 7 1 14 4 8 2 11 . 1 . p 118 . 12 7 8 27 20 . 20 . . 5 . . 7 9 . 3 . . . . . . c 212 23 37 22 38 52 19 . 2 . 7 3 . . . 4 . 5 . . . . . . l 174 10 52 21 9 17 39 . . . 3 12 . . . . 1 9 . . 1 . . . g 103 2 27 3 7 22 12 17 9 . . 2 2 . . . . . . . . . . . b 108 7 26 19 23 19 7 1 1 . . 1 . . 3 . . . 1 . . . . . s 515 219 24 72 46 46 15 . 9 7 12 . . . 15 . . 45 . . . . . 5 h 82 3 13 21 3 25 7 . 5 . . . . . 5 . . . . . . . . . d 267 38 66 24 11 42 78 . 1 1 . . . . . 3 3 . . . . . . . t 579 288 53 32 58 73 9 . 17 . . . . . . 34 . . . . . . . 15 m 374 189 58 12 9 41 43 9 . 2 . . . 3 . . . 1 2 . . . . 5 f 70 . 36 8 9 9 4 . 3 . . . . . . . . . . 1 . . . . v 133 . 73 11 5 30 14 . . . . . . . . . . . . . . . . . x 49 23 15 . . 9 . . . . . . . . . . . 2 . . . . . . z 2 . . . . 1 1 . . . . . . . . . . . . . . . . . q 98 . . . 98 . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 8233 1318 815 570 513 888 422 443 380 118 212 174 103 108 515 82 267 579 374 70 133 49 2 98 Next-symbol probability (× 99): TT i a u e o n r p c l g b s h d t m f v x z q -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 99 . 9 8 1 16 1 4 6 4 5 1 1 2 11 1 7 5 5 4 4 . . 4 i 99 13 1 8 6 3 4 15 1 2 4 4 1 3 9 1 3 14 2 . 2 1 . . a 99 9 2 . 3 6 . 9 3 1 4 8 1 6 5 2 11 12 10 . 7 . . 1 u 99 2 9 6 2 9 5 7 5 3 2 5 . 1 16 . 2 4 18 . . 1 . . e 99 17 3 1 3 1 1 7 15 . 2 2 6 1 7 . 3 19 6 . . 3 . . o 99 16 2 1 . . . 21 8 3 7 4 . 1 11 . 3 2 17 . 1 . . 2 n 99 18 14 8 9 9 7 1 . 1 2 . 3 . 5 . 4 14 . 1 1 . . 1 r 99 12 10 15 6 28 8 2 3 . 1 . 2 . 2 . 4 1 2 1 3 . . . p 99 . 10 6 7 23 17 . 17 . . 4 . . 6 8 . 3 . . . . . . c 99 11 17 10 18 24 9 . 1 . 3 1 . . . 2 . 2 . . . . . . l 99 6 30 12 5 10 22 . . . 2 7 . . . . 1 5 . . 1 . . . g 99 2 26 3 7 21 12 16 9 . . 2 2 . . . . . . . . . . . b 99 6 24 17 21 17 6 1 1 . . 1 . . 3 . . . 1 . . . . . s 99 42 5 14 9 9 3 . 2 1 2 . . . 3 . . 9 . . . . . 1 h 99 4 16 25 4 30 8 . 6 . . . . . 6 . . . . . . . . . d 99 14 24 9 4 16 29 . . . . . . . . 1 1 . . . . . . . t 99 49 9 5 10 12 2 . 3 . . . . . . 6 . . . . . . . 3 m 99 50 15 3 2 11 11 2 . 1 . . . 1 . . . . 1 . . . . 1 f 99 . 51 11 13 13 6 . 4 . . . . . . . . . . 1 . . . . v 99 . 54 8 4 22 10 . . . . . . . . . . . . . . . . . x 99 46 30 . . 18 . . . . . . . . . . . 4 . . . . . . z 99 . . . . 50 50 . . . . . . . . . . . . . . . . . q 99 . . . 99 . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 16 10 7 6 11 5 5 5 1 3 2 1 1 6 1 3 7 4 1 2 1 0 1 Previous-symbol probability (× 99): TT i a u e o n r p c l g b s h d t m f v x z q -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 16 . 15 19 3 24 4 11 20 45 30 7 9 23 29 16 37 11 17 79 38 . 50 52 i 10 8 1 11 9 3 8 27 3 11 16 20 5 23 14 10 8 20 5 4 14 24 . 3 a 7 4 1 . 4 4 . 11 4 4 11 25 8 31 6 11 23 11 16 . 30 . . 3 u 6 1 6 6 2 5 6 8 7 13 4 14 1 4 16 . 4 4 25 1 1 12 . . e 11 11 3 1 5 1 3 14 36 3 8 11 50 10 13 . 10 29 15 4 1 55 . 3 o 5 5 1 1 . . . 20 8 11 13 9 2 5 9 1 4 1 19 . 3 4 . 10 n 5 6 7 7 8 5 8 1 . 3 4 . 14 . 5 . 6 11 . 6 4 4 . 3 r 5 3 4 10 5 12 7 1 3 . 2 . 9 1 1 1 5 1 2 3 8 . 50 . p 1 . 1 1 2 3 5 . 5 . . 3 . . 1 11 . 1 . . . . . . c 3 2 4 4 7 6 4 . 1 . 3 2 . . . 5 . 1 . . . . . . l 2 1 6 4 2 2 9 . . . 1 7 . . . . . 2 . . 1 . . . g 1 . 3 1 1 2 3 4 2 . . 1 2 . . . . . . . . . . . b 1 1 3 3 4 2 2 . . . . 1 . . 1 . . . . . . . . . s 6 16 3 13 9 5 4 . 2 6 6 . . . 3 . . 8 . . . . . 5 h 1 . 2 4 1 3 2 . 1 . . . . . 1 . . . . . . . . . d 3 3 8 4 2 5 18 . . 1 . . . . . 4 1 . . . . . . . t 7 22 6 6 11 8 2 . 4 . . . . . . 41 . . . . . . . 15 m 4 14 7 2 2 5 10 2 . 2 . . . 3 . . . . 1 . . . . 5 f 1 . 4 1 2 1 1 . 1 . . . . . . . . . . 1 . . . . v 2 . 9 2 1 3 3 . . . . . . . . . . . . . . . . . x 1 2 2 . . 1 . . . . . . . . . . . . . . . . . . z 0 . . . . . . . . . . . . . . . . . . . . . . . q 1 . . . 19 . . . . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Symbol entropy: 3.996 Next-symbol entropy: count ntrpy i a u e o n r p c l g b s h d t m f v x z q ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 1318 3.928 . 0.318 0.301 0.081 0.424 0.081 0.172 0.239 0.189 0.214 0.062 0.049 0.109 0.361 0.066 0.281 0.210 0.210 0.194 0.182 . 0.008 0.182 i 815 3.859 0.385 0.065 0.294 0.244 0.163 0.199 0.409 0.090 0.095 0.195 0.199 0.045 0.154 0.312 0.065 0.141 0.399 0.131 0.030 0.126 0.090 . 0.030 a 570 3.852 0.308 0.110 0.016 0.164 0.256 . 0.312 0.145 0.060 0.192 0.285 0.086 0.243 0.219 0.094 0.348 0.363 0.342 . 0.269 . . 0.040 u 513 3.681 0.119 0.324 0.255 0.127 0.312 0.224 0.278 0.218 0.149 0.094 0.212 0.018 0.055 0.427 . 0.119 0.195 0.447 0.018 0.018 0.075 . . e 888 3.554 0.436 0.157 0.055 0.153 0.042 0.084 0.265 0.417 0.035 0.114 0.119 0.240 0.078 0.279 . 0.149 0.459 0.251 0.028 0.011 0.153 . 0.028 o 422 3.361 0.430 0.098 0.064 . 0.021 . 0.472 0.282 0.155 0.260 0.171 0.037 0.076 0.361 0.021 0.146 0.108 0.433 . 0.064 0.037 . 0.128 n 443 3.475 0.448 0.394 0.304 0.318 0.318 0.274 0.061 . 0.061 0.105 . 0.165 . 0.228 . 0.181 0.400 . 0.061 0.073 0.035 . 0.049 r 380 3.333 0.365 0.327 0.414 0.252 0.514 0.289 0.094 0.157 . 0.069 . 0.128 0.023 0.106 0.023 0.175 0.069 0.117 0.040 0.148 . 0.023 . p 118 3.048 . 0.335 0.242 0.263 0.487 0.434 . 0.434 . . 0.193 . . 0.242 0.283 . 0.135 . . . . . . c 212 2.929 0.348 0.440 0.339 0.445 0.497 0.312 . 0.063 . 0.162 0.087 . . . 0.108 . 0.127 . . . . . . l 174 2.832 0.237 0.521 0.368 0.221 0.328 0.484 . . . 0.101 0.266 . . . . 0.043 0.221 . . 0.043 . . . g 103 2.823 0.110 0.506 0.149 0.264 0.476 0.361 0.429 0.307 . . 0.110 0.110 . . . . . . . . . . . b 108 2.757 0.256 0.495 0.441 0.475 0.441 0.256 0.063 0.063 . . 0.063 . . 0.144 . . . 0.063 . . . . . s 515 2.732 0.525 0.206 0.397 0.311 0.311 0.149 . 0.102 0.084 0.126 . . . 0.149 . . 0.307 . . . . . 0.065 h 82 2.591 0.175 0.421 0.503 0.175 0.522 0.303 . 0.246 . . . . . 0.246 . . . . . . . . . d 267 2.545 0.400 0.498 0.312 0.190 0.420 0.519 . 0.030 0.030 . . . . . 0.073 0.073 . . . . . . . t 579 2.377 0.501 0.316 0.231 0.333 0.377 0.093 . 0.149 . . . . . . 0.240 . . . . . . . 0.137 m 374 2.284 0.498 0.417 0.159 0.129 0.350 0.359 0.129 . 0.040 . . . 0.056 . . . 0.023 0.040 . . . . 0.083 f 70 2.130 . 0.493 0.358 0.380 0.380 0.236 . 0.195 . . . . . . . . . . 0.088 . . . . v 133 1.777 . 0.475 0.297 0.178 0.485 0.342 . . . . . . . . . . . . . . . . . x 49 1.672 0.512 0.523 . . 0.449 . . . . . . . . . . . 0.188 . . . . . . z 2 1.000 . . . . 0.500 0.500 . . . . . . . . . . . . . . . . . q 98 0.000 . . . . . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 8233 3.261 0.423 0.330 0.267 0.250 0.347 0.220 0.227 0.205 0.088 0.136 0.118 0.079 0.082 0.250 0.066 0.160 0.269 0.203 0.058 0.096 0.044 0.003 0.076 Previous-symbol entropy: count ntrpy i a u e o n r p c l g b s h d t m f v x z q ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 1318 0.423 . 0.410 0.460 0.163 0.493 0.187 0.343 0.467 0.516 0.523 0.266 0.307 0.489 0.520 0.421 0.531 0.348 0.433 0.258 0.530 . 0.500 0.490 i 815 0.330 0.294 0.065 0.360 0.324 0.153 0.303 0.511 0.157 0.351 0.429 0.470 0.212 0.489 0.400 0.328 0.297 0.463 0.226 0.195 0.401 0.497 . 0.154 a 570 0.267 0.179 0.084 0.016 0.176 0.191 . 0.359 0.192 0.193 0.356 0.502 0.286 0.525 0.234 0.350 0.489 0.360 0.424 . 0.521 . . 0.154 u 513 0.250 0.058 0.244 0.238 0.127 0.221 0.254 0.304 0.265 0.378 0.178 0.402 0.065 0.176 0.427 . 0.190 0.179 0.499 0.088 0.053 0.371 . . e 888 0.347 0.359 0.167 0.078 0.224 0.042 0.146 0.394 0.531 0.166 0.302 0.349 0.498 0.336 0.380 . 0.327 0.520 0.410 0.195 0.053 0.474 . 0.154 o 422 0.220 0.225 0.059 0.050 . 0.011 . 0.463 0.301 0.351 0.386 0.305 0.110 0.205 0.323 0.078 0.201 0.085 0.455 . 0.152 0.188 . 0.336 n 443 0.227 0.247 0.280 0.260 0.291 0.205 0.282 0.061 . 0.166 0.178 . 0.405 . 0.206 . 0.253 0.348 . 0.236 0.178 0.188 . 0.154 r 380 0.205 0.166 0.203 0.335 0.207 0.366 0.271 0.084 0.157 . 0.108 . 0.307 0.063 0.084 0.078 0.223 0.050 0.119 0.147 0.297 . 0.500 . p 118 0.088 . 0.090 0.078 0.094 0.153 0.208 . 0.224 . . 0.147 . . 0.084 0.350 . 0.039 . . . . . . c 212 0.136 0.102 0.203 0.181 0.278 0.240 0.201 . 0.040 . 0.162 0.101 . . . 0.213 . 0.059 . . . . . . l 174 0.118 0.053 0.253 0.175 0.102 0.109 0.318 . . . 0.087 0.266 . . . . 0.030 0.093 . . 0.053 . . . g 103 0.079 0.014 0.163 0.040 0.085 0.132 0.146 0.181 0.128 . . 0.074 0.110 . . . . . . . . . . . b 108 0.082 0.040 0.159 0.164 0.201 0.119 0.098 0.020 0.023 . . 0.043 . . 0.043 . . . 0.023 . . . . . s 515 0.250 0.430 0.150 0.377 0.312 0.221 0.171 . 0.128 0.242 0.235 . . . 0.149 . . 0.286 . . . . . 0.219 h 82 0.066 0.020 0.095 0.175 0.043 0.145 0.098 . 0.082 . . . . . 0.065 . . . . . . . . . d 267 0.160 0.148 0.294 0.192 0.119 0.208 0.450 . 0.023 0.058 . . . . . 0.175 0.073 . . . . . . . t 579 0.269 0.479 0.256 0.233 0.356 0.296 0.118 . 0.201 . . . . . . 0.527 . . . . . . . 0.414 m 374 0.203 0.402 0.271 0.117 0.102 0.205 0.336 0.114 . 0.100 . . . 0.144 . . . 0.016 0.040 . . . . 0.219 f 70 0.058 . 0.199 0.086 0.102 0.067 0.064 . 0.055 . . . . . . . . . . 0.088 . . . . v 133 0.096 . 0.312 0.110 0.065 0.165 0.163 . . . . . . . . . . . . . . . . . x 49 0.044 0.102 0.106 . . 0.067 . . . . . . . . . . . 0.028 . . . . . . z 2 0.003 . . . . 0.011 0.021 . . . . . . . . . . . . . . . . . q 98 0.076 . . . 0.456 . . . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 8233 3.261 3.319 4.062 3.728 3.826 3.822 3.835 2.835 2.972 2.520 2.944 2.925 2.301 2.425 2.914 2.517 2.613 2.876 2.629 1.205 2.239 1.719 1.000 2.295 So, let's try these substitutions: pok count latn count jsaz guy2 FSG --- ----- ---- ----- --------- ------------------- ------------------ q 1668 e 888 qo *4o *4O a 1435 i 815 ci a+ A+ e 1195 t 579 ix x+ E+ y 1151 a 570 oix +ox+ +OE+ o 1118 s 515 o +o +O f 910 u 513 ljci lpa DA r 874 n 443 is 2* R* C 849 o 422 cc -et -T u 737 r 380 cgci +8a+ +8A+ U 505 m 374 ccc +etc +TC 6 498 d 267 ccccgci +etc89* +TC8G* n 498 c 212 iiu iv* N* m 439 l 174 iiiu iiv* M* 7 372 v 133 zcccgci +e'tc89* +SC8G* Z 369 p 118 zc +e't +S z 354 b 108 z =s =2 V 348 g 103 zcc +e'tc +SC b 347 q 98 qjci qpa- HA- S 302 h 82 ljcccgci lpet89* DT8G* L 299 f 70 ljccgci lpc89* DC8G* 8 261 x 49 cg +8 +8 Q 215 z 2 qg +dj +P J 207 ljcci lpc9* DCG* R 197 ljccci lpet9= DTG= H 190 lj lp D G 160 qjccgci qpc89* HC8G* K 134 qj -qp -H c 131 c -c -C F 110 qjci qp9* HG* N 97 qjcccgci qpet89* HT8G* M 67 qjccci qpet9* HTG* k 56 ij ig* K* A 52 ljc lpc DC i 52 i i I w 39 iis i2 IR P 35 lg fj F B 30 qjc qpc HC X 27 ljcc lpet DT Y 23 qjcc qpet HT j 12 j ? ? * = always space. = = space 3/4 of the time. + = space half the time. - = space 1/4 of the time. Final FSG "A" (Guy2 "a") should be replaced by "G" (Guy2 "9") [ Oops! found a bug in this code that would erase 4 bytes in every 8 and replace them by the other 4. Listing and results below are as fixed on 97-09-13. ] In "sed" notation: --- lat2voy ------------------------ #! /n/gnu/bin/sed -f # remove all spaces: s/ //g # Pad line with #s s/$/\#\#\#\#\#\#\#\#/g # Insert dice throws after each character: s/\(.\)\(.\)\(.\)\(.\)\(.\)\(.\)\(.\)\(.\)/\11\22\33\40\52\61\70\83/g # Now do the substitutions. Use J instead of FSG 2 for now: s/e[01]/A/g s/e[23]/A_/g # s/i0/_T/g s/i[123]/T/g # s/t[03]/_SC/g s/t[12]/SC/g # s/a0/OE_/g s/a1/_OE/g s/a2/_OE_/g s/a3/OE/g # s/s[12]/_O/g s/s[03]/O/g # s/u[0-3]/DA/g # s/n[01]/R_/g s/n[23]/R/g # s/o[02]/E_/g s/o[13]/E/g # s/r[03]/8A/g s/r1/_8A/g s/r2/8A_/g # s/m[1]/_TC/g s/m[023]/TC/g # s/d[012]/TC8A_/g s/d[3]/_TC8A_/g # s/c[0-3]/N_/g # s/l[0-3]/M_/g # s/v[123]/SC8A_/g s/v[0]/_SC8A_/g # s/p[123]/S/g s/p[0]/_S/g # s/b[23]/_J/g s/b[01]/J/g # s/g[0-3]/_4O/g # s/q[123]/HA/g s/q0/HA_/g # s/h[0-3]/DT8A_/g # s/f[0-3]/DC8A_/g # s/x[12]/_8/g s/x[03]/8/g # s/z[01]/_P/g s/z[23]/P/g # Remove padding: s/\#[0-3]//g s/\#//g # Replace J by 2: s/J/2/g # Replace underscores by blanks: s/__*/ /g # Reduce isolated letters: s/ M / AM /g s/ N / AN /g s/ A / AR /g s/ 8 / 8G /g s/ E / AE /g s/ T / TCG /g s/ O / OEC /g s/ S / POE /g # Change final A into G: s/A /G /g # Remove line-leading and line-trailing blanks: s/^ *//g s/ *$//g s/A$/G/g ------------------------------------ OK, let's try it: cat latn.txt | lat2voy > pseudo-voynich.fsg ASC8AG 8TC8G OE SC8G TTC8G OARDAA8G OESCDT8G OE 2A2OESCHADAG OE AR SCOESCTO SM DA8AT TCE OTC8G TAO N DATCHG DAAE SA8G T8AG SCDA8ASC8G AR OEC SCT2DAOR AE RN OE AM ADC8G TG 2OE SC TC8G T8A8G DAR SCA8G 4OE AR TOG 8ASC8G TODATHG DG OEG 8AOE TCDAO TC8G ETCTR AE R AE OSC8G E8AG 4O T OETC8G DAM AR ON AR R SCDAM OE TC SC8G TCG 8G 4OTR AR TCG SC OSCG SCN E8AOE TC8G AR 4OG AR SCDC8G ESC8G AR OE SCADATC TC8G AE 8ATCT OE SCHADAG TR OTR DASCDAEASCN OE AM ADC8G OEN TCG OE SC TC8G ETCTR DG TCR AE OSC8ADATC 8AG 4OATC cat pseudo-voynich.fsg \ | count-digraph-freqs \ -v showentropy=1 \ -v chars=' OT98AE4DC2RNMSHPG' Digraph counts: TT O T 8 A E 4 D C 2 R N M S H P G ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 3296 . 699 780 187 538 70 103 185 . 70 104 15 5 497 29 8 6 O 1194 85 64 61 28 38 671 . 63 . 1 24 18 3 46 15 . 77 T 1538 99 82 57 109 15 36 . 68 752 12 123 40 36 76 7 . 26 8 981 14 . 13 . 229 . . . . . . 1 . 3 2 . 719 A 1446 . 107 224 109 37 179 . 129 . 4 263 87 95 97 5 . 110 E 1064 601 31 96 23 22 . . 16 66 12 75 22 27 46 14 . 13 4 103 . 103 . . . . . . . . . . . . . . . D 665 . . 82 . 391 . . . 70 . . . . . . . 122 C 1600 270 71 163 505 70 58 . 132 . 9 26 22 1 38 25 . 210 2 108 21 12 21 . 9 7 . 25 . . 1 . 1 . . . 11 R 620 471 12 30 2 13 22 . 30 . . 4 7 1 20 1 . 7 N 212 212 . . . . . . . . . . . . . . . . M 174 174 . . . . . . . . . . . . . . . . S 824 6 7 11 18 12 20 . 17 712 . . . 5 1 . . 15 H 98 . . . . 72 . . . . . . . . . . . 26 P 8 . 6 . . . 1 . . . . . . . . . . 1 G 1343 1343 . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 15274 3296 1194 1538 981 1446 1064 103 665 1600 108 620 212 174 824 98 8 1343 Next-symbol probability (× 99): TT O T 8 A E 4 D C 2 R N M S H P G -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 99 . 21 23 6 16 2 3 6 . 2 3 . . 15 1 . . O 99 7 5 5 2 3 56 . 5 . . 2 1 . 4 1 . 6 T 99 6 5 4 7 1 2 . 4 48 1 8 3 2 5 . . 2 8 99 1 . 1 . 23 . . . . . . . . . . . 73 A 99 . 7 15 7 3 12 . 9 . . 18 6 7 7 . . 8 E 99 56 3 9 2 2 . . 1 6 1 7 2 3 4 1 . 1 4 99 . 99 . . . . . . . . . . . . . . . D 99 . . 12 . 58 . . . 10 . . . . . . . 18 C 99 17 4 10 31 4 4 . 8 . 1 2 1 . 2 2 . 13 2 99 19 11 19 . 8 6 . 23 . . 1 . 1 . . . 10 R 99 75 2 5 . 2 4 . 5 . . 1 1 . 3 . . 1 N 99 99 . . . . . . . . . . . . . . . . M 99 99 . . . . . . . . . . . . . . . . S 99 1 1 1 2 1 2 . 2 86 . . . 1 . . . 2 H 99 . . . . 73 . . . . . . . . . . . 26 P 99 . 74 . . . 12 . . . . . . . . . . 12 G 99 99 . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 21 8 10 6 9 7 1 4 10 1 4 1 1 5 1 0 9 Previous-symbol probability (× 99): TT O T 8 A E 4 D C 2 R N M S H P G -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 21 . 58 50 19 37 7 99 28 . 64 17 7 3 60 29 99 . O 8 3 5 4 3 3 62 . 9 . 1 4 8 2 6 15 . 6 T 10 3 7 4 11 1 3 . 10 47 11 20 19 20 9 7 . 2 8 6 . . 1 . 16 . . . . . . . . . 2 . 53 A 9 . 9 14 11 3 17 . 19 . 4 42 41 54 12 5 . 8 E 7 18 3 6 2 2 . . 2 4 11 12 10 15 6 14 . 1 4 1 . 9 . . . . . . . . . . . . . . . D 4 . . 5 . 27 . . . 4 . . . . . . . 9 C 10 8 6 10 51 5 5 . 20 . 8 4 10 1 5 25 . 15 2 1 1 1 1 . 1 1 . 4 . . . . 1 . . . 1 R 4 14 1 2 . 1 2 . 4 . . 1 3 1 2 1 . 1 N 1 6 . . . . . . . . . . . . . . . . M 1 5 . . . . . . . . . . . . . . . . S 5 . 1 1 2 1 2 . 3 44 . . . 3 . . . 1 H 1 . . . . 5 . . . . . . . . . . . 2 P 0 . . . . . . . . . . . . . . . . . G 9 40 . . . . . . . . . . . . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Symbol entropy: 3.514 Next-symbol entropy: TT O T 8 A E 4 D C 2 R N M S H P G ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 2.970 . 0.474 0.492 0.235 0.427 0.118 0.156 0.233 . 0.118 0.157 0.035 0.014 0.412 0.060 0.021 0.017 O 2.443 0.271 0.226 0.219 0.127 0.158 0.467 . 0.224 . 0.009 0.113 0.091 0.022 0.181 0.079 . 0.255 T 2.782 0.255 0.225 0.176 0.271 0.065 0.127 . 0.199 0.505 0.055 0.291 0.137 0.127 0.214 0.035 . 0.100 8 1.043 0.087 . 0.083 . 0.490 . . . . . . 0.010 . 0.026 0.018 . 0.329 A 3.341 . 0.278 0.417 0.281 0.135 0.373 . 0.311 . 0.024 0.447 0.244 0.258 0.261 0.028 . 0.283 E 2.451 0.465 0.149 0.313 0.120 0.116 . . 0.091 0.249 0.073 0.270 0.116 0.135 0.196 0.082 . 0.078 4 0.000 . . . . . . . . . . . . . . . . . D 1.614 . . 0.372 . 0.450 . . . 0.342 . . . . . . . 0.449 C 2.998 0.433 0.199 0.336 0.525 0.198 0.173 . 0.297 . 0.042 0.097 0.085 0.007 0.128 0.094 . 0.385 2 2.775 0.459 0.352 0.459 . 0.299 0.256 . 0.489 . . 0.063 . 0.063 . . . 0.336 R 1.531 0.301 0.110 0.211 0.027 0.117 0.171 . 0.211 . . 0.047 0.073 0.015 0.160 0.015 . 0.073 N 0.000 . . . . . . . . . . . . . . . . . M 0.000 . . . . . . . . . . . . . . . . . S 0.992 0.052 0.058 0.083 0.121 0.089 0.130 . 0.116 0.182 . . . 0.045 0.012 . . 0.105 H 0.835 . . . . 0.327 . . . . . . . . . . . 0.508 P 1.061 . 0.311 . . . 0.375 . . . . . . . . . . 0.375 G 0.000 . . . . . . . . . . . . . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 2.192 0.477 0.287 0.333 0.254 0.322 0.268 0.049 0.197 0.341 0.051 0.188 0.086 0.074 0.227 0.047 0.006 0.308 Now let's see who are the culprits for the low H2 of FSG-encoded Voynichese: cat bio-m-evt.evt \ | grep ';C>' \ | sed \ -e 's/{[^}]*}//g' \ -e 's/[\!%]//g' \ > .tmp-c-fsg.evt extract-words-from-interlin \ -chars 'COG8EDA4TSHRNM2ZPIKLFG' \ .tmp-c-fsg.evt \ .tmp-c-fsg Digraph counts: TT C O G 8 E D A 4 T S H R N M 2 Z P I K L F ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 6408 . 24 1362 138 513 364 108 133 1643 759 692 152 132 . . 278 . 99 1 . 2 8 C 4275 7 951 172 837 1895 4 155 55 1 15 9 80 8 . 8 45 . 17 11 . 2 3 O 3889 34 19 4 13 31 1342 1427 3 8 7 9 566 300 7 14 7 . 68 9 7 1 13 G 3763 3510 1 7 . 17 21 71 2 10 20 25 54 14 . . 6 . 1 1 1 . 2 8 2727 73 19 72 2045 2 10 8 417 1 36 38 1 2 . . 1 . . 1 . 1 . E 2347 1085 9 157 106 84 7 270 55 2 306 181 37 13 . . 16 . 11 . 2 . 6 D 2181 14 871 79 169 2 11 . 736 . 69 28 . . . 1 . 198 . 3 . . . A 1961 6 . 5 4 8 550 4 1 . . 1 4 394 471 399 7 . 2 51 42 12 . 4 1665 5 19 1619 3 . . 4 4 . . 1 5 . . . 2 . 2 . . . 1 T 1445 1 1049 49 62 96 13 82 26 . 1 2 39 4 . . 6 . 12 . . . 3 S 1073 4 864 37 27 40 5 45 21 . 3 . 25 1 . . 1 . . . . . . H 966 4 341 58 88 3 3 1 257 . 60 25 . . . . 1 121 . 4 . . . R 911 619 4 82 44 5 1 1 92 . 37 22 1 . . . . . 2 1 . . . N 478 462 . 7 2 3 . . 2 . 1 . . . . . 1 . . . . . . M 422 412 . 2 5 1 . . 1 . 1 . . . . . . . . . . . . 2 372 73 4 114 10 3 1 5 131 . 14 13 2 . . . . . 1 1 . . . Z 344 2 96 10 203 21 . . 9 . 2 . . . . . 1 . . . . . . P 215 4 3 48 6 3 . . 14 . 91 25 . . . . . 21 . . . . . I 152 . . . . . 11 . . . . . . 43 . . . . . 69 4 25 . K 56 54 . 1 . . 1 . . . . . . . . . . . . . . . . L 43 38 . 1 1 . 3 . . . . . . . . . . . . . . . . F 36 1 1 3 . . . . 2 . 23 2 . . . . . 4 . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 35729 6408 4275 3889 3763 2727 2347 2181 1961 1665 1445 1073 966 911 478 422 372 344 215 152 56 43 36 Next-symbol probability (× 99): TT C O G 8 E D A 4 T S H R N M 2 Z P I K L F -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 99 . . 21 2 8 6 2 2 25 12 11 2 2 . . 4 . 2 . . . . C 99 . 22 4 19 44 . 4 1 . . . 2 . . . 1 . . . . . . O 99 1 . . . 1 34 36 . . . . 14 8 . . . . 2 . . . . G 99 92 . . . . 1 2 . . 1 1 1 . . . . . . . . . . 8 99 3 1 3 74 . . . 15 . 1 1 . . . . . . . . . . . E 99 46 . 7 4 4 . 11 2 . 13 8 2 1 . . 1 . . . . . . D 99 1 40 4 8 . . . 33 . 3 1 . . . . . 9 . . . . . A 99 . . . . . 28 . . . . . . 20 24 20 . . . 3 2 1 . 4 99 . 1 96 . . . . . . . . . . . . . . . . . . . T 99 . 72 3 4 7 1 6 2 . . . 3 . . . . . 1 . . . . S 99 . 80 3 2 4 . 4 2 . . . 2 . . . . . . . . . . H 99 . 35 6 9 . . . 26 . 6 3 . . . . . 12 . . . . . R 99 67 . 9 5 1 . . 10 . 4 2 . . . . . . . . . . . N 99 96 . 1 . 1 . . . . . . . . . . . . . . . . . M 99 97 . . 1 . . . . . . . . . . . . . . . . . . 2 99 19 1 30 3 1 . 1 35 . 4 3 1 . . . . . . . . . . Z 99 1 28 3 58 6 . . 3 . 1 . . . . . . . . . . . . P 99 2 1 22 3 1 . . 6 . 42 12 . . . . . 10 . . . . . I 99 . . . . . 7 . . . . . . 28 . . . . . 45 3 16 . K 99 95 . 2 . . 2 . . . . . . . . . . . . . . . . L 99 87 . 2 2 . 7 . . . . . . . . . . . . . . . . F 99 3 3 8 . . . . 6 . 63 6 . . . . . 11 . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 18 12 11 10 8 7 6 5 5 4 3 3 3 1 1 1 1 1 0 0 0 0 Previous-symbol probability (× 99): TT C O G 8 E D A 4 T S H R N M 2 Z P I K L F -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- 18 . 1 35 4 19 15 5 7 98 52 64 16 14 . . 74 . 46 1 . 5 22 C 12 . 22 4 22 69 . 7 3 . 1 1 8 1 . 2 12 . 8 7 . 5 8 O 11 1 . . . 1 57 65 . . . 1 58 33 1 3 2 . 31 6 12 2 36 G 10 54 . . . 1 1 3 . 1 1 2 6 2 . . 2 . . 1 2 . 6 8 8 1 . 2 54 . . . 21 . 2 4 . . . . . . . 1 . 2 . E 7 17 . 4 3 3 . 12 3 . 21 17 4 1 . . 4 . 5 . 4 . 17 D 6 . 20 2 4 . . . 37 . 5 3 . . . . . 57 . 2 . . . A 5 . . . . . 23 . . . . . . 43 98 94 2 . 1 33 74 28 . 4 5 . . 41 . . . . . . . . 1 . . . 1 . 1 . . . 3 T 4 . 24 1 2 3 1 4 1 . . . 4 . . . 2 . 6 . . . 8 S 3 . 20 1 1 1 . 2 1 . . . 3 . . . . . . . . . . H 3 . 8 1 2 . . . 13 . 4 2 . . . . . 35 . 3 . . . R 3 10 . 2 1 . . . 5 . 3 2 . . . . . . 1 1 . . . N 1 7 . . . . . . . . . . . . . . . . . . . . . M 1 6 . . . . . . . . . . . . . . . . . . . . . 2 1 1 . 3 . . . . 7 . 1 1 . . . . . . . 1 . . . Z 1 . 2 . 5 1 . . . . . . . . . . . . . . . . . P 1 . . 1 . . . . 1 . 6 2 . . . . . 6 . . . . . I 0 . . . . . . . . . . . . 5 . . . . . 45 7 58 . K 0 1 . . . . . . . . . . . . . . . . . . . . . L 0 1 . . . . . . . . . . . . . . . . . . . . . F 0 . . . . . . . . . 2 . . . . . . 1 . . . . . -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 99 Next-symbol entropy: TT C O G 8 E D A 4 T S H R N M 2 Z P I K L F ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 3.131 . 0.030 0.475 0.119 0.292 0.235 0.099 0.116 0.503 0.365 0.347 0.128 0.115 . . 0.196 . 0.093 0.002 . 0.004 0.012 C 2.256 0.015 0.482 0.187 0.461 0.520 0.009 0.174 0.081 0.003 0.029 0.019 0.107 0.017 . 0.017 0.069 . 0.032 0.022 . 0.005 0.007 O 2.235 0.060 0.038 0.010 0.027 0.056 0.530 0.531 0.008 0.018 0.016 0.020 0.405 0.285 0.016 0.029 0.016 . 0.102 0.020 0.016 0.003 0.027 G 0.563 0.094 0.003 0.017 . 0.035 0.042 0.108 0.006 0.023 0.040 0.048 0.088 0.030 . . 0.015 . 0.003 0.003 0.003 . 0.006 8 1.313 0.140 0.050 0.138 0.311 0.008 0.030 0.025 0.414 0.004 0.082 0.086 0.004 0.008 . . 0.004 . . 0.004 . 0.004 . E 2.620 0.515 0.031 0.261 0.202 0.172 0.025 0.359 0.127 0.009 0.383 0.285 0.094 0.042 . . 0.049 . 0.036 . 0.009 . 0.022 D 2.182 0.047 0.529 0.173 0.286 0.009 0.038 . 0.529 . 0.158 0.081 . . . 0.005 . 0.314 . 0.013 . . . A 2.427 0.026 . 0.022 0.018 0.032 0.514 0.018 0.006 . . 0.006 0.018 0.465 0.494 0.467 0.029 . 0.010 0.137 0.119 0.045 . 4 0.258 0.025 0.074 0.039 0.016 . . 0.021 0.021 . . 0.006 0.025 . . . 0.012 . 0.012 . . . 0.006 T 1.657 0.007 0.335 0.166 0.195 0.260 0.061 0.235 0.104 . 0.007 0.013 0.141 0.024 . . 0.033 . 0.057 . . . 0.019 S 1.268 0.030 0.252 0.168 0.134 0.177 0.036 0.192 0.111 . 0.024 . 0.126 0.009 . . 0.009 . . . . . . H 2.496 0.033 0.530 0.244 0.315 0.026 0.026 0.010 0.508 . 0.249 0.136 . . . . 0.010 0.375 . 0.033 . . . R 1.692 0.379 0.034 0.313 0.211 0.041 0.011 0.011 0.334 . 0.188 0.130 0.011 . . . . . 0.019 0.011 . . . N 0.286 0.047 . 0.089 0.033 0.046 . . 0.033 . 0.019 . . . . . 0.019 . . . . . . M 0.208 0.034 . 0.037 0.076 0.021 . . 0.021 . 0.021 . . . . . . . . . . . . 2 2.321 0.461 0.070 0.523 0.140 0.056 0.023 0.084 0.530 . 0.178 0.169 0.041 . . . . . 0.023 0.023 . . . Z 1.606 0.043 0.514 0.148 0.449 0.246 . . 0.138 . 0.043 . . . . . 0.024 . . . . . . P 2.376 0.107 0.086 0.483 0.144 0.086 . . 0.257 . 0.525 0.361 . . . . . 0.328 . . . . . I 1.873 . . . . . 0.274 . . . . . . 0.515 . . . . . 0.517 0.138 0.428 . K 0.258 0.051 . 0.104 . . 0.104 . . . . . . . . . . . . . . . . L 0.678 0.158 . 0.126 0.126 . 0.268 . . . . . . . . . . . . . . . . F 1.814 0.144 0.144 0.299 . . . . 0.232 . 0.413 0.232 . . . . . 0.352 . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 1.972 0.445 0.367 0.348 0.342 0.283 0.258 0.246 0.230 0.206 0.187 0.152 0.141 0.135 0.083 0.076 0.069 0.064 0.044 0.034 0.015 0.012 0.010 Previous-symbol entropy: TT C O G 8 E D A 4 T S H R N M 2 Z P I K L F ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- 0.445 . 0.042 0.530 0.175 0.453 0.417 0.215 0.263 0.019 0.488 0.408 0.420 0.404 . . 0.314 . 0.515 0.048 . 0.206 0.482 C 0.367 0.011 0.482 0.199 0.482 0.365 0.016 0.271 0.145 0.006 0.068 0.058 0.298 0.060 . 0.108 0.369 . 0.289 0.274 . 0.206 0.299 O 0.348 0.040 0.035 0.010 0.028 0.073 0.461 0.400 0.014 0.037 0.037 0.058 0.452 0.528 0.089 0.163 0.108 . 0.525 0.241 0.375 0.126 0.531 G 0.342 0.476 0.003 0.016 . 0.046 0.061 0.161 0.010 0.044 0.085 0.126 0.233 0.093 . . 0.096 . 0.036 0.048 0.104 . 0.232 8 0.283 0.074 0.035 0.107 0.478 0.008 0.034 0.030 0.475 0.006 0.133 0.171 0.010 0.019 . . 0.023 . . 0.048 . 0.126 . E 0.258 0.434 0.019 0.187 0.145 0.155 0.025 0.373 0.145 0.012 0.474 0.433 0.180 0.087 . . 0.195 . 0.219 . 0.172 . 0.431 D 0.246 0.019 0.468 0.114 0.201 0.008 0.036 . 0.531 . 0.210 0.137 . . . 0.021 . 0.459 . 0.112 . . . A 0.230 0.009 . 0.012 0.010 0.025 0.491 0.017 0.006 . . 0.009 0.033 0.523 0.021 0.076 0.108 . 0.063 0.529 0.311 0.514 . 4 0.206 0.008 0.035 0.526 0.008 . . 0.017 0.018 . . 0.009 0.039 . . . 0.041 . 0.063 . . . 0.144 T 0.187 0.002 0.497 0.080 0.098 0.170 0.042 0.178 0.083 . 0.007 0.017 0.187 0.034 . . 0.096 . 0.232 . . . 0.299 S 0.152 0.007 0.466 0.064 0.051 0.089 0.019 0.116 0.070 . 0.019 . 0.136 0.011 . . 0.023 . . . . . . H 0.141 0.007 0.291 0.090 0.127 0.011 0.012 0.005 0.384 . 0.191 0.126 . . . . 0.023 0.530 . 0.138 . . . R 0.135 0.326 0.009 0.117 0.075 0.017 0.005 0.005 0.207 . 0.135 0.115 0.010 . . . . . 0.063 0.048 . . . N 0.083 0.274 . 0.016 0.006 0.011 . . 0.010 . 0.007 . . . . . 0.023 . . . . . . M 0.076 0.255 . 0.006 0.013 0.004 . . 0.006 . 0.007 . . . . . . . . . . . . 2 0.069 0.074 0.009 0.149 0.023 0.011 0.005 0.020 0.261 . 0.065 0.077 0.018 . . . . . 0.036 0.048 . . . Z 0.064 0.004 0.123 0.022 0.227 0.054 . . 0.036 . 0.013 . . . . . 0.023 . . . . . . P 0.044 0.007 0.007 0.078 0.015 0.011 . . 0.051 . 0.251 0.126 . . . . . 0.246 . . . . . I 0.034 . . . . . 0.036 . . . . . . 0.208 . . . . . 0.517 0.272 0.455 . K 0.015 0.058 . 0.003 . . 0.005 . . . . . . . . . . . . . . . . L 0.012 0.044 . 0.003 0.003 . 0.012 . . . . . . . . . . . . . . . . F 0.010 0.002 0.003 0.008 . . . . 0.010 . 0.095 0.017 . . . . . 0.075 . . . . . ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 1.972 2.127 2.524 2.339 2.165 1.510 1.676 1.807 2.724 0.125 2.286 1.889 2.017 1.967 0.110 0.369 1.441 1.310 2.042 2.050 1.234 1.633 2.416 Dennis complained that H1 of pseudo-voynich.fsg was too low. Let's try to fix it: latn count jsaz guy2 FSG ---- ----- --------- ------------------- ------------------ e 888 ci a+ A+ i 815 cc -et -T t 579 zcc +e'tc +SC a 570 oix +ox+ +OE+ s 515 o +o +O u 513 ljci lpa DA n 443 is 2+ R+ o 422 ix x+ E+ r 380 cgci +8a,8a+ +8A,8A+ m 374 ccc -etc -TC d 267 ccccgci -etc8a* -TC8A* c 212 iiu iv* N* l 174 iiiu iiv* M* v 133 zcccgci -e'tc8a* -SC8A* p 118 zc -e't -S b 108 z +s +2 g 103 qo *4o *4O q 98 qjci qpa- HA- h 82 ljcccgci lpet8a* DT8A* f 70 ljccgci lpc8a* DC8A* x 49 cg +8 +8 z 2 qg +dj +P ljcci lpca* DCA* ljccci lpeta= DTA= lj lp D qjccgci qpc8a* HC8A* qj -qp -H c -c -C qjci qpa* HA* qjcccgci qpet8a* HT8A* qjccci qpeta* HTA* ij ig* K* ljc lpc DC i i I iis i2 IR lg fj F qjc qpc HC ljcc lpet DT qjcc qpet HT j ? ? * = always space. = = space 3/4 of the time. + = space half the time. - = space 1/4 of the time. [ See fixed lat2voy script and results above. ]