Shall we try computing digraphs? First, ignoring word and line breaks: --- glp2let ------------------------ #! /n/gnu/bin/sed -f # Maps the most popular glyphs to single letters # Discard ordinary blanks: s/ //g # Map line, paragraph, and word breaks to blanks: s@(_)@ @g s@(//)@ @g s@(=)@ @g s/ */ /g # Any other garbage goes to "?": s/([^)]*[^)a-z][^)]*)/\?/g # Now the glyphs: s/(qo)/W/g s/(o)/Y/g s/(t)/T/g s/(tc)/D/g s/(tcc)/J/g s/(s)/S/g s/(sc)/Z/g s/(scc)/X/g s/(b)/A/g s/(e)/U/g s/(r)/I/g s/(z)/O/g s/(g)/E/g s/(bg)/Æ/g s/([dh])/C/g s/([dh]c)/G/g s/([dh]cc)/Q/g s/([dh]z)/F/g s/([dh]zc)/V/g s/([pf])/@/g s/([pf]z)/\#/g s/([pf]zc)/\$/g s/(qoe)/Ñ/g s/(qor)/P/g s/(oe)/H/g s/(or)/B/g s/(ae)/L/g s/(am)/M/g s/(an)/N/g s/(ar)/R/g s/(ak)/K/g #Anything else is a program error: s/([^)]*)/\*/g ------------------------------------ cat .voyn.glp \ | glp2let \ | tr -d ' \012' \ | enum-digraphs \ | grep -v ' ' \ | count-transition-freqs \ -v chars=' ÆEAIOULMNRKHBÑPWYTDJSZXCGQFV@#$?*' State entropy: 4.405 Transition entropy: 4.405 Transition counts: count freq ntrpy pntpy Æ E A I O U L M N R K H B Ñ P W Y T D J S Z X C G Q F V @ # $ ? - ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- N 495 0.028 3.680 0.104 6 12 28 1 11 4 4 1 1 4 . 79 6 4 . 20 83 38 66 12 15 72 15 1 . . 3 1 3 1 1 3 A 685 0.039 3.729 0.145 2 4 3 2 3 16 125 97 56 117 10 61 18 4 . 20 25 14 24 3 8 29 6 4 5 2 . . 1 . . 26 B 286 0.016 3.947 0.064 . 21 9 . 1 1 11 15 10 10 1 49 9 1 . 11 31 8 22 7 3 39 10 6 . . . 1 7 . . 3 Ñ 193 0.011 4.059 0.045 2 9 1 4 3 3 2 4 1 2 . 11 1 5 . 3 13 5 40 6 4 20 13 14 6 15 . . 3 . . 3 O 365 0.021 3.685 0.076 3 10 1 . 5 2 35 48 34 26 1 90 19 . . 5 28 5 10 3 3 13 4 8 . 1 1 1 2 . . 7 P 17 0.001 3.102 0.003 . . . . . 1 . 3 2 . . 2 1 . . . 1 1 1 1 . . . . . . . . . . . 4 C 1654 0.094 3.334 0.313 2 258 4 1 1 14 263 194 349 161 13 98 25 . . . 39 66 66 2 11 45 . . . 1 . . . . . 41 Q 569 0.032 1.395 0.045 339 192 19 2 4 1 2 . . 1 . 6 1 . . . 1 1 . . . . . . . . . . . . . . Æ 2054 0.117 3.534 0.413 20 37 127 58 74 149 4 2 1 6 . 104 16 94 8 744 203 35 96 7 32 70 10 59 21 12 4 3 34 2 . 22 D 927 0.053 2.430 0.128 461 206 50 2 14 . 11 . 1 8 1 55 10 . . . 5 2 . . . . . 30 7 1 45 8 4 . 1 5 E 1726 0.098 3.858 0.378 23 33 151 65 92 174 . 2 . 2 1 93 23 63 6 438 146 32 60 11 24 54 11 98 30 41 1 4 27 1 . 20 R 405 0.023 3.949 0.091 6 28 11 1 7 3 15 4 6 9 1 68 17 2 . 27 54 17 33 7 11 56 8 1 2 . 1 . 4 1 1 4 F 238 0.014 1.128 0.015 15 199 1 . 1 . 6 . 1 1 . 2 4 . . . 3 1 1 . . 1 . . . . . . . . . 2 S 212 0.012 3.547 0.043 36 28 5 1 1 5 9 1 2 9 1 26 4 1 . 1 8 1 2 . . . . 11 1 . 41 17 . . . 1 G 622 0.035 1.678 0.059 425 104 31 1 3 1 1 . 1 5 . 20 4 . . . 3 6 5 . 6 3 . . . . . . . . . 3 T 400 0.023 3.636 0.083 80 62 16 4 6 13 14 . 1 8 4 31 9 . . . 9 1 . . 2 1 . 17 6 2 78 19 5 5 5 2 H 1152 0.065 4.279 0.280 35 58 43 10 25 25 6 14 8 4 1 77 41 6 2 58 68 50 139 24 25 99 30 140 61 65 2 1 19 1 . 15 U 456 0.026 3.995 0.104 12 14 16 7 10 5 5 3 2 4 2 42 19 . . 9 23 25 105 10 10 52 13 33 12 13 . . 6 . . 4 V 82 0.005 1.389 0.006 28 48 1 . 3 1 . . . . . 1 . . . . . . . . . . . . . . . . . . . . I 186 0.011 4.009 0.042 2 11 1 1 2 2 16 19 12 9 3 27 10 1 . 1 11 3 24 3 3 13 4 2 . . . . 3 . . 3 # 15 0.001 2.683 0.002 3 5 2 . . . 1 . . 1 . 1 . . . . 1 . . . . . . . . . . . . . . 1 J 126 0.007 2.458 0.018 33 55 2 1 7 . 2 1 . 1 . 2 2 . . . . . . . . . . 14 2 . . 1 1 . . 2 W 1436 0.082 1.857 0.152 4 1 6 1 . 1 1 1 . 1 . 4 1 . . 4 2 2 2 . . 1 1 754 300 301 2 7 28 . . 11 $ 10 0.001 0.971 0.001 4 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . X 150 0.009 2.372 0.020 57 51 5 2 5 . 1 . . . . 4 . 1 . . . . . . . . . 17 2 1 1 1 2 . . . K 43 0.002 2.937 0.007 . 3 8 . 12 . . . . . . 1 . . 1 7 5 . 2 . . 2 . . . . . . 1 . . 1 L 557 0.032 4.145 0.131 28 46 61 7 32 16 3 2 2 2 1 42 19 9 . 45 28 28 62 9 19 56 7 12 6 2 1 . 7 . . 5 ? 291 0.017 4.161 0.069 51 39 11 13 16 15 2 . 4 3 2 15 3 . . 12 18 2 11 5 2 13 4 12 1 2 6 3 8 . 1 17 Y 885 0.050 2.696 0.136 14 18 16 1 9 1 1 . . . . 2 1 1 . 9 3 . 5 1 2 6 1 386 155 109 8 6 53 1 1 75 M 414 0.024 3.700 0.087 6 12 21 1 15 3 2 . . 2 . 55 5 1 . 22 61 30 65 12 22 56 11 4 . . 2 . 3 1 . 2 Z 716 0.041 2.438 0.099 357 149 33 . 3 . 8 . . 5 1 43 13 . . . 5 . . . . . . 31 5 1 42 9 5 2 . 4 @ 227 0.013 2.978 0.038 1 7 2 . . . 7 3 1 4 . 41 5 . . . 8 27 86 3 10 15 2 . . . . . . . . 5 ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 17594 1.000 3.198 3.198 2055 1726 685 186 365 456 557 414 495 405 43 1152 286 193 17 1436 885 400 927 126 212 716 150 1654 622 569 238 82 226 15 10 291 Transition probabilities (× 99): count freq ntrpy pntpy Æ E A I O U L M N R K H B Ñ P W Y T D J S Z X C G Q F V @ # $ ? - ----- ----- ----- ----- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- P 17 0.001 3.102 0.003 . . . . . 6 . 17 12 . . 12 6 . . . 6 6 6 6 . . . . . . . . . . . 23 A 685 0.039 3.729 0.145 . 1 . . . 2 18 14 8 17 1 9 3 1 . 3 4 2 3 . 1 4 1 1 1 . . . . . . 4 C 1654 0.094 3.334 0.313 . 15 . . . 1 16 12 21 10 1 6 1 . . . 2 4 4 . 1 3 . . . . . . . . . 2 F 238 0.014 1.128 0.015 6 83 . . . . 2 . . . . 1 2 . . . 1 . . . . . . . . . . . . . . 1 $ 10 0.001 0.971 0.001 40 59 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . # 15 0.001 2.683 0.002 20 33 13 . . . 7 . . 7 . 7 . . . . 7 . . . . . . . . . . . . . . 7 G 622 0.035 1.678 0.059 68 17 5 . . . . . . 1 . 3 1 . . . . 1 1 . 1 . . . . . . . . . . . Q 569 0.032 1.395 0.045 59 33 3 . 1 . . . . . . 1 . . . . . . . . . . . . . . . . . . . . T 400 0.023 3.636 0.083 20 15 4 1 1 3 3 . . 2 1 8 2 . . . 2 . . . . . . 4 1 . 19 5 1 1 1 . S 212 0.012 3.547 0.043 17 13 2 . . 2 4 . 1 4 . 12 2 . . . 4 . 1 . . . . 5 . . 19 8 . . . . D 927 0.053 2.430 0.128 49 22 5 . 1 . 1 . . 1 . 6 1 . . . 1 . . . . . . 3 1 . 5 1 . . . 1 Z 716 0.041 2.438 0.099 49 21 5 . . . 1 . . 1 . 6 2 . . . 1 . . . . . . 4 1 . 6 1 1 . . 1 J 126 0.007 2.458 0.018 26 43 2 1 6 . 2 1 . 1 . 2 2 . . . . . . . . . . 11 2 . . 1 1 . . 2 X 150 0.009 2.372 0.020 38 34 3 1 3 . 1 . . . . 3 . 1 . . . . . . . . . 11 1 1 1 1 1 . . . V 82 0.005 1.389 0.006 34 58 1 . 4 1 . . . . . 1 . . . . . . . . . . . . . . . . . . . . @ 227 0.013 2.978 0.038 . 3 1 . . . 3 1 . 2 . 18 2 . . . 3 12 38 1 4 7 1 . . . . . . . . 2 O 365 0.021 3.685 0.076 1 3 . . 1 1 9 13 9 7 . 24 5 . . 1 8 1 3 1 1 4 1 2 . . . . 1 . . 2 B 286 0.016 3.947 0.064 . 7 3 . . . 4 5 3 3 . 17 3 . . 4 11 3 8 2 1 13 3 2 . . . . 2 . . 1 N 495 0.028 3.680 0.104 1 2 6 . 2 1 1 . . 1 . 16 1 1 . 4 17 8 13 2 3 14 3 . . . 1 . 1 . . 1 R 405 0.023 3.949 0.091 1 7 3 . 2 1 4 1 1 2 . 17 4 . . 7 13 4 8 2 3 14 2 . . . . . 1 . . 1 M 414 0.024 3.700 0.087 1 3 5 . 4 1 . . . . . 13 1 . . 5 15 7 16 3 5 13 3 1 . . . . 1 . . . I 186 0.011 4.009 0.042 1 6 1 1 1 1 9 10 6 5 2 14 5 1 . 1 6 2 13 2 2 7 2 1 . . . . 2 . . 2 H 1152 0.065 4.279 0.280 3 5 4 1 2 2 1 1 1 . . 7 4 1 . 5 6 4 12 2 2 9 3 12 5 6 . . 2 . . 1 U 456 0.026 3.995 0.104 3 3 3 2 2 1 1 1 . 1 . 9 4 . . 2 5 5 23 2 2 11 3 7 3 3 . . 1 . . 1 L 557 0.032 4.145 0.131 5 8 11 1 6 3 1 . . . . 7 3 2 . 8 5 5 11 2 3 10 1 2 1 . . . 1 . . 1 Ñ 193 0.011 4.059 0.045 1 5 1 2 2 2 1 2 1 1 . 6 1 3 . 2 7 3 21 3 2 10 7 7 3 8 . . 2 . . 2 Æ 2054 0.117 3.534 0.413 1 2 6 3 4 7 . . . . . 5 1 5 . 36 10 2 5 . 2 3 . 3 1 1 . . 2 . . 1 E 1726 0.098 3.858 0.378 1 2 9 4 5 10 . . . . . 5 1 4 . 25 8 2 3 1 1 3 1 6 2 2 . . 2 . . 1 K 43 0.002 2.937 0.007 . 7 18 . 28 . . . . . . 2 . . 2 16 12 . 5 . . 5 . . . . . . 2 . . 2 W 1436 0.082 1.857 0.152 . . . . . . . . . . . . . . . . . . . . . . . 52 21 21 . . 2 . . 1 Y 885 0.050 2.696 0.136 2 2 2 . 1 . . . . . . . . . . 1 . . 1 . . 1 . 43 17 12 1 1 6 . . 8 ? 291 0.017 4.161 0.069 17 13 4 4 5 5 1 . 1 1 1 5 1 . . 4 6 1 4 2 1 4 1 4 . 1 2 1 3 . . 6 ----- ----- ----- ----- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 17594 1.000 3.198 3.198 12 10 4 1 2 3 3 2 3 2 0 6 2 1 0 8 5 2 5 1 1 4 1 9 3 3 1 0 1 0 0 2 Note that C=([dh]) on the right is very different from G=([dh]c) and Q=([dh]cc), which are more similar to Z=(sc), D=(tc), X=(scc), and J=(tcc); and the latter are different from S=(s) and T=(t). The difference has to do with the forward distribution of Æ=(bg), which does not occur after C at all; and also LMNRK, which do not occur after G and Q at all. Also, F=([dh]z) is common after (s) and (t) but less so after Z,D,X,J. Thus perhaps we should join CGQSTDZXJ$# with the following letters. Reverse transition probabilities. Note that these tables are transposed relative to the ones I have created before: the row is the SECOND letter, the column is the FIRST letter. cat .voyn.glp \ | glp2let \ | tr -d ' \012' \ | enum-digraphs \ | grep -v ' ' \ | revbytes \ | count-transition-freqs \ -v chars=' ÆEAIOULMNRKHBÑPWYTDJSZXCGQFV@#$?*' State entropy: 4.405 Transition entropy: 4.405 Transition counts: count freq ntrpy pntpy Æ E A I O U L M N R K H B Ñ P W Y T D J S Z X C G Q F V @ # $ ? - ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- N 495 0.028 1.723 0.048 1 . 56 12 34 2 2 . 1 6 . 8 10 1 2 . . 1 1 . 2 . . 349 1 . 1 . 1 . . 4 A 685 0.039 3.780 0.147 127 151 3 1 1 16 61 21 28 11 8 43 9 1 . 6 16 16 50 2 5 33 5 4 31 19 1 1 2 2 . 11 B 286 0.016 4.203 0.068 16 23 18 10 19 19 19 5 6 17 . 41 9 1 1 1 1 9 10 2 4 13 . 25 4 1 4 . 5 . . 3 Ñ 193 0.011 2.067 0.023 94 63 4 1 . . 9 1 4 2 . 6 1 5 . . 1 . . . 1 . 1 . . . . . . . . . O 365 0.021 3.673 0.076 74 92 3 2 5 10 32 15 11 7 12 25 1 3 . . 9 6 14 7 1 3 5 1 3 4 1 3 . . . 16 P 17 0.001 1.646 0.002 8 6 . . . . . . . . 1 2 . . . . . . . . . . . . . . . . . . . . C 1654 0.094 2.583 0.243 59 98 4 2 8 33 12 4 1 1 . 140 6 14 . 754 386 17 30 14 11 31 17 . . . . . . . . 12 Q 569 0.032 2.149 0.069 12 41 2 . 1 13 2 . . . . 65 . 15 . 301 109 2 1 . . 1 1 1 . . . . . . . 2 Æ 2055 0.117 3.209 0.375 20 23 2 2 3 12 28 6 6 6 . 35 . 2 . 4 14 80 461 33 36 357 57 2 425 339 15 28 1 3 4 51 D 927 0.053 3.826 0.202 96 60 24 24 10 105 62 65 66 33 2 139 22 40 1 2 5 . . . 2 . . 66 5 . 1 . 86 . . 11 E 1726 0.098 4.069 0.399 37 33 4 11 10 14 46 12 12 28 3 58 21 9 . 1 18 62 206 55 28 149 51 258 104 192 199 48 7 5 6 39 R 405 0.023 2.843 0.065 6 2 117 9 26 4 2 2 4 9 . 4 10 2 . 1 . 8 8 1 9 5 . 161 5 1 1 . 4 1 . 3 F 238 0.014 2.677 0.036 4 1 . . 1 . 1 2 3 1 . 2 . . . 2 8 78 45 . 41 42 1 . . . . . . . . 6 S 212 0.012 3.795 0.046 32 24 8 3 3 10 19 22 15 11 . 25 3 4 . . 2 2 . . . . . 11 6 . . . 10 . . 2 G 622 0.035 2.310 0.082 21 30 5 . . 12 6 . . 2 . 61 . 6 . 300 155 6 7 2 1 5 2 . . . . . . . . 1 T 400 0.023 3.787 0.086 35 32 14 3 5 25 28 30 38 17 . 50 8 5 1 2 . 1 2 . 1 . . 66 6 1 1 . 27 . . 2 H 1152 0.065 4.321 0.283 104 93 61 27 90 42 42 55 79 68 1 77 49 11 2 4 2 31 55 2 26 43 4 98 20 6 2 1 41 1 . 15 U 456 0.026 2.639 0.068 149 174 16 2 2 5 16 3 4 3 . 25 1 3 1 1 1 13 . . 5 . . 14 1 1 . 1 . . . 15 V 82 0.005 3.243 0.015 3 4 . . 1 . . . 1 . . 1 1 . . 7 6 19 8 1 17 9 1 . . . . . . . . 3 I 186 0.011 2.830 0.030 58 65 2 1 . 7 7 1 1 1 . 10 . 4 . 1 1 4 2 1 1 . 2 1 1 2 . . . . . 13 # 15 0.001 2.866 0.002 2 1 . . . . . 1 1 1 . 1 . . . . 1 5 . . . 2 . . . . . . . . . . J 126 0.007 3.779 0.027 7 11 3 3 3 10 9 12 12 7 . 24 7 6 1 . 1 . . . . . . 2 . . . . 3 . . 5 W 1436 0.082 2.055 0.168 744 438 20 1 5 9 45 22 20 27 7 58 11 3 . 4 9 . . . 1 . . . . . . . . . . 12 $ 10 0.001 2.161 0.001 . . . . . . . . 1 1 . . . . . . 1 5 1 . . . . . . . . . . . . 1 X 150 0.009 3.697 0.032 10 11 6 4 4 13 7 11 15 8 . 30 10 13 . 1 1 . . . . . . . . . . . 2 . . 4 K 43 0.002 3.145 0.008 . 1 10 3 1 2 1 . . 1 . 1 1 . . . . 4 1 . 1 1 . 13 . . . . . . . 2 L 557 0.032 2.726 0.086 4 . 125 16 35 5 3 2 4 15 . 6 11 2 . 1 1 14 11 2 9 8 1 263 1 2 6 . 7 1 . 2 ? 291 0.017 3.793 0.063 22 20 26 3 7 4 5 2 3 4 1 15 3 3 4 11 75 2 5 2 1 4 . 41 3 . 2 . 5 1 . 17 Y 885 0.050 3.734 0.188 203 146 25 11 28 23 28 61 83 54 5 68 31 13 1 2 3 9 5 . 8 5 . 39 3 1 3 . 8 1 . 18 M 414 0.024 2.385 0.056 2 2 97 19 48 3 2 . 1 4 . 14 15 4 3 1 . . . 1 1 . . 194 . . . . 3 . . . @ 226 0.013 3.567 0.046 34 27 1 3 2 6 7 3 3 4 1 19 7 3 . 28 53 5 4 1 . 5 2 . . . . . . . . 8 Z 716 0.041 3.872 0.158 70 54 29 13 13 52 56 56 72 56 2 99 39 20 . 1 6 1 . . . . . 45 3 . 1 . 15 . . 13 ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- TOT 17594 1.000 3.198 3.198 2054 1726 685 186 365 456 557 414 495 405 43 1152 286 193 17 1436 885 400 927 126 212 716 150 1654 622 569 238 82 227 15 10 291 Transition probabilities (× 99): count freq ntrpy pntpy Æ E A I O U L M N R K H B Ñ P W Y T D J S Z X C G Q F V @ # $ ? - ----- ----- ----- ----- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- C 1654 0.094 2.583 0.243 4 6 . . . 2 1 . . . . 8 . 1 . 45 23 1 2 1 1 2 1 . . . . . . . . 1 Q 569 0.032 2.149 0.069 2 7 . . . 2 . . . . . 11 . 3 . 52 19 . . . . . . . . . . . . . . . G 622 0.035 2.310 0.082 3 5 1 . . 2 1 . . . . 10 . 1 . 48 25 1 1 . . 1 . . . . . . . . . . @ 226 0.013 3.567 0.046 15 12 . 1 1 3 3 1 1 2 . 8 3 1 . 12 23 2 2 . . 2 1 . . . . . . . . 4 N 495 0.028 1.723 0.048 . . 11 2 7 . . . . 1 . 2 2 . . . . . . . . . . 70 . . . . . . . 1 M 414 0.024 2.385 0.056 . . 23 5 11 1 . . . 1 . 3 4 1 1 . . . . . . . . 46 . . . . 1 . . . K 43 0.002 3.145 0.008 . 2 23 7 2 5 2 . . 2 . 2 2 . . . . 9 2 . 2 2 . 30 . . . . . . . 5 L 557 0.032 2.726 0.086 1 . 22 3 6 1 1 . 1 3 . 1 2 . . . . 2 2 . 2 1 . 47 . . 1 . 1 . . . R 405 0.023 2.843 0.065 1 . 29 2 6 1 . . 1 2 . 1 2 . . . . 2 2 . 2 1 . 39 1 . . . 1 . . 1 # 15 0.001 2.866 0.002 13 7 . . . . . 7 7 7 . 7 . . . . 7 33 . . . 13 . . . . . . . . . . $ 10 0.001 2.161 0.001 . . . . . . . . 10 10 . . . . . . 10 50 10 . . . . . . . . . . . . 10 F 238 0.014 2.677 0.036 2 . . . . . . 1 1 . . 1 . . . 1 3 32 19 . 17 17 . . . . . . . . . 2 V 82 0.005 3.243 0.015 4 5 . . 1 . . . 1 . . 1 1 . . 8 7 23 10 1 21 11 1 . . . . . . . . 4 Æ 2055 0.117 3.209 0.375 1 1 . . . 1 1 . . . . 2 . . . . 1 4 22 2 2 17 3 . 20 16 1 1 . . . 2 E 1726 0.098 4.069 0.399 2 2 . 1 1 1 3 1 1 2 . 3 1 1 . . 1 4 12 3 2 9 3 15 6 11 11 3 . . . 2 B 286 0.016 4.203 0.068 6 8 6 3 7 7 7 2 2 6 . 14 3 . . . . 3 3 1 1 5 . 9 1 . 1 . 2 . . 1 H 1152 0.065 4.321 0.283 9 8 5 2 8 4 4 5 7 6 . 7 4 1 . . . 3 5 . 2 4 . 8 2 1 . . 4 . . 1 T 400 0.023 3.787 0.086 9 8 3 1 1 6 7 7 9 4 . 12 2 1 . . . . . . . . . 16 1 . . . 7 . . . S 212 0.012 3.795 0.046 15 11 4 1 1 5 9 10 7 5 . 12 1 2 . . 1 1 . . . . . 5 3 . . . 5 . . 1 D 927 0.053 3.826 0.202 10 6 3 3 1 11 7 7 7 4 . 15 2 4 . . 1 . . . . . . 7 1 . . . 9 . . 1 Z 716 0.041 3.872 0.158 10 7 4 2 2 7 8 8 10 8 . 14 5 3 . . 1 . . . . . . 6 . . . . 2 . . 2 X 150 0.009 3.697 0.032 7 7 4 3 3 9 5 7 10 5 . 20 7 9 . 1 1 . . . . . . . . . . . 1 . . 3 J 126 0.007 3.779 0.027 6 9 2 2 2 8 7 9 9 6 . 19 6 5 1 . 1 . . . . . . 2 . . . . 2 . . 4 W 1436 0.082 2.055 0.168 51 30 1 . . 1 3 2 1 2 . 4 1 . . . 1 . . . . . . . . . . . . . . 1 Ñ 193 0.011 2.067 0.023 48 32 2 1 . . 5 1 2 1 . 3 1 3 . . 1 . . . 1 . 1 . . . . . . . . . P 17 0.001 1.646 0.002 47 35 . . . . . . . . 6 12 . . . . . . . . . . . . . . . . . . . . I 186 0.011 2.830 0.030 31 35 1 1 . 4 4 1 1 1 . 5 . 2 . 1 1 2 1 1 1 . 1 1 1 1 . . . . . 7 Y 885 0.050 3.734 0.188 23 16 3 1 3 3 3 7 9 6 1 8 3 1 . . . 1 1 . 1 1 . 4 . . . . 1 . . 2 A 685 0.039 3.780 0.147 18 22 . . . 2 9 3 4 2 1 6 1 . . 1 2 2 7 . 1 5 1 1 4 3 . . . . . 2 O 365 0.021 3.673 0.076 20 25 1 1 1 3 9 4 3 2 3 7 . 1 . . 2 2 4 2 . 1 1 . 1 1 . 1 . . . 4 U 456 0.026 2.639 0.068 32 38 3 . . 1 3 1 1 1 . 5 . 1 . . . 3 . . 1 . . 3 . . . . . . . 3 ? 291 0.017 3.793 0.063 7 7 9 1 2 1 2 1 1 1 . 5 1 1 1 4 26 1 2 1 . 1 . 14 1 . 1 . 2 . . 6 ----- ----- ----- ----- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- TOT 17594 1.000 3.198 3.198 12 10 4 1 2 3 3 2 3 2 0 6 2 1 0 8 5 2 5 1 1 4 1 9 3 3 1 0 1 0 0 2 Coincidence: if we keep the spaces that come from the line breaks, we get exactly 20000 digraphs. It seems that (p) is basically a version of (d) (dc) (dcc) (h) (hc) (hcc), which are themselves very similar to each other, Let's try to see if there is any difference between the location of "H" words and that of the corresponding "D" words. cat .voyn.fsg \ | tr ' ' '\012' \ | egrep '^[A-Z248][A-Z248]*$' \ | head -6400 \ | enum-words-in-blocks -v WPB=100 \ | sort +0.5 -0.99 \ | make-word-location-map \ -v MAXLEN=12 \ -v CTWD=1 \ -v NBLOCKS=64 \ -v PERCENT=0 \ > .voyn.map After some manual editing, I extracted the pairs of words that (1) differ only by "H"/"D" switch, and (2) have together at least 5 occurrences. Here are their location maps: 1 2OEHCC8G ................1............................................... 5 2OEDCC8G ........................1............................11.2....... ---------------------------------------------------------------- 1 4OEHCC8G .........................................................1...... 8 4OEDCC8G ..............1...1..........1.2............1...........1.....1. ---------------------------------------------------------------- 23 4OHAE .............1...11..12.....21.1..11...1.1.1...1.2112..1........ 106 4ODAE 211.4111..41.....214562....131..112341142111.261.524211275112..1 ---------------------------------------------------------------- 1 4OHAEG ........1....................................................... 6 4ODAEG ....2............1......1.....................1..........1...... ---------------------------------------------------------------- 1 4OHAL ..1............................................................. 4 4ODAL 1......................1.........1....1......................... ---------------------------------------------------------------- 15 4OHAM ................213.....1...1......1.2.........12...1........... 85 4ODAM .........3.2..12536.341.4...21.11.1111431...1.243..352.15421...1 ---------------------------------------------------------------- 22 4OHAN .....1..2......2.11...........312.3....2..1..1.........2........ 150 4ODAN 4843481..3.....1142.2525231.523612746723731.1.11.25511.1131.2.21 ---------------------------------------------------------------- 14 4OHAR 1.1...1........1.....1.....11.....11...1...........1...1...11... 47 4ODAR 311.122.122......1........1231...14112.1...1.2.1..311...111...2. ---------------------------------------------------------------- 46 4OHC8G ...11111.....11.2432.11..2.....1.1....2...2...11131...11..3222.. 161 4ODC8G 1419322311.2.55.31.55793452..1.53.1.112...1414.4143.247211632553 ---------------------------------------------------------------- 2 4OHCC8 ............1................................................1.. 5 4ODCC8 ....1..1.....................................1...........1...1.. ---------------------------------------------------------------- 40 4OHCC8G ...21.1.....131.522...1...12..32.......11..1..1..1..4.11....1.1. 152 4ODCC8G 3.48..313.1134437942213314...1.91252111..22...25723.213252523311 ---------------------------------------------------------------- 7 4OHCCG ...1.............1.........1...................111...........1.. 85 4ODCCG .12..3.531..112..31.1.21...2..13532.31..1131.11242.12.3.413.1223 ---------------------------------------------------------------- 8 4OHCG ........1........1.............1....1.....2......11............. 40 4ODCG ..22..21133....1....2..1.112.....1.1....11.1.1.1.1....2...2..114 ---------------------------------------------------------------- 26 4OHG ..1....1...1......1..1......2.1....22..14....2..22..1......11... 58 4ODG 33.1221...11...4.1..2.2.......1113.12.1.2...1221.1223.31...1.22. ---------------------------------------------------------------- 6 4OHOE ................1..1....................2........1.........1.... 18 4ODOE ......1.......1.2.....11....1.111.....11.11......111........1... ---------------------------------------------------------------- 2 4OHSC8G ..............................1....................1............ 7 4ODSC8G .1............1.............11.1....................11.......... ---------------------------------------------------------------- 2 4OHT8G ......1................1........................................ 10 4ODT8G 1...1..1...............1.....1........2......1.......2.......... ---------------------------------------------------------------- 3 4OHTC8G ........................1............................1.....1.... 7 4ODTC8G ....................1......................1.........12.1..1.... ---------------------------------------------------------------- 2 4OHTCG ............................11.................................. 4 4ODTCG ..............................1...................11...........1 ---------------------------------------------------------------- 3 4OHTG .....................1............11............................ 6 4ODTG ..............1.....1...1...1.........................1.....1... ---------------------------------------------------------------- 1 4OHZCG .............................................1.................. 4 4ODZCG ...........1........................1...........1......1........ ---------------------------------------------------------------- 1 EHAM ..................1............................................. 5 EDAM ...........1..........1...............2.........1............... ---------------------------------------------------------------- 1 EHAN 1............................................................... 5 EDAN .......11...............1.............1........................1 ---------------------------------------------------------------- 2 EHC8G ..1..........................................1.................. 7 EDC8G .......1...1.1..................................1.1...11........ ---------------------------------------------------------------- 2 EHCC8G .............1.................................1................ 6 EDCC8G .......11..1..1.........1......................1................ ---------------------------------------------------------------- 2 GHAN 1..........1.................................................... 3 GDAN ....1...................1...................................1... ---------------------------------------------------------------- 6 GDC8G ............1.........1.1....................1..............1.1. 8 GHC8G ......1.................1..1.................4.............1.... ---------------------------------------------------------------- 3 GHCC8G ......................1...1....1................................ 10 GDCC8G ......1.................1.......11........1..1....1.......11.1.. ---------------------------------------------------------------- 4 GHCCG ............1................1...1...........1.................. 7 GDCCG 1........................1...1..11...1.........1................ ---------------------------------------------------------------- 1 OEHAE ...........................1.................................... 4 OEDAE ..............................1..........11...................1. ---------------------------------------------------------------- 1 OEHAM ........................................1....................... 9 OEDAM ....................1......1.1......1......21......1..........1. ---------------------------------------------------------------- 2 OEHAN ...........................1.............1...................... 24 OEDAN .....1.................2.1......2..1111142..1.1...121.........1. ---------------------------------------------------------------- 1 OEHAR ...........................1.................................... 6 OEDAR 1..................................1....11.1............1....... ---------------------------------------------------------------- 3 OEHC8G ..1..................1....................1..................... 19 OEDC8G .....12....11.1.......22.1.............1..111.............121... ---------------------------------------------------------------- 2 OEHCC8G .............................1........1......................... 15 OEDCC8G ...1....................11......2.....1.......2...121...1.1..1.. ---------------------------------------------------------------- 20 OHAE ......1..1.1..........1....13.1...1.1112..........1.11.........2 21 ODAE ..11..1.....2......1....1111......1.....2..111.1..1........1.11. ---------------------------------------------------------------- 2 ODAEG ........................1...........1........................... 3 OHAEG .........................1............1............1............ ---------------------------------------------------------------- 13 OHAM ...............1..........111....1..1.1.1..........1..21...1.... 33 ODAM ...1.1..11..1..2.12.1...1.....2.3...12.2....3.....1....1211.1..1 ---------------------------------------------------------------- 25 OHAN .1...11....2..............1122..13.1.11....12..1..111........... 41 ODAN ......1112.21..........2......1.212323.121211..11.11..1....1..21 ---------------------------------------------------------------- 19 ODAR 1..1.1..11.31.........1....2....11......1.1.......2...........1. 22 OHAR .133..1...1.1.1.......1.12.....1..1.1.....1...1...........11.... ---------------------------------------------------------------- 2 OHC8AR ............1................................................1.. 3 ODC8AR .............1..............................................2... ---------------------------------------------------------------- 44 ODC8G ...21.12..121..1.1.1...213.....1..1.........111......1....113644 50 OHC8G ..133.1...122411.1..1.3.13......1..1......211..1....1.21.112241. ---------------------------------------------------------------- 16 ODCCG 1.....1.2......1.........1...111.2........1.111............1.... 18 OHCCG ......11..1..................1..21..11....21..1.1......11..1.1.. ---------------------------------------------------------------- 11 ODCG 1......1..................11.....1..............1............113 13 OHCG ...1..111......1......2..........1....1...12................1... ---------------------------------------------------------------- 3 OHCOE ..............1............................1...................1 3 ODCOE ..........................1...................1.........1....... ---------------------------------------------------------------- 12 ODG ...........3..........11.......1...1......1..11...11............ 17 OHG ..11...1..2.1..1.......1.1......1111...1..1.......1........1.... ---------------------------------------------------------------- 6 ODOE ........1.............1...1.1............................1....1. 8 OHOE ......1........1................11.1.....1...................11. ---------------------------------------------------------------- 3 ODSC8G .1..........................................1...............1... 4 OHSC8G ...........................1.1.....................1......1..... ---------------------------------------------------------------- 2 OHSCG .............................1..1............................... 3 ODSCG ......1......................1...........1...................... ---------------------------------------------------------------- 2 ODTC8G ..................................1...........1................. 3 OHTC8G .............................1........................1.....1... ---------------------------------------------------------------- 4 OHTCG ...........................1.1........................1....1.... 6 ODTCG 1.........................1...1.....1...........1.......1....... ---------------------------------------------------------------- 2 ODTG ..........1....................1................................ 2 OHTG ............1.............................................1..... ---------------------------------------------------------------- 7 SCCHG ..11............2.....1....1..........1......................... 8 SCCDG 1...1....1.......................1....1......1.....1.........1.. ---------------------------------------------------------------- 6 SCHG ......1........1..................2..11......................... 11 SCDG ...1...........1..............1.1.21.1.........1.1............1. ---------------------------------------------------------------- 13 SCHZG ..1......1..1.....................131..2.............11....1.... 19 SCDZG ....22........2.........1.........1122.1......11.11..1.......... ---------------------------------------------------------------- 1 SHZC8G .............1.................................................. 6 SDZC8G ...............1......1.......................1.......2...1..... ---------------------------------------------------------------- 3 SHZCG ................1....................................1.....1.... 5 SDZCG ..............1....................1............1....1...1...... ---------------------------------------------------------------- 12 SHZG .........11..........1.............2.1.1.1...........2....11.... 24 SDZG ...1..1.2.22.............11.1........11..1...21.........1.112.11 ---------------------------------------------------------------- 2 TCCHG ..................1...............1............................. 11 TCCDG ..1............1.1.....111.........1........1....1....1..1...... ---------------------------------------------------------------- 4 TCHG ....1....................1.........1..................1......... 12 TCDG ..1..1...............21..............1.......11....1..1...1..1.. ---------------------------------------------------------------- 17 TCHZG .1......1....1.................1...123.11............11...2..1.. 21 TCDZG ......1.1.....4...11.1.11...1....11..2......1.....1.1....1.1.... ---------------------------------------------------------------- 1 THZ8G .......................................................1........ 5 TDZ8G .......................2..........................1...1...1..... ---------------------------------------------------------------- 2 THZC8G ..1.........1................................................... 5 TDZC8G ...............1......1..................................1...2.. ---------------------------------------------------------------- 1 THZCG ......................1......................................... 9 TDZCG ...11.......1....1........11......1..............1..........1... ---------------------------------------------------------------- 20 THZG 1.........2......1.......1.121....121.1...2.......1....1...11... 35 TDZG 1.1....11222..2..1..2..1......11..12..11.2..211...1.1...1.2....1 ---------------------------------------------------------------- 3 DAE 1........1.....................................................1 8 HAE .............................1...1....211............1.1........ ---------------------------------------------------------------- 4 HAM ...............1...........................1...1.1.............. 6 DAM ....................1.............1..1..1...1.....1............. ---------------------------------------------------------------- 4 HAN ...............1...1...1............1........................... 14 DAN ..2.1...11.......................1....1132.............1........ ---------------------------------------------------------------- 3 DAR .......................1...1....1............................... 5 HAR ................................1.............1...1..1.....1.... ---------------------------------------------------------------- 11 DC8G .....1.................31.1...........1........1.............2.1 20 HC8G 1...2.1.........1.....2.1.1......1........1...1...1......112..21 ---------------------------------------------------------------- 4 HCC8G ..............1....1........................1...............1... 10 DCC8G .1.21......11..1......1..1.............1........................ ---------------------------------------------------------------- 3 DOE .......1..1..............................................1...... 6 HOE .....1................................12.1............1......... ---------------------------------------------------------------- 2 DSCG .1.................1............................................ 3 HSCG ............................1...1...............1............... ---------------------------------------------------------------- 1 DTC8G .....................1.......................................... 11 HTC8G ..1..........2.1......1..........11...............1.111......... ---------------------------------------------------------------- 2 HTCG ......................................2......................... 4 DTCG .....1.........................1......1............1............ ---------------------------------------------------------------- 2 DZCG 1................................................1.............. 3 HZCG .1.......................................................1..1... ---------------------------------------------------------------- While this data does not prove the equivalence of "H" and "D", it seems basically compatible with it.