Prefix/suffix decomposition for the Biological section

[ This page has been made obsolete by the new prefix-midfix-suffix decomposition schema. ]

I remember seeing in the mail archives a simple "railroad" syntactic diagram by Mike Roe and Rene Zandbergen that generated most of the words in the VMs. This is a first attempt to reconstruct such a diagram for the Biological section, using my "blurred" stroke-level encoding.

Contents

Source text

The counts were obtained from the entire Biological section of the VMs (f75r--f84v), which is in Currier's "Language B". The version used was a mechanical stroke-level "consensus" of the Currier and FSG transcriptions. Only "good" words (where the two versions agreed) were used.

Character encoding

The text was encoded with an ad-hoc stroke-level encoding, with identification of some easily confused letters. It is basically the Frogguy encoding, with the following changes:

    Frogguy      8    9    a    s    e    e'   t    ig   iiiv iiv  iv  
    -----------  ---- ---- ---- ---- ---- ---- ---- ---- ---- ---- ----
    This table   c3   ci   ci   c    c    c    c    f    m    m    n  
    
    Frogguy      qp   lp   dj   fj   eQPt eLPt eDJt eFJt
    -----------  ---- ---- ---- ---- ---- ---- ---- ----
    This table   H    H    P    P    cHc  cHc  cPc  cPc 

The full table

In the table below, each entry is the count of all words that can be parsed as PREFIX·SUFFIX, where PREFIX is any of the 18 strings

    2 4oH 4oP 4ox H P c c3 cH ci ciH xH cccH ccccH oH oP ox oxH x
and SUFFIX is any string of the form [co][^HP4]* (that is, one \c/ or \o/, followed by zero or more symbols that are neither a "tall letter" or a \4/.)

For clarity, the most common suffixes have been grouped into similar classes.

The entries in this table account for 4104 out of the 4742 "good" words in that section (that is, 87%).

                     PREFIX
                     -----------------------------------------------------------------------------------------------------------------
               TOTAL   4oH    oH     c     H   oxH   ciH    cH    xH  cccH ccccH    c3     P     x    ox   4ox     2    oP    ci   4oP
               ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
  SUFFIX \ TOT  4104  1038   446   998   148   105    53    21    43   173   109   338    63   212   150    38    73    40    34    22
  ------------ ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
  ci             257    79    28     1     2    10     1     1     2    31    26    36     0     6    22     7     2     2     0     1
  cci            247    43    22    14     2     4     2     1     1    90    68     0     0     0     0     0     0     0     0     0
  ccci           333    87    35   139     5    19    10     6     3    11     4     2     0     5     4     2     1     0     0     0
  cccci          171    12    13    61    10     3     0     0     1     1     0     3     3    20    19     9     5     4     5     2
  ccccci          26     1     1     1     1     1     1     0     0     0     0     3     0     6     4     0     1     2     4     0
  cccccci          1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0

  c3ci            21     1     0     0     0     0     0     0     0     0     0     0     0     8     9     1     0     0     1     1
  cc3ci          392   201    85    21    28    20    12     3     8    10     2     2     0     0     0     0     0     0     0     0
  ccc3ci         736   191    57   387    16    15    12     4     7    15     5     5     3     7     6     1     1     2     0     2
  cccc3ci        325    19    10    60    18     2     1     0     2     0     0    27    17    73    38     7    10    18    12    11
  ccccc3ci        18     2     0     0     0     0     0     0     0     0     0     2     0     6     1     3     2     0     2     0

  cim            283    94    41    30     7     7     4     0     6     2     0    72     0     2     3     2    13     0     0     0
  ccccim           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  cix            254   116    40    14    10     5     3     1     0     1     0    50     0     3     2     2     6     1     0     0
  ccix            12     1     0     7     0     0     0     1     0     1     2     0     0     0     0     0     0     0     0     0
  cccix           10     0     1     7     1     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0
  ccccix           2     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0

  ci2            184    50    36    11     8     7     2     1     4     3     0    51     1     2     1     1     3     2     1     0
  cci2            11     1     0    10     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccci2           12     0     1     8     1     1     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cccci2           1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0

  cin             98    54    16     0     2     5     2     0     2     0     0    12     0     0     3     1     1     0     0     0
  ccin             1     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0

  cif             18     4     0     0     1     1     0     0     0     2     0     6     0     1     0     0     3     0     0     0
  ccif             3     0     0     3     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  ox             124    21    10    25     6     1     0     0     1     0     0    17     8    16     7     1     9     1     0     1
  cox             35     1     6    23     3     1     0     1     0     0     0     0     0     0     0     0     0     0     0     0
  ccox            45     1     1    33     4     0     0     0     0     0     0     3     2     0     0     0     0     0     0     1
  cccox           25     0     0     4     3     0     1     0     0     0     0     4     4     3     3     1     0     0     1     1

  o2              51     4     2    10     4     0     0     0     0     0     0     3     0    10    13     0     3     1     0     1
  co2              7     4     0     1     0     0     0     1     0     1     0     0     0     0     0     0     0     0     0     0
  cco2            25     0     1    17     0     0     0     0     0     0     0     2     2     3     0     0     0     0     0     0
  ccco2            2     0     0     1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0
  cccco2           1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0

  om              10     1     0     5     0     1     0     0     0     0     0     0     1     0     0     0     2     0     0     0

  ccc3            18     5     3     7     2     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  cccc3           15     1     0     1     0     0     0     0     0     1     0     0     0     5     3     0     2     1     1     0

  c3ci2            1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cc3ci2          11     4     4     3     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3ci2          7     0     0     4     0     0     2     0     0     0     0     0     0     0     1     0     0     0     0     0
  cccc3ci2         8     0     0     1     1     0     0     0     0     0     0     0     4     1     0     0     0     0     1     0

  c                5     0     0     1     0     0     0     0     0     2     0     0     0     0     2     0     0     0     0     0
  cc               3     1     0     0     0     0     0     0     0     0     0     0     0     2     0     0     0     0     0     0
  ccc             13     1     0    10     0     0     0     0     0     0     0     0     0     1     0     0     1     0     0     0
  cccc             5     0     0     4     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0
  ccccc            4     0     0     0     0     0     0     0     0     0     0     1     0     1     1     0     0     0     1     0

  c3cix            1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cc3cix           6     2     2     1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3cix         13     2     1     9     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1
  cccc3cix         3     0     0     0     0     0     0     0     0     0     0     0     1     1     0     0     0     1     0     0
  ccccc3cix        1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0

  cc3o2            1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3o2           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  c3cixo2          1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cc3cixo2         1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  cixo2            5     0     4     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  cccixo2          1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  oxo2             2     0     0     1     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  coxo2            1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  c3               1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cc3              6     3     1     0     0     1     0     0     0     1     0     0     0     0     0     0     0     0     0     0
  ccccc3           1     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0

  cic              3     2     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccccic           1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  cc3cif           2     0     1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3cif          2     0     0     1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  ci2ci            8     1     0     0     1     0     0     0     0     0     0     5     0     0     0     0     1     0     0     0
  cci2ci           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  oxc3ci           5     1     1     1     1     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  coxc3ci          5     0     1     3     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  ccoxc3ci         1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cccoxc3ci        1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0

  cc3ccci          1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3ccci         2     0     0     1     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0

  oxcccci          5     0     0     3     0     0     0     0     0     0     0     0     1     0     1     0     0     0     0     0
  coxcccci         1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccoxcccci        1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  c3cim            2     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     1     0
  cc3cim           2     1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3cim          9     0     0     9     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  o                5     0     0     0     0     0     0     0     0     0     0     0     0     4     1     0     0     0     0     0
  co               1     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0
  cco              1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0

  coxo             1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccoxo            1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  cx               6     0     0     6     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccx              2     0     0     0     0     0     0     0     0     0     0     0     0     2     0     0     0     0     0     0

  cixci           21     7     4     1     1     0     0     0     0     0     0     6     0     0     0     0     1     1     0     0
  ccixci           1     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0
  cccixci          3     0     0     2     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0

  c3ox             2     0     0     0     0     0     0     0     0     0     0     0     1     1     0     0     0     0     0     0
  cc3ox            1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccc3ox           3     0     0     3     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  oxox             1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccoxox           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0

  oxci2ci2         1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  oc3ci2           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixci2           1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  cci2o2           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  c3cixcco2        1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  cx2              1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixcccc3         1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  oxcccc3          1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  oxccc3           1     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccoxcic3         1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  oc3              1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  ccccixc3         1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0
  cixci28          1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  cioc             1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cxc              1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixc             1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ci2cif           1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccccif           2     0     0     0     0     0     0     0     0     0     0     0     0     1     1     0     0     0     0     0
  ci2of            1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  oxof             1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  ci2oxof          1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cccc3ci2ci       1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cixci2ci         1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  ci2o2ci          1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  ci2c3ci          1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixc3c3ci        1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cc3cc3ci         1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cc3cccc3ci       1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  oxc3cccc3ci      1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  oxccccc3ci       1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  occcc3ci         1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cixcccc3ci       4     0     1     0     0     0     0     0     0     0     0     2     0     0     0     0     0     1     0     0
  oxcccc3ci        6     1     0     2     0     0     0     0     0     0     0     1     2     0     0     0     0     0     0     0
  o2oxcccc3ci      1     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ciccc3ci         1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixccc3ci        3     0     1     0     0     0     0     0     0     0     0     2     0     0     0     0     0     0     0     0
  oxcixccc3ci      1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  oxccc3ci         3     0     0     0     1     0     0     0     0     0     0     0     2     0     0     0     0     0     0     0
  ccoxccc3ci       1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cic3ci           5     3     1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccoc3ci          1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cxc3ci           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixc3ci          8     3     2     0     0     0     0     0     0     0     0     3     0     0     0     0     0     0     0     0
  ccc3cixc3ci      1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccccixc3ci       1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0
  cc3cci           1     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  o2ccci           1     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ci2cccci         1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  c3ci2cccci       1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0
  cixcccci         1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  occci            1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cixccci          1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  oxccci           2     0     0     0     0     0     0     0     0     0     0     0     2     0     0     0     0     0     0     0
  cimci            1     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0
  oxci             8     0     0     1     0     0     0     0     0     0     0     0     0     4     0     0     1     2     0     0
  coxci            2     1     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  ccixoxci         1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  o2cim            2     0     0     0     0     0     0     0     0     0     0     1     0     1     0     0     0     0     0     0
  coc3cim          1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cxc3cim          1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixcim           1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  oxcim            1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  o2cin            1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  cc3cin           1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cifo             1     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  cino             1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixo             2     0     0     0     0     0     0     0     0     0     0     2     0     0     0     0     0     0     0     0
  cixccx           1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0
  ci2cix           2     0     0     1     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  c3ci2cix         1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0
  o2cix            1     0     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0
  ci2ccccc3cix     1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  oxccc3cix        1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  oc3cix           1     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0
  oxc3cix          1     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixcix           1     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  ciix             2     1     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  ccci2ox          1     0     0     0     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0
  o2ox             2     0     0     0     0     0     0     0     0     0     0     0     0     0     1     0     1     0     0     0
  ccccox           2     0     0     1     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0
  oxcccox          1     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cc3ciox          1     0     0     0     1     0     0     0     0     0     0     0     0     0     0     0     0     0     0     0
  cixox            4     1     2     0     0     0     0     0     0     0     0     1     0     0     0     0     0     0     0     0

Significant suffixes

Only 19 suffixes occur in significant numbers; namely,

       746  0.18  ccc3ci
       398  0.09  cc3ci
       343  0.08  ccci
       325  0.08  cccc3ci
       285  0.07  cim
       271  0.06  ci
       260  0.06  cci
       257  0.06  cix
       185  0.04  ci2
       172  0.04  cccci
       127  0.03  ox
        98  0.02  cin
        51  0.01  o2
        45  0.01  ccox
        36  0.01  cox
        26  0.01  ccccci
        25  0.01  cco2
        25  0.01  cccox
These 19 suffixes and the 18 prefixes listed above still acount for 3611 good words (76%) of the Biological section.

Reduced table

The following table shows the same information as the full one, but only for the significant suffixes, sorted by frequency:

                     PREFIX
                     -----------------------------------------------------------------------------------------------------------------
               TOTAL   4oH    oH     c     H   oxH   ciH    cH    xH  cccH ccccH     c3     P     x    ox   4ox     2    oP    ci   4oP
               ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
  SUFFIX \ TOT  3611   974   851   404   292   156   164   125   129   105   100    55    51    40    37    34    33    23    18    20
  ------------ ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- ----- -----
  ccc3ci         736   191    57   387    16    15    12     4     7    15     5     5     3     7     6     1     1     2     0     2
  cc3ci          392   201    85    21    28    20    12     3     8    10     2     2     0     0     0     0     0     0     0     0
  ccci           333    87    35   139     5    19    10     6     3    11     4     2     0     5     4     2     1     0     0     0
  cccc3ci        325    19    10    60    18     2     1     0     2     0     0    27    17    73    38     7    10    18    12    11
  cim            283    94    41    30     7     7     4     0     6     2     0    72     0     2     3     2    13     0     0     0
  ci             257    79    28     1     2    10     1     1     2    31    26    36     0     6    22     7     2     2     0     1
  cix            254   116    40    14    10     5     3     1     0     1     0    50     0     3     2     2     6     1     0     0
  cci            247    43    22    14     2     4     2     1     1    90    68     0     0     0     0     0     0     0     0     0
  ci2            184    50    36    11     8     7     2     1     4     3     0    51     1     2     1     1     3     2     1     0
  cccci          171    12    13    61    10     3     0     0     1     1     0     3     3    20    19     9     5     4     5     2
  ox             124    21    10    25     6     1     0     0     1     0     0    17     8    16     7     1     9     1     0     1
  cin             98    54    16     0     2     5     2     0     2     0     0    12     0     0     3     1     1     0     0     0
  o2              51     4     2    10     4     0     0     0     0     0     0     3     0    10    13     0     3     1     0     1
  ccox            45     1     1    33     4     0     0     0     0     0     0     3     2     0     0     0     0     0     0     1
  cox             35     1     6    23     3     1     0     1     0     0     0     0     0     0     0     0     0     0     0     0
  ccccci          26     1     1     1     1     1     1     0     0     0     0     3     0     6     4     0     1     2     4     0
  cccox           25     0     0     4     3     0     1     0     0     0     0     4     4     3     3     1     0     0     1     1
  cco2            25     0     1    17     0     0     0     0     0     0     0     2     2     3     0     0     0     0     0     0

Unparseable words

Here are the words that cannot be parsed with the above prefix/suffix set. Some are too short; they may be function words (articles, preposiitons, etc.). Others could be covered by adding a few more suffixes and/or prefixes---but they may also be errors, and sometimes it is hard to tell where to split the word...

   131  0.12  ox                   3  0.00  c3cixc3ci            2  0.00  4occci               1  0.00  oPcccc3
    81  0.07  4ox                  3  0.00  ccoxc3ci             2  0.00  4occcc3ci            1  0.00  oHoxc3ci
    41  0.04  o2                   3  0.00  cccif                2  0.00  4oHx                 1  0.00  oHxo2
    21  0.02  cim                  3  0.00  ccc3ci2              2  0.00  4oHcic               1  0.00  oHxox
    12  0.01  cix                  3  0.00  cccc3ox              2  0.00  4oHcc3cix            1  0.00  oHcoxo2
    10  0.01  ccci2                3  0.00  ccHox                2  0.00  4oHccc3cix           1  0.00  oHcoxc3ci
    10  0.01  cccc                 3  0.00  ccHcc3ci             2  0.00  4oHccccc3ci          1  0.00  oHci2oxof
     9  0.01  oxc3ci               3  0.00  4oxccccc3ci          2  0.00  4oH                  1  0.00  oHci2cif
     9  0.01  cccc3cim             3  0.00  4oxHccci             1  0.00  2o2ox                1  0.00  oHcioc
     9  0.01  cccc3cix             3  0.00  4oc3ci               1  0.00  2oxci                1  0.00  oHcixccci
     8  0.01  xc3ci                3  0.00  4oHcixc3ci           1  0.00  2ci2ci               1  0.00  oHcixccc3ci
     8  0.01  cccci2               3  0.00  4oHcic3ci            1  0.00  2cixci               1  0.00  oHcixcccc3ci
     7  0.01  ci2                  3  0.00  4oHcc3               1  0.00  2cixccx              1  0.00  oHcixHci
     7  0.01  cccix                2  0.00  2om                  1  0.00  2cicHcci             1  0.00  oHcic3ci
     7  0.01  ccccix               2  0.00  2cccc3               1  0.00  2ccccixc3            1  0.00  oHc3cix
     7  0.01  cccc3                2  0.00  2ccccc3ci            1  0.00  2ccccix              1  0.00  oHccoc3ci
     7  0.01  4oHcixci             2  0.00  2cccHci              1  0.00  2cccc                1  0.00  oHccoHci2
     6  0.01  xccccc3ci            2  0.00  o2oxci               1  0.00  2ccc                 1  0.00  oHcc3cim
     6  0.01  c3cif                2  0.00  om                   1  0.00  o2ci2                1  0.00  oHcc3cif
     6  0.01  c3cixci              2  0.00  oxoHci               1  0.00  o2cin                1  0.00  oHcc3cixo2
     6  0.01  ccx                  2  0.00  oxxccci              1  0.00  o2cix                1  0.00  oHcc3ccci
     6  0.01  ccHci                2  0.00  oxcccHcci            1  0.00  o2cccci              1  0.00  oHcc3
     6  0.01  c                    2  0.00  oxc                  1  0.00  o2ccccc3ci           1  0.00  oHccci2
     6  0.01  4o2                  2  0.00  oc3ci                1  0.00  on                   1  0.00  oHcccix
     6  0.01  4occc3ci             2  0.00  occcci               1  0.00  oxo2ox               1  0.00  oHccc3cix
     5  0.00  2                    2  0.00  occcc3ci             1  0.00  oxoxcccci            1  0.00  oHccccic
     5  0.00  o2ox                 2  0.00  ocHccci              1  0.00  oxoxHccci            1  0.00  o4ox
     5  0.00  o2ci                 2  0.00  oPoxci               1  0.00  oxoHci2              1  0.00  o4oPcccc3ci
     5  0.00  xcccc3               2  0.00  oHcixox              1  0.00  oxoHcccc3ci          1  0.00  o4oHci
     5  0.00  xccccHcci            2  0.00  oHcixc3ci            1  0.00  oxo                  1  0.00  o4o
     5  0.00  com                  2  0.00  oHcc3cix             1  0.00  oxxof                1  0.00  o
     5  0.00  c3ci2ci              2  0.00  xccx                 1  0.00  oxcimci              1  0.00  x2
     5  0.00  cccPccci             2  0.00  xccccHccc3ci         1  0.00  oxccixci             1  0.00  xo2cin
     5  0.00  4oHccc3              2  0.00  xcccHci              1  0.00  oxccco2              1  0.00  xo2cim
     4  0.00  oxPcccc3ci           2  0.00  xcc                  1  0.00  oxccc3ci2            1  0.00  xo2cix
     4  0.00  oHcixo2              2  0.00  coxcccc3ci           1  0.00  oxccccif             1  0.00  xoxo2
     4  0.00  oHcixci              2  0.00  coHccc3ci            1  0.00  oxccccc3ci           1  0.00  xoxof
     4  0.00  oHcc3ci2             2  0.00  ci2ox                1  0.00  oxccccc3             1  0.00  xoxc3ci
     4  0.00  xoxci                2  0.00  ci2ci                1  0.00  oxccccc              1  0.00  xoxHcim
     4  0.00  xo                   2  0.00  cimci                1  0.00  oxccccHcci           1  0.00  xoc3
     4  0.00  x                    2  0.00  cixccc3ci            1  0.00  oxccHci              1  0.00  xoccci
     4  0.00  coxHccc3ci           2  0.00  ciccccc3ci           1  0.00  oxPocHcci            1  0.00  xocccc3ci
     4  0.00  cin                  2  0.00  ciHccc3ci2           1  0.00  oxHom                1  0.00  xoHcif
     4  0.00  c3x                  2  0.00  c3cixo               1  0.00  oxHcif               1  0.00  xxc3ci2
     4  0.00  cccc3ci2             2  0.00  c3cixccc3ci          1  0.00  oxHcic3ci            1  0.00  xcif
     4  0.00  ccccc                2  0.00  c3cixcccc3ci         1  0.00  oxHcc3               1  0.00  xc3ox
     4  0.00  cccH                 2  0.00  c3ciHcc3ci           1  0.00  oxHccci2             1  0.00  xc3ci2
     4  0.00  cPccci               2  0.00  c3ciHccc3ci          1  0.00  ox4o                 1  0.00  xc3cim
     4  0.00  Pcccc3ci2            2  0.00  c3ccccc3ci           1  0.00  ocicccci             1  0.00  xc3cixo2
     4  0.00  4oxHci               2  0.00  c3Hcc3ci             1  0.00  oc3ci2               1  0.00  xc3
     4  0.00  4ocHccci             2  0.00  ccoHci               1  0.00  oc3cim               1  0.00  xcco
     4  0.00  4oHco2               2  0.00  cccoHci              1  0.00  oc3cix               1  0.00  xccci2
     4  0.00  4oHcif               2  0.00  ccccixci             1  0.00  oc3cccc3ci           1  0.00  xccc3ci4o
     4  0.00  4oHcc3ci2            2  0.00  ccccPcci             1  0.00  occix                1  0.00  xcccco2
     4  0.00  4o                   2  0.00  ccccHccix            1  0.00  occc3ci              1  0.00  xcccci2
     3  0.00  2cif                 2  0.00  cccPcccc3ci          1  0.00  occccin              1  0.00  xccccif
     3  0.00  o2cim                2  0.00  cccHcif              1  0.00  occccci              1  0.00  xcccc3ci2ci
     3  0.00  oxcccc3              2  0.00  cccHc                1  0.00  occcPoxc             1  0.00  xcccc3ci2
     3  0.00  ocHcci               2  0.00  ccHcix               1  0.00  ocHco2               1  0.00  xcccc3cix
     3  0.00  oHccc3               2  0.00  ccHccci              1  0.00  ocHccox              1  0.00  xccccc
     3  0.00  oH                   2  0.00  Poxccci              1  0.00  ocHccc3ci            1  0.00  xccccHcix
     3  0.00  xcccHcci             2  0.00  Poxccc3ci            1  0.00  oPcixci              1  0.00  xccccHccci
     3  0.00  coxcccci             2  0.00  Poxcccc3ci           1  0.00  oPcixcccc3ci         1  0.00  xcccPccc3ci
     3  0.00  coxHccci             2  0.00  HoHox                1  0.00  oPcixHcim            1  0.00  xccc
     3  0.00  cif                  2  0.00  Hccc3                1  0.00  oPcccixci            1  0.00  xPox
     3  0.00  cixci                2  0.00  4oxHccc3ci           1  0.00  oPcccc3cix           1  0.00  xPccc3ci
     1  0.00  xPcccc3ci            1  0.00  c3cixcim             1  0.00  cccccHcci            1  0.00  Hcic
     1  0.00  xHoc3cix             1  0.00  c3cixcix             1  0.00  cccccHccci           1  0.00  Hci4oHci
     1  0.00  xHxcccci             1  0.00  c3cixHci             1  0.00  ccccPccc3ci          1  0.00  Hcc3ciox
     1  0.00  xHx                  1  0.00  c3cicHcci            1  0.00  ccccHco              1  0.00  Hcc3cix
     1  0.00  xHcoxc3ci            1  0.00  c3ciHcim             1  0.00  ccccHccc3ccci        1  0.00  HcccoHccc3ci
     1  0.00  xHcifo               1  0.00  c3ciHci              1  0.00  cccPox               1  0.00  Hccci2
     1  0.00  xHcixo2              1  0.00  c3ciHcci             1  0.00  cccPci               1  0.00  Hcccix
     1  0.00  xHcc3cci             1  0.00  c3ciHccci            1  0.00  cccPccc3ci           1  0.00  Hccc3oxHc3ci
     1  0.00  xHccc3               1  0.00  c3ccoxcic3           1  0.00  cccP                 1  0.00  Hccc3cif
     1  0.00  coxo2                1  0.00  c3ccccox             1  0.00  cccHco2              1  0.00  Hcccc3ci2
     1  0.00  coxci                1  0.00  c3ccccc3cix          1  0.00  cccHccix             1  0.00  H
     1  0.00  coxc3ci              1  0.00  c3ccccc              1  0.00  cccHcc3              1  0.00  4o2cim
     1  0.00  coxcccox             1  0.00  c3cccHccc3ci         1  0.00  cccHcccix            1  0.00  4oox
     1  0.00  coxccHcix            1  0.00  c3Hccci              1  0.00  cccHcccc3            1  0.00  4on
     1  0.00  coxHci               1  0.00  c3Hcccci             1  0.00  ccPcim               1  0.00  4oxc3ci
     1  0.00  coxHcci              1  0.00  cxc                  1  0.00  ccPcccc3ci           1  0.00  4oxHci2ci
     1  0.00  coxHcc3ci            1  0.00  cco2                 1  0.00  ccPccccc3ci          1  0.00  4oxHccoxci
     1  0.00  coc3ci2              1  0.00  ccoxo                1  0.00  ccHcox               1  0.00  4oxHcci
     1  0.00  cocHccci             1  0.00  ccoxci               1  0.00  ccHci2ox             1  0.00  4oxHcccci
     1  0.00  coHox                1  0.00  ccoxcccci            1  0.00  ccHci2               1  0.00  4oci2
     1  0.00  ci2o2                1  0.00  ccoxHcccci           1  0.00  ccHcim               1  0.00  4ocim
     1  0.00  ci2oHcci             1  0.00  ccoc3cim             1  0.00  cc                   1  0.00  4ocix
     1  0.00  ci2ci2               1  0.00  ccocPccc3ci          1  0.00  cPcox                1  0.00  4ociHci
     1  0.00  cioHci               1  0.00  ccocHcci             1  0.00  cPcci2               1  0.00  4oc3ciHcci
     1  0.00  cixo2ci              1  0.00  ccoHcim              1  0.00  cPccix               1  0.00  4oc3ccci
     1  0.00  cixo2                1  0.00  ccoHcix              1  0.00  cPcc3o2              1  0.00  4oc3ccc3ci
     1  0.00  cixox                1  0.00  ccoHccc3ci           1  0.00  cPcccci              1  0.00  4oc3cccc3
     1  0.00  cixcixc3ci           1  0.00  cci2cix              1  0.00  cPcccc3ci            1  0.00  4occcci
     1  0.00  cixcccc3ci           1  0.00  ccino                1  0.00  cHco2                1  0.00  4occHccc3ci
     1  0.00  cixccccc             1  0.00  ccixci               1  0.00  cHccin               1  0.00  4ocHcci
     1  0.00  cic3ci2cix           1  0.00  ccixcccc3            1  0.00  cHccix               1  0.00  4ocHccc3ci
     1  0.00  cic3ci2cccci         1  0.00  ccixHccci            1  0.00  Pom                  1  0.00  4ocHccc
     1  0.00  cic3cim              1  0.00  cciHcim              1  0.00  Poxci2ci2            1  0.00  4oPc3ci
     1  0.00  cic3ci               1  0.00  ccx2                 1  0.00  Poxcim               1  0.00  4oPccc3cix
     1  0.00  ciccccixc3ci         1  0.00  ccxc3cim             1  0.00  Poxcixccc3ci         1  0.00  4oP
     1  0.00  cicccc3ci2           1  0.00  ccxc3ci              1  0.00  Poxc3ciHci           1  0.00  4oHom
     1  0.00  cicccc3              1  0.00  ccxccPccccci         1  0.00  Poxc3cccc3ci         1  0.00  4oHoxox
     1  0.00  cicccccci            1  0.00  ccxc                 1  0.00  Poxccc3cix           1  0.00  4oHoxc3ci
     1  0.00  ciccccc              1  0.00  cccoxox              1  0.00  Poxcccci             1  0.00  4oHoxcccc3ci
     1  0.00  ciccccHcci           1  0.00  cccoxo               1  0.00  Poxcccc3             1  0.00  4oHoPci
     1  0.00  ciccccHccci          1  0.00  cccoxc3ci            1  0.00  PoxHcccox            1  0.00  4oHcoxci
     1  0.00  cicPcim              1  0.00  cccoxccc3ci          1  0.00  PoxHccci             1  0.00  4oHci2ci
     1  0.00  cicHccci             1  0.00  cccoxcccci           1  0.00  PoHcin               1  0.00  4oHci2c3ci
     1  0.00  ciPcccci             1  0.00  ccci2o2              1  0.00  PciHcc3ci            1  0.00  4oHciix
     1  0.00  ciPcccc3ci           1  0.00  ccci2ci              1  0.00  Pc3ox                1  0.00  4oHcixox
     1  0.00  ci                   1  0.00  cccixoxci            1  0.00  Pc3cixcco2           1  0.00  4oHcixc3c3ci
     1  0.00  c32ox                1  0.00  ccciHccci            1  0.00  Pcccoxc3ci           1  0.00  4oHcixcccci
     1  0.00  c3o2cim              1  0.00  ccc3ox               1  0.00  Pccci2ox             1  0.00  4oHcixc
     1  0.00  c3oxcccc3ci          1  0.00  ccc3cin              1  0.00  Pcccc3cix            1  0.00  4oHciccc3ci
     1  0.00  c3oxccccc3ci         1  0.00  ccc3cif              1  0.00  Ho2oxcccc3ci         1  0.00  4oHc3ci
     1  0.00  c3oxPcccc3ci         1  0.00  ccc3cix              1  0.00  Ho2ccci              1  0.00  4oHcci2
     1  0.00  c3oxHcc3ci           1  0.00  cccco2               1  0.00  Hoxc3cix             1  0.00  4oHccix
     1  0.00  c3oHcc3ci            1  0.00  ccccixo2             1  0.00  Hoxc3ci              1  0.00  4oHcc3o2
     1  0.00  c3oHccci             1  0.00  cccciHci             1  0.00  Hoxccc3ci            1  0.00  4oHcc3cim
     1  0.00  c3xcccc3ci           1  0.00  cccc3o2              1  0.00  Hoxccc3              1  0.00  4oHcc3cc3ci
     1  0.00  c3ci2o2ci            1  0.00  cccc3cif             1  0.00  HoxPci               1  0.00  4oHcc3cccc3ci
     1  0.00  c3ci2of              1  0.00  cccc3cixc3ci         1  0.00  HoxHcci              1  0.00  4oHcccc3
     1  0.00  c3ci2cccci           1  0.00  cccc3ciHci2          1  0.00  HocHccci             1  0.00  4oHccccHcci
     1  0.00  c3ci2ccccc3cix       1  0.00  cccc3ccci            1  0.00  HoHcixci             1  0.00  4oHccc
     1  0.00  c3ciix               1  0.00  cccccox              1  0.00  Hci2cix              1  0.00  4oHcc
     1  0.00  c3cixox              1  0.00  cccccim              1  0.00  Hci2ci               1  0.00  4o4oHccci
     1  0.00  c3cixci2ci           1  0.00  cccccix              1  0.00  Hcif             -----  ----  ----
     1  0.00  c3cixci2c3           1  0.00  ccccc3ci2            1  0.00  Hcixci            1131  1.00  TOT
     1  0.00  c3cixci2             1  0.00  ccccc3

Last edited on 97-11-10 by stolfi
Also edited on 97-07-17 by stolfi