Hacking at the Voynich manuscript - Side notes
028 Page-specific words

Last edited on 1998-07-16 18:43:02 by stolfi

In Note 015 I prepared colorized versions of the VMs pages,
highlighting "special" words - those that have exceptional
concentration in one or more pages.  Let's list those special 
words and look at them...

Let's get 

  foreach sec ( hea heb bio pha ) 
    cat ../015/sec-${sec}.pwct \
      | map-field \
          -v table=fnum-to-pnum.tbl \
          -v inField=2 \
          -v outField=2 \
      | gawk '/./{printf "%4d %s %-6s %s\n", $1,$2,$3,$4}' \
      | sort -b +3 -4 +1 -2n \
      > sec-${sec}.pwct 
  end

  foreach sec ( hea heb bio pha ) 
    cat sec-${sec}.pwct \
      | compute-chunkiness \
      | sort +0 -1g \
      > sec-${sec}.pgen
  end