Hacking at the Voynich manuscript - Side notes
110 Preparing the word rank-frequency (Zipf law) plots

Last edited on 2001-01-05 10:24:40 by stolfi

INTRODUCTION

  In this note we produce the rank-frequency (Zipf law)
  plots for the words of Voynichese and other languages.
  
SETTING UP THE ENVIRONMENT

  Links:
  
    ln -s ../../capitalize-ligatures
    ln -s ../../compute-cum-cum-freqs
    ln -s ../../compute-cum-freqs
    ln -s ../../compute-freqs
    ln -s ../../combine-counts
    ln -s ../../remove-freqs
    ln -s ../../totalize-fields
    ln -s ../../select-units
    ln -s ../../words-from-evt
    ln -s ../../format-counts-packed
    
    ln -s ../100/data
    ln -s ../101/lang

  Paper directory:

    set tbldir = "/home/staff/stolfi/papers/voynich-words/techrep/tables/auto"
    set figdir = "/home/staff/stolfi/papers/voynich-words/techrep/figures/auto"

COMPARING THE ZIPF LAW PLOTS WITH OTHER LANGS

  Preparing plots of word frequency vs. word frequency rank
  
    foreach kc ( text/0.20 labs/0.22 )
      set kind = "${kc:h}"; set const = "${kc:t}"
      set title = "text"; set opt = ( )
      if ( "${kind}" == "labs" ) then
        set title = "labels"; set opt = ( "-noylabels" )
      endif
      set ofile = "langs-${kind}-zipf.eps"; echo "${ofile}"
      compare-freq-vs-rank-distribs ${opt} ${const} \
          lang/${kind}/voyn.wfr "Voynichese (${title})" \
          lang/${kind}/engl.wfr "English"    \
          lang/${kind}/latn.wfr "Latin"      \
        > ${ofile}
      mv -b ${ofile} ${figdir}/
    end

  
  
