# Last edited on 2002-01-15 17:38:06 by stolfi
# The ACIP-JS Tibetan script transcription alphabet
# 
# The Tibetan script encoding described below is a slight variant of the 
# ACIP encoding used by the ACIP (Asian Classics Input Project),
# as described at http://www.asianclassics.org/ on 2002-01-04
# 
# The encoding was slightly modified by J. Stolfi to 
# simplify the use of existing statistical analysis tools.
# 
# Consonants
# 
#    K  G  T  D  P  B  t  d  C  S  DZ  Z  
#    KH GH TH DH PH BH th dH CH SH DZH ZH 
#    N M n J W Y R L NG NY TZ TS sh H Ksh ' 
#
# Vowels after consonants
# 
#    \   backlash below (sanskrit virama)
#    A   no mark
#    I   left-hook above
#    U   pipe below
#    Ri  right-hook above and spout below
#    Li  "21" below
#    E   spout above
#    EE  double-spout above
#    O   wings above
#    OO  double-wings above
# 
#    'A  "2" below
#    'I  "2" below and left-hook above
#    'U  "2" below and pipe below
#    R'i right-hook above and spout below and "2" below  
#    L'i right-hook above and "21" below and "2" below
# 
# Vowels in initial position:
# 
#    AA   "6V"
#    AI   "6V" with left-hook above
#    AU   "6V" with pipe below
#    Ri   "T"  with righ-hook above and spout below
#    Li   "21" with right-hook above
#    AE   "6V" with spout above
#    AEE  "6V" with double-spout above
#    AO   "6V" with wings above
#    AOO  "6V" with double-wings above
# 
#    A'A  "6V" with "2" below
#    A'I  "6V" with "2" below and left-hook above
#    A'U  "6V" with "2" below and pipe below
#    R'i  "T"  with spout below and "2" below
#    L'i  "21" with "2" below
# 
# Vowel modifiers (postfix)
# 
#    m  circle above
#    :  colon with open circles at left (sanskrit visarga )
#    
# Punctuation: 
# 
#   &      sanskrit apostrophe - said to look like a "NYA"
#   ,      nail (expression delimiter, shad)
#   ,,     double nail (section delimiter) 
#   ~~     double scroll
#   ^      like Greek lowercase upsilon
#   `      nail with corn-dots (rin-chen-sprung-shad?)
#   o      small circle under preceding syllable
#   x      small "x" under preceding syllable
#   ~~~    triple scroll
#   %      small circle-on-bowl under preceding syllable 
#   ;      nail-like stroke with three dots above (sbrul-shad?)
#   /      open or close tibetan brace  (ang khang gton/gyas?)
#   space  top dot separator syllables (tsheg)
#   *      unreadable, dubious, or unrepresentable character
#   X-Y    X and Y are separate letters, side by side
#   X+Y    sanskrit stack of letter X over letter Y
# 
# Formatting codes:
# 
#   รท             line break in original book.
#   ## page NNNX  original page numbers (NNN = number, X = "A" front,"B" back)
#   #XXXX         comment line (not part of the original text).
#   {=XXXX}       preceding char/word was XXXX in the ACIP file
#   {XXXX}        embedded comment (not part of the original text).