# Last edited on 2002-01-15 17:38:06 by stolfi
# The ACIP-JS Tibetan script transcription alphabet
#
# The Tibetan script encoding described below is a slight variant of the
# ACIP encoding used by the ACIP (Asian Classics Input Project),
# as described at http://www.asianclassics.org/ on 2002-01-04
#
# The encoding was slightly modified by J. Stolfi to
# simplify the use of existing statistical analysis tools.
#
# Consonants
#
# K G T D P B t d C S DZ Z
# KH GH TH DH PH BH th dH CH SH DZH ZH
# N M n J W Y R L NG NY TZ TS sh H Ksh '
#
# Vowels after consonants
#
# \ backlash below (sanskrit virama)
# A no mark
# I left-hook above
# U pipe below
# Ri right-hook above and spout below
# Li "21" below
# E spout above
# EE double-spout above
# O wings above
# OO double-wings above
#
# 'A "2" below
# 'I "2" below and left-hook above
# 'U "2" below and pipe below
# R'i right-hook above and spout below and "2" below
# L'i right-hook above and "21" below and "2" below
#
# Vowels in initial position:
#
# AA "6V"
# AI "6V" with left-hook above
# AU "6V" with pipe below
# Ri "T" with righ-hook above and spout below
# Li "21" with right-hook above
# AE "6V" with spout above
# AEE "6V" with double-spout above
# AO "6V" with wings above
# AOO "6V" with double-wings above
#
# A'A "6V" with "2" below
# A'I "6V" with "2" below and left-hook above
# A'U "6V" with "2" below and pipe below
# R'i "T" with spout below and "2" below
# L'i "21" with "2" below
#
# Vowel modifiers (postfix)
#
# m circle above
# : colon with open circles at left (sanskrit visarga )
#
# Punctuation:
#
# & sanskrit apostrophe - said to look like a "NYA"
# , nail (expression delimiter, shad)
# ,, double nail (section delimiter)
# ~~ double scroll
# ^ like Greek lowercase upsilon
# ` nail with corn-dots (rin-chen-sprung-shad?)
# o small circle under preceding syllable
# x small "x" under preceding syllable
# ~~~ triple scroll
# % small circle-on-bowl under preceding syllable
# ; nail-like stroke with three dots above (sbrul-shad?)
# / open or close tibetan brace (ang khang gton/gyas?)
# space top dot separator syllables (tsheg)
# * unreadable, dubious, or unrepresentable character
# X-Y X and Y are separate letters, side by side
# X+Y sanskrit stack of letter X over letter Y
#
# Formatting codes:
#
# รท line break in original book.
# ## page NNNX original page numbers (NNN = number, X = "A" front,"B" back)
# #XXXX comment line (not part of the original text).
# {=XXXX} preceding char/word was XXXX in the ACIP file
# {XXXX} embedded comment (not part of the original text).