<?xml version="1.0" encoding="utf-8"?>
<tei.2><teiheader><filedesc><titlestmt><title>The Tibetan and Himalayan Digital
 Library<ent value="rsquo"></ent>s Tibetan Literature Archive at the University
 of Virginia</title><author>Germano, David</author><respstmt><resp>Consultant</resp><name>Hillis,
 Greg</name><resp>Consultant</resp><name>Weinberger, Steve</name><resp>Consultant</resp><name>Pitti,
 Daniel</name><resp>Consultant</resp><name>Martin, Worthy</name><resp>Consultant</resp><name>Bingler,
 Robert</name><resp>Consultant, Revisor and SGML Markup</resp><name>Garson,
 Than</name></respstmt></titlestmt><publicationstmt><publisher>Institute for
 Advanced Technology in the Humanities</publisher><pubplace>University of
Virginia</pubplace><pubplace>Charlottesville,  Va.</pubplace></publicationstmt><sourcedesc><p></p></sourcedesc></filedesc></teiheader><text><front
><head>THDL<ent value="rsquo"></ent>s Tibetan Literature Archive: Technical
Notes</head><titlepage><titlepart>The  Tibetan and Himalayan Digital Library<ent
value="rsquo"></ent>s Tibetan Literature Archive: Technical Notes</titlepart><byline>Prepared
by David Germano with  the assistance of Gregory Hillis, Nathaniel Garson,
Steven Weinberger, Daniel  Pitti, Worthy Martin and Robert Bingler</byline><byline>Institute
of Advanced  Technology in the Humanities, the University of Virginia</byline><docdate>June
 1, 1999 (Revised February, 2001)</docdate></titlepage></front>

<body><head>Technical Documentation for Tibetan Literary Collections</head><div1 type="chapter"><head>I. Digital Cataloging
 and Representation of Tibetan Texts: The Rise Of Thematic Research Collections</head><p>The
 Tibetan Literature Archive section of the <xref n="http://faculty.virginia.edu/tibet-initiative/library/frameset.html">Tibetan
and Himalayan Digital Library</xref> is devoted  to in-depth digital cataloging
of Tibetan literature as part of a broader  thematic research collection.
 This entails that the cataloging operates as  an integrated means of accessing
the digital versions of the texts themselves  in the original Tibetan and
in translation, as well scholarly analysis and  commentary on those texts.</p><p>Digital
technology has enabled the creation  of what we term <ent value="ldquo"></ent>thematic
research collection<ent value="rdquo"></ent>.  An archive is traditionally
understood as a secondary by-product accumulating  around the life of a person
or organization.  We are using this terminology  to distinguish it from what
archivists and manuscript librarians usually mean  when they use the terms <ent
value="ldquo"></ent>collection<ent value="rdquo"></ent>  and <ent value="ldquo"></ent>archives<ent
value="rdquo"></ent>.  An <ent value="ldquo"></ent>archive<ent value="rdquo"></ent>
in this sense thus coheres by provenance and not by intention,  in contrast
to an ordinary <ent value="ldquo"></ent>artificial collection<ent value="rdquo"></ent>
 which is the result of intentional acquisition of materials for the purpose
 of building a collection. In contrast to a traditional archive, <ent value="ldquo"></ent>a
 thematic research collection<ent value="rdquo"></ent> signifies a relatively
new  model that has been proliferating under the stimulation of the incorporation
 of technology into humanities scholarship. </p><p>A thematic research collection
 begins with the selection of a scholarly theme, which could be an author,
 a genre, a movement, a city, a historical period, or a canonical body of
literature,  such as in our own case. It is thus intentional in that one formulates
a criteria  and then builds a collection based on that criteria. Following
the selection  of the theme, all primary, secondary and reference resources
relevant to the  study of that theme must be identified.  The collection then
accumulates the  relevant resources in primary, secondary and reference literature,
and then  actively integrates them in a single, interlinked medium, unlike
a traditional  print collection. </p><p>Another characteristic is that in
addition to simultaneously  providing direct access and intellectual access
to these resources, a thematic  research collection also functions as a publisher.
In this aspect, it facilitates  and organizes focused research on a particular
theme or area, such that it  integrates the activities and interests of archivists,
librarians, publishers,  and scholars.  The nature of electronic on-line publishing
entails that the  products of its publishing activities are intimately interlinked,
exegetical  literature to the primary literature it comments on, reference
works to the  exegetical literature, and so on. In addition, the publishing
can be open-ended,  with products amenable to constant update, thus allowing
for dialogues and  conversations to emerge precisely due to the extended temporality
of published  texts, or texts in the process of publication. Another element
of publishing  in a thematic research collection is the possibility of proactively
soliciting  and organizing interlinked scholarship in ways enabled uniquely
by this extended  temporality. What we have in mind is projects that constitute
collaborative  reference works without traditional limitations on length,
which are directly  linked into the primary resources and scholarship rather
than standing alone,  and which can be constantly updated with reference to
the original and revised  entries. This might take the form of an encyclopedia
of terminology, a biographical  database, and so forth. </p><p>This unitary
and interlinked character of the  publishing enables a thematic research collection
to integrate the traditionally  distinct work of philologists and interpretive
scholars, text scholars and  art historians, and in general scholars from
different disciplines. It also  attempts to bridge the traditional distances
between modern Euro-American  scholarship and traditional Asian scholarship.
Most interestingly, the integration  of these different scholars into a single
interlinked site, as well as the  ability to respond to other works through
revision of one's original work,  stimulates unprecedented discussions across
disciplinary and other boundaries.  Such conversations are often largely implicitly
present in the academy, but  the revisable and interlinked nature of a thematic
research collection renders  such discussions explicit. In addition, the ability
to publish short contributions,  as well as eliminate temporal lag between
composition and distribution, further  encourages the development of such
dialogues across boundaries. The ultimate  result is that a thematic research
collection can become the foundation for  the emergence of a new community
of actively interacting scholars bound together  by a common subject area
across disciplinary and regional boundaries. The  collection becomes a site
where primary resources are consulted, reference  materials are accessed,
scholars publish articles, scholars read new cutting  edge scholarship on
their subject, and explicit real time exchange take place. </p><p>Technology
 has offered new possibilities of collaboration that did not exist previously
 - rather than taking extant models and digitizing them, the new medium is
 enabling us to come up with entirely new models.  We would like to briefly
 point out some of the obvious elements of technology which enable new collaborative
 possibilities:</p><head rend="list">1. Open-ended publishing</head><p>The
 digital nature of the medium enables an open-ended publishing process capable
 of continual revision and expansion. Projects no longer have to have finite
 time lines that terminate an irrevocably fixed form. This open-ended quality
 of digital publication enables the gradual evolution of dialogue and synergy
 across all types of scholarly boundaries, thereby enabling collaboration
as  well as stimulating it. </p><head rend="list">2. Granularity of publishing</head><p>Electronic
 publication enables small, granular contributions such as chapter summaries,
 or even mere footnotes, to be formally published as part of a larger venture.
 This is a function of the integration of publication, collection maintenance,
 and scholarship. This not only widens the participatory nature of a project,
 but it also helps dissolve the strict boundaries of publications and authorship.
  </p><head rend="list">3. Immediacy of publishing</head><p>The Internet enables
 the incorporation of on-line communication channels into scholarly projects,
 as well as the collapsing of time lags between composition and circulation.
 It also expands access in terms of financial and geographical barriers.</p><head
rend="list">4. Publishing links between diverse types of texts, images and
 sounds</head><p>Hypertext enables direct connections between diverse textual
 work as well as images: (i) text critical work, (ii) translations, (iii)
reference  materials, (iv) interpretive scholarship, and (v) visual images
and sounds. </p><head rend="list">5. Searching links across domains in publishing</head><p>The
availability  of sophisticated searching operations, that operate simultaneously
across  diverse textual domains, there by establishing direct links between
them within  the medium of publishing itself.</p><head rend="list">6. Integration
of scholarship,  publication and collection maintenance</head><p>In short,
a thematic research  collection both demands and enables collaboration, as
outlined above.</p><p>While  at present the main focus is on cataloging the
collections, such as <hi rend="italics">The  Collected Tantras of the Ancients</hi>,
preliminary work is also thus being  done on digitally representing these
texts in a variety of formats. There  will be five different presentations
for each text: extended Wylie transliteration,  typed in Tibetan script representations,
translations into English and other  non-Tibetan languages, scanned digital
images of the original Tibetan manuscripts,  and electronic critical editions.
Each type of representation presents its  own unique problems and issues both
on technical and intellectual fronts.  This section discussions these issues,
solutions we have devised for some,  and issues on which we continue to work.
The topics are:<list n="" rend="u"><item>Issues  in Tibetan Literature</item><item>Identifying
the Boundaries of a Text</item><item>Cataloging  the Structure of a Text</item><item>Interfacing
a Catalog and a Textual Representation</item><item>Digital  Images of Folios</item><item>Extended
Wylie Transliteration</item><item>Tibetan  Script Representations</item><item>Critical
Editions</item><item>Translations</item></list></p><div2 type="section"><head>Issues
in Tibetan Literature</head><div3><head>Authorship,  Text, and Classification
Schemes in a Tibetan Buddhist Canon</head><p>Religious  canons of literature
often involve texts attributed to divine voices or inspiration  rather than
to the actual human person who delivered the text into this world  as the
agent of at least its dissemination.  In a Buddhist context, these  divine
voices are most typically the various Buddhas inhabiting the cosmos,  though
they can also be Bodhisattvas or other divinities. In addition, reincarnation
 raises additional problems, since people might claim to be now revealing
texts  they authored or received from famous saints in previous lifetimes.
In a related  but different practice, Buddhist texts are often attributed
to much earlier  saints in order to gain legitimacy. This results in complicated
questions  of authorship for non-divine texts as well.  Thus visionary practices
and  assumptions concerning the production of texts complicates issues of
authorship  compared to the much more simple practices of modern literary
production.    Finally, even at the very mundane level of ordinary composition,
we often  find <ent value="ldquo"></ent>group<ent value="rdquo"></ent> composition
where a number  of hands are deeply involved in the actual formation of the
text - the purported <ent value="ldquo"></ent>author<ent value="rdquo"></ent>
may have simply given a lecture  which inspired the composition, or even simply
have commanded the composition,  which is actually written by several other
people, and then further revised  by still others.</p><p>From a modern critical
point of view, such are thus  often understood as a compilation of heterogeneous
materials belonging to  different authorial hands and different temporal periods.
 In this way, from  modern and traditional perspectives, religious canons,
and in particular Buddhist  canons, can be quite complicated in terms of identifying
the precise identity  and character of an <ent value="ldquo"></ent>author<ent
value="rdquo"></ent>. Modern  notions of <ent value="ldquo"></ent>authorship<ent
value="rdquo"></ent> are just not  adequate with their assumptions of a single
author producing a easily defined  text with his/her name stamped on it. Instead
we must identify an entire set  of figures, including <ent value="ldquo"></ent>divine
author<ent value="rdquo"></ent>, <ent value="ldquo"></ent>original redactor<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>concealer<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>revealer<ent value="rdquo"></ent>, and so  forth, each
of whom may have had a hand in the actual shape of the received  text. We
must also be sensitive to how <ent value="ldquo"></ent>reincarnation<ent value="rdquo"></ent>
or notions of inspiration can blur historical differences  between figures
involved in this process. We are thus dealing with this problem  by not simply
documenting the <ent value="ldquo"></ent>author<ent value="rdquo"></ent>;
 instead, utilizing traditional nomenclature, we document every single figure
 involved in this process of creating the text, both human and divine.  We
 thus track authorial pedigree across domains - the human and the divine -
 and lives - the multiple lives of any given individual during which s/he
may  have had a hand in the text<ent value="rsquo"></ent>s final form.</p><p>When
we  turn to notions of what constitutes a text, we also face problems that
cannot  be dealt with when we assume modern notions of a single, stable text
that  view unacknowledged textual borrowings as simple plagiarism. At the
simplest  level, we have the Tibetan compositional practice of writing literary
cycles  of many short and long interdependent texts. We have to recall this
was before  the days of journals to which authors could submit shorter pieces,
or edited  volumes to which authors submitted individual chapters. In addition,
the same  authors would pen texts that were ritual, yogic, devotional, and
philosophical  in character.</p><p>Many anthologies and/or canons contain
texts which are  often short and group together many such texts of different
authorship. In  addition, there is the problem of extensive intertextuality:
entire sections  are shared by texts which claim different authorship and
titles. Often a given  text seems clearly to function as an anthology of shorter
texts previously  circulated independently but now used as chapters of this
longer text <ent value="ndash"></ent> either in their original form or in
somewhat modified form.  Finally, there is a problem with <ent value="ldquo"></ent>root<ent
value="rdquo"></ent>  (<hi rend="italics">rtsa ba</hi>) texts being gradually
expanded with additional  sections often labeled <ent value="ldquo"></ent>supplements<ent
value="rdquo"></ent>  (<hi rend="italics">phyi ma</hi>) or <ent value="ldquo"></ent>secondary
supplements<ent value="rdquo"></ent> (<hi rend="italics">phyi ma<ent value="rsquo"></ent>i
phyi ma</hi>).  These supplements sometimes have chapters numbered from <ent
value="ldquo"></ent>one<ent value="rdquo"></ent> but other times have chapters
whose enumeration begins where  the previous text<ent value="rsquo"></ent>s
enumeration ended. In some cases, it  seems likely that these different textual
units were written by different  authors.</p><p>Thus in addition to the problem
of identifying the author,  we have a central problem of defining the boundaries
of a text in Tibetan  literature<ent value="mdash"></ent>what constitutes
a Tibetan text? Is a <ent value="ldquo"></ent>supplement<ent value="rdquo"></ent>
a distinct text or the same text as the <ent value="ldquo"></ent>root<ent
value="rdquo"></ent> and <ent value="ldquo"></ent>secondary supplement<ent
value="rdquo"></ent>?  Stipulating the constitution and boundaries of a text
is often a second order  act that is a function of commentary<ent value="mdash"></ent>this
stretch of material  is what constitutes a text in this case, and so forth.
We can see the changing  definitions of <ent value="ldquo"></ent>text<ent
value="rdquo"></ent> in Western Tibetological  literature, for example, in
how scholars initially cited the <hi rend="italics">sNying  thig ya bzhi</hi>
as if it were a single text. However, the <hi rend="italics">sNying  thig
ya bzhi</hi> is actually an eleven volume anthology of five quite distinct
 collections of texts, three of which consist of texts of varying authors.
 The next stage occurred when Western Tibetanists began to refer to the distinct
 collections which they cited a passage, such as saying <ent value="ldquo"></ent>from
 the <hi rend="italics">Bi ma snying thig</hi><ent value="rdquo"></ent>. Yet,
for  example, in this case the collection contains many individual texts authored
 by completely different authors. Finally, a third phase was when Tibetanists
 began to refer to the individual titles within each collection, along with
 the individual author(s). </p><p>After a detailed analysis, we have decided
 to provisionally define a separate Tibetan text as that which is terminated
 by a Tibetan term meaning <ent value="ldquo"></ent>(It is) completed.<ent
value="rdquo"></ent>  (<hi rend="italics">rdzogs so</hi>) and is framed by
traditional <ent value="ldquo"></ent>front<ent value="rdquo"></ent> elements
(title line, homage, etc.) and <ent value="ldquo"></ent>back<ent value="rdquo"></ent>
elements (closing comments followed by <hi rend="italics">rdzogs  so</hi>,
colophons, etc.). On this principle, we have found that in general <ent value="ldquo"></ent>supplements<ent
value="rdquo"></ent> (<hi rend="italics">phyi ma</hi>)  and <ent value="ldquo"></ent>secondary
supplements<ent value="rdquo"></ent> (<hi rend="italics">phyi  ma<ent value="rsquo"></ent>i
phyi ma</hi>) are all classified as separate texts,  even when chapter enumeration
is directly continuous with the preceding text.  In these ways, we have examined
indigenous issues of how <ent value="ldquo"></ent>authorship<ent value="rdquo"></ent>
and <ent value="ldquo"></ent>text<ent value="rdquo"></ent> are understood
 and throughout our work indicated how we are dealing with these issues in
 the cataloging and describing of the texts.</p><p>A third critical issue
pertains  to the editorial organization of religious canons of literature
in terms of  the classification of texts under different rubrics. These rubrics
are themselves  sometimes organized into nested hierarchies, as well as internally
divided  into further nested hierarchies of sub-rubrics. These intellectual
classifications  can be found in a cycle of texts, or <ent value="ldquo"></ent>collected
works<ent value="rdquo"></ent> authored by a single author, as well as in
a <ent value="ldquo"></ent>canon<ent value="rdquo"></ent> or <ent value="ldquo"></ent>anthology<ent
value="rdquo"></ent> of texts  by very diverse authors. In the former case,
for example, a cycle of texts  might be divided up into a series of trilogies;
or it may consist of one hundred <ent value="ldquo"></ent>instructions<ent
value="rdquo"></ent>, which themselves are classified  into ten sets of ten
instructions apiece. </p><p>When we consider canons of  diverse authorship,
we find that often the intellectual schemes for organizing  the grouping and
sequencing of texts are both helpful in understanding the  relationships between
texts and highly debatable in terms of their attempts  to connect texts into
coherent canons (or sub-canonical formations). In short,  editorial activity
on this level involves trying to retroactively construct <ent value="ldquo"></ent>traditions<ent
value="rdquo"></ent> that the texts themselves seem  unaware of, and/or constructing
an artificial unity by grouping together diverse  texts for reasons that are
not always clear or valid. It is thus not surprising  that a given canon can
have the same texts grouped in different ways by different  editions or, in
some editions, can lack any apparent editorial organization  at all.  Still,
these different editorial schemes often yield invaluable information  on how
a given text should be understood in relationship to other texts and  movements,
as well as how distinct <ent value="ldquo"></ent>movements<ent value="rdquo"></ent>
 were historically constructed. This process of construction sometimes occurred
 as texts were produced, sometimes stimulated the production of texts, and
 in some cases long postdated the production of the very texts it claims constitute
 a distinct <ent value="ldquo"></ent>tradition<ent value="rdquo"></ent>, or <ent
value="ldquo"></ent>sect<ent value="rdquo"></ent>.</p><p>It may be helpful
to examine this issue in the context  of a particular case. In the case of
the <hi rend="italics">rNying ma rgyud <ent value="rsquo"></ent>bum</hi> (NGB),
all five of the mainstream editions<ent value="mdash"></ent>the  mTshams brak
(Tb), gTing skye (Tk), sKyid grong (Kg), sDe dge (Dg) and Waddell  (Wa)<ent
value="mdash"></ent>are organized into three overarching groups of texts 
that correspond to three distinct literary, ritual and philosophical traditions
 known as Atiyoga, Anuyoga, and Mah<ent value="amacr"></ent>yoga. In each
case,  the beginning and end of the sections correspond to the beginning and
end  of individual volumes. The sixth edition<ent value="mdash"></ent><hi
rend="italics">The  Collected Tantras of Vairocana</hi><ent value="mdash"></ent>is
unique in that it  contains only the Atiyoga texts. Other editions of these
texts are either  individual editions of single texts, or published as part
of a much smaller  subset of materials. When we look at the internal organization
of these three  sections<ent value="mdash"></ent>Atiyoga, Anuyoga and Mah<ent
value="amacr"></ent>yoga<ent value="mdash"></ent>we find that the sDe dge
edition has precisely organized each  of the three into a system of subdivisions
and sub-subdivisions. Each of these  corresponds to supposed sub-traditions
within the broader traditions. While  the beginning and end of these subsections
often correspond to the beginning  and end of volumes, sometimes they are
found within a given volume. This intellectual  scheme of organization within
the sDe dge edition is made explicit in the  catalog for the edition written
by the editor himself. He not only outlines  the categories and subcategories
but also provides a list of titles found  in each. Finally, these editions
often have a final volume or two of miscellaneous  titles which are not precisely
classified<ent value="mdash"></ent>these are often  texts which were only
obtained after the first printing.</p><p>The gTing skyes  and sKyid grong
editions seem to have internal organizations of texts groupings  and sequencing
which are closely related to those of the sDe dge, though there  are some
variations. Indeed, the classification scheme underlying our own  digital <ent
value="ldquo"></ent>master<ent value="rdquo"></ent> edition of the canon 
is closely based on the sDe dge<ent value="rsquo"></ent>s. We are as of yet
unsure  about the Waddell edition<ent value="rsquo"></ent>s organization.
The mTshams brag  edition, however, is quite different. Its Atiyoga section
seems to lack any  internal organization, though the Anuyoga and Mah<ent value="amacr"></ent>yoga
 section do seem structured in ways generally parallel to sDe dge.</p><p>For
 all these reasons, it is important that our canonical cataloging and descriptive
 schemes account for these different intellectual organizations. In reality,
 a canon, or an edition of a canon, creates for itself a new textual identity:
  in some sense all the individual texts become subsumed as individual pieces
 of a far larger <ent value="ldquo"></ent>text<ent value="rdquo"></ent>. We
can thus  focus on the individual texts apart from their existence in the
canon, or  we can focus on the canon per se and see the texts as the interlocking
pieces  which constitute it. This ambiguity is evident in how Tibetan scholars
will  at times focus on the title of a particular texts within <hi rend="italics">The
 Collected Tantras of the Ancients</hi>, and at other points just cite something
 as being from the <ent value="ldquo"></ent><hi rend="italics">The Collected
Tantras  of the Ancients</hi><ent value="rdquo"></ent> without any concern
for the particular  textual and authorial source. </p><p>Thus between the
collection-level and  the individual text-level we have what we might term
the <ent value="ldquo"></ent>classification<ent value="rdquo"></ent> level,
i.e. the ways in which individual texts within a given  collection/canon/edition
are grouped into sequential and sometimes hierarchical  classification schemes.
These classification schemes can thus have several  different nested layers.
In the case of <hi rend="italics">The Collected Tantras  of the Ancients</hi>,
it appears at present that three layers is the deepest  that any scheme has:
the Atiyoga layer, then its sub-divisions, and then its  sub-subdivisions.
Given the above considerations, these classification schemes<ent value="mdash"></ent>which
differ from edition to edition<ent value="mdash"></ent>are  as important for
the edition-as-a-text perspective as chapters are to an individual  text.
They are critical to understanding how a given edition of a canon is  structured <hi
rend="italics">as a canon</hi>.</p><p>Our objective is thus  to offer two
different ways to access and view the individual titles grouped  together
for printed editions: (i) by means of the individual volumes, and  (ii) by
means of the intellectual classifications. In both cases, they terminate 
with the individual text titles. This also provides us with a template for
 dealing with other Tibetan collections that have quite diverse ways of organizing
 the individual texts. For our own master digital edition, which lacks any
 physical <ent value="ldquo"></ent>volumes<ent value="rdquo"></ent>, access
will be purely  via the intellectual organization of the texts which classifies
them into  traditions.</p></div3></div2><div2 type="section"><head>Identifying
the Boundaries  of a Text</head><p>A text is not simply a seamless flow of
words; rather it  is typically divided <ent value="mdash"></ent> either explicitly
or implicitly <ent value="mdash"></ent> into different sections or units of
varying types and sizes.  For example, prose can be divided into sections,
chapters, paragraphs and  sentences; a verse text can be divided into cantos,
stanzas and lines. These  structural textual units thus must be identified
and rendered in markup in  order to accurately represent a text in SGML-based
electronic form. </p><p>One  of the great values of using SGML is that as
long as one accurately represents  the structure of the document in question
with a coherent tagging scheme,  the internal structure of the original document
is always accessible for searches,  various displays, and other manipulations.
The display is a secondary issue,  and if the SGML is solid, one can write
a variety of programs to view it in  various ways. Our aim in representing
a Tibetan text in an SGML template is  thus to identify and label all the
structural elements of the text. In terms  of display, we will <ent value="ldquo"></ent>publish<ent
value="rdquo"></ent> the Tibetan  text not in a visual reproduction of the
original rectangular artefact but  rather in a reformatted form that closely
approximates Western publishing  conventions currently in use by many modern
 Tibetan publications. In other  words, we will mark outline headers by different
font sizes and bold face,  break prose into paragraphs, indent citations,
give separate verse lines,  and so forth. </p><p>Because electronic catalogs
allow one to search all parts  of the texts which have been identified as
separate components, it is imperative  that each structural component of the
text be marked as such with precision  and lack of ambiguity. For example,
one should not refer to different types  of closing materials simply as a <ent
value="ldquo"></ent>colophon<ent value="rdquo"></ent>;  instead, one should
divide materials into their component parts, each of which  can then be searched
separately. This enables a user to search only on printing  colophons, or
only on author<ent value="rsquo"></ent>s colophons, and so forth.  For these
reasons, electronic catalogs require us to think differently and  more precisely
about the structure of the text than print catalogs. These  components can
also be formatted in particular ways with style sheets, if  they have been
structurally identified in the SGML.</p><p>In addition to (i)  searches and
(ii) display issues, the scheme of structural division outlined  below is
also intended to enable (iii) a system of identification numbers  that gives
each line in the text a unique identifying tag. See the <ent value="ldquo"></ent><ref
type="link" target="2.6">Text Identification Numbers</ref><ent value="rdquo"></ent>
 document for details. </p><p>The scheme given below is thus intended to analyze
 the logical structure of a text, dividing it into its component elements.
 This is distinct from tagging the intellectual content of the text, such
as  all ritual implements, funerary ritual terms, etc. We are trying to make
a  template that is flexible enough to handle all the different literary genres
 within standard scholastic literature and s<ent value="umacr"></ent>tras/tantras.
 It is necessary to create different templates for other genres of texts. </p><p>In
 summary, our principle is that <ent value="ldquo"></ent><hi rend="italics">rdzogs
 so</hi><ent value="rdquo"></ent> marks the end boundaries of a text. We also
look  to the presence of traditional front matter (prefatory) and back matter
(concluding)  as supporting evidence. Such things as whether the enumeration
of chapters  crosses the divide between textual units is also relevant, but
we do not see  it as important as the preceding considerations. This is most
relevant in  the problem of <ent value="ldquo"></ent>root<ent value="rdquo"></ent>
(<hi rend="italics">rtsa  ba</hi>) tantras which have <ent value="ldquo"></ent>supplements<ent
value="rdquo"></ent>  (<hi rend="italics">phyi ma</hi>) and <ent value="ldquo"></ent>secondary
supplements<ent value="rdquo"></ent> (<hi rend="italics">phyi ma<ent value="rsquo"></ent>i
phyi ma</hi>).  In such cases, often the chapter enumeration is a single series
across all  three textual units. However, the <ent value="ldquo"></ent><hi
rend="italics">rdzogs  so</hi><ent value="rdquo"></ent> tends to mark the
conclusion of each textual unit,  which is also bound by traditional front
and back materials. We have thus  followed the general principle of considering
each of the three to be separate,  albeit linked, texts.</p><p>There are also
more complex cases where a series  of short and clearly linked texts are found
which seem to lay on the boundaries  between individual texts and unusual
internal arrangements of sections in  a single text. Here too we have tended
towards identifying separate units  as separate texts. Our general sense is
that the identification of such units  as separate texts is a more elegant
and robust solution since it allows users  to immediately discern textual
boundaries inherent to the texts themselves.  In addition, the linkage of
texts can be easily documented, but it is more  difficult to break apart a
compilation of smaller textual units treated as  a single text. Finally, the
tradition itself<ent value="mdash"></ent>or at least  the editor(s) of the
canons<ent value="mdash"></ent>seem to be ambivalent about  whether these
should together be considered a given text. Given the often  strong indications
that indeed different authorial hands, and even different  historical periods,
are behind the distinct elements, it would thus seem prudent  to err on the
side of granularity.</p></div2><div2 type="section"><head>Cataloging  the
Structure of a Tibetan Text</head><p>While Tibetan tantras and scholastic
 commentaries have characteristic structural divisions that are not common
 in Western literature, they do have a general structure which is familiar
 to Western readers. A Tibetan text can be divided into three Divisions which
 we term Front, Body and Back. A <ent value="ldquo"></ent>Body<ent value="rdquo"></ent>
 can be defined as the enumerated sequence of chapters, while <ent value="ldquo"></ent>Front<ent
value="rdquo"></ent> is all that precedes it, and <ent value="ldquo"></ent>Back<ent
value="rdquo"></ent> all that follows it. Each of the divisions can then be
divided  into chapter-level elements with the chapters of the Body taken as
the model.  Finally, the chapters have sub-chapter level elements of various
types. The  following is a list of considerations respect to the analysis
of Tibetan texts<ent value="rsquo"></ent> structure:</p><list n="1" rend="1"><item>The
texts have prefatory  sections with indigenous labels (which our project labels <ent
value="ldquo"></ent>Front<ent value="rdquo"></ent>), a body (which our project
labels <ent value="ldquo"></ent>Body<ent value="rdquo"></ent>), and concluding
sections with indigenous labels (which our  project labels <ent value="ldquo"></ent>Back<ent
value="rdquo"></ent>). </item><item>The <ent value="ldquo"></ent>Front<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>Body<ent value="rdquo"></ent>,
 and <ent value="ldquo"></ent>Back<ent value="rdquo"></ent> themselves often
consist  of a number of structurally distinct sections<ent value="mdash"></ent>the <ent
value="ldquo"></ent>Front<ent value="rdquo"></ent> can be divided into title
page, title  line, homage, etc., the <ent value="ldquo"></ent>Body<ent value="rdquo"></ent>
into  the chapters, and the <ent value="ldquo"></ent>Back<ent value="rdquo"></ent>
into closing  section, author<ent value="rsquo"></ent>s colophon, translator<ent
value="rsquo"></ent>s  colophon, and so forth.</item><item>The bulk of the
texts lies in the <ent value="ldquo"></ent>Body<ent value="rdquo"></ent> (<hi
rend="italics">lus</hi>), which  is internally divided in different ways.
The most common division is <ent value="ldquo"></ent>chapters<ent value="rdquo"></ent>.
But chapters can also be grouped  together into broader <ent value="ldquo"></ent>sections<ent
value="rdquo"></ent> or <ent value="ldquo"></ent>books<ent value="rdquo"></ent>.</item><item>Chapters
are often divided  into a nested hierarchy of thematic sections and subsections
(<hi rend="italics">sa  bcad</hi>), each with an explicit header or title;
these sections<ent value="rsquo"></ent>  own enumeration and structure can
often span multiple chapters. Such sections  and subsections can also be presented
in a non-hierarchical fashion, where  a given chapter might have groups of
separate sequences of thematically titled  sections, or nested hierarchies,
that are not integrated with each other.  Classifying Tibetan texts according
to these types of structural divisions  results in three general types: <list
n="1" rend="i"><item>A text has only  chapters (no chapters is considered
one chapter), and no internal outline  at all.</item><item>A text has chapters
and an internal outline which is enumerated  anew within each chapter.</item><item>A
text has a complicated, nested hierarchy  of outlines which span multiple
chapters. This third type presents the problem  of how to preserve the chapters
and outlines as mutual parts of a nested hierarchy  of structural divisions,
since the sequence of chapters and sequence of outline  levels constitute
overlapping organizations.</item></list></item><item>Chapters  can consist
of prose as well as verse text.  The next structural level within  chapters
- putting aside the issue of internal thematic outlines - is what  we term <ent
value="ldquo"></ent> paragraph-level elements<ent value="rdquo"></ent>:  
 namely paragraphs within prose, or, analogously, stanzas within verse. Paragraphs
 are sometimes explicitly indicated by a the punctuation mark of a double
shad,  though this can also indicate the end of a section containing multiple
paragraphs.   Paragraphs can also be explicitly indicated by the convention
of repeating  the final letter of the preceding syllable, and adding a <hi
rend="italics">na  ro</hi> vowel; however, this can also simply mean a sentence
within a paragraph.   In summary, there is no simple and straightforward sub-chapter
textual division  understood as the equivalent to the English <ent value="ldquo"></ent>paragraph<ent
value="rdquo"></ent> in Tibetan literature, but there are various ways of
marking  textual transitions within prose.  It is thus necessary, though possible,
 to make scholarly determinations as to how prose can be divided into subsections
 analogous to <ent value="ldquo"></ent>paragraphs<ent value="rdquo"></ent>,
though at  times there is admitted lack of clarity and hence degree of arbitrariness
 about precisely where these divisions are to be made.<p>Other paragraph-level
 elements in prose include the distinction between text written by the author
 and the author<ent value="rsquo"></ent>s citations of other works. Within
verse,  also, there are a variety of explicit indicators of the division of
stretches  of verse into sections, as well as less clear indicators that require
human  - and debatable judgement.  Explicit indicators include a set pattern
of changing  syllabic lines, a set pattern like quatrains, or terminative
particles (most  commonly, the repeated letter of the preceding syllable with
an added <hi rend="italics">na ro</hi> vowel).  Implicit indicators include
more thematic  breaks, or transitions.</p></item><item>The next level is sub-paragraph-level
 elements, or sentences, which in Tibetan we define as shad-delimited lines.
 A <ent value="ldquo"></ent>shad<ent value="rdquo"></ent> is the only common
element  of Tibetan punctuation (aside from a <hi rend="italics">tsheg</hi> <ent
value="ndash"></ent>  see below) used at multiple points on a single folio,
and it can correspond  functionally to the English comma, semicolon, colon
and period. </item><item>Finally,  within any given <ent value="ldquo"></ent>line<ent
value="rdquo"></ent>, each syllable  is separated by a <ent value="ldquo"></ent>tsheg<ent
value="rdquo"></ent>, beyond which  there is no punctuation or formatting
marker which marks words or phrases. </item></list><p>The  following provides
an exhaustive list of the possible internal subdivisions  of the Front, Body
and Back of a Tibetan text:<list n="1" rend="I"><item>Front<list n="1" rend="1"><item><hi
rend="underline">Title page </hi>(<hi rend="italics">mtshan</hi>):  this applies
if there is a separate page with boxed-in title.</item><item><hi rend="underline">Title
line</hi>: this applies if there are opening lines  giving the title in Tibetan
or various other languages. </item><item><hi rend="underline">Homage/Invocation/Praise</hi>
 (<hi rend="italics">mchod par brjod pa</hi> - marked by <hi rend="italics">phyag <ent
value="rsquo"></ent>tshal</hi>): this is a supplication to a Buddha, Bodhisattva,
 teacher, or whomever is held in high religious esteem. Can be prose or verse.</item><item><hi
rend="underline">Statement of Intent</hi>, or <ent value="ldquo"></ent>promise
 to compose<ent value="rdquo"></ent> (<hi rend="italics">rtsom par dam bca<ent
value="rsquo"></ent>  ba</hi>): this is a verse stating the author<ent value="rsquo"></ent>s
intention  in composing this work.</item><item><hi rend="underline">Untitled
introduction</hi>:  this encompasses any materials that follows the preceding
that clearly are  not included within the first chapter of the text. </item><item><hi
rend="underline"><ent value="ldquo"></ent>Ordinary Introductory scene<ent
value="rdquo"></ent></hi> (<hi rend="italics">thun  mong gleng gzhi</hi>):
this type of section is typically found in s<ent value="umacron"></ent>tras
 (diacritics) and tantras. They describe the narrative context occasioning
 the verbal lectures and/or dialogues between the Buddha(s) and retinue which
 constitute the bulk of the text.</item><item><hi rend="underline"><ent value="ldquo"></ent>Extraordinary
 Introductory scene<ent value="rdquo"></ent></hi> (<hi rend="italics">thon
mong  ma yin pa<ent value="rsquo"></ent>i gleng gzhi</hi>): tantras transmitted
by the  rNying ma tradition in Tibet often are characterized by dual introductory
 scene, one termed <ent value="ldquo"></ent>ordinary<ent value="rdquo"></ent>,
and one <ent value="ldquo"></ent>extraordinary<ent value="rdquo"></ent></item><item><hi
rend="underline">Outline</hi>  (<hi rend="italics">sa bcad</hi>): some texts
have an initial section which  provides a detailed outline of the entire text.
 If, however, the outline  is part of chapter one, then it is a subdivision
of the chapter division of  the body.</item></list></item><item>Body<list
n="1" rend="1"><item><hi rend="underline">Section  divisions</hi>: some texts
have overarching divisions of the text which are  then subdivided into chapters.</item><item><hi
rend="underline">Chapters</hi>:  this usually consists of the individual chapters.
If there are no individual  chapters, then the whole body of the text is considered
a single chapter-level  element. <list n="1" rend="i"><item><hi rend="underline">Chapter
title</hi>:  the digital reproduction of each chapter is framed in the beginning
by its  title (bold face, large font size, centered), which is generally only
specified  at the end of the chapter in the actual text.</item><item><hi rend="underline">Chapter<ent
value="rsquo"></ent>s colophon</hi>: Each chapter usually ends with a <ent
value="ldquo"></ent>colophon<ent value="rdquo"></ent>, i.e. a reiteration
of the text title, and then the name of  the chapter and its enumeration (i.e.
the 5th chapter, etc.).</item></list></item></list></item><item>Back <p>A
 Colophon is traditionally defined as: The inscription or device formerly
placed  at the end of a book or manuscript and containing the title, the scribe
or  printer<ent value="rsquo"></ent>s name, date and place of printing, etc.
(In early  times, the colophon gave the information now given on the title
page). For  analytical clarity, we have separated out a number of different
types of sections  that we refer to as <ent value="ldquo"></ent>colophons<ent
value="rdquo"></ent> of various  types, as well as a few additional sections
found at the end of texts which  we have labeled otherwise: </p><list n="1"
rend="1"><item><hi rend="underline">Closing  section</hi>: this comprises
everything following the final chapter but preceding  the reiteration of the
title followed by <ent value="ldquo"></ent><hi rend="italics">rdzogs  so</hi><ent
value="rdquo"></ent>. This is often labelled a <ent value="ldquo"></ent>colophon<ent
value="rdquo"></ent>, but we have chosen to separate it out as a <ent value="ldquo"></ent>closing
 section<ent value="rdquo"></ent> since it is distinct from the following
sections  and at times contains considerably more information that merely
the title. </item><item><hi rend="underline">Author<ent value="rsquo"></ent>s
colophon</hi>: (<hi rend="italics">mjug</hi>):  sometimes, after the <ent
value="ldquo"></ent><hi rend="italics">rdzogs so</hi><ent value="rdquo"></ent>,
there is a short colophon by the author. This <ent value="ndash"></ent>  as
with all chapter-level elements <ent value="ndash"></ent> is marked by <ent
value="lt"></ent>DIV2> tags. They are distinguished by their type attributes.
Thus,  the author<ent value="rsquo"></ent>s colophon is marked by <ent value="lt"></ent>DIV2
 TYPE = <ent value="ldquo"></ent>author<ent value="rsquo"></ent>s colophon<ent
value="rdquo"></ent>>  tags.</item><item><hi rend="underline">Redactor<ent
value="rsquo"></ent>s colophon</hi>:   self evident.</item><item><hi rend="underline">Translator<ent
value="rsquo"></ent>s  colophon</hi>:  self evident.</item><item><hi rend="underline">Lineage
transmission</hi>:   self evident.</item><item><hi rend="underline">Reviser<ent
value="rsquo"></ent>s  colophon</hi>:  self evident.</item><item><hi rend="underline">Editorial
colophon</hi>:   self evident.</item><item><hi rend="underline">Scribal colophon</hi>:
 self  evident.</item><item><hi rend="underline">Printing colophon</hi>: self
evident</item><item><hi rend="underline">Concluding prayer</hi> (<hi rend="italics">mjug
byang smon  lam</hi>): There is a great deal of variability from text to text
concerning  the number of concluding prayers and where they typically fall
in the sequence  of <ent value="ldquo"></ent>Back<ent value="rdquo"></ent>
textual sections. </item><item><hi rend="underline">Closing invocations</hi>:
this signifies final mantric particles  or invocations like <ent value="ldquo"></ent><hi
rend="italics">rgya rgya rgya</hi><ent value="rdquo"></ent>, <ent value="ldquo"></ent><hi
rend="italics">sarva mangalam</hi><ent value="rdquo"></ent>, etc.</item><item><hi
rend="underline">Instructional colophon</hi>:  this refers to an appended
set  of instruction on performing offerings or  other rituals in relation
to  the text.</item><item><hi rend="underline">Undetermined  colophon</hi>:
this is for situations were a precise determination of the  type of colophon
cannot be made by the editor but requires further scholarly  research and
discussion.</item></list></item></list></p></div2><div2><head>Interfacing
 a Catalog and a Textual Representation</head><p>The catalog of a collection
 is the interface for users accessing digital representations of the texts
 themselves. This aspect is still in its developmental stage and the actual
 dynamics of it are presently being worked out. There are at least two methods
 for connecting to representations of the text itself. Each text<ent value="rsquo"></ent>s
 bibliographic record will include a link to representations of that text
at  the top of the record. This page contains a menu on the right and a content
 frame on the left, each formatted according to the Library<ent value="rsquo"></ent>s
 display standards. The right-hand menu displays all the available options
 for viewing that particular text<ent value="mdash"></ent>scanned images,
Wylie  transliteration, and so forth. The content frame displays the choice
selected.  This is the most basic format for viewing representations of a
text. In addition,  we are providing simultaneous views of the text<ent value="rsquo"></ent>s
hierarchical  structure for navigational purposes, and the corresponding part
of the text  highlighted in that structural outline.</p><p>Finally, we are
developing a  functionality which allows for viewing of the Tibetan text and
corresponding  translation(s) in sync with each other, so that as one moves
through one,  the other scrolls down automatically in tandem with it.  In
this regards,  <xref n="http://www.iath.virginia.edu/">IATH</xref> is developing
another program called IBabble, which allows one to view  multiple representations
in diverse languages in different windows at the  same time. The text-representations
displayed in the various windows are marked  up so that as the viewer scrolls
down one, the others scroll in a synchronized  fashion. This allows a user
to simultaneously view different representations  of a single text, whether
that be different editions of a canon, such as the  mTshams brag and the gTing
skyes editions of <hi rend="italics">The Collected  Tantras of the Ancients</hi>,
or different modes, such as transliteration,  Tibetan Script, and translation,
or different languages, such as Tibetan,  Sanskrit and Chinese. While IBabble
has been tested on Greek and Aramaic,  further work needs to be done to implement
IBabble with Tibetan, most notably  it requires the use of a Unicode font
in XML markup.</p><p>The second way  to access views of a text will be through
the citations in its bibliographic  record. Because all paginations for titles,
chapters, and so forth are specially  marked up in <ent value="lt"></ent>PAGINATION>
tags, these can be displayed as  hyperlinks through transformations initiated
by the style sheets. The links  will go to anchor-tags called <ent value="lt"></ent>MILESTONE>
tags, which record  physical locations in the text either page boundaries
or line boundaries.  Thus, any pagination or title can act as a link to that
point in the digital  text. <ent value="lt"></ent>MILESTONE> tags can easily
be inserted in text files  containing either transliteration or Tibetan script.
It is however much more  difficult to <ent value="ldquo"></ent>markup<ent
value="rdquo"></ent> scanned images  to reflect page and line boundaries.</p><p>As
certain innovations become available,  these will also be implemented for
text-representation viewing. For instance,  software is being developed to <ent
value="ldquo"></ent>tag<ent value="rdquo"></ent>  images as described above.
IATH is developing an image annotating program  known as I-Note, which possibly
could be used for this purpose with some modification.</p><p>A  text<ent value="rsquo"></ent>s
bibliographic record will act as a front-end for  accessing various representations
of the text itself. These representations  will include digital images of
its pages, typed in version in transliteration  and Tibetan script, and  translations.
 Some scholars and students prefer  to look at transliteration; others would
rather see Tibetan script. Of course,  those who do not know Tibetan would
need translations, though our translating  tools allows for automated parsing
and definition of Tibetan passages to provide  a rough suggestion of the meaning
of an untranslated passage. Each of these  formats will be briefly described
in the following sections. </p></div2><div2 type="section"><head>Digital Images
of Manuscript Folios</head><p>Because  input versions of the texts will invariably
produce some mistakes, and because  text critical scholars will need to see
the original manuscript, it is essential  to have photographic reproductions
of the original manuscripts in the archives.  Such a development revolutionizes
some aspects of making critical editions,  because it allows the interested
parties to verify the work of text editors  in preparing diplomatic and critical
editions. In addition, since the original  manuscripts have limited circulation,
our publishing of the digital images <ent value="ndash"></ent> which is far
easier than typing in the texts <ent value="ndash"></ent>  will vastly enhance
access to these collections, and encourage scholarly contributions  to the
archives.</p><p>We have yet to finalize standards and procedures for  digitizing
manuscripts. This will include both miniatures on illustrated manuscripts
 and the manuscripts themselves. In all probability two images of each page
 will be stored to facilitate access. One will be a smaller GIF file to provide
 easier and quicker initial access to view the page and the other a larger
 TIFF file for a magnified view; we will also likely implement Mr. Sid so
that  viewers can see the TIFF at multiple resolutions corresponding to their
access  speed. We are also looking into developing or obtaining software for
viewing  and manipulating images, possibly collaborating with Dr. Janet Bragg<ent
value="rsquo"></ent>s  project to create a Facsimile Editor that allows for
the SGML/XML markup of  the manuscript image itself (as well as videos) with
all search functionalities.  Being able to associate SGML/XML markup with
an image will allow for quicker  and more accurate interfacing between a text<ent
value="rsquo"></ent>s bibliographic  record and it images. Paginations in
its bibliographic information can be  directly linked to lines in the page<ent
value="rsquo"></ent>s image. </p></div2><div2 type="section"><head>Digital
Representations of Extended Wylie Transliteration</head><p>The  Tibetan and
Himalayan Digital Library has decided that the best format for  storing Tibetan
information is in Latin-font transliteration. These characters  are most easily
processed on all levels of the digital technology structure,  and especially
easier to store and retrieve without any data-corruption or  loss. Furthermore,
with relative ease the Latin-font transliteration can be  converted into digital
Tibetan script through software that is presently being  developed by the
Library. Thus, transliteration allows for the broadest possible  access and
dissemination. </p><p>As with many non-Latin-based languages, Tibetan  transliteration
is not a simple one-for-one correspondence between their letters  and Latin
characters. Transliteration of a single Tibetan character may involve  three
or more Latin equivalents. Since the beginning of Tibetan Studies, scholars
 have devised different methods for transliterating the Tibetan language.
These  generally share a common core of principles but differ radically in
crucial  aspects. Most of these methods use a variety of different diacritical
marks  to represent Tibetan characters. This results in difficulties of entry
and  display in digital mediums, not to mention the confusion caused by the
lack  of correspondence between these different systems<ent value="rsquo"></ent>
equivalents.  Furthermore, none of these systems fully account for the breadth
and depth  of the Tibetan language, which has religious, colloquial, and foreign
(Sanskrit  and Chinese) components and can include punctuation symbols for
which no transliteration  equivalents have until now been devised. In order
to comprehensively transliterate  the Tibetan information before us in a way
that is easy to understand, enter,  and display, it was necessary to come
up with a comprehensive scheme. To this  end, we have developed the Extended
Wylie Transliteration Scheme, which fills  in the gaps of an earlier commonly
accepted system. The details of our enhanced  scheme can be read in <ref type="link"
target="2.2">the transliteration section  below</ref>.</p><p>At issue with
a transliteration of a text is the format  for display. Traditional Tibetan
texts are printed on long narrow pages, front  and back, which are stacked
vertically and unbound aside from often being  contained between two similarly
shaped boards or pieces of cardboard on the  top and bottom of the stack.
While useful in terms of staying flat without  any need to hold them down <ent
value="ndash"></ent> an especially useful aspect  for uses in performing rituals <ent
value="ndash"></ent> these long, narrow pages  do not lend themselves well
to digital representation. Of course, the digital  images of the pages will
retain that shape, but to impose it on digital text  would serve to hinder
access rather than enhance it. It has therefore been  decided that both the
transliteration and the digital Tibetan script will  be formatted as most
text is formatted on the web please use capital for Web  unless you can tell
me no one does that and in <ent value="ldquo"></ent>Western<ent value="rdquo"></ent>
style books. Tibetans themselves have been publishing Tibetan  in works in
such a format for several decades. Formatting of these representations  will
be along standard lines with headers, indented verse, and so forth. Hyperlinks
 are certainly possible and will be used in a helpful but discrete manner.</p></div2><div2
type="section"><head>Digital Representations in Tibetan Script</head><p>There
 are several hurdles to mount before representations of texts in Tibetan script
 become easily accessible over the World Wide Web. These fall under two broad
 categories, those associated with the use of Tibetan fonts and those associated
 with converting between transliteration and Tibetan fonts. For optimum portability,
 we have decided to store all of our Tibetan data in extended Wylie transliteration.
 Such a format allows for the possibility of displaying the information in
 the format of the user<ent value="rsquo"></ent>s choice, either in straight
Wylie  transliteration or in any number of proprietary Tibetan fonts through
the  use of appropriate conversion routines. As extended Wylie uses only the
Roman  characters encoded in the first half of the standard ASCII character
set,  there is no chance for data corruption, since these are identical for
all  Roman fonts. Wylie also allows for ease of editing. In addition, it allows
 users to download RTF files in augmented Wylie divested of all SGML. They
 can then view the files with ease or easily convert them into their own Tibetan 
script. Finally, the Wylie remains available for future revisions, whereas
 until better software tools are available for working directly with Tibetan
 script, such would be very difficult with Tibetan characters. For display
 purpose, Wylie can be converted to Tibetan fonts. (On fonts, see the next
section.) It must then be decided  which Tibetan font or fonts to use, how
these will be displayed over the web,  and how the Wylie will be converted
to Tibetan script.</p>  <div3 id="TibFontsGen"><head>Tibetan Fonts</head><p>As
mentioned in previous sections, extended Wylie transliteration is used to
encode and  store Tibetan data. This provides an unambiguous means for recording
the information contained  in the texts. However, for this information to
be useful to the broadest possible audience, it  is absolutely necessary that
it can be displayed in the Tibetan script as well as in transliteration  and
translation. This requires the appropriate Tibetan fonts.</p><p>Tibetan fonts
present unique problems in the digital milieu. The Tibetan syllabary  contains
only 30 consonants, 6 vowels, a small assortment of characters for transliterating
Sanskrit,  and a variety of punctuation<ent value="mdash"></ent>totalling
less than 100 "glyphs". However, in written  Tibetan these characters are
often stack one on top of the other. Standard stacks often have up to  four
different characters vertically aligned: the root letter, a superscript, a
subscript, and a vowel. This is where the difficulty arises for fonts and
display.</p><p>There are two different means to handle this issue of stacking
characters. The character glyphs  can be drawn in a full size as well as compacted
stack size. Stacks are then built as the data is  entered or interpreted.
Or, a glyph can be drawn for each unique stack, which is then given its own
encoding. The data is then interpreted and the appropriate stack glyph is
called. To date, most  proprietary Tibetan fonts have chosen the latter method,
which significantly increases the number of required glyphs. When stacks used
for the transliteration of Sanskrit are included, the number of  required
glyphs is on the order of 1,000. This requires that the set of glyphs be spread
over several font files, as the upper limit of ASCII encoding is 256 glyphs.
Thus, the average, proprietary Tibetan font is in actuality a set of 5 or
more fonts. On the other hand, the Unicode Consortium has chosen the former
route of assigning each character two forms, a base form and a subscripted
form, implying that stacks are to be built on the fly. In the long run, Unicode
will provide the standard system of font encoding used by all. However, the
software and fonts required to successfully implement Tibetan Unicode have
yet to be created. For this reason, the most viable solution at present is
to utilize a public-domain Tibetan font based on an earlier, widely-spread
proprietary encoding scheme. Through the combined efforts of the Tibetan and
Himalayan Digital Library, the Tibetan Computer Company, and the Trace Foundation,
such has been achieved and is being prepared for public release!</p><p>The
next two sub-sections will cover 1) the issues of a Unicode Tibetan font and
its implementation and 2) the public domain font and its underlying encoding
and use.</p></div3><div3 id="TibUniFont"><head>The Goal: Unicode Tibetan Fonts</head><p>Our
ultimate goal is to use Unicode Tibetan fonts,  though presently these are
not available. The promise of Unicode is that it  will provide a standard
encoding for use in representing all scripts in the  world. Presently, non-Latin
fonts map their glyphs onto the 256 places in  the ASCII character set and
call those glyphs using those codes. This requires  that the user has downloaded
the specific font and installed it in their system.  In order to view it on
the web, the browser must be specifically told which  font to use and when
to switch back to the Latin font. The problems with this  are 1. the different
Tibetan fonts use different mappings of their glyphs  onto the ASCII codes
and therefore are not interchangeable, 2. not having  the appropriate font
installed will result in a display of meaningless strings,  and 3. in order
to display English and Tibetan on the same page, the browser  must be told
to repeatedly switch fonts. By reserving a unique range of codes  for each
of the world<ent value="rsquo"></ent>s languages, Unicode solves all these
 problems. Though at present no workable versions exist, every Tibetan Unicode
 font would have the exact same encoding and would thus be interchangeable.
 Furthermore, Unicode makes it possible to create a single font to display
 multiple languages. For instance, English (with the Extended Latin character
 set to include diacritics), Sanskrit, Tibetan, Chinese, and Japanese glyphs
 could all be included in a single font, each character having a unique reference 
code. Finally, use of the Unicode encoding would disambiguate the call to
 the browser so that the browser would automatically know what character to
 display without being told to change fonts. The immense benefits of such
a  development are obvious<ent value="mdash"></ent>e-mails could be easily
sent in  Tibetan, documents easily traded back and forth, and so on.</p><p>However,
significant barriers remain to implementing Unicode Tibetan as a functional
 solution. The Unicode Standard for Tibetan script takes an on-the-fly stack-construction
 approach. Written Tibetan words often include a number of silent characters,
 much like the English word, knight. Unlike English, however, some these letters
 are frequently stacked vertically on top of each other. Stacks of four distinct
 characters (including the vowel) are common in Tibetan. The Unicode Consortium<ent
value="rsquo"></ent>s approach to this problem as embodied in the 3.0 standard
 is to encode two forms for each consonant<ent value="mdash"></ent>the form
it takes  when on top of a stack and the form it takes when subjoined to another
character.  Such an encoding requires software to build these stacks on the
fly. This  is equivalent to having to build a complex Chinese ideogram from
the simpler  ideograms that compose it. With Chinese, the encoding did not
take this approach,  but rather all the Chinese ideograms were given a unique
code. As such was  not done with Tibetan, some preprocessing of the Unicode
Tibetan is required  before the glyphs can be displayed. The recent advent
of OpenType fonts has  made this more possible, but there needs to be a corresponding
commitment  from the side of both font and software developers. Other issues
such as formatting  difficulties<ent value="mdash"></ent>line breaks, spacing
and so forth<ent value="mdash"></ent>must  also be addressed on the software
side. In addition, Unicode works well for  modern languages, but it runs into
trouble with ancient languages because  of expanded character sets for which
Unicode is not necessarily adequate.  Thus there is a Unicode standard for
Tibetan, but the character set for all  glyphs is not complete. In other words,
it may be sufficient for ordinary  Tibetan, but transliterated Sanskrit stacks
and bizarre stacks like found  in Atiyoga literature pose special problems.
Because of the present lack of  tools, there is at present no functioning
Unicode Tibetan font, though with  the rapid advance of technology this lacuna
will no doubt be eventually filled.</p></div3><div3 id="PubTibFonts"><head>Present
Implementation: Public Domain Tibetan Font</head><p>While an operable Unicode
Tibetan font is still several years off, there is an  immediate, interim solution,
namely the use of a public-domain, ASCII Tibetan font. With funding from the
Trace Foundation, Tony Duff of the Tibetan Computer Company worked in collaboration
with various members of the Tibetan and Himalayan Digital Library to create
such a font: <b>Tibetan Machine Web</b>. This font will soon be made available
on both the THDL and TCC web-sites for free distribution and use. There is
also work being done on various conversion routines to convert back and forth
between other proprietary fonts and Tibetan Machine Web, in order to enhance
the usability of Tibetan fonts in the digital medium.</p><p><b>Tibetan Machine
Web</b> is a cross-platform, web-usable, ASCII font based on the Tibetan Computer
Company<ent value="rsquo"></ent>s Tibetan Machine font. In collaborating with
the TCC and the Trace foundation, we desired to create a font that had the
artistic grace of the Tibetan Machine but was also functional on a variety
of platforms. Tibetan Machine was only available for Windows. Furthermore,
it was absolutely necessary that the public-domain font be viewable in web-browsers,
as this was to be our main use for it. Tony Duff visited the THDL for several
weeks, over the course of which we assisted him in creating this new font.
In order to meet the requirements that the font be cross-platform and web-accessible,
it was necessary only to use the lower half of the ASCII range. Thus, what
was once a five file font, became a ten file font. These fonts, however, can
be used on the Mac OS, Windows, and possibly Linux platforms, and therefore
also in the standard Web browsers. Due to the Trace Foundations financial
support, this new font set has been made public-domain so that users will
be free to download the font and view literary material in Tibetan script.
The site for downloading the font is presently under development.</p><p>We
are also developing means for entering data directly into Tibetan script.
There are a number of prototypes being worked on. There is a Java Applet and
Servlet that allow one to type Wylie but view the resultant Tibetan script
over the web. This is already being connected to our dictionary project and
it also allows one to view the dictionary entries in Tibetan script as well.
As mentioned above, work is being done on converters from various encodings
as well as from Wylie transliteration into Tibetan Machine Web. We are almost
finished developing the first release of a macro for Microsoft Word that converts
entered Wylie into Tibetan Machine Web, either on the fly or after entry,
and mechanisms are being developed for creating web-pages using Tibetan Machine
Web that will assist in the speed of web-based publication in Tibetan.</p></div3><div3><head>Conversion
 from Extended Wylie to Tibetan Script</head><p>The problem of converting
from  Wylie transcription to Tibetan script is related to the Unicode issue.
Until  a Unicode Tibetan font becomes available and workable, one is required
to  use specific ASCII-based fonts. As mentioned above, there has been no
standardization  for overlaying Tibetan glyphs on the ASCII codes, so that
each font has a  different encoding scheme. There are also differences between
fonts that build  their stacks on the fly and those that have preformed stack.
The latter generally  use more than one set of glyphs in order to represent
the upwards of 600 different  Tibetan stacks. Given such a variety, conversion
programs must take into account  for various fonts or be tied to a specific
one. The Library has been developing  various technologies to convert our
Extended Wylie transliteration into Tibetan  script. We have developed a word-processor
macro which when activated turns  Wylie into Tibetan as it is typed. A similar
JAVA program has been written  for a database of Tibetan words and phrases,
and finally a PERL script has  been written for batch conversions of text
files from Wylie to Tibetan encoding.  All of these are in their beta-phase
and are not presently available to the  public. Work is also being done on
conversion from Tibetan script to Wylie  transliteration. Our ultimate goal
is to allow the user both to see and enter  information<ent value="mdash"></ent>such
as queries and so forth<ent value="mdash"></ent>directly  in Tibetan script
through the use of either Wylie or a special Tibetan keyboard.  Further documentation
of the algorithms for conversion will be made available  upon the public release
of the free software.</p></div3></div2><div2 type="section"><head>Critical
 Editions</head><p>TEI has many resources for dealing with critical editions.</p><p>We
 are working with previous efforts at Virginia to create a XML template and
 DTD for use in representing critical editions of Tibetan literature. In addition,
 we need to establish a set of guidelines for preparation of a critical edition
 in a word processor with footnotes/endnotes. We can then accept such a document
 and run a simple conversion routine on it to convert it into an XML document.</p><div3><head>Conventions
 for Noting Variant Readings</head><p>We are also considering the creative
 uses of colors, fonts, and so forth.  For example, we could make all sigla
 red, all variant readings blue, and so forth. Or we could color-code editions <ent
value="mdash"></ent> all mTshams brag readings are blue, sDe dge reading are
red,  etc. </p><p> In the variant element, we will have a mixture of English
and  Tibetan. Even if we separated the sigla and the variants, the variants
can  use English words like <ent value="ldquo"></ent>omit<ent value="rdquo"></ent>
or <ent value="ldquo"></ent>insert<ent value="rdquo"></ent>. </p><p>We note
variants at the  syllabic level, not the word or phrase level. Thus if a two
syllable word  only varies in one syllable, only that one syllable is noted
in the annotation  giving variants. </p><p>In listing sigla that share a common
reading, the  sigla are noted together without intervening spaces or commas.
Thus <ent value="ldquo"></ent>TbTkKg<ent value="rdquo"></ent>. This is possible
because upper case are only used in our  Sigla in the first letter slot.</p><p>After
sigla and before the variant,  our convention is the insertion of a colon
and space.</p><p>If an insertion  is being documented, it is indicated as
follows: <ent value="ldquo"></ent>Tb inserts <hi rend="italics">yang</hi>.<ent
value="rdquo"></ent></p><p>If an omission is being  documented, it is indicated
as follows: <ent value="ldquo"></ent>Tb omits <hi rend="italics">yang</hi>.<ent
value="rdquo"></ent></p><p>Usually critical editions will separate variants
into  two groups<ent value="mdash"></ent>the significant and the insignificant<ent
value="mdash"></ent>in  order to prevent the reader from being overwhelmed
by the masses of variants.  Thus the significant group is footnoted, and the
insignificant are end-noted.  Significant readings are variants which are
shared across editions that show  common patterns of errors that are useful
for stemmatic analysis or single  readings which are semantically significant;
insignificant readings are those  that are single readings of trivial difference
that do not show a pattern  of shared errors. We thus want two types of variant
reading <ent value="ldquo"></ent>holders<ent value="rdquo"></ent><ent value="mdash"></ent>the <ent
value="ldquo"></ent>significant<ent value="rdquo"></ent> and <ent value="ldquo"></ent>insignificant<ent
value="rdquo"></ent>.  Then we can specify whether we want to see one, the
other, or both. In addition,  the marker in the text that a variant exists
for that word should be colored  coded so that one color indicates a significant
variant and one color indicates  an insignificant variation<ent value="mdash"></ent>if
it has both, both colors  appear. </p></div3></div2><div2 type="section"><head>Translations</head><p>The
 structure of the Tibetan Literature Archive is set up incorporate non-Tibetan
language  translations of the texts. Original Tibetan literature in an integrated
fashion.  While our present focus is on cataloging, we finalizing this aspect <ent
value="ndash"></ent>  including on-line submission features <ent value="ndash"></ent>
and incorporating  initial translations during the fall, winter and spring
of 2001-2. This includes  integrating the translation resources into searching
and analytical tools  we are developing. </p><p>Several issues need to be
addressed before a larger  scale translation of texts can be possible. One
issue is ethical in character,  namely that many bodies of Tibetan Buddhist
literature, including the collected  tantras of the ancients, are traditionally
considered to be esoteric.  As  such, although the texts themselves are sold
publicly to anyone, traditional  representatives of the corresponding religious
lineages expect that study  of the texts will be preceded by initiatory rituals
and be accompanied by  oral commentary.  Given that literacy is much widespread
in many modern countries  compared to traditional Tibet and other cultural
differences, public dissemination  of the texts in translated form is not
precisely analogous to the public sale  of these texts in their original Tibetan
form.   At the same time, it remains  the case that the Tibetan texts can
be purchased by anyone.  In addition,  high quality and extensive scholarship
will only be enabled when translators  can do in a public context, since the
modern financing of translation is far  more dependent upon public sales than
on private patronage in most literary  contexts.  Finally, many of these texts
are centuries old and rarely even  consulted <ent value="ndash"></ent> if
every <ent value="ndash"></ent> by modern representatives  of corresponding
lineages, suggesting the need to balance responsibility to  the past with
responsibility to the present.  We are thus taking a deliberate  approach
to translation, and are slowly consulting a variety of relevant parties  to
formulate a reasonable policy.  For the time being, translators with concerns
 are offered a variety of possibilities, including password access to their
 work.</p><p>Furthermore, there are other relevant issues to be worked on
with  regards to translation.  We believe it is quite untenable to think that
a  series of translation standards can be developed for terminology used in
the  translation of Tibetan literature into English.  However, we do aim to
develop  translation standards for use with titles, chapter titles, and colophons,
 which will be used in <ent value="ldquo"></ent>official<ent value="rdquo"></ent>
fields  for each.  This will allow standardized searching abilities across
different  texts.  We plan to establish an editorial board, or series of boards,
for  establishing such internal standards for different types of terminology.
 </p></div2></div1>
<!--STANDARDS-->
<div1
type="chapter"><head>II. Standards</head><p>In order to create a working, useable
 resource, certain standards need to be established for the Library as a whole
 and the Tibetan Literature Archive in particular. Standards need  to be set
for cataloging, transliteration, the fonts used in connection with  the Library,
doxographical classification, terminology of various sorts, identification
 numbers for texts/volumes/editions, file naming conventions, and HTML display.
 Some of these standards are more or less important depending on the phase
 of the encoding process. The Tibetan Literature Archive is presently (February
2001)  focused on cataloging and displaying the catalog on the web. Therefore,
 cataloging, transliteration, text identification, and HTML display are the
 more important issues. Some of these standards, such as the cataloging records
 and transliteration scheme, have been somewhat firmly established; others
 remain to be hashed out, though all of them need to be vetted by a thorough
 scholarly review. Each of the most important areas where standards are needed
 will be discussed below.</p><div2 n="" type=""><head>Standards for Cataloging
 Records</head><p>Each textual collection is composed of a number of different
 types of files for holding specific types of information, from digital images
 to bibliographic records to text representations. While most of the files
 will use similar SGML markup schemes, primarily the TIBBIBL TEI-base DTD,
 each type of file has or will have its own unique structure. This section
 describes the structural make-up of each file type without going into the
 specifics of the SGML markup. What follows are descriptions of each file-type
 in its fullest possible form, i.e. including all the fields it could possibly
 contain. However, actual records in the catalog may not contain some of 
these fields because the information is presently unavailable or unknown due
 to lack of adequate research in the field. These fields can always be added
 whenever the information becomes available.</p><p>The types of files have
 been organized into two distinct sets: those dealing with multiple texts
and  those dealing with individual texts. Here, text refers to a text within
the  particular collection being cataloged. A file dealing with multiple texts
 would be, for instance, an edition catalog file, while one dealing with 
an individual text would be a text bibliographic record.</p><p>The distinction
 needs to be made here between a catalog file and a bibliographic record.
 The two are interrelated though distinct. Bibliographic records are just
that<ent value="mdash"></ent>files that contain the bibliographic information
for a particular  component of a collection, which could be a text, a volume,
or an edition.  A catalog file, on the other hand, is a file that contains
a description  of the structure of a multi-part component of the collection,
such as a volume  or an edition. Embedded in this structure are references
to the bibliographic  information for each part contained in the component.
Thus, each element of  a collection (each volume, text, and edition) has its
own separate bibliographic  record, while the catalog records serve to concatenate
these into a hierarchy  that mirrors the actual physical structure of the
collection<ent value="mdash"></ent>discrete  editions containing discrete
volumes containing discrete texts. For instance,  a volume contains a number
of texts. The body of the volume catalog file  contains at its uppermost level
a reference to that volume<ent value="rsquo"></ent>s  bibliographic record
followed by a series of subdivisions equal to the number  of texts it contains.
These subdivisions in turn contain pointers to the bibliographic  records
for the individual texts themselves. The catalog records therefore  mirror
the structure; the bibliographic records record the bibliographic information
 for each individual item.</p><p>The project has developed a set of templates
 to act as standard guidelines for constructing the files mentioned above.
 Together these templates represent a powerful set of tools for cataloging,
 describing, reproducing, translating, and analyzing canons of Tibetan literature.
 They have been designed to cover not just editions of <hi>The Collected Tantras
 of the Ancients</hi>, but also the broader literary category of Tibetan religious
 literature, especially tantras, s<ent value="umacr"></ent>tras and commentarial
 literature. We have not, however, tried to make it so broad as to cover wholly
 distinct literary genres such as Gesar epics, and so forth. We anticipate
 new versions will be necessary for other such genres.</p><p>These templates
 can be divided into two groups: those that deal with multiple texts and those
 focused on single texts:<list n="1" rend="I"><item>SGML files dealing with
 multiple texts<list n="1" rend="1"><item>Authority records</item><item>General
 scholarship template</item><item>Edition catalog template</item><item>Edition
 scholarship template</item><item>Doxographical catalog template</item><item>Volume
 catalog template</item></list></item><item>SGML files dealing with individual
 texts<list n="1" rend="1"><item>Text catalog template</item><item>Text representation
 template</item><item>Text translation template</item><item>Text chapter summaries
 template</item><item>Text images</item><item>Text scholarship template</item><item>Text
 intertextuality template</item></list></item></list></p><p>The first set
of  templates thus begins with the overarching <ent value="ldquo"></ent>authority
records<ent value="rdquo"></ent>, which function as general databases of information
available  at all points in the Tibetan Literature Archive and applicable
to all individual  catalogs. These include biographies, maps, and so forth.
Thus, users at  any point can ask for biographical information about a given
author, see the  location of a mentioned place on a map, or request a detailed
history of an  intellectual movement referred to in a text.  Likewise, the <ent
value="ldquo"></ent>general  scholarship template<ent value="rdquo"></ent>
contains Tibetan and non-Tibetan  scholarship which ranges over a series of
texts without being focused on a  given edition or text. </p><p>The <ent value="ldquo"></ent>edition
catalog template<ent value="rdquo"></ent> and <ent value="ldquo"></ent>edition
scholarship template<ent value="rdquo"></ent>  are instead focused on particular
editions of the catalog. The <ent value="ldquo"></ent>doxographical  catalog
template<ent value="ldquo"></ent> deals with individual groupings of  texts <hi
rend="italics">within</hi> a given edition based on intellectual  affiliations,
while the <ent value="ldquo"></ent>volume catalog template<ent value="rdquo"></ent>
does the same based on the individual physical volumes <hi rend="italics">qua
volume</hi> within a given edition. These three catalog  templates thus perform
two distinct functions. On the one hand, they function  as avenues to accessing
the catalogs of the individual texts; on the other  hand, they provide bibliographical
information at their respective levels  (the level of the edition overall,
etc.).</p><p>We then have seven templates  which are geared towards individual
texts. The <ent value="ldquo"></ent>text catalog  template<ent value="rdquo"></ent>
is the root of the whole cataloging project  since it provides bibliographical <hi
rend="italics">and</hi> descriptive  information about a given text. Three
files then contain representations of  the Tibetan text via an electronic
Tibetan script version, a digital photographic  version of the original Tibetan
document (this, however, is a non-SGML file),  and English language documents.
At the level of scholarly analysis of the  individual text, there are three
templates: one for chapter summaries, one  for scholarly essays in Tibetan
and non-Tibetan languages, and one for the  analysis of intertextual relationships. </p><p>What
follows are details on  these different categories of files and detailed outlines
of their contents. </p><div3 n="" type=""><head>Files Dealing with Multiple
Texts</head><p>The first of  the two sections covers templates for multiple
texts. </p><div4 n="" type=""><head>Authority  Records</head><div5><head>Summary</head><p>These
are a series of independent  databases that can be accessed from anywhere
in the Library for further information  about a particular person, place,
and so forth. The exact structure and format  of these databases is presently
being worked out with an emphasis on the geo-referencing  and terminological
databases.</p><p>Our Authority Records fulfill two distinct  functions. First,
they provide authority files for items with multiple names,  such as a scholar
who goes by more than one name. The Authority Record allows  one to identify
all known variants, and access all materials using one of  those variant names.</p><p>Second,
authority records allow us to access all  parts of the database from anywhere
within the Tibetan Literature Archive. For example,  we have a biographical
database that allows the user to go from a personal  name in the Tibetan Literature
Archive to the biographical database and a geo-referencing  database that
goes from a place name to a map locating it. In contrast to  the first function,
this requires extensive information being stored for each  item. Thus, the
general principle is that if something will occur more than  once, it makes
sense to store it in an authority file; however, if it is unique  and only
in that text, the overhead is too burdensome. The only exception  occurs when
one wants to search the authority files themselves.</p><p>The  Authority Records
will be SGML files and will have their own DTDs. The two  functions of the
authority records can be managed by hiding the extensive  information from
view when working with it simply as an authority file, but  then displaying
it when working with it as a large reference source. They  can be accessed
in the Tibetan Literature Archive by clicking on them, though the  exact procedure
has yet to be worked out. Finally, these Authority Records  can also integrate
sound, images, video and text. </p><p>We currently are  designing the following
seven types of authority records:<list n="1" rend="1"><item>Biographies</item><item>Institutions</item><item>Movements</item><item>Geo-references</item
><item>Encyclopedia of Terminology</item><item>Images</item><item>Clans</item></list></p></div5><div5><head>Structure
 of Authority Files</head><list n="1" rend="1"><item>Biography Database: <p>This
 will be a list of personal names, with type-attributes such as Indian, Tibetan,
 historical/mythical, Buddha, Bodhisattva, and so forth. Each name will have
 associated with it a list of variant names, analytical biographical information,
 and an extended biography. These names will include all people involved with
 the authorship, translation, redaction, transmission, and editing of all
texts  and editions in the Collections. It will also include mythical figures
and  deities referred to in the texts. </p><p>Visually, this would have lists
of  names if accessed in general. Clicking on a given name would provide a
window  of the following information.</p><list n="1" rend="i"><item>Normative
name:  In the biographical database every form of every name found in the
Collections  will be entered. However, one of these will be chosen as the
normative name,  the one most commonly used or most generally accepted among
Tibetans themselves.  It will be under this rubric that the other names will
be stored.</item><item>Variant  names: figures, both historical and mythical,
often have several widely different  names. Not only are there different forms
of the same name, omitting one or  more syllables, but new names are often
given at important rites of passage  in a persons life, such as birth names,
ordination names, or realizational  names. On top of this, there are the usual
nicknames, epithets, titles. This  database will contain a list of each person<ent
value="rsquo"></ent>s alternate  names. Thus, no matter which name is searched,
the biographical data may be  obtained.</item><item>Life dates</item><item>Biographical
sketch and historical  significance. Or do a chronological sketch: date/event/place,
date/event/place,  etc. Sources will be provided for all information.</item><item>Birth
place,  principal sphere of activity.</item><item>Images of the people</item><item>Estates,
 family names, clan names.</item></list><p>If you entered the biographical
 database from a particular name in the Tibetan Literature Archive, then the
Authority  Record screen would come up by itself with an option to access
that particular  Authority Record. All names would be available in Tibetan
script, Wylie, Romanized  Sanskrit with diacritics, and phonetic renderings.</p></item><item>Institution
 Database: <list n="1" rend="i"><item>Normative name</item><item>Variant names</item><item>Organization
 type: the type is entered as an attribute<ent value="mdash"></ent><ent value="ldquo"></ent>monastic<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>governmental<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>administrative<ent value="rdquo"></ent>, etc.</item><item>Life
dates</item><item>Chronological  sketch</item></list></item><item>Movement
Database: <p>This refers to schools  of thought, different sectarian traditions,
and so forth. It will include  a nested hierarchy or tree relating these different
traditions to each other.</p><list n="1" rend="i"><item>Normative name</item><item>Variant
names</item><item>Movement  type</item><item>Life dates</item><item>Chronological
sketch</item></list></item><item>Geo-referencing  Database: <list n="1" rend="i"><item>Normative
name of place</item><item>All  variant names/spellings</item><item>Location
on a map. We would like to include  searches for a certain geographical range,
the results of which would locate  all place names within the chosen region.
The map should go from the name  of the place to a blinking light on the relevant
map where the place is. This  should include details about the terrain, etc.
We want to classify types of  places<ent value="mdash"></ent>monasteries,
temples, towns, rivers, etc. We can  then shift between different versions
of a map to see only all the monasteries  in a region, etc. We will assign
different colors and symbols for each genre  of place.</item><item>Pictures
of the place</item><item>Geographical description</item><item>Chronological
 sketch of the place in history</item><item>List of all passages which refer
 to this place.</item></list><p>We plan to deal with mythical geography as
 well as historical geography. Problems emerge because boundaries are not
necessarily  clearly fixed and they change over time. Jurisdictional names
have boundaries  and governmental agencies that run them, but other place
names are more amorphous.</p></item><item>Encyclopedia  of Terminology Database: <p>This
will include translation standards, as well  as detailed discussions of individual
terms with histories of words and extended  citations. </p><p>It will also
incorporate illustrations which help to explain  the sense of a word. Examples
would be illustrations of ritual items, and  so forth. In most cases, a representative
image will be given, and if relevant,  there will be links to variant images
in the <ent value="ldquo"></ent>Images Database<ent value="rdquo"></ent>.
It will also include sounds, such as musical instruments,  chanting styles,
etc.</p></item><item>Images Database: <p>This will incorporate  artwork, as
well as various pictures. There could be a video of a ritual located  here. </p></item><item>Clan
database: <p>This will represent Tibetan clans  over time, with the specifics
to emerge as we work with the data. </p></item></list></div5></div4><div4
n="" type=""><head>General Scholarship Files</head><p>These files concern
 all Tibetan and Western scholarship pertaining to the Collections that is
 not dedicated to a particular edition or a particular text. The Western scholarship
 includes a bibliography of published materials and also our own collection
 of electronic essays. See the Scholarship template below for a model for
its  basic design. These documents can have a simple structure in which the
body  of the document is divided into thematic sections containing articles
of different  topics with different scopes. Depending on the number of submissions
received  this categorical, subject hierarchy can be expanded several levels.</p></div4><div4
n="" type=""><head>Edition Catalog and Bibliographic Files</head><div5 n=""
type="subsection"><head>Summary</head><p>Each edition of a Textual Collection
 has its own <ent value="ldquo"></ent>Edition Catalog<ent value="rdquo"></ent>
SGML  file. These files consist of two parts: the bibliographic record for
the edition  and references to the catalog records for each of the volumes
within that  edition. Since each volume<ent value="rsquo"></ent>s catalog
record in turn includes  references to the individual texts<ent value="rsquo"></ent>
bibliographic information,  this will form an Edition\Volume\Text hierarchy
that is the skeletal structure  of the digital edition. Other than this structure,
the main component of the  edition catalog is the edition bibliographic record.
This contains analytical  information about the edition as a whole.  Each
edition is described in terms  of the number of volumes and texts it contains,
its publishing information,  etc.</p><p>In the current project of cataloging <hi>The
Collected Tantras  of the Ancients</hi>, we will deal with 6 editions of the <hi>Collected
Tantras</hi>:   gTing skyes, mTshams brag, sDe dge, Waddell, sKyid grong,
and Bai ro rgyud <ent value="rsquo"></ent>bum. An Edition Catalog will be
generated for each. There  is also a separate <ent value="ldquo"></ent>Edition
Catalog<ent value="rdquo"></ent>  for our <ent value="ldquo"></ent>Master
Catalog of <hi>The Collected Tantras of  the Ancients</hi><ent value="rdquo"></ent>
that has critical editions of each text  drawn from the individual editions.
This catalog will provide tables that  analyze comparatively the separate
information from each edition. Since the <ent value="ldquo"></ent>Master Catalog<ent
value="rdquo"></ent> lacks any physical volumes,  texts are grouped according
to three layers of doxographical categories: (i)  overall tradition (Atiyoga,
Anuyoga, Mah<ent value="amacr"></ent>yoga), (ii) category  (such as <hi rend="italics">sems
sde</hi> within Atiyoga), and (iii) subcategory  (such as <hi rend="italics">sems
sde bco brgyad</hi> within <hi rend="italics">sems  sde</hi>). The first layer
will be delineated by the initial sigla being tripartite  (Ng1=Atiyoga, Ng2=Anuyoga,
Ng3=Mah<ent value="amacr"></ent>yoga). </p><p>Finally,  we are including indexes
of lost editions, which at present is limited to  the Pad ma <ent value="rsquo"></ent>od
gling index by <ent value="rsquo"></ent>Jigs  med gling pa.  These indexes
will be listed under <ent value="ldquo"></ent>Individual  Edition Catalogs<ent
value="rdquo"></ent> in the web site along with the gTing  skyes, etc, but
they will only have the <ent value="ldquo"></ent>Edition Catalog<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>Volume Catalog<ent value="rdquo"></ent>  and <ent value="ldquo"></ent>Text
Catalog<ent value="rdquo"></ent> templates. They  will of course lack the <ent
value="ldquo"></ent>Text Representation<ent value="rdquo"></ent>  and <ent
value="ldquo"></ent>Manuscript Images<ent value="rdquo"></ent> files. </p><p>For
 the four bibliographic records<ent value="mdash"></ent>the Edition, Volume,
Doxographical  and Text<ent value="mdash"></ent>the information can be grouped
into seven classes:</p><list n="1" rend="I"><item>ACKNOWLEDGEMENTS: This details,
in the header of the  SGML, the people who did the work as well as the donors
who paid for it.</item><item>IDENTIFICATION:  This includes all identification
schemes.</item><item>PUBLICATION: This includes  everything relating to the
physical reproduction of the text, i.e. publishing.</item><item>PHYSICAL 
DESCRIPTION: This includes the description of the text as a physical object.</item><item>INTELLECTUAL
 DESCRIPTION: This describes the intellectual content of the texts.</item><item>ORIGINATION:
 This describes the creation of the intellectual content, as opposed to its
 physical reproduction.</item><item>TITLES: This includes all titles of various
 types (text titles, chapter titles, titles of groups of texts).</item></list></div5><div5
n="" type=""><head>Outline</head><div6 n="" type=""><head>Header</head><p>The
 following is in the header of the SGML file rather than the actual body of
 the file. </p><p><hi rend="underline">ACKNOWLEDGEMENTS</hi><list n="1" rend="i"><item><hi
rend="underline">Virtual Donors</hi>: This details individuals or organizations
 that paid for the cataloging of this text in particular rather than for work
 done on an entire canon of which this may be part. <ent value="ldquo"></ent>Virtual<ent
value="rdquo"></ent> plays off the virtual nature of these texts and the traditional
 association of <ent value="ldquo"></ent>virtue<ent value="rdquo"></ent> with
the act  of sponsoring the reproduction of scripture.<list n="1" rend="1"><item><hi
rend="underline">Virtual Donor(s) #1</hi><list n="1" rend="i"><item><hi rend="underline">Name
 of Donor</hi></item><item><hi rend="underline">Date of Donation</hi></item><item><hi
rend="underline">Dedication</hi></item></list></item><item><hi rend="underline">Virtual
 Donor(s) #2</hi> etc.</item></list></item><item><hi rend="underline">Electronic
 Scribes</hi>: This identifies the individual(s) responsible for entering
 and proofreading the text in question.<list n="1" rend="1"><item><hi rend="underline">Scribe
 #1</hi> (or a team of scribes who work together)<list n="1" rend="i"><item><hi
rend="underline">Role</hi></item><item><hi rend="underline">Date of work submission</hi>:
  This is the date when the work is submitted in finished form.</item><item><hi
rend="underline">Name of scribe (s)</hi></item></list></item><item><hi rend="underline">Scribe
 #2</hi> and so forth ...</item></list></item></list></p></div6><div6 n=""
type=""><head>Identification</head><list n="1" rend="i"><item><hi rend="underline">Edition
Sigla</hi></item><item><hi rend="underline">LOC  number new</hi></item><item><hi
rend="underline">LOC number old</hi></item><item><hi rend="underline">Library
class number</hi></item><item><hi rend="underline">ISBN  number</hi></item><item><hi
rend="underline">OCLC number</hi></item></list></div6><div6 n="8.3.3.2" type="subsection"><head>Publication</head><list
n="1" rend="i"><item><hi rend="underline">Contemporary publishing information</hi>:<list><item><hi
rend="underline">Publisher</hi></item><item><hi rend="underline">Publisher
 place</hi></item><item><hi rend="underline">Address of publisher</hi></item><item><hi
rend="underline">Date of publication</hi></item><item><hi rend="underline">Printer</hi>:
 i.e., Jayyed Press, etc.</item><item><hi rend="underline">Discussion of publishing/editing
 process</hi></item><item><hi rend="underline">Discussion of contemporary
publication</hi></item></list></item><item><hi rend="underline">Traditional
publishing information for wood block print</hi>:<list><item><hi rend="underline">Publishing
house</hi></item><item><hi rend="underline">Location  of publishing house</hi></item><item><hi
rend="underline">Tibetan date</hi></item><item><hi rend="underline">Western
date</hi></item><item><hi rend="underline">Editor</hi></item><item><hi rend="underline">Printer</hi></item><item><hi
rend="underline">Patron/sponsor</hi></item><item><hi rend="underline">Discussion
of traditional publishing</hi></item></list></item><item><hi rend="underline">Manuscript
information</hi>:<list><item><hi rend="underline">Archive  where currently
located</hi></item><item><hi rend="underline">Archive where  originally located</hi></item><item><hi
rend="underline">Editor</hi></item><item><hi rend="underline">Scribe</hi></item><item><hi
rend="underline">Tibetan date  of writing</hi></item><item><hi rend="underline">Western
date of writing</hi></item><item><hi rend="underline">Location of manuscript
creation</hi></item><item><hi rend="underline">Patron/sponsor</hi></item><item><hi
rend="underline">Discussion of reproduction of manuscript</hi></item><item><hi
rend="underline">Discussion of manuscript</hi></item></list></item><item><hi
rend="underline">Associated editions</hi></item></list></div6><div6 n="" type=""><head>Physical
 Description</head><list n="1" rend="i"><item><hi rend="underline">Number
of  total volumes</hi>: In the master catalog, this will summarize the information
 on this issue for the individual editions with tables. <list><item>Number
 of volumes associated with Atiyoga</item><item>Number of volumes associated
 with Anuyoga</item><item>Number of volumes associated with Mah<ent value="amacr"></ent>yoga</item><item>Number
 of volumes unclassified</item></list></item><item><hi rend="underline">Number
 of total texts all together</hi>: In the master catalog, this and the other
 categories below will include additional tables of analytical comparisons
 of the different editions indicating the number of texts that are unique,
 shared by X number of editions, and so forth.<list><item>Number of texts
associated  with Atiyoga</item><item>Number of texts associated with Anuyoga</item><item>Number
 of texts associated with Mah<ent value="amacr"></ent>yoga</item><item>Number
of  texts unclassified</item></list></item><item><hi rend="underline">Number
of  total folio sides</hi>: In the master catalog, this will summarize the
information  on this issue for the individual editions with tables. <list><item>Number
 of folio sides associated with Atiyoga</item><item>Number of folio sides
associated  with Anuyoga</item><item>Number of folio sides associated with
Mah<ent value="amacr"></ent>yoga</item><item>Number  of folio sides unclassified</item></list></item><item><hi
rend="underline">Composition  and appearance of the manuscript</hi><list n="1"
rend="1"><item>Genre form  original</item><item>Genre form reproduction</item><item>Paper
and ink</item><item>Dimension</item><item>Standard  lines per page</item><item>Page
formatting</item><item>Illustrations</item><item>Script</item></list></item></list></div6><div6
n="" type=""><head>Intellectual Description</head><list n="1" rend="i"><item><hi
rend="underline">Relation to canon</hi>:  This also requires its own separate
 DISCUSSION field. </item><item><hi rend="underline">Edition doxographical
 category</hi>:  This requires its own separate DISCUSSION field.</item><item><hi
rend="underline">Master doxographical category</hi></item><item> <hi rend="underline">Literary
 genre</hi>: This requires its own separate DISCUSSION field.</item><item> <hi
rend="underline">Subject keywords</hi> </item><item><hi rend="underline">Summary
 of contents, significance and history</hi>: This is a DISCUSSION FIELD.</item></list></div6><div6
n="" type=""><head>Origination</head><list n="1" rend="i"><item><hi rend="underline">Authorship</hi><list
n="1" rend="1"><item><hi rend="underline">Author</hi>: This set of elements
 as a whole requires its own separate DISCUSSION field.<list n="1" rend="i"><item><hi
rend="underline">Author(s)/Speaker(s)</hi></item><item><hi rend="underline">Authorship
 type</hi></item><item><hi rend="underline">Affiliation of author</hi></item><item><hi
rend="underline">Place of composition/delivery</hi></item><item><hi rend="underline">Tibetan
 date of composition</hi></item><item><hi rend="underline">Western date of
 composition</hi></item></list></item><item><hi rend="underline">Others</hi>:
 This set of elements as a whole requires its own separate DISCUSSION field.<list
n="1" rend="i"><item><hi rend="underline">Audience</hi></item><item><hi rend="underline">Requester(s)
 of composition</hi></item><item><hi rend="underline">Redactor(s) of original
 composition</hi></item></list></item></list></item><item><hi rend="underline">Translation</hi>:
 This set of elements as a whole requires its own separate DISCUSSION field.<list
n="1" rend="1"><item><hi rend="underline">Language(s) from which translated</hi></item><item><hi
rend="underline">Translator(s)</hi></item><item><hi rend="underline">Place(s)
 of translation</hi></item><item><hi rend="underline">Tibetan date(s) of translation</hi></item><item><hi
rend="underline">Western date(s) of translation</hi></item><item><hi rend="underline">Redactor(s)
 of translation</hi></item><item><hi rend="underline">Place(s) of redaction
 of translation</hi></item><item><hi rend="underline">Tibetan Date(s) of redaction
 of translation</hi></item><item><hi rend="underline">Western Date(s) of redaction
 of translation</hi></item></list></item><item><hi rend="underline">Transmission</hi><list
n="1" rend="1"><item><hi rend="underline">Transmission status</hi>: This requires
 its own separate DISCUSSION field.</item><item><hi rend="underline">Concealment</hi>:
 This set of elements as a whole requires its own separate DISCUSSION field.<list
n="1" rend="i"><item><hi rend="underline">Concealer(s)</hi></item><item><hi
rend="underline">Place(s) of concealment</hi></item><item><hi rend="underline">Date(s)
 of concealment</hi></item></list></item><item><hi rend="underline">Revelation</hi>:
 In addition to an entry, this has a button for <ent value="ldquo"></ent>further
 discussion<ent value="rdquo"></ent>. <list n="1" rend="i"><item><hi rend="underline">Revealer(s)</hi>(<hi
rend="italics">gter ston</hi>): This set of elements as a whole requires its
 own separate DISCUSSION field.</item><item> <hi rend="underline">Place(s)
 of revelation</hi></item><item> <hi rend="underline">Date(s) of revelation</hi></item></list></item><item><hi
rend="underline">Lineage History</hi>: This is a DISCUSSION field.</item><item><hi
rend="underline">Editors</hi>: This requires its own separate DISCUSSION field.</item></list></item></list></div6><div6
n="" type=""><head>Titles</head><p>The cataloging process involves recording
 a veritable plethora of titles for the different components of a collection.
 The kinds of titles vary depending on the level of the catalog. </p><list
n="1" rend="I"><item>Edition<list n="1" rend="1"><item>Title of the edition</item><item>Oral
 title of the edition</item><item>Title of the edition in secondary literature</item></list></item><item>Volume<list
n="1" rend="1"><item>Volume<ent value="rsquo"></ent>s number</item><item>Volume<ent
value="rsquo"></ent>s Tibetan letter</item><item>Volume<ent value="rsquo"></ent>s
title  or designation (such as, <hi rend="italics">bam po gnyis pa</hi>)</item></list></item><item>Text<list
n="1" rend="1"><item>Normalized title</item><item>Title page title</item><item>Title
 line title</item><item>Titles from ends of chapters</item><item>Titles from
 colophons</item><item>Titles in margins</item><item>Titles from oral sources</item><item>Titles
 from secondary Tibetan literature</item><item>Non-Tibetan titles (Sanskrit,
 Chinese, Khotanese, etc.)</item></list></item><item>Doxography Index</item></list></div6><div6><head>Indexes</head><list
n="1" rend="1"><item>Doxography: This will be a list of all doxographical
 classifications in an edition which will function as links to the Doxography
 Catalog Template.</item><item>Edition: This will be a list of all the volumes
 in an edition and can serve as a link to a particular volume.</item><item>Volume:
 This will be a list of all texts in a particular volume with links to that
 text<ent value="rsquo"></ent>s bibliographic record.</item><item>Text: The
index  of a text is included within its bibliographic record as a list of
its chapter-level  elements, namely the sections of its front, body, and back.
Ultimately, these  will serve as links to digital representations of that
particular section.</item></list></div6></div5></div4><div4 n="" type=""><head>Edition
Scholarship Files</head><div5 n="" type=""><head>Summary</head><p>The  Scholarship
Template is for Tibetan and non-Tibetan scholarship focusing on  the edition
as an edition. This will include all indexes or catalogs of  the edition done
by others.<list n="1" rend="i"><item>Indexes of the edition:  Tibetan or non-Tibetan.</item><item>History
of Edition: This will include  all essays about the history of this edition.
In the master catalog, this  will contain all essays about the history of
the canon. For individual edition  collections, this will have summarized
information about the edition in question. </item></list></p></div5><div5
n="" type=""><head>Outline</head><p>The outline of the scholarship files are
 presently open for consideration. As structured documents, they can be outlined
 according to the needs of scholars. Two of the subcategories, indexes and
 history, are mentioned above; other categories and subcategories could be
 included as needed.</p></div5></div4><div4 n="" type=""><head>Doxographical
 Catalog Files</head><div5 n="8.5.2" type="subsection"><head>Summary</head><p>Each
 doxographical category within a given edition has its own separate <ent value="ldquo"></ent>doxographical
 catalog<ent value="rdquo"></ent> file. This file will be different from the
other  catalog files in that it will not record the physical structure of
a canon  but its intellectual structure. The list of works in the canon will
be divided  according to doxographical category and these categories divided
into sub-categories  and so on. The level corresponding to the volume catalog
level in physical  canons will be a category or sub-category level. Each of
these will have analytical  information about that doxographical classification
as a whole, as well as  a list of sub-categories or texts contained within
that classification.</p><p>Since  the master catalog will lack <ent value="ldquo"></ent>volume<ent
value="rdquo"></ent>  catalogs, its organization will be purely based on doxographical
classifications:  (i) the first layer is the tripartite division into Atiyoga,
Anuyoga and Mah<ent value="amacr"></ent>yoga; (ii) the second layer is divisions
of each of those three;  and (iii) the third layer is subdivisions of those
divisions in turn. This  catalog includes discussion fields for each doxographical
category (from  Atiyoga down to <ent value="ldquo"></ent>The Eighteen Texts<ent
value="rdquo"></ent>  subdivision) that discusses the nature and history of
the category. </p><p>Each  of the  categories and subcategories requires the
fields for <ent value="ldquo"></ent>physical  description<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>intellectual description<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>origination<ent value="rdquo"></ent>, and <ent value="ldquo"></ent>titles<ent
value="rdquo"></ent> as given above for the edition catalog.  It may seem
that some of these are superfluous for subclassifications, but  in fact it
will preserve the ability to document changes that might happen  across subclassifications.
Information in those categories will be filled  in as they apply in a synoptic
way to the given doxographical class as a whole. </p><p>However, <ent value="ldquo"></ent>titles<ent
value="rdquo"></ent> will have a list of all text titles  contained within
that doxographical class that function as links to the texts<ent value="rsquo"></ent>
master records. The master text record will contain the same  information
as a text record for a particular version but generalized information  that
does not relate to specific editions or printings. It will also contain  a
list of cross-references which will act as links to the catalog records  of
the text in individual editions.</p><p>For the master catalogs, there  are
of course no volume bibliographic records. Instead, they will have files 
describing the nature of that category or sub-category containing all information,
 historical and doxographical, that may pertain to it. Thus each category
(such  as Atiyoga), subcategory, and sub-subcategory would have an element
to enter  an official description of it, and then an element to enter individual
essays  on that category. These will be accessible on the Internet site from
the initial  doxographical Table Of Contents by which people enter into lists
of texts<ent value="mdash"></ent>instead of going on to titles, they could
instead go to discussions  of the doxographical classifications. </p><p>The
following is the master doxographical  scheme; each edition will have a partial
variant on this:</p></div5><div5 n="8.5.3" type="subsection"><head>Doxographical
classes</head><list n="1" rend="I"><item>Atiyoga Literature<list n="1" rend="1"><item><hi
rend="underline">Mind  Series (<hi rend="italics">Sems sde</hi>)</hi>: all
of these and the sub-genres  are hot linked.<list n="1" rend="i"><item><hi
rend="underline">The Eighteen  Mind Series Texts</hi></item><item><hi rend="underline">The
All Creating Cycle  (<hi rend="italics">Kun byed skor</hi>)</hi></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Space
Series (<hi rend="italics">kLong sde</hi>)</hi></item><item><hi rend="underline">Experiential
Precept Series (<hi rend="italics">Man ngag  sde</hi>)</hi><list n="1" rend="i"><item><hi
rend="underline">External and  internal cycles (<hi rend="italics">Phyi nang
skor</hi>)</hi></item><item><hi rend="underline">Esoteric Cycles (<hi rend="italics">gSang
skor</hi>)</hi></item><item><hi rend="underline">Seminal Heart (<hi rend="italics">sNying
thig</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item><item><hi
rend="underline">Crown  Pith (<hi rend="italics">sPyi ti</hi>)</hi></item><item><hi
rend="underline">Ultra  Pith (<hi rend="italics">Yang ti</hi>)</hi><list n="1"
rend="i"><item><hi rend="underline">Brahmin cycles (<hi rend="italics">Bram
ze<ent value="rsquo"></ent>i  skor</hi>)</hi></item><item><hi rend="underline">Padma-related</hi></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Miscellaneous</hi></item></list><p>For
 this and the next two TOCs, clicking directly on a given category will take
 one to a list of texts (see below). However, there will also be a <ent value="ldquo"></ent>discussion<ent
value="rdquo"></ent> option for the main rubric (Atiyoga), and each division
and  subdivision within it. These <ent value="ldquo"></ent>discussion<ent
value="rdquo"></ent>  options take the user to a list of essays on the nature
of these categories  and their contents. </p></item><item>Anuyoga Literature<list
n="1" rend="1"><item><hi rend="underline">The four root s<ent value="umacr"></ent>tras
(<hi rend="italics">rtsa  ba<ent value="rsquo"></ent>i mdo bzhi</hi>)</hi></item><item><hi
rend="underline">The  six tantras clarifying the six limits (<hi rend="italics">mtha<ent
value="rsquo"></ent>  drug gsal bar byed pa<ent value="rsquo"></ent>i rgyud
drug</hi>)</hi></item><item><hi rend="underline">The twelve rare tantras (<hi
rend="italics">dkon rgyud</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item><item>Mah<ent
value="amacr"></ent>yoga  Literature<list n="1" rend="1"><item><hi rend="underline">Tantra
Series (<hi rend="italics">rGyud sde</hi>)</hi><list n="1" rend="i"><item><hi
rend="underline">The eightfold  set of root <hi rend="italics">Magical Emanation
Tantras</hi> (<hi rend="italics">rtsa  bar gyur sgyu <ent value="rsquo"></ent>phrul
sde brgyad</hi>)</hi></item><item><hi rend="underline">The eighteenfold  set
of explanatory tantras(<hi rend="italics">bshad pa dang cha mthun gyi  rgyud
sde bco brgyad</hi>)</hi> <list n="1" rend="1"><item><hi rend="underline">Enlightened
Body (<hi rend="italics">sku</hi>)</hi></item><item><hi rend="underline">Enlightened
Speech (<hi rend="italics">gsung</hi>)</hi></item><item><hi rend="underline">Enlightened
 Mind (<hi rend="italics">thugs</hi>)</hi></item><item><hi rend="underline">Enlightened
Qualities (<hi rend="italics">yon tan</hi>)</hi></item><item><hi rend="underline">Enlightened
Activities (<hi rend="italics">phrin  las</hi>)</hi></item></list></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Practice
Series of the Eight Proclamation Deities (<hi rend="italics">sGrub  sde bka<ent
value="rsquo"></ent> brgyad</hi>)</hi><list n="1" rend="i"><item><hi rend="underline">The
Practice Series (<hi rend="italics">sgrub sde</hi>)</hi><list n="1" rend="1"><item><hi
rend="underline">Summary of the Highest Intention  (<hi rend="italics">bla
ma dgongs pa <ent value="rsquo"></ent>dus pa</hi>)</hi></item><item><hi rend="underline">Consortium
of Sugatas (<hi rend="italics">bde gshegs <ent value="rsquo"></ent>dus pa</hi>)</hi></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">The
Eight Proclamation Deities (<hi rend="italics">bka<ent value="rsquo"></ent>
brgyad</hi>)</hi><list n="1" rend="1"><item><hi rend="underline">The  Ma<ent
value="ntilde"></ent>ju<ent value="sacute"></ent>r<ent value="imacr"></ent>
Cycle on Enlightened Form (<hi rend="italics"><ent value="rsquo"></ent>jam
dpal sku<ent value="rsquo"></ent>i skor</hi>)</hi></item><item><hi rend="underline">The
Lotus Tantras on Enlightened Communication (<hi rend="italics">pad  ma gsung
gi rgyud</hi>)</hi></item><item><hi rend="underline">The Real Tantras  on
Enlightened Mind (<hi rend="italics">yang dag thugs kyi rgyud</hi>)</hi></item><item><hi
rend="underline">The Nectar Tantras on Enlightened Qualities (<hi rend="italics">bdud
 rtsi yon tan gyi rgyud</hi>)</hi></item><item><hi rend="underline">The Sacred
 Dagger Cycle on Enlightened Activities (<hi rend="italics">phrin las phur
 pa<ent value="rsquo"></ent>i skor</hi>)</hi></item><item><hi rend="underline">The
 Cycle on Invoking the Fierce Ma-mo Deities (<hi rend="italics">ma mo rbod
 gtong skor</hi>)</hi></item><item><hi rend="underline">Offerings and Praises
 to Protect the Teachings (<hi rend="italics">bstan srung mchod bstod</hi>)</hi></item><item><hi
rend="underline">The Cycle on Fierce Mantras (<hi rend="italics">drag sngags
 skor</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item></list></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item>Unclassified
 Literature</item></list></div5></div4><div4 n="" type=""><head>Volume Catalog
 Files</head><div5 n="" type=""><head>Summary</head><p>Each volume within
a  given edition has its own separate <ent value="ldquo"></ent>Volume Catalog<ent
value="rdquo"></ent> SGML file. This file has analytical information about
the  volume as a whole (Volume Bibliographic Record), as well as a list of
texts  contained within that volume. The titles for the texts are the normalized
 titles, shared or pulled in from the Text Bibliographic Record itself. Thus,
 if the normalized title is changed for any reason, the change will be reflected
 in the TOC of the volume.</p><p>Since it is a purely electronic edition,
the  master catalogs of course have no <ent value="ldquo"></ent>volumes<ent
value="rdquo"></ent>  and hence lack any <ent value="ldquo"></ent>volume<ent
value="rdquo"></ent> catalog  files. In their stead will be a catalog of the
different doxographical categories,  as discussed in the previous section
on doxographic catalog files.</p></div5><div5 n="" type=""><head>Outline</head><p>Individual
 volumes require the fields for <ent value="ldquo"></ent>physical description<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>intellectual description<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>origination<ent value="rdquo"></ent>,
and <ent value="ldquo"></ent>titles<ent value="rdquo"></ent> as given above
for the Edition catalog. Information in those  categories will be filled in
as they apply in a synoptic way to the volume  as a whole. However, <ent value="ldquo"></ent>titles<ent
value="rdquo"></ent> will have  a list of all text titles contained within
that volume class that function  as links to the corresponding individual
text catalogs. </p></div5></div4></div3><div3 n="" type=""><head>Files Dealing
with Individual Texts</head><div4 n="" type=""><head>Text  Bibliographic Files</head><div5
n="" type=""><head>Summary</head><p>Each individual  text in the Collections
will have five separate SGML documents recording specific  information pertaining
to it: <list n="1" rend="1"><item>Text Bibliographic  File: this is the bibliographical
information concerning the text.</item><item>Text  Representation File: this
has the following subdivisions<list n="1" rend="i"><item>Tibetan  script representation</item><item>Extended
Wylie transliteration</item></list></item><item>Chapter  Summaries</item><item>Scholarship
Template: this contains traditional and  modern scholarship on the text aside
from the chapter summaries mentioned  above. </item></list></p><p>Each text
thus has a separate <ent value="ldquo"></ent>Text  Catalog Entry<ent value="rdquo"></ent>
SGML file. This file has all the bibliographical  information about a given
individual text (TEXT BIBL)<ent value="mdash"></ent>physical  description
of the manuscript, authorship, etc. It also has a separate list  of chapter
titles and pagination. It is thus DESCRIPTIVE in the bibliographical  sense.
Cataloging is a scholarly activity that normalizes data; <ent value="ldquo"></ent>representation<ent
value="rdquo"></ent> however is a scholarly activity that represents data
with  all its original flaws/errors. We have thus kept our bibliographical
description  and representation of a given text in separate files.</p><p>At
present, we  are entering the <hi rend="italics">mTshams <ent value="rsquo"></ent>brag</hi>
 (Tb) edition of <hi>The Collected Tantras of the Ancients</hi>, and we will
 be using those texts as the starting point for the master catalog as well.
 Thus, we are creating these three files for each text in Tb at present. </p></div5><div5
n="" type=""><head>Outline</head><div6><head>Header</head><p>The following
 is in the header of the SGML file rather than the actual body of the file. </p><p><hi
rend="underline">ACKNOWLEDGEMENTS</hi><list n="1" rend="i"><item><hi rend="underline">Virtual
 Donors</hi>: this details individuals or organizations that paid for the
cataloging  of this text in particular, rather than for work done on an entire
canon of  which this may be part. <ent value="ldquo"></ent>Virtual<ent value="rdquo"></ent>
plays  off the virtual nature of these texts and the traditional association
of <ent value="ldquo"></ent>virtue<ent value="rdquo"></ent> with the act of
sponsoring the reproduction  of scripture.<list n="1" rend="1"><item><hi rend="underline">Virtual
Donor(s)  #1</hi><list n="1" rend="i"><item><hi rend="underline">Name of Donor</hi></item><item><hi
rend="underline">Date of Donation</hi></item><item><hi rend="underline">Dedication</hi></item></list></item><item><hi
rend="underline">Virtual Donor(s) #2</hi> etc.</item></list></item><item><hi
rend="underline">Electronic Scribes</hi> : this identifies the individual(s)
 responsible for entering and proofreading the text in question.<list n="1"
rend="1"><item><hi rend="underline">Scribe #1</hi> (or a team of scribes who
 work together)<list n="1" rend="i"><item><hi rend="underline">Role</hi></item><item><hi
rend="underline">Date of work submission</hi>:  this is the date when the
 work is submitted in finished form.</item><item><hi rend="underline">Name
 of scribe (s)</hi></item></list></item><item><hi rend="underline">Scribe
#2</hi>  and so forth ...</item></list></item></list></p></div6><div6><head>Identification</head><list><item><hi
rend="underline">Edition Sigla</hi></item><item><hi rend="underline">Edition
 ID number</hi></item><item><hi rend="underline">Edition doxographical class</hi>:
 this is the numbering scheme we have for representing doxographical class
 according to the edition itself.</item><item><hi rend="underline">Master
doxographical  class</hi>: this is the numbering scheme we have for representing
doxographical  class according to the master catalog.</item><item><hi rend="underline">Volume
 number</hi>: this is a sequential number we assign to each volume in an edition,
 beginning from <ent value="ldquo"></ent>1<ent value="rdquo"></ent>.</item><item><hi
rend="underline">Text number within the volume</hi>: this is a sequential
 number we assign to each text within a given volume beginning from <ent value="ldquo"></ent>1<ent
value="rdquo"></ent>. See <ent value="ldquo"></ent>Authorship, text and classification
 schemes in a Tibetan Buddhist canon<ent value="rdquo"></ent> for a discussion
of  the process by which a given textual unit can be identified as either
an independent  text or a textual division of a larger text which precedes/follows
it.</item><item><hi rend="underline">Volume letter</hi>: this is the Tibetan
letter assigned to  this volume (ka, kha, etc.).</item><item><hi rend="underline">LOC
number old</hi></item><item><hi rend="underline">Library class number</hi></item><item><hi
rend="underline">ISBN  number</hi></item><item><hi rend="underline">OCLC number</hi></item></list></div6><div6><head>Publication</head><note><hi
rend="boldface"><hi rend="italics">NOTE: this section is unnecessary for texts
 within a canon since the publishing information will be in the edition catalog;
 for all other texts, the text catalog will need to record the publishing
 information.</hi></hi></note><p><ent value="ldquo"></ent>Publication<ent
value="rdquo"></ent>  usually contains three elements: where the text was
published, the date it  was published, and who published it. These texts have
complicated pedigrees,  in that in many cases manuscripts or traditional monastic
blockprints were  reproduced in India by <ent value="ldquo"></ent>modern<ent
value="rdquo"></ent> publishers.  Our publication record will document the
whole history of the text. The primary  form available today will be recorded
under a <ent value="ldquo"></ent>msspubldecl<ent value="rdquo"></ent> in SGML
given first, while the previous forms it reproduces  will be recorded under
a <ent value="ldquo"></ent>msspublhist<ent value="rdquo"></ent>  given subsequently.
The latter forms will only record the basic bibliographic  information for
those texts, not all the details of physical description and  so forth.</p><list
n="1" rend="i"><item><hi rend="underline">Contemporary  publishing information</hi>:
this applies when the edition has been published  by contemporary standards
with ISBN, etc., as opposed to just being a local  monastic production. In
the master catalog, this will summarize publishing  of individual editions
overall.<list><item><hi rend="underline">Publisher</hi></item><item><hi rend="underline">Place
of publication</hi></item><item><hi rend="underline">Address  of publisher</hi></item><item><hi
rend="underline">Date of publication</hi></item><item><hi rend="underline">Printer</hi>:
i.e., Jayyed Press, etc.</item><item><hi rend="underline">Discussion  of publishing/editing
process</hi></item><item><hi rend="underline">Discussion  of contemporary
publication</hi></item></list></item><item><hi rend="underline">Traditional
 publishing information for wood block prints</hi>: this applies to texts
published  over time, including the present, by traditional Tibetan monastic
publishing  houses without reference to contemporary publishing norms. It
may be that  (i) and (ii) apply simultaneously, for instance, if a traditionally
published  text is simply reproduced in some fashion in a reprint in India.<list><item><hi
rend="underline">Publishing house</hi></item><item><hi rend="underline">Location
 of publishing house</hi></item><item><hi rend="underline">Tibetan date</hi></item><item><hi
rend="underline">Western date</hi></item><item><hi rend="underline">Editor</hi></item><item><hi
rend="underline">Printer</hi></item><item><hi rend="underline">Patron/sponsor</hi></item><item><hi
rend="underline">Discussion of traditional publishing</hi></item></list></item><item><hi
rend="underline">Manuscript information</hi>: this applies to all handwritten
 manuscripts and to printed texts which have been obtained through an archive.<list><item><hi
rend="underline">Archive where currently located</hi></item><item><hi rend="underline">Archive
 where originally located</hi>: often contemporary archives projects have
gathered  together the contents of many traditional archives. Thus, while
we might have  obtained a copy from the contemporary archive, the important
historical fact  is the identity of the traditional archive.</item><item><hi
rend="underline">Archive  ID number</hi></item><item><hi rend="underline">Editor</hi></item><item><hi
rend="underline">Scribe</hi></item><item><hi rend="underline">Tibetan date
 of writing</hi></item><item><hi rend="underline">Western date of writing</hi></item><item><hi
rend="underline">Location of manuscript creation</hi></item><item><hi rend="underline">Patron/sponsor</hi></item><item><hi
rend="underline">Reproduction of manuscript</hi></item><item><hi rend="underline">Discussion
 of manuscript</hi></item></list></item><item><hi rend="underline"> Associated
 editions</hi>: this is a discussion field that summarizes how this edition
 is related to other editions that it reproduces, or in turn, that reproduce
 it, with identical pagination and so forth. This also discusses any possible
 alterations of the edition in the reproduction processes.</item></list></div6><div6><head>Physical
 Description</head><list n="1" rend="i"><item><hi rend="underline">Pagination</hi><list
n="1" rend="1"><item><hi rend="underline">Pagination</hi>: this specifies
 the page and line number of the text<ent value="rsquo"></ent>s beginning
and end  in the form: 12.3-55.6. This includes all back material of all types.
In general, <ent value="ldquo"></ent>page<ent value="rdquo"></ent> in the
Tibetan Literature Archive refers  to one folio side and uses the pagination
specified by a Western style numeral.  If the text in question lacks such
numbers, it will generally still have a  Tibetan enumeration on the margin,
which, however, considers the front and  back of a folio as a single page.
It is in those cases necessary to specify  if the page in question is the
front or back side (we have not determined  a convention for this yet). For
the Master catalog, the pagination field  will be replaced by the number of <hi>shad</hi>-delimited
lines in that section. </item><item><hi rend="underline">Extent</hi>: this
is the length of the entire text in terms  of the total number of pages. Again,
for traditional Tibetan publications,  a page is one side of a folio. In the
master catalog, this will summarize  the information on this issue for the
individual editions with tables.</item></list></item><item><hi rend="underline">
Composition and appearance of the manuscript</hi>: In the  master catalog,
this will integrate in tabular form all such descriptions  of individual editions.</item><item><hi
rend="underline">Genre form original</hi>:  in the EAD, <ent value="ldquo"></ent>genre
form<ent value="rdquo"></ent> refers to whether  the object is a novel, video,
and so forth. We are adapting this to the following  options: oral form, inscription,
fragment, scroll, handwritten <hi>dpe cha</hi>,  xylograph <hi>dpe cha</hi>,
typeset codex. <ent value="ldquo"></ent>Original<ent value="rdquo"></ent>
refers to the form of the original document.</item><item><hi rend="underline">Genre
form reproduction</hi>: this applies if we don<ent value="rsquo"></ent>t have
the original edition but only some reproduction of it.  This field has the
following options: handwritten copy, electronic copy, microfilm,  microfiche,
35 mm slides/prints, photocopy, audio tape, video tape.</item><item><hi rend="underline">Paper
and ink</hi>: will specify type of paper, color of  ink, etc. The remaining
items are all classified under the SGML element <ent value="ldquo"></ent>physical
facets<ent value="rdquo"></ent> drawn from the EAD. Each  is distinguished
by a type attribute specification.</item><item><hi rend="underline">Dimension</hi>:
 will specify measurements of the page with metric standards.</item><item><hi
rend="underline">Standard lines per page</hi>: this specifies the number of
 lines per standard page (usually 5-8). This requires a separate discussion
 field that discusses how lineation may vary from volume to volume (typically
 it does not); for example, opening pages may have a reduced number of lines
 per page, and so forth.</item><item><hi rend="underline">Page formatting</hi>:
 this details formatting borders, the nature and content of page margins and
 so forth. This will itself be a DISCUSSION field without requiring a discrete
 search on any particular item.</item><item><hi rend="underline">Illustrations</hi>:
 this will have a discrete yes/no option, which will allow people to search
 for all illuminated manuscripts. It requires a separate DISCUSSION field
to  discuss the nature and significance of the illuminations. This would be
summary  in fashion and defer any elaborate discussion to a separate essay
in the scholarship  catalog.</item><item><hi rend="underline">Script</hi>:
this identifies the  script the manuscript is written in. We need a scheme
of options so that this  field can be searched to yield all manuscripts in
a given script. This requires  its own separate DISCUSSION field.</item></list></div6><div6><head>Intellectual
 Description</head><list n="1" rend="i"><item><hi rend="underline">Relation
 to Canon</hi>: Options are as follows: Canon text proper, Dedicated commentary
 on a single canonical text, Dedicated commentary on multiple canonical texts,
 Text containing citations of canonical texts, Anuyoga Tantra, Mah<ent value="amacr"></ent>yoga
 Tantra, Atiyoga Tantra, Atiyoga text (i.e. no citations), Anuyoga text (i.e.
 no citations), Mah<ent value="amacr"></ent>yoga text (i.e. no citations),
General  Canonical Tantric Text (i.e. ranges over more than one of the three
tantric  traditions), Text related to the canonical tradition (i.e. any religious
text  which bears some relationship to the canonical tradition), Canonical
background  (the broadest category, i.e. historical, biographical and other
works which  provide background for understanding the particular canon). This
field will  be of essential use for text catalogs of texts not in the canonical
collection  in question and will also require its own separate DISCUSSION
field. </item><item><hi rend="underline">Edition Doxographical Category</hi>:
this is the intellectual  category corresponding to a type of movement as
specified in the edition.  We are creating a stable scheme of normalized vocabulary
for this element,  using colons to separate subrubrics from the primary rubrics,
e.g. A ti yo  ga: sems sde. This requires its own separate DISCUSSION field.</item><item><hi
rend="underline">Master Doxographical Category</hi>: this is the intellectual
 category corresponding to a type of movement as specified in the master catalog.</item><item><hi
rend="underline">Literary Genre</hi>: this requires its own separate DISCUSSION
 field.</item><item><hi rend="underline">Subject Keywords</hi>: this will
draw  upon our keyword classification field</item><item><hi rend="underline">Summary
 of Contents, Significance and History</hi>: this is a DISCUSSION FIELD that
 offers a brief and standardized orientation to the object of the catalog,
 whether it be an edition, doxography, volume list, or text. Obviously for
 the volume list it will be very short, and for the edition and text it will
 be a little longer.</item></list></div6><div6><head>Origination</head><list
n="1" rend="i"><item><hi rend="underline">Authorship</hi>: When there is no
 specification of a particular item, enter <ent value="ldquo"></ent>Not specified<ent
value="rdquo"></ent>. We will not normalize anything but rather represent
what  is in the text; the authority file acts as a control. Each of these
categories  has an authority-controlled place-field and date-field associated
with it,  and each could contain more than one name and/or an organization
name. An  organization name would be that of a monastery, a sect, a sub-sect
or a generic  group, such as the 100 Deities.<list n="1" rend="1"><item><hi
rend="underline">Author</hi>:  this set of elements as a whole requires its
own separate DISCUSSION field.  This discussion will invariably begin by specifying
the precise page references  in the texts or other sources on which the data
is based. After this opening  data, it will proceed to discuss broader issues
pertaining to these categories  as well as the source of the information.
Thus the fields below will just  have the basic items entered. <list n="1"
rend="i"><item><hi rend="underline">Author(s)/Speaker(s)</hi>:  the name(s)
of the author is filled in if it is a written text, the speaker  if it is
set forth as a transcript of a Buddha<ent value="rsquo"></ent>s oral lecture.
 In our display, we will preface all of these fields with something like <ent
value="ldquo"></ent>Traditionally Ascribed<ent value="rdquo"></ent> etc. to
prevent  people from simply taking these ascriptions at face value when skimming
through  the database.</item><item><hi rend="underline">Authorship type</hi>:
this  concerns the <ent value="ldquo"></ent>type<ent value="rdquo"></ent>
of author: Buddha,  Mythic figure, Indian, Other Ethnicity, Unknown Ethnicity,
Tibetan, Anonymous.</item><item><hi rend="underline">Affiliation of author</hi>:
this is a somewhat miscellaneous  category to put in simple and relevant identifying
information about the author,  such as geographical base, main monastic base,
sectarian affiliation, intellectual  affiliation and so forth. The authority
records will give detailed information  on the author and all other figures,
but this provides a type of <ent value="ldquo"></ent>keywords<ent value="rdquo"></ent>
file about the author.</item><item><hi rend="underline">Place  of composition/delivery</hi></item><item><hi
rend="underline">Tibetan date  of composition</hi>: in all cases, if a date
is unknown we use <ent value="ldquo"></ent>c.<ent value="rdquo"></ent> to
indicate a general date.</item><item><hi rend="underline">Western  date of
composition</hi></item></list></item><item><hi rend="underline">Others</hi>:
 this requires a DISCUSSION field, which will be structured precisely as detailed
 above.<list><item><hi rend="underline">Audience</hi>: if it is a Buddha<ent
value="rsquo"></ent>s oral lecture, this signifies the retinue who are receiving
 the talk.</item><item><hi rend="underline">Requester(s) of composition</hi>:
 this is the person who requested the text<ent value="rsquo"></ent>s composition
 (zhu pa po). In the case of a Buddha<ent value="rsquo"></ent>s oral lecture,
it  might be the follower who asked him to speak on the subject as specified
in  the text<ent value="rsquo"></ent>s narrative opening frame; in an ordinary
treatise,  it might be a follower who urged the author to write the text (generally
specified  in the colophon).</item><item><hi rend="underline">Redactor(s)
of original  composition</hi>: this applies to people involved in the original
redaction  of the text in its native language. </item></list></item></list></item><item><hi
rend="underline">Translation</hi>: this requires a DISCUSSION field, which
 will be structured precisely as detailed above.<list n="1" rend="1"><item><hi
rend="underline">Language(s) from which translated</hi>:</item><item><hi rend="underline">Translator(s)</hi>:
 those who have translated this text from its native language into Tibetan.
 This will include specification of anyone consulted in the translation process.</item><item><hi
rend="underline">Place(s) of translation</hi></item><item><hi rend="underline">Tibetan
 date(s) of translation</hi></item><item><hi rend="underline">Western date(s)
 of translation</hi></item><item><hi rend="underline">Redactor(s) of translation</hi>:
 this applies to people involved in the redaction of the text in translation.</item><item><hi
rend="underline">Place(s) of redaction of translation</hi></item><item><hi
rend="underline">Tibetan Date(s) of redaction of translation</hi></item><item><hi
rend="underline">Western Date(s) of redaction of translation</hi></item></list></item><item><hi
rend="underline">Transmission</hi><list n="1" rend="1"><item><hi rend="underline">Transmission
 status</hi>: This will include options for <ent value="ldquo"></ent><hi>bka<ent
value="rsquo"></ent> ma</hi><ent value="rdquo"></ent>, <ent value="ldquo"></ent><hi>dgongs
 gter</hi><ent value="rdquo"></ent>, etc. We need to have a pop up window
that shows  the User what the range of possibilities are in a type of tree
of variants.  This requires a DISCUSSION field, which will be structured precisely
as detailed  above.</item><item><hi rend="underline">Concealment</hi>: this
requires a  DISCUSSION field, which will be structured precisely as detailed
above.<list n="1" rend="i"><item><hi rend="underline">Concealer(s)</hi>: individuals
who  conceal the text for future revelation.</item><item><hi rend="underline">Place(s)
 of concealment</hi></item><item><hi rend="underline">Date(s) of concealment</hi></item></list></item><item><hi
rend="underline">Revelation</hi>: this requires a DISCUSSION field, which
 will be structured precisely as detailed above. <list n="1" rend="i"><item><hi
rend="underline">Revealer(s)</hi>: individuals who re-reveal the text  after
concealment (<hi rend="italics">gter ston</hi>).</item><item><hi rend="underline">Place(s)
 of Revelation</hi>:</item><item><hi rend="underline">Date(s) of Revelation</hi>:</item></list></item><item><hi
rend="underline">Lineage history</hi>: this is a DISCUSSION field.</item><item><hi
rend="underline">Editors</hi>: this includes editors involved in the production
 of later editions of the text that entailed alteration of the text<ent value="rsquo"></ent>s
 readings in any way, right into the twentieth century. This requires a DISCUSSION
 field.</item></list></item></list></div6><div6><head>Titles</head><list n="1"
rend="i"><item><hi rend="underline">Titles of the text</hi><list n="1" rend="1"><item><hi
rend="underline">Normalized title</hi>: the normalized title is a title we
 have decided on for ease of reference and will be presented up front as the
 primary title. Our general principle is to use the title given in the title
 page or title line, dropping generic elements; if the text is referred to
 in the secondary literature by a standard title, that title should be used. <list
n="1" rend="i"><item><hi rend="underline">Transliteration of title</hi></item><item><hi
rend="underline">Translation of title</hi>: all translations of text and chapter
 titles will be done by the project<ent value="rsquo"></ent>s staff using
the project<ent value="rsquo"></ent>s evolving lexicon of translation standards.
This enables English  language users to navigate the Tibetan Literature Archive
with reliability.</item><item><hi rend="underline">Source of normalized title</hi>:
this will briefly explain  where the title comes from, while the discussion
field below will be the main  location where the text<ent value="rsquo"></ent>s
various titles are analyzed. </item></list></item><item><hi rend="underline">Titles
from the text itself</hi>: this will be repeated for  each title given in
the text<list n="1" rend="i"><item><hi rend="underline">Title  #1</hi>: all
titles must be included in full, including <ent value="ldquo"></ent><hi>zhes
 bya ba</hi><ent value="rdquo"></ent> unless that phrase occurs only at the
very  end of the title. Usually Tibetan texts reiterate the overall text title
at  the end of each chapter, and these can at times be at variance with the
title  page<ent value="rsquo"></ent>s title, or even with each other.<list
n="1" rend="1"><item><hi rend="underline">Division</hi>: Front, Body, or Back.</item><item><hi
rend="underline">Section  number</hi></item><item><hi rend="underline">Section
type</hi></item><item><hi rend="underline">Pagination of reference to title</hi></item><item><hi
rend="underline">Transliteration  of title</hi></item><item><hi rend="underline">Translation
of title</hi></item></list></item><item><hi rend="underline">Title #2</hi>:
etc.</item></list></item><item><hi rend="underline">Title(s)  from Margins</hi>:
usually only one title on all pages, but could be multiple  entries.<list
n="1" rend="i"><item><hi rend="underline">Title #1</hi>:<list n="1" rend="1"><item><hi
rend="underline">Pagination</hi></item><item><hi rend="underline">Transliteration
of title</hi></item><item><hi rend="underline">Translation  of title</hi></item></list></item><item><hi
rend="underline">Title #2</hi>:  etc.</item></list></item><item><hi rend="underline">Titles
from Oral Traditions</hi>:<list n="1" rend="i"><item><hi rend="underline">Title
#1</hi>:<list n="1" rend="1"><item><hi rend="underline">Transliteration of
title</hi></item><item><hi rend="underline">Translation  of title</hi></item><item><hi
rend="underline">Speaker</hi></item><item><hi rend="underline">Source</hi></item><item><hi
rend="underline">Date</hi></item></list></item><item><hi rend="underline">Title
#2</hi>: Etc.</item></list></item><item><hi rend="underline">Titles  from
Secondary Literature</hi>:<list n="1" rend="i"><item><hi rend="underline">Title
 #1</hi>:<list n="1" rend="1"><item><hi rend="underline">Source title</hi>:</item><item><hi
rend="underline">Source author</hi>:</item><item><hi rend="underline">Source
 ID</hi>: this ID will be our ID for that text that will lead to its full
bibliographical  citation in another part of the Tibetan Literature Archive. </item><item><hi
rend="underline">Source  date</hi>: this is enables users to see the chronology
of title changes at  a glance. </item><item><hi rend="underline">Source Pagination</hi>:</item><item><hi
rend="underline">Transliteration of title</hi>:</item><item><hi rend="underline">Translation
 of title</hi>:</item></list></item><item><hi rend="underline">Title #2</hi>:
 etc. </item></list></item><item><hi rend="underline">Non-Tibetan Titles</hi>: <list
n="1" rend="i"><item><hi rend="underline">Source of title</hi>: this records
 the source for the non-Tibetan titles, which usually come from the text<ent
value="rsquo"></ent>s own opening title line but sometimes come from other
sources.</item><item><hi rend="underline">Language of title</hi></item><item><hi
rend="underline">Transliteration  of title</hi></item><item><hi rend="underline">Translation
of title</hi></item></list></item><item><hi rend="underline">Discussion of
all titles</hi>: this is a discussion field  where all title variants and
their significance can be analyzed.</item></list></item><item><hi rend="underline">Titles
of chapter-level sections</hi><lb></lb><note><hi rend="italics"><hi rend="boldface">NOTE:
each chapter-level section of the text, whether Front,  Body, or Back, is
included herein. The division specifies Front, Body or Back,  and then the
type-attribute specifies which type of chapter-level section  it is within
that division</hi></hi></note><list n="1" rend="1"><item><hi rend="underline">Summary
of text<ent value="rsquo"></ent>s sections</hi>: this  has a single discussion
section for all the subcategories listed below. This  will include a discussion
of all relevant issues pertaining to the structure  of the text, including
whether chapters are grouped together into distinctive  sets and so forth.<list
n="1" rend="i"><item><hi rend="underline">Number of  chapters</hi>: this is
listed first since it is the most important thing for  users. In most cases
it will be the same as the number of sections in the  body, but if the body
has title pages, or inter-chapter notes, it will not  be. Thus it is listed
separately here.</item><item><hi rend="underline">Name  for chapters</hi>:
this specifies the term used to signify chapter-like divisions  of the body,
i.e. <ent value="ldquo"></ent><hi>le<ent value="rsquo"></ent>u</hi><ent value="rdquo"></ent>, <ent
value="ldquo"></ent><hi>rtog pa</hi><ent value="rdquo"></ent>,  etc.</item><item><hi
rend="underline">Front number of sections</hi>: if there  are no internal
sections, then there will just be the number <ent value="ldquo"></ent>1<ent
value="rdquo"></ent>. If there is no <ent value="ldquo"></ent>Front<ent value="rdquo"></ent>,
 or <ent value="ldquo"></ent>Back<ent value="rdquo"></ent> at all, then <ent
value="ldquo"></ent>0<ent value="rdquo"></ent> should be entered. </item><item><hi
rend="underline">Body  number of sections</hi></item><item><hi rend="underline">Back
number of sections</hi></item><item><hi rend="underline">Name for sections</hi></item></list></item><item><hi
rend="underline">Index  of section titles</hi><list n="1" rend="i"><item><hi
rend="underline">Individual  chapter-level section #1</hi><list n="1" rend="1"><item><hi
rend="underline">Division</hi>:  i.e. Front, Body or Back.</item><item><hi
rend="underline">Section type</hi>:  the specifies the type of chapter-level
element it is within the overall division  specified above<ent value="mdash"></ent>i.e. <ent
value="ldquo"></ent>title line<ent value="rdquo"></ent>, <ent value="ldquo"></ent>author<ent
value="rsquo"></ent>s colophon<ent value="rdquo"></ent>, etc.</item><item><hi
rend="underline">Identification Number</hi>:  this identifies the division
(a=Front, b=Body, c=Back), and then the sequence  of the chapter-level element
within the division (#1, #2, etc.). Thus <ent value="ldquo"></ent>a1<ent value="rdquo"></ent>,
or <ent value="ldquo"></ent>b4<ent value="rdquo"></ent>,  and so forth.</item><item><hi
rend="underline">Designation</hi>: sometimes  the Tibetan text assigns letters
to the various sections of the text.</item><item><hi rend="underline">Pagination</hi>:
this provides the full pagination from beginning  page.line to end page.line,
i.e. <ent value="ldquo"></ent>4.3-18.4<ent value="rdquo"></ent>.</item><item><hi
rend="underline">Section name</hi>: this specifies how these types of sections
 are referred to by name in the Tibetan text itself. It thus corresponds precisely
 to our scheme of <ent value="ldquo"></ent>section types<ent value="rdquo"></ent>.
For  example, a Tibetan text may have any one of a number of synonyms for
chapters: <ent value="ldquo"></ent><hi>le<ent value="rsquo"></ent>u</hi><ent
value="rdquo"></ent>, <ent value="ldquo"></ent><hi>rtog pa</hi><ent value="rdquo"></ent>, <ent
value="ldquo"></ent><hi>mchod  brjod</hi><ent value="rdquo"></ent>, etc. If
there are multiple terms, separate  each with a <ent value="ldquo"></ent><hi>shad</hi><ent
value="rdquo"></ent>. If there  is no name specified<ent value="mdash"></ent>which
is frequent in the Front and  Back<ent value="mdash"></ent>one enters <ent
value="ldquo"></ent>unnamed<ent value="rdquo"></ent>  (<hi rend="italics">ming
med</hi>).</item><item><hi rend="underline">Section  title (in Tibetan)</hi>:
this should include <ent value="ldquo"></ent><hi>ste dang  po</hi><ent value="rdquo"></ent>, <ent
value="ldquo"></ent><hi>ste gnyis pa</hi><ent value="rdquo"></ent>, and so
forth. Such information may be relevant, and thus  should not be thought of
as superfluous since we are including the section  number separately above.</item><item><hi
rend="underline">Section title (English  translation)</hi></item><item><hi
rend="underline">Page of section name/title</hi>:  this specifies the page
and line number from which the title is taken, e.g.  58.3.</item><item><hi
rend="underline">Subject keywords</hi></item></list></item><item><hi rend="underline">Individual
chapter-level section #2</hi>: etc.</item></list></item></list></item></list></div6></div5></div4><div4
n="" type=""><head>Text Representation Files</head><div5 n="" type=""><head>Summary</head><p>Each
 text then has a separate <ent value="ldquo"></ent>Text Representation<ent
value="rdquo"></ent>  SGML file. This also contains the Text BIBL<ent value="mdash"></ent>the <ent
value="ldquo"></ent>description<ent value="rdquo"></ent> of the text<ent value="mdash"></ent>which
is pulled in from the <ent value="ldquo"></ent>Text Catalog Entry<ent value="rdquo"></ent>.
However, it primarily  contains the <ent value="ldquo"></ent>representation<ent
value="rdquo"></ent> of the  text in the form of an SGML file containing the
text<ent value="rsquo"></ent>s  transcription. The text is entered using Extended-Wylie
but will be <ent value="ldquo"></ent>published<ent value="rdquo"></ent> on
the Web in Tibetan script.</p></div5><div5 n="" type=""><head>Outline</head><p>The
 following is an outline of the material available for use in reproducing/representing
 a text beyond a mere catalog or description of it. In addition to indicating
 the content of the information, the following also indicates to some degree
 how this information is encoded in the SGML. In particular, we are representing
 the text through a series of <ent value="ldquo"></ent>DIV<ent value="rdquo"></ent>
(=<ent value="ldquo"></ent>division<ent value="rdquo"></ent>) tags which are
numbered to represent  a nested hierarchy of information.</p><head>Header</head><p>The
following  is in the header of the SGML file. </p><div6><head>Acknowledgements</head><p><list
n="1" rend="i"><item><hi rend="underline">Virtual Donors</hi>: this details
 individuals or organizations that paid for the representation of this text. <ent
value="ldquo"></ent>Virtual<ent value="rdquo"></ent> plays off the virtual
nature of  these texts and the traditional association of <ent value="ldquo"></ent>virtue<ent
value="rdquo"></ent> with the act of sponsoring the reproduction of scripture.<list
n="1" rend="1"><item><hi rend="underline">Virtual Donor(s) #1</hi><list n="1"
rend="i"><item><hi rend="underline">Name of Donor</hi></item><item><hi rend="underline">Date
 of Donation</hi></item><item><hi rend="underline">Dedication</hi></item></list></item><item><hi
rend="underline">Virtual Donor(s) #2</hi> etc.</item></list></item><item><hi
rend="underline">Electronic Scribes</hi> : this identifies the individual(s)
 responsible for entering and proofreading the text in question.<list n="1"
rend="1"><item><hi rend="underline">Scribe #1</hi> (or a team of scribes who
 work together)<list n="1" rend="i"><item><hi rend="underline">Role</hi></item><item><hi
rend="underline">Date of work submission</hi>:  this is the date when the
 work is submitted in finished form.</item><item><hi rend="underline">Name
 of scribe (s)</hi></item></list></item><item><hi rend="underline">Scribe
#2</hi>  and so forth ...</item></list></item></list></p></div6><div6><head>Identification</head><p>This
 only requires the text identification number.</p></div6><div6><head>Transliteration</head><list
n="1" rend="i"><item><hi rend="underline">TITLE</hi>: this is our own overarching
 normalized title inserted as a header for the text.</item><item><hi rend="underline">FRONT</hi>:
 the Front, Body, and Back overall are each DIV 1, with the TYPE = Front,
Body,  or Back.<list n="1" rend="1"><item><hi rend="underline">Individual
Chapter-level  section in the Front</hi>(<hi rend="italics">one for each section</hi>):
each  individual section of the Front is a <hi rend="boldface">DIV 2</hi>,
with  the type = one of the possible chapter-level elements/sections for the
Front  section. The different possible chapter-level section types in a FRONT
section  are as follows: title page, title line, homage, statement of intent, <ent
value="ldquo"></ent>ordinary introductory scene<ent value="rdquo"></ent> (<hi
rend="italics">thun  mong gleng gzhi</hi>), <ent value="ldquo"></ent>extraordinary
introductory scene<ent value="rdquo"></ent> (<hi rend="italics">thon mong
ma yin pa<ent value="rsquo"></ent>i  gleng gzhi</hi>), untitled introduction,
outline, and miscellaneous. Each  of the chapter-level elements of the front
is assigned a section number, beginning  with <ent value="ldquo"></ent>1<ent
value="rdquo"></ent> and proceeding sequentially.<lb></lb><hi rend="underline">Transliteration
of Section</hi>: this is the actual Tibetan  text itself, in which the following
are marked via SGML (this applies for  all transliteration mentioned below
as well): <list rend="u"><item>we insert  names before each chapter-level
section. We will use our <ent value="ldquo"></ent>types<ent value="rdquo"></ent>
as the headers, while for chapters we will say <ent value="ldquo"></ent>chapter
 1: [insert title]<ent value="rdquo"></ent>. These headers will thus be displayed
 to users. For chapters in the Body, this is the actual title of the chapter
 usually specified at its end. If there is no title specified, one enters <ent
value="ldquo"></ent>untitled chapter<ent value="rdquo"></ent> (<hi rend="italics">le<ent
value="rsquo"></ent>u mtshan med</hi>). </item><item>milestone tags are used
to  record the pagination and lineation of the original printed document.</item><item>each <hi>shad</hi>-delimited
 line of the text is given a sequential number beginning at <ent value="ldquo"></ent>one<ent
value="rdquo"></ent>, beginning anew for each chapter-level element. </item><item>each <hi>shad</hi>-delimited
 line is given a sequential <ent value="ldquo"></ent>concordance<ent value="rdquo"></ent>
 number that is inserted in all editions and the critical edition.</item><item>each
 sub-chapter level section identified as separate (i.e. <hi>sa bcad</hi>)
is  given a label identifying it.</item><item>prose is identified as prose,
verse  as verse, lists as list.</item><item>we are using <ent value="ldquo"></ent>drama<ent
value="rdquo"></ent> tags to mark speakers and narrative comments in tantras/s<ent
value="umacr"></ent>tras.</item></list></item><item><hi rend="underline">Individual
 chapter-level section in the Front</hi> (#2, and so forth...)</item></list></item><item><hi
rend="underline">BODY</hi>: the Front, Body, and Back overall are each <hi
rend="boldface">DIV 1</hi>, with the TYPE = Front, Body, or Back.<list n="1"
rend="1"><item><hi rend="underline">Individual Chapter-level section in the
 Body</hi> (<hi rend="italics">one for each section</hi>): each individual
 section of the Body is a <hi rend="boldface">DIV 2</hi>, with the type =
one  of the possible chapter-level elements/sections for the Body section.
The  different possible chapter-level section types in a Body section are
as follows:  chapter, interchapter section, chapter-title page, and miscellaneous. <ent
value="ldquo"></ent>Interchapter section<ent value="rdquo"></ent> is when
you have text  located between enumerated chapters, such as a brief specification
of how  chapters are grouped together, or longer discussion of some type. <ent
value="ldquo"></ent>Chapter-title  page<ent value="rdquo"></ent> appears when
each chapter is preceded by a title  page.<lb></lb>Each of the chapter-level
elements of the Body is assigned a section  number, beginning with <ent value="ldquo"></ent>1<ent
value="rdquo"></ent> and proceeding  sequentially. This means in certain instances
our enumeration of chapter-level  elements in the Body  may not correspond
to the text<ent value="rsquo"></ent>s  own enumeration of its chapters. Certain
texts begin, for instance, with a  number other than <ent value="ldquo"></ent>1<ent
value="rdquo"></ent> (e.g., a text  where chapter 56 is the first chapter).
Or, a text may erroneously reduplicate  or skip a number in its overall enumeration
(i.e. chapter 44, chapter 46 without  ever having chapter 45). In these situations,
though our enumeration would  be out of synch with chapter enumeration, it
would indicate the sequential  position of the chapter in the Body of the
text.<lb></lb><hi rend="underline">Transliteration  of Section</hi>: this
is the actual Tibetan text itself. See above for details.</item><item><hi
rend="underline">Individual chapter-level section in the Body</hi>: (#2, and
 so forth...)</item></list></item><item><hi rend="underline">BACK</hi>: the
 Front, Body, and Back overall are each DIV 1, with the TYPE = Front, Body,
 or Back.<list n="1" rend="1"><item><hi rend="underline">Individual Chapter-level
 section in the Back</hi> (<hi rend="italics">one for each section</hi>):
each  individual section of the Back is a DIV 2, with the type = one of the
possible  chapter-level elements/sections for the Back section. The different
possible  chapter-level section types in a Body section are as follows: closing
section  (i.e. that which is between the final chapter and <hi rend="italics">rdzogs
 so</hi>), author<ent value="rsquo"></ent>s colophon, translator<ent value="rsquo"></ent>s
 colophon, lineage transmission, reviser<ent value="rsquo"></ent>s colophon,
editorial  colophon, scribal colophon, printers colophon, concluding prayer,
closing  invocations (i.e.for final mantric particles or invocations like <ent
value="ldquo"></ent><hi rend="italics">rgya rgya rgya</hi><ent value="rdquo"></ent>, <ent
value="ldquo"></ent><hi rend="italics">sarva mangalam</hi><ent value="rdquo"></ent>,
etc.), and miscellaneous. <lb></lb>Each  of the chapter-level elements of
the Back is assigned a section number, beginning  with <ent value="ldquo"></ent>1<ent
value="rdquo"></ent> and proceeding sequentially.<lb></lb><hi rend="underline">Transliteration
of Section</hi>: this is the actual Tibetan  text itself. See above for details.</item><item><hi
rend="underline">Individual  chapter-level section in the Back</hi> (#2, and
so forth...)</item></list></item></list></div6></div5></div4><div4 n="" type=""><head>Text
Translation Files</head><div5 n="" type=""><head>Summary</head><p>These  files
contain translations of individual chapter-level sections by contemporary
 scholars.</p></div5><div5 n="" type=""><head>Outline</head><head>Header</head><p>The
 following is in the header of the SGML file.</p><div6><head>Acknowledgements</head><p><list
n="1" rend="i"><item>i. <hi rend="underline">Virtual Donors</hi>: this details
 individuals or organizations that paid for the translation of this text. <ent
value="ldquo"></ent>Virtual<ent value="rdquo"></ent> plays off the virtual
nature of  these texts and the traditional association of <ent value="ldquo"></ent>virtue<ent
value="rdquo"></ent> with the act of sponsoring the reproduction of scripture.<list
n="1" rend="1"><item><hi rend="underline">Virtual Donor(s) #1</hi><list n="1"
rend="i"><item><hi rend="underline">Name of Donor</hi></item><item><hi rend="underline">Date
 of Donation</hi></item><item><hi rend="underline">Dedication</hi></item></list></item><item><hi
rend="underline">Virtual Donor(s) #2</hi> etc.</item></list></item><item><hi
rend="underline">Electronic Scribes</hi> : this identifies the individual(s)
 responsible for entering and proofreading the text in question.<list n="1"
rend="1"><item><hi rend="underline">Scribe #1</hi> (or a team of scribes who
 work together)<list n="1" rend="i"><item><hi rend="underline">Role</hi></item><item><hi
rend="underline">Date of work submission</hi>:  this is the date when the
 work is submitted in finished form.</item><item><hi rend="underline">Name
 of scribe (s)</hi></item></list></item><item><hi rend="underline">Scribe
#2</hi>  and so forth ...</item></list></item></list></p></div6><div6><head>Identification</head><list
n="1" rend="i"><item><hi rend="underline">Identification Number</hi></item><item><hi
rend="underline">Title of Text in Question</hi></item><item><hi rend="underline">Editions
 this translation is based on</hi></item></list></div6><div6><head>Translation</head><p>This
 mirrors the text representation above.</p><list n="1" rend="i"><item><hi
rend="underline">Translation  of Individual Front Section</hi> (<hi rend="italics">one
for each section</hi>):<list n="1" rend="1"><item><hi rend="underline">Section
Number</hi></item><item><hi rend="underline">Section Type</hi></item><item><hi
rend="underline">Section  Name</hi></item><item><hi rend="underline">Name
of Translator</hi></item><item><hi rend="underline">Date of Translation</hi></item><item><hi
rend="underline">Translation  itself</hi></item></list></item><item><hi rend="underline">Translation
of  Individual Body Section</hi> (<hi rend="italics">one for each section</hi>):<list
n="1" rend="1"><item><hi rend="underline">Section Number</hi></item><item><hi
rend="underline">Section Type</hi></item><item><hi rend="underline">Section
 Name</hi></item><item><hi rend="underline">Section Title</hi></item><item><hi
rend="underline">Name of Translator</hi></item><item><hi rend="underline">Date
 of Translation</hi></item><item><hi rend="underline">Translation itself</hi></item></list></item><item><hi
rend="underline">Translation of Individual Back Section</hi> (<hi rend="italics">one
 for each section</hi>):<list n="1" rend="1"><item><hi rend="underline">Section
 Number</hi></item><item><hi rend="underline">Section Type</hi></item><item><hi
rend="underline">Section Name</hi></item><item><hi rend="underline">Name of
 Translator</hi></item><item><hi rend="underline">Date of Translation</hi></item><item><hi
rend="underline">Translation itself</hi></item></list></item></list></div6></div5></div4><div4
n="" type=""><head>Text Chapter Summaries Files</head><div5 n="" type=""><head>Summary</head><p>These
 files contain summaries of individual chapter-level sections by contemporary
 scholars.</p></div5><div5 n="" type=""><head>Outline</head><head>Header</head><p>The
 following is in the header of the SGML.</p><div6><head>Acknowledgements</head><p><list
n="1" rend="i"><item><hi rend="underline">Virtual Donors</hi>: this details
 individuals or organizations that paid for the summarizing of this chapter
 in particular. <ent value="ldquo"></ent>Virtual<ent value="rdquo"></ent>
plays off the  virtual nature of these texts, and the traditional association
of <ent value="ldquo"></ent>virtue<ent value="rdquo"></ent> with the act of
sponsoring the reproduction of scripture.<list n="1" rend="1"><item><hi rend="underline">Virtual
Donor(s) #1</hi><list n="1" rend="i"><item><hi rend="underline">Name of Donor</hi></item><item><hi
rend="underline">Date  of Donation</hi></item><item><hi rend="underline">Dedication</hi></item></list></item><item><hi
rend="underline">Virtual Donor(s) #2</hi> etc.</item></list></item><item><hi
rend="underline">Electronic Scribes</hi> : this identifies the individual(s)
 responsible for entering and proofreading the text in question.<list n="1"
rend="1"><item><hi rend="underline">Scribe #1</hi> (or a team of scribes who
 work together)<list n="1" rend="i"><item><hi rend="underline">Role</hi></item><item><hi
rend="underline">Date of work submission</hi>:  this is the date when the
 work is submitted in finished form.</item><item><hi rend="underline">Name
 of scribe (s)</hi></item></list></item><item><hi rend="underline">Scribe
#2</hi>  and so forth ...</item></list></item></list></p></div6><div6><head>Identification</head><list
n="1" rend="i"><item><hi rend="underline">Identification Number</hi></item><item><hi
rend="underline">Title of Text in Question</hi></item><item><hi rend="underline">Editions
 this summary is based on</hi></item></list></div6><div6><head>Section data</head><list
n="1" rend="i"><item><hi rend="underline">Division</hi>: Front, Body or Back.</item><item><hi
rend="underline">Chapter-level section number</hi></item><item><hi rend="underline">Chapter-level
 section type</hi></item></list></div6><div6><head>Summary</head><p><hi rend="italics">There
 will be only one <ent value="ldquo"></ent>summary<ent value="rdquo"></ent>,
and it is  intended as a normative summary for reference. We will reserve
the right to  substitute a new summary for an old one.</hi></p><list n="1"
rend="i"><item><hi rend="underline">Author of summary</hi></item><item><hi
rend="underline">Date</hi></item><item><hi rend="underline">Summary of Section</hi></item></list></div6><div6><head>Analysis</head><p><hi
rend="italics">This contains all other analysis of the chapter by itself.</hi></p><list
n="1" rend="i"><item><hi rend="underline">Analysis #1</hi><list n="1" rend="1"><item><hi
rend="underline">Author of analysis</hi></item><item><hi rend="underline">Date</hi></item><item><hi
rend="underline">Analysis of Section</hi></item></list></item><item><hi rend="underline">Analysis
 #2</hi>: etc...</item></list></div6></div5></div4><div4 n="" type=""><head>Folio
 Image Files</head><div5 n="" type=""><head>Summary</head><p>These are actually
 JPEG images of the original documents that are referred to by the <ent value="ldquo"></ent>Text
 Representation<ent value="rdquo"></ent> SGML document but do not themselves
contain  any SGML markup. Of course, there are no particular document images
associated  with the master catalog. </p></div5><div5 n="" type=""><head>Outline</head><p>Since
 this is not a SGML file, there is no outline. It does have a header, however,
 which we need to work out. This must include the following:</p><div6><head>Acknowledgements</head><p><list
n="1" rend="i"><item><hi rend="underline">Virtual Donors</hi>: this details
 individuals or organizations that paid for the scanning of this text. <ent
value="ldquo"></ent>Virtual<ent value="rdquo"></ent> plays off the virtual
nature of  these texts, and the traditional association of <ent value="ldquo"></ent>virtue<ent
value="rdquo"></ent> with the act of sponsoring the reproduction of scripture.<list
n="1" rend="1"><item><hi rend="underline">Virtual Donor(s) #1</hi><list n="1"
rend="i"><item><hi rend="underline">Name of Donor</hi></item><item><hi rend="underline">Date
 of Donation</hi></item><item><hi rend="underline">Dedication</hi></item></list></item><item><hi
rend="underline">Virtual Donor(s) #2</hi> etc.</item></list></item></list></p></div6></div5></div4><div4
n="" type=""><head>Text Scholarship Files</head><div5 n="" type=""><head>Summary</head><p>The
 Scholarship Template is for Tibetan and non-Tibetan scholarship focusing
on  a particular text in <hi>The Collected Tantras of the Ancients</hi>. This
 is a single SGML file that has four components and which is only present
for  the master catalog.</p><list n="1" rend="i"><item><hi rend="underline">Citations
 of the text in Tibetan language texts</hi>: this includes references to the
 text by title and actual citations of the text found in traditional Tibetan
 language commentaries. The references and citations provide information as
 the dating and use of these texts; the citations often include exegesis which
 helps understand how passages were interpreted; and citations provide independent
 witnesses for variant readings (<hi rend="italics">lemma</hi>).</item><item><hi
rend="underline">Commentarial Literature on the text in Tibetan language texts</hi>:
 this is a bibliography of actual commentaries on the text in traditional
Tibetan  language literature. It is distinguished from (i) in that these are
commentaries  explicitly focused on this particular text, rather than just
citations or  references.</item><item><hi rend="underline">References to the
text in non-Tibetan  language texts</hi>: this is an analytical bibliography
of all contemporary  non-Tibetan publications referring to the text, whether
incidentally or centrally.  It includes full translations of the text in question,
as well as book-length  studies of the text. It thus offers a bibliography
of references in contemporary  secondary literature to this text. In addition,
it provides page references;  and when translations or analysis of particular
passages are offered, it also  provides page references back to the original
passages of the Tibetan text.  Finally, it provides summaries of the significance
of each of the secondary  materials in question.</item><item><hi rend="underline">Electronic
essays  on the text in non-Tibetan languages</hi>: these are full essays on
the text  either written for our Tibetan Literature Archive or republished
in our Collections  from previous print publications.</item></list></div5><div5
n="" type=""><head>Outline</head><div6><head>Citations  in Other Tibetan Texts</head><list
n="1" rend="i"><item><hi rend="underline">Exegetical  text #1</hi><list n="1"
rend="1"><item><hi rend="underline">Identification</hi><list n="1" rend="i"><item><hi
rend="underline">Identification number of citation  source</hi>: all bibliographical
information will be obtained from its entry  in the <ent value="ldquo"></ent>Catalog
of Tibetan-language related collections  and commentarial literature<ent value="rdquo"></ent>.</item><item><hi
rend="underline">Identification  number of cited text</hi></item></list></item><item><hi
rend="underline">Title</hi><list n="1" rend="i"><item><hi rend="underline">Title
of Source in Tibetan</hi></item><item><hi rend="underline">Title of Source
in English</hi></item><item><hi rend="underline">Title  by which text is cited</hi></item><item><hi
rend="underline">Normalized title  of text cited</hi></item></list></item><item><hi
rend="underline">Intellectual  description</hi><list n="1" rend="i"><item><hi
rend="underline">Doxographical  category of citation source</hi></item></list></item><item><hi
rend="underline">Origination</hi><list n="1" rend="i"><item><hi rend="underline">Author</hi>
(including Name, Organization,  Date, and Place)</item><item><hi rend="underline">Translator</hi>
(including  Name, Organization, Date, and Place)</item><item><hi rend="underline">Redactor</hi>
 (including Name, Organization, Date, and Place)</item></list></item><item><hi
rend="underline">Transliteration of Citation/Exegesis</hi>: this includes
 not just the citation but also framing exegetical comments, if present. If
 the commentary cites a passage in full, that citation will be given, offering
 an independent reading of that portion of the text. Or the passage may only
 make reference to the text without citation.<list n="1" rend="i"><item><hi
rend="underline">Pagination in Source</hi></item><item><hi rend="underline">Pagination
 of Original Cited</hi></item><item><hi rend="underline">Passage in source</hi></item></list></item><item><hi
rend="underline">Translation of Citation/Exegesis</hi><list n="1" rend="i"><item><hi
rend="underline">Translator<ent value="rsquo"></ent>s Name</hi></item><item><hi
rend="underline">Date</hi></item><item><hi rend="underline">Translation itself</hi></item></list></item><item><hi
rend="underline">Analysis of Citation/Exegesis</hi><list n="1" rend="i"><item><hi
rend="underline">Analysis Author</hi></item><item><hi rend="underline">Date</hi></item><item><hi
rend="underline">Analysis itself</hi></item></list></item></list></item><item><hi
rend="underline">Exegetical text #2</hi>: and so forth.</item></list></div6><div6><head>Tibetan
 Commentarial Literature</head><p>This will only list texts which are in some
 sense substantial commentaries on this text and does not include texts that
 simply cite it in passing.</p><list n="1" rend="1"><item><hi rend="underline">Text
 #1</hi><list n="1" rend="i"><item><hi rend="underline">Title</hi></item><item><hi
rend="underline">Author</hi></item><item><hi rend="underline">ID</hi>: this
 ID will be from our catalog of Tibetan-language related collections and 
commentarial literature. Thus users can access the full bibliographical data
 for this text. </item><item><hi rend="underline">Type of commentary</hi>:
 this will be from our scheme of normalized vocabulary describing different
 types of commentaries.</item><item><hi rend="underline">Discussion of commentarial
 relation</hi>: this will be a discussion field which will analyze the relationship
 of this commentarial text to the text in question. </item></list></item><item><hi
rend="underline">Text #2</hi>: and so forth.</item></list></div6><div6><head>Non-Tibetan
 References</head><p>These are references and translations not included in
 our Collections. This would be a bibliography of modern scholarship that
discusses  or translates <hi>The Collected Tantras of the Ancients</hi> text
in question.</p><list n="1" rend="i"><item><hi rend="underline">Text #1</hi><list
n="1" rend="1"><item><hi rend="underline">Identification</hi><list n="1" rend="i"><item><hi
rend="underline">ID  number of source</hi></item><item><hi rend="underline">ID
number of cited  text</hi></item></list></item><item><hi rend="underline">Bibliographical
information</hi>:  the full bibliographical data will be pulled in from the <ent
value="ldquo"></ent>Collection  of non-Tibetan language commentarial literature<ent
value="rdquo"></ent> via the  ID number.<list n="1" rend="i"><item><hi rend="underline">Title
of Non-Tibetan  Source</hi></item><item><hi rend="underline">Tibetan title
of cited text</hi></item><item><hi rend="underline">English title of cited
text</hi></item></list></item><item><hi rend="underline">Origination</hi><list
n="1" rend="i"><item><hi rend="underline">Author</hi></item><item><hi rend="underline">Affiliation
of author</hi></item><item><hi rend="underline">Date</hi></item></list></item><item><hi
rend="underline">References</hi><list n="1" rend="i"><item><hi rend="underline">Pagination
 in Source</hi></item><item><hi rend="underline">Pagination in Original</hi></item></list></item><item><hi
rend="underline">Passages</hi><list n="1" rend="i"><item><hi rend="underline">Passage
 #1</hi><list n="1" rend="1"><item><hi rend="underline">Title by which text
 is referred</hi></item><item><hi rend="underline">The passage itself</hi></item><item><hi
rend="underline">Author of analysis</hi></item><item><hi rend="underline">Date</hi></item><item><hi
rend="underline">Analysis of passage</hi></item></list></item><item><hi rend="underline">Passage
 #2</hi>: (if there are multiple passages), etc...</item></list></item></list></item><item><hi
rend="underline">Text #2</hi>: etc...</item></list></div6><div6><head>Non-Tibetan
 electronic essays on the Text</head><list n="1" rend="i"><item><hi rend="underline">Essay
 #1<list n="1" rend="1"><item><hi rend="underline">Title of Essay</hi></item><item><hi
rend="underline">Date of Essay</hi></item><item><hi rend="underline">Author
 of Essay </hi></item><item><hi rend="underline">Author<ent value="rsquo"></ent>s
 Organization and Address</hi></item><item><hi rend="underline">Previous publication
 details</hi>: this applies if the article was published previously, either
 in our electronic journal or elsewhere.</item><item><hi rend="underline">Essay
 itself</hi>: these could be lengthy essays and require full formatting with
 footnotes, etc. </item></list></hi></item><item><hi rend="underline">Essay
 #2</hi>: etc...</item></list></div6></div5></div4><div4 n="" type=""><head>Text
 Intertextuality Files</head><p>This template will be for recording unacknowledged
 sharing of passages between this <hi>Collected Tantras of the Ancients</hi>
 text and any other text. </p><list n="1" rend="1"><item><hi rend="underline">Tibetan
 parallel passage source</hi></item><item><hi rend="underline">ID of source</hi></item><item><hi
rend="underline">Sigla/ID</hi> of passage as constituting independent edition:
 we need to identify these with IDs in order to refer to them in the critical
 edition as necessary. These ID<ent value="rsquo"></ent>s will be assigned
to the  texts in accordance with rules governing ID<ent value="rsquo"></ent>s
assigned  to individual texts not located within broader canonical collections. </item><item><hi
rend="underline">Author of Tibetan parallel passage source</hi></item><item><hi
rend="underline">Date of Tibetan parallel passage source</hi></item><item><hi
rend="underline">Publishing details</hi>: this will be a link off to its entry
 elsewhere in the Tibetan Literature Archive.</item><item><hi rend="underline">Pagination
 in Tibetan parallel passage source</hi></item><item><hi rend="underline">Pagination
 in original</hi></item><item><hi rend="underline">Tibetan of parallel passage</hi></item><item><hi
rend="underline">Tibetan of original</hi></item><item><hi rend="underline">Translation
 of Tibetan parallel passage</hi></item><item><hi rend="underline">Translation
 of Tibetan original</hi></item><item><hi rend="underline">Analysis of intertextuality</hi></item></list></div4></div3></div2><div2
id="EWTS" type="section"><head>Extended Wylie Transliteration Scheme</head><div3><head>Introduction</head><q
rend="indent" type="prose quote"> In view of the increasing interest in Tibetan
 studies, it is desirable now more than ever that serious consideration be
 given to the acceptance of a standard system of Tibetan transcription. It
 is time to trade transcriptional independence for uniformity in order to
facilitate  and standardize the advancement of Tibetan studies.<note n="1"
type="footnote" resp="ndg">Turrell Wylie, <ent value="ldquo"></ent>A Standard
System of Tibetan  Transcription,<ent value="rdquo"></ent> <hi rend="italics">Harvard
Journal of Asiatic  Studies</hi>, 22 (1959), 263.</note></q><p>These are the
words of Turrell  Wylie in 1959, proponent and co-author of what is arguably
the most widely  used Tibetan transliteration system to date. With the burgeoning
of digital  technology in humanities and the recent surge of scholarly interest
in Tibetan  studies, the need for a standard transliteration system is even
more compelling  at present. However, Wylie<ent value="rsquo"></ent>s own
scheme is not comprehensive,  and incommensurable extensions of it have proliferated
in the digital world.  While the international standards community (<ent value="ldquo"></ent>ISO<ent
value="rdquo"></ent>)  has established standards for the transliteration of
other languages using  non-Roman alphabets, no such standard has been established
for Tibetan, either  de facto or ISO-approved. We thus feel the time has come
to establish such  an initiative in order to promote a standard format for
information exchange  dealing with Tibetan texts and language. 	</p><p>Wylie<ent
value="rsquo"></ent>s  fundamental argument was that a Tibetan transcription
system <ent value="ldquo"></ent>should  be of minimal complexity and capable
of reproduction on a standard typewriter,  i.e. one lacking special keys for
diacritical marks.<ent value="rdquo"></ent>   His system succeeds in avoiding
the use of diacritical marks and capitalization  employed by other systems,
and has been widely adopted by many scholars since  its introduction. For
this reason, it seems clear that Wylie<ent value="rsquo"></ent>s  scheme should
be the basis for any proposed standard. Some within the PRC  have modified
Wylie<ent value="rsquo"></ent>s system by using v and x respectively  to transcribe
the a-chung (23rd letter) and the a-chen (30th letter). We can  see no compelling
reason for such an alteration. Our system is geared towards  a disambiguated
transliteration required for digital processing and aims to  make as little
change as possible to already extant scholarly practices in  most countries,
especially those where transliteration is most commonly used  to represent
Tibetan. We have not been persuaded by arguments that the a-chung  and a-chen
should be represented by <ent value="rsquo"></ent>v<ent value="rsquo"></ent>
 and <ent value="rsquo"></ent>x<ent value="rsquo"></ent> since they are treated
as consonants  in most Tibetan grammars. Firstly, the apostrophe used to represent
a-chung  in Wylie is not a vowel; secondly, the status of a-chung and a-chen
in Tibetan  grammar are controversial; and thirdly, while Wylie is not a phonetic
system,  it does have the advantage of using reasonable phonetic approximations
of  Tibetan letters, and <ent value="rsquo"></ent>v<ent value="rsquo"></ent>
and <ent value="rsquo"></ent>x<ent value="rsquo"></ent> would completely contradict
that principle.  	</p><p>Though  we have adopted Wylie<ent value="rsquo"></ent>s
basic scheme, there are several  fundamental gaps in the Wylie transliteration
scheme that need to be addressed,  if one is to develop a comprehensive standard.
These are: <list n="1" rend="1"><item>The  ability to represent Tibetan transliterations
of Sanskrit characters not normally  found in the Tibetan language, </item><item>The
ability to represent unusual  stacks of Tibetan characters in the transliteration
of mantras and the like,  and </item><item>The ability to represent various
Tibetan punctuation marks  not covered by Wylie. </item></list></p><p>Various
solutions have been proposed,  though there has been no consensus. The following
presents an unambiguous  solution to these problems, with an eye toward digitally
processing Tibetan  documents as well as orthographic considerations. This
is a provisional proposal,  and we plan to systematically revise it as necessary
in coordination with  critical feedback from Tibetan scholars and Tibetan
computing experts from  around the world. Our goal is establishing an internationally
accepted standard  that is both used by scholars and computing experts, and
approved by the relevant  international standards organization. Wylie<ent
value="rsquo"></ent>s basic scheme,  adopted for transliterating the consonant
and vowels, is outlined in the tables  below. What follows is an explanation
for the proposed additions to Wylie<ent value="rsquo"></ent>s system. </p><p>It
should be noted that a proposed transliteration  scheme must serve the needs
of print publication and electronic publication.  Print publications most
typically use transliteration for references to terms  or names, as well as
for citations of short passages; less frequently, an  entire text may be given
in transliteration rather than Tibetan script. Electronic  publications differ
from print publications in having the added flexibility  of being able to
store material in transliteration, but then deliver it for  display in various
modes, such as in a different transliteration system (such  as one with diacritics),
in Tibetan script, in a phonetic rendering and so  on. For the most part,
the needs of print and digital publication are identical  when it comes to
a transliteration system, with the latter simply adding more  stringent requirements. </p><p>However,
there is one potential problem with  regards to the transliteration system
proposed here, namely the use of capital  letters and <ent value="ldquo"></ent>plus<ent
value="rdquo"></ent> signs as detailed  below, which are vital to articulate
a comprehensive transliteration system  beyond the basic scheme now commonly
used. While not formally part of the  Wylie system, many authors have taken
to using a capital letter to indicate  a place or personal name. In print
publications authors could continue to  use capital letters to mark place
and person names as an <hi rend="italics">informal</hi>  practice<ent value="mdash"></ent>which
is its current status within Wylie<ent value="mdash"></ent>but  this could
not be done in data intended to be used in computer contexts. Additionally,
 in the scheme proposed here, the capital letters and plus signs can look
odd  visually to people accustomed to the basic Wylie scheme. In this regards,
 we should note that the capital letters and plus signs are used to represent
 special words, such as Sanskrit transliterations, and thus will not typically
 appear in standard Tibetan passages.</p></div3><div3 type="section"><head>Tibetan
 Transliteration of Sanskrit</head><p>Because of the great importance of translation
 of Indian Buddhist scripture in Tibetan religious culture from the eighth
 century onwards, Tibetans developed a method for accurately transliterating
 Sanskrit using Tibetan orthography. However, to do so they had to make modifications
 to their glyphs and writing conventions. There were two basic problems. First,
 Sanskrit contains a whole series of characters not found in Tibetan, mainly
 the retroflex characters but also the <hi>anusvara</hi> and <hi>visarga</hi>,
 to name a few. The last two were dealt with by adopting the Sanskrit ligatures<ent
value="mdash"></ent>a circle above the character and a colon-like glyph at
the  end of a word, respectively. For the retroflex characters, Tibetans took
their  closest relative, generally the dentals with one exception, and reversed
them.  The second problem was that Sanskrit contains consonant combinations
that  do not conventionally occur in Tibetan. This was easily solved by stacking
 the consonants with the first on top and the last on the bottom, though such
 combinations often transgressed the traditional grammatical rules for creating 
stacks in Tibetan. While Wylie<ent value="rsquo"></ent>s system can account
for  the abnormal stacks by just stringing the consonants together, it presents
 no way to </p><p>In the proposed system, these retroflex characters are handled
 in a way similar to the Tibetan method, namely by capitalization of the corresponding
 dental. There are five main retroflex characters, represented below with
their  diacritical transliteration: </p><p><xptr n="retros" type="jpg"></xptr></p><p>Even
 with the computer, diacritical marks are hard to type, requiring several
keystrokes  for one character. Yet, if a simple, unambiguous transliteration
is adopted,  the computer <hi rend="italics">can</hi> easily be programmed
to represent  those characters with diacritical marks if so desired, as well
as correspondingly  represent them with the correct Tibetan glyph. Thus, it
is proposed that for  these characters, the capital forms of their corresponding
letters be used,  rendering them: </p><p><table rend="center"><row><cell>Ta</cell><cell>Tha</cell><cell>Da</cell><cell>Na</cell><cell>Sha</cell></row
></table></p><p>This would of course preclude using capitals to distinguish
proper names  or root letters. We believe there is little practical benefit
in distinguishing  root letters that way, as anyone who can read Tibetan can
easily pick out  the root letters. Capitalization of proper names could be
allowed when imbedded  in English for conformity to English practice, but
should not be used for  strict transliteration. For the sake of consistency
it is proposed that the <hi rend="italics">anusvara</hi> and <hi rend="italics">visarga</hi>
be handled  in a similar way through capitalization: </p><p><table><row><cell>M</cell><cell>H</cell></row></table></p><p>
 While the objection could be raised that interspersed capital letters are
 difficult to read, the purpose here is to develop an unambiguous and simple
 transliteration system as Wylie proposed. In print publications, diacritics
 could still be used, as they are today. However, with digital technology,
 it is desirable to have the transliterated text easily transferable between
 machines and platforms. The use of diacritics, which are font specific, hinders
 this, whereas the use of capitals does not. Conversely, conversion from our 
extended Wylie system to a diacritic font would be relatively straightforward,
 as long as the transliteration system is, like this one, disambiguated. 
 </p><p>Non-standard  Tibetan stacks found in Tibetan transliteration of Sanskrit
also present a  problem for transliteration. These stacks can be represented
in Wylie<ent value="rsquo"></ent>s system by merely stringing consonants together.
However,  some of these Sanskrit stacks can be confused for letter combinations
found  in standard Tibetan. For instance, there is a Sanskrit stack consisting
of  an <hi rend="italics">n</hi> over a <hi rend="italics">y</hi>. Unless
provision  is made for distinguishing this combination, it would naturally
be confused  with the eighth Tibetan letter, nya. For this situation, we suggest
the use  of the plus sign (+) between all non-standard stack letters found
in Tibetan  transliteration of Sanskrit. For instance there are aspirated
letters in Sanskrit  not found in Tibetan, such as <hi rend="italics">dha</hi>, <hi
rend="italics">gha</hi>,  and <hi rend="italics">bha</hi>. These would be
transliterated as <hi rend="italics">d+ha</hi>, <hi rend="italics">g+ha</hi>,
and <hi rend="italics">b+ha</hi> respectively. We  have, however, retained
the use of the period between a <hi rend="italics">g</hi>  prefixed to a <hi
rend="italics">y</hi> to distinguish such a combination  from the stack <hi
rend="italics">gy</hi>, because this is an already well-established  practice.
Thus, the Tibetan words for right and left are transliterated <hi rend="italics">g.yon</hi>
and <hi rend="italics">g.yas</hi>, whereas the word  for ornament would be <hi
rend="italics">rgyan</hi>.</p></div3><div3 type="section"><head>Punctuation</head><p>The
 other major omission of the Wylie scheme lies in its limited coverage of
punctuation.  This system adopts the basic policy, as does Wylie, of using
a space for a <hi rend="italics">tsheg</hi>, the intersyllablic dot, with
the special exception  of using an asterisk (*) for a non-breaking <hi>tsheg</hi>,
which will not allow a  line break to occur after it. In other Wylie-based
transliteration systems  that use a space for a <hi rend="italics">tsheg</hi>,
there is no way to represent  the white space that can actually occur in Tibetan
texts. To disambiguate  this situation, the underscore (_) is proposed as
an equivalent to Tibetan  white space.<note n="2" type="footnote" resp="ndg">It
would be up to software  manufactures and encoders to determine whether the
whitespace character had  a fixed or variable size, depending on the degree
of accuracy desired in representing  Tibetan texts digitally.</note>  Besides
the <hi rend="italics">tsheg</hi>,  Wylie presents no policy concerning punctuation
in his article. Yet, Tibetan  has as much punctuation as English, if not more.
It is therefore necessary  to develop a standard for transcribing these unpronounced
marks. The common  convention of using the slash (/) for a <hi rend="italics">shad</hi>,
the  vertical line ending a phrase, has been adopted. The remaining punctuation
 marks are less frequent, though the attempt has been made to represent them
 with standard Latin-I character set glyphs that are, if possible, somewhat
 similar in form to the corresponding Tibetan glyph. Thus, the <hi rend="italics">tsheg
 shad</hi>, a line with a dot above it, is represented by the semicolon (;).
 The <hi rend="italics">rin chen spungs shad</hi>, a <hi rend="italics">shad</hi>
 with three dots above it, is represented by the vertical bar (|). The <hi
rend="italics">sbrul shad</hi> is represented by an exclamation mark (!),
 and the <hi rend="italics">gter shad</hi> represented by the colon (:). Although
 it could be argued that the colon is more appropriate for the <hi rend="italics">visarga</hi>,
 as they are almost identical in shape, the counter argument is that the <hi
rend="italics">gter shad</hi> not only is similar in form to a colon but they
 are both punctuation marks (albeit with different functions), whereas the <hi
rend="italics">visarga</hi> represents an aspiration at the end of the word.
 Given the method of dealing with transliterated Sanskrit above, <hi>visarga</hi>
 is best represented by a capital H. The final punctuation mark to mention
 is the <hi rend="italics">yig mgo</hi> mark, which has two forms <hi rend="italics">mdun
 ma</hi> (front) and <hi rend="italics">sgab ma</hi> (back) respectively.
The  first is represented by a similar looking glyph, the at-sign (@), while
the  second is represented by the next symbol over on the keyboard, the pound
sign  (#) for ease of typing. This covers the major punctuation marks; others
are  listed in the punctuation chart below.  </p><p>	Not all the punctuation
marks  in the chart below have transliteration equivalents. This is because
there  are not enough characters on the standard keyboard to correspond to
every  character that is found in the <ref n="http://www.unicode.org/charts/PDF/U0F00.pdf"
type="hyperlink" resp="ndg">Unicode 3.0 Tibetan character set</ref>. Besides,
 there should be a secondary way of transcribing a character as some keyboards
 may lack certain keys such as the dollar-sign and so forth. To this end,
it  is proposed that the use of an <ent value="ldquo"></ent>escape<ent value="rdquo"></ent>
 character is standardized. The <ent value="ldquo"></ent>escape<ent value="rdquo"></ent>
 character would be inserted before the desired character<ent value="rsquo"></ent>s
 Unicode value, just as one can now use the alt key to type in a decimal code
 for a character. This can be done with any character in the Tibetan character
 set. The proposed escape character is the backslash (\) and the code should
 be the Unicode 3.0 Tibetan character code. Thus, the <hi rend="italics">rgya
 gram shad</hi>, (transliterated by a left bracket: ]) could also be encoded
 by typing a back slash followed by its Unicode hexadecimal equivalent: \0F12.
 The astrological sign <hi>sgra gcan char rtags</hi>, lacking a transliteration
 equivalent, will have to be entered: \0F17. As the Unicode is unambiguous
 and always four characters long, there is no need for a closing marker. Because
 the consonants, vowels, Sanskrit-derived characters, and punctuation marks
 make use of every available key on the standard keyboard, such escape codes
 will have to be used for the other Tibetan characters in Unicode 3.0 (Chart
 VI) and later releases. These escape codes could be used for other character
 sets as well, such as Devan<ent value="amacr"></ent>gari or Chinese, which
may  be interspersed with Tibetan. This accounts for all the Unicode 3.0 character
 set; however, some provision needs to be made to deal with punctuation not
 included in that set. The last chart below (Chart VII) includes some marks
 found in Tibetan Computer Company<ent value="rsquo"></ent>s Tibetan Machine
font  that do not appear to be included in the Unicode standard for Tibetan. </p></div3><div3
type="section"><head>Proposed Solution to be Vetted</head><p>The Extended
 Wylie Transliteration Scheme thus proposed covers all the various possible
 letter combinations found in Tibetan literature in an unambiguous way. For
 most situations, Wylie<ent value="rsquo"></ent>s basic system will suffice.
So,  it has been left in tact here. However, if the more unusual letter combinations
 are found, they can be easily and clearly transliterated using the above
system.  The following charts give the complete proposed Extended Wylie system
of transliteration  as described above. There are seven tables: </p><p><list
n="1" rend="1"><item><ref target="TibCons">Consonants</ref></item><item><ref
target="TibVows">Vowels</ref></item><item><ref target="TibNums">Numerals</ref></item><item><ref
target="TibSkt">Sanskrit  letters</ref></item><item><ref target="TibPunc">Punctuation</ref></item><item><ref
target="TibOther">Other Tibetan characters found in Unicode 3.0</ref></item><item><ref
target="TibNotUni">Characters not found in Unicode 3.0</ref></item></list></p><p>The
 transliteration of all characters, except the vowels, includes the short
a  sound implicit in all Tibetan characters. This sound is only pronounced
when  that character is the root character of the syllable (<hi rend="italics">tsheg
 bar</hi>), but not when it is a prefix, superscript, subscript, or suffix.
 The same is true in transliteration, such letters dropping their implicit
 a. The vowels are written in their common stand-alone form using the a-chen
 as their base, though they of course can be appended to any root letter or
 stack.<note n="4" type="footnote" resp="ndg">Except in the case of the Sanskrit
 semi-vowels, <hi rend="italics">Ri</hi>/<hi rend="italics">RI</hi> and <hi
rend="italics">Li</hi>/<hi rend="italics">LI</hi>. These are given in combination
 with the consonants <hi rend="italics">ha</hi> and <hi rend="italics">ka</hi>
 respectively with which they are in more natural combination.</note>  As
with  other Sanskrit derivatives, the long simple vowels are represented by
capitals,  while complex vowels are represented by the corresponding diphthong. </p><p>
 For each character in each of the charts, the Tibetan glyph is given in the
 middle of the cell, the proposed transliteration is given immediately below
 it in boldface. To the left of that is the main Unicode character code, and
 to the right, if there is a number, is the Unicode character code for the
 alternate subscribed version of that character.</p><div4 id="TibCons"><head>Tibetan
 Consonants</head><p><xptr n="cons" type="jpg" resp="ndg"></xptr><xptr n="notesc"
type="jpg" resp="ndg"></xptr></p></div4><div4 id="TibVows"><head>Tibetan Vowels</head><p><xptr
n="vowls" rend="" type="jpg" resp="ndg"></xptr><xptr n="notesv" type="jpg"
resp="ndg"></xptr></p></div4><div4 id="TibNums"><head>Tibetan Numerals</head><p><xptr
n="nums" type="jpg" resp="ndg"></xptr></p></div4><div4 id="TibSkt"><head>Tibetan
Transliteration of Sanskrit</head><p><xptr n="sansk" type="jpg" resp="ndg"></xptr></p></div4><div4
id="TibPunc"><head>Tibetan Punctuation</head><p><xptr n="punct" type="jpg"
resp="ndg"></xptr><xptr n="notesp" type="jpg" resp="ndg"></xptr></p></div4><div4
id="TibOther"><head>Other Tibetan Unicode 3.0 Characters</head><p><xptr n="other"
type="jpg" resp="ndg"></xptr></p></div4><div4 id="TibNotUni"><head>Tibetan
Characters  Not Found in Unicode 3.0</head><p><xptr n="othnot" type="jpg"
resp="ndg"></xptr></p></div4></div3></div2><div2><head>Fonts</head><p>In 
order to enhance both the project<ent value="rsquo"></ent>s presentation and
accessibility,  there needs to be a standard set of fonts that is used by
the Library and  is available to its users. The issues concerning Unicode
Tibetan fonts have  been discussed <ref target="TibFontsGen">above</ref>.
There is at present  no implemented Unicode Tibetan font available for any
platform. Some recent  Windows fonts contain the Unicode Tibetan characters
in the correct codespace,  but the software support is non-existent. The most
aesthetically pleasing  Tibetan fonts use the ASCII character codes and have
not yet been converted  into Unicode. There are even greater difficulties
encountered in other operating  systems, especially the Mac. </p><p>An open-type
Unicode Tibetan font with  corresponding software support would rectify most
of these problems. This  font would need to be a public-domain font that could
be available to all  of the users of the Library. With the increasing publicity
of Tibet and Tibetan  Studies, and with the consistent advancement of technology,
the prospect of  Unicode Tibetan in the near future seems quite likely.</p><p>A
further need  is to have a Unicode Extended Latin Character set font that
contains all the  necessary diacritic marks for displaying translation, transliteration,
and  articles about Tibet. At present, several of these exist for the Windows
system  as public domain fonts, but none of them have a corresponding Macintosh
version.  The Library is in the process of deciding on a Unicode Extended
Latin font  to use with all of its collections and grappling with the issues
of displaying  both the Tibetan and the Latin fonts over the web within the
various software  we use, such as Flash, Dynaweb, and so forth.</p></div2><div2
type="section"><head>Doxography</head><p>The  doxographical classification
of texts will be the organizing rubric for a  Master Catalog which will contain
a bibliographic reference to every unique  work found in any edition of a
religious canon, such as <hi rend="italics">The  Collected Tantras of the
Ancients</hi>. As catalogs for other collections  of religious works are entered,
other doxographical classification schemes  will have to be standardized.
For now, the Tibetan Literature Archive aspect of the  Library is focusing
on two collections, The Samantabhadra Collection (i.e.,<hi rend="italics">The
Collected Tantras of the Ancients</hi>) and the B<ent value="ouml"></ent>n-po
 Collection. Therefore, the doxographical categories for these two schools
 of thought need to be standardized. This is especially important for The
Samantabhadra  Collection, as this is an explicit collection of Tantric scriptures
with a  definite doxographical hierarchy. </p><div3><head>Doxographical Classification
 Scheme for <hi rend="italics">The Collected Tantras of the Ancients</hi></head><list
n="1" rend="I"><item>Atiyoga<list n="1" rend="1"><item><hi rend="underline">Mind
 Series (<hi rend="italics">sems sde</hi>)</hi><list n="1" rend="i"><item><hi
rend="underline">The Eighteen Mind Series Texts</hi></item><item><hi rend="underline">The
 All Creating Cycle (<hi rend="italics">kun byed skor</hi>)</hi></item><item> <hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Space
 Series (<hi rend="italics">klong sde</hi>)</hi></item><item><hi rend="underline">Experiential
 Precept Series (<hi rend="italics">man ngag sde</hi>)</hi><list n="1" rend="i"><item><hi
rend="underline">External and internal cycles (<hi rend="italics">phyi nang
 skor</hi>)</hi></item><item><hi rend="underline">Esoteric Cycles (<hi rend="italics">gsang
 skor</hi>)</hi></item><item><hi rend="underline">Seminal Heart (<hi rend="italics">snying
 thig</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item><item><hi
rend="underline">Crown Pith (<hi rend="italics">spyi ti</hi>)</hi></item><item><hi
rend="underline">Ultra Pith (<hi rend="italics">yang ti</hi>)</hi><list n="1"
rend="i"><item><hi rend="underline">Brahmin cycles (<hi rend="italics">bram
 ze<ent value="rsquo"></ent>i skor</hi>)</hi></item><item><hi rend="underline">Padma-related</hi></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Miscellaneous</hi></item></list></item><item>Anuyoga<list
n="1" rend="1"><item><hi rend="underline">The four root s<ent value="umacr"></ent>tras
 (<hi rend="italics">rtsa ba<ent value="rsquo"></ent>i mdo bzhi</hi>)</hi></item><item><hi
rend="underline">The six tantras clarifying the six limits (<hi rend="italics">mtha<ent
value="rsquo"></ent> drug gsal bar byed pa<ent value="rsquo"></ent>i rgyud
drug</hi>)</hi></item><item><hi rend="underline">The twelve rare tantras (<hi
rend="italics">dkon rgyud</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item><item>Mah<ent
value="amacr"></ent>yoga<list n="1" rend="1"><item><hi rend="underline">Tantra
Series (<hi rend="italics">rgyud  sde</hi>)</hi><list n="1" rend="i"><item><hi
rend="underline">The eightfold  set of root <hi rend="italics">Magical Emanation
Tantras</hi> (<hi rend="italics">rtsa  bar gyur sgyu <ent value="rsquo"></ent>phrul
sde brgyad</hi>)</hi></item><item><hi rend="underline">The eighteenfold set
of explanatory tantras (<hi rend="italics">bshad  pa dang cha mthun gyi rgyud
sde bco brgyad</hi>)</hi><list n="1" rend="1"><item><hi rend="underline">Enlightened
Body (<hi rend="italics">sku</hi>)</hi></item><item><hi rend="underline">Enlightened
Speech (<hi rend="italics">gsung</hi>)</hi></item><item><hi rend="underline">Enlightened
Mind (<hi rend="italics">thugs</hi>)</hi></item><item><hi rend="underline">Enlightened
Qualities (<hi rend="italics">yon tan</hi>)</hi></item><item><hi rend="underline">Enlightened
Activities (<hi rend="italics"><ent value="rsquo"></ent>phrin  las</hi>)</hi></item></list></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">Practice
Series of the Eight Proclamation Deities (<hi rend="italics">sgrub  sde bka<ent
value="rsquo"></ent> brgyad</hi>)</hi><list n="1" rend="i"><item><hi rend="underline">The
Practice Series (<hi rend="italics">sgrub sde</hi>)</hi><list n="1" rend="1"><item><hi
rend="underline">Summary of the Highest Intention  (<hi rend="italics">bla
ma dgongs pa <ent value="rsquo"></ent>dus pa</hi>)</hi></item><item><hi rend="underline">Consortium
of Sugatas (<hi rend="italics">bde gshegs <ent value="rsquo"></ent>dus pa</hi>)</hi></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item><item><hi rend="underline">The
Eight Proclamation Deities (<hi rend="italics">bka<ent value="rsquo"></ent>
brgyad</hi>)</hi><list n="1" rend="1"><item><hi rend="underline">The  Ma<ent
value="ntilde"></ent>ju<ent value="sacute"></ent>ri Cycle on Enlightened Form
(<hi rend="italics"><ent value="rsquo"></ent>jam dpal sku<ent value="rsquo"></ent>i
skor</hi>)</hi></item><item><hi rend="underline">The Lotus Tantras on Enlightened
Communication (<hi rend="italics">pad  ma gsung gi rgyud</hi>)</hi></item><item><hi
rend="underline">The Real Tantras  on Enlightened Mind (<hi rend="italics">yang
dag thugs kyi rgyud</hi>)</hi></item><item><hi rend="underline">The Nectar
Tantras on Enlightened Qualities (<hi rend="italics">bdud  rtsi yon tan gyi
rgyud</hi>)</hi></item><item><hi rend="underline">The Sacred  Dagger Cycle
on Enlightened Activities (<hi rend="italics"><ent value="rsquo"></ent>phrin
 las phur pa<ent value="rsquo"></ent>i skor</hi>)</hi></item><item><hi rend="underline">The
 Cycle on Invoking the Fierce Ma-mo Deities (<hi rend="italics">ma mo rbod
 gtong skor</hi>)</hi></item><item><hi rend="underline">Offerings and Praises
 to Protect the Teachings (<hi rend="italics">bstan srung mchod bstod</hi>)</hi></item><item><hi
rend="underline">The Cycle on Fierce Mantras (<hi rend="italics">drag sngags
 skor</hi>)</hi></item><item><hi rend="underline">Miscellaneous</hi></item></list></item></list></item><item><hi
rend="underline">Miscellaneous</hi></item></list></item></list></div3></div2><div2
type="section"><head>Terminology</head><p>This refers to the varying types
 of standardized terminology employed in the Tibetan Literature Archive. It
is essentially  also a standard but is separated off here as a category because
of its importance  and potential size. There are two basic levels at which
terminology is of  prime importance. These are translation and categorization.
Both of these  are covered by the Encyclopedia of Tibet, which is to be one
part of the Textual  Collections. The Encyclopedia will contain Tibetan terms,
translations, and  descriptive entries as well as references concerning each
item. Thus, it includes  both translation and categorization.</p><div3><head>Translation
Terms</head><p>For  the purpose of translating texts, audio recording, video
transcripts, and  so forth, a standard glossary of terms will be compiled
to aid translators  and to ensure a base-level of consistency among translations
by different  hands. The Library is already working on a terminological database
that will  be web-accessible that will cover the broad scope of definitions
and illustrations  available for much of the Tibetan language. There will
be fields for: </p><p><list n="1" rend="1"><item>Basic term</item><item>Linguistic
etymology</item><item>Glosses</item><item>English  translations</item><item>Sanskrit
equivalents</item><item>Chinese equivalents</item><item>Other  equivalents</item><item>Numerical
index</item><item>Grammatical index</item><item>Related  keywords</item><item>Definitions</item><item>Passages</item><item>Global
comments</item></list></p><p>Readers  with sufficient credentials will be
able to submit comments, signed and dated,  to add further information, draw
attention to errata, or contest an interpretation.  The electronic interface
for this is in the process of being constructed,  and a significant body of
terms is being amassed.</p></div3><div3><head>Categorical  Terminology</head><p>The
other type of terms employed in the Library are of  a categorical nature in
reference to linguistic and bibliographic information.  In the process of
cataloging the mTshams brag edition of <hi rend="italics">The  Collected Tantras
of the Ancients</hi> it became apparent that we needed to  standardize the
use of certain terms and phrases in the descriptions of the  texts and define
these uses to provide the clearest possible description. </p><div4><head>Literary
 Genre</head><p>As with any body of literature, the analysis of Tibetan literary
 genres is complex, and among areas of sophisticated literary criticism, Tibetan
 Studies is still in its infancy. A number of genres have been identified,
 but many remain obscured by unclear or incomplete definitions. This is not
 a task for a single institution or individual, but rather, a typology of
genres  should be agreed upon by a large number of respected scholars. A great
deal  of work thus remains to be done. However, to promote discourse on this
topic,  we have initiated a <hi rend="bold">provisional</hi> list of genre
types.  These are organized under general headings. Our hope is that these
categories  will be further refined with scholarly input to produce a tree-like
hierarchy  of literary genres. Below is a simple list of these categories
and their sub-rubrics.  Ultimately, this will be elaborated with short descriptions
of each genre  including quotations from relevant examples.</p><p><list n="1"
rend="I"><item>Buddha-voiced<list n="1" rend="1"><item> <hi rend="italics"></hi>rgyud </item><item><hi
rend="italics">mdo </hi></item></list></item><item>Literary<list n="1" rend="1"><item>Songs
(<hi rend="italics">mgur </hi>)</item><item>Contemporary  novels</item><item>Poetry</item></list></item><item>Doctrinal<list
n="1" rend="1"><item>Psychology  (	<hi rend="italics">blo rig </hi>)</item><item>Tenets
(<hi rend="italics">grub  mtha<ent value="rsquo"></ent> </hi>)</item><item>Stages
of the Path (<hi rend="italics">lam  rim </hi>)</item><item>Grounds and Paths
(<hi rend="italics">sa lam </hi>)</item><item>The  Nine Vehicles (<hi rend="italics">theg
pa rim pa dgu </hi>)</item></list></item><item>Prayers  and Liturgy<list n="1"
rend="1"><item>Supplication to the Lineage of Teachers  (<hi rend="italics">la
ma brgyud pa<ent value="rsquo"></ent>i gsol <ent value="rsquo"></ent>debs </hi>)</item><item>Dedication
 (<hi rend="italics">bsngo ba </hi>)</item><item> <hi rend="italics">yang
srid  mgyur byon </hi></item><item><hi rend="italics">par byang smon tshig </hi></item></list></item><item>Ritual-yogic <list
n="1" rend="1"><item><hi rend="italics">chab shog </hi></item><item><hi rend="italics">tshe
 sgrub</hi></item><item><hi rend="italics">gtor chog</hi></item><item><hi
rend="italics">bsangs <ent value="rsquo"></ent>pho ba</hi></item><item><hi
rend="italics">g.yang <ent value="rsquo"></ent>gugs</hi></item><item><hi rend="italics">sbyin
sreg </hi></item><item><hi rend="italics">sgrub thabs </hi></item><item><hi
rend="italics">bum sgrub </hi></item><item><hi rend="italics">mandala bshad
 pa </hi></item><item><hi rend="italics">dkyil <ent value="rsquo"></ent>khor
dbang  chog </hi></item><item><hi rend="italics">bla ma<ent value="rsquo"></ent>i
rnal <ent value="rsquo"></ent>byor </hi></item><item><hi rend="italics">rjes
gnang </hi></item></list></item><item>  Places, Institutions, and Catalogs<list
n="1" rend="1"><item><hi rend="italics">gnas  yig</hi></item><item><hi rend="italics">dkar
chag</hi></item></list></item><item>Historical  and Biographical <list n="1"
rend="1"><item><hi rend="italics">lo rgyus</hi></item><item><hi rend="italics">rnam
thar</hi></item><item><hi rend="italics">rang gi rnam  thar</hi></item><item><hi
rend="italics">chos <ent value="rsquo"></ent>byung</hi></item><item><hi rend="italics">gdan
rabs</hi></item></list></item><item>Miscellaneous <list n="1" rend="1"><item><hi
rend="italics">btsod pa gsol <ent value="rsquo"></ent>debs  zhabs brtan sogs</hi></item><item><hi
rend="italics">bca<ent value="rsquo"></ent>  yig</hi></item><item><hi rend="italics">brtan
bzhugs</hi></item><item><hi rend="italics">gser skyems</hi></item><item><hi
rend="italics">mngon rtogs </hi></item><item><hi rend="italics">thang ka<ent
value="rsquo"></ent>i kha byang </hi></item><item><hi rend="italics">dris
lan </hi></item><item><hi rend="italics">bris yig </hi></item></list></item></list></p></div4><div4><head>Literary
 Form</head><p>Related to literary genre, there is the category of literary
 form, that is, the more general classification for the type of literature
 to which a text belongs. At present, there are three broad categories, though
 not much work has been done in regard to this scheme. This also is something
 that needs to be discussed with a wide range of Tibetologists. The categories
 are:</p><p></p><p><list n="1" rend="I"><item>Verse<list n="1" rend="1"><item>Syllabic
 Verse</item></list></item><item>Prose<list n="1" rend="1"><item>Narrative</item><item>Expository</item></list></item><item>Mixed
 Verse and Prose</item></list></p></div4><div4><head>Provenance Personae</head><p>One
 of the most robust mechanisms of the mark-up scheme used in the Tibetan Literature
Archive  is the ability to catalog various people<ent value="rsquo"></ent>s
roles in the  creation of a text. As mentioned above, determining the authorship
of a Tibetan  text, especially a religious one, is complicated by a number
of factors: the  speaker of a tantra is a divine figure, the text has been
written and hidden  by one person but was discovered and translated by another,
or the text was  redacted from an earlier more extensive version. To deal
with this level of  complexity, the Library has developed a special mark-up
scheme particularly  to cover Tibetan literature. Given the nature of SGML
and XML, this scheme  is flexible and can be modified or appended to in the
future. For the present,  we have enumerated the following persons involved
in the creation of a text:</p><p><list rend="u"><item>Author/Speaker</item><item>Audience</item><item>Translators</item><item>Redactor</item><item
>Concealer</item><item>Revealer</item><item>Requester</item><item>Editor</item><item>Scribe</item><item>Printer</item><item>Patron</item></list></p
><p>Along  these lines, there are also the following provisional lists of
related sub-categories:</p><p><list n="1" rend="I"><head>Author Types</head><item>Divine
Speaker<list n="1" rend="1"><head>Being  Types</head><item>Bodhisattva</item><item>Lha/lha
mo</item><item>mKha<ent value="rsquo"></ent> <ent value="rsquo"></ent>gro
ma</item><item>Chos skyong/srung ma</item><item>Adversary</item><item>Consort</item></list></item><item>Human
 Author<list n="" rend="u"><head>Characteristics</head><item>Ethnicity</item><item>Status<list
n="1" rend="1"><item>Rang sangs rgyas</item><item>Nyan thos</item><item>Lay
 Person</item><item>Miscellaneous</item></list></item><item>Reincarnational
 status</item><item>Emanational status</item><item>Affiliation</item></list></item></list></p><p><list
rend="u"><head>Audience Types</head><item>Requester of Text</item><item>Interlocutor</item><item>Simple
 Listener</item></list></p><p><list rend="u"><head>Translator Types</head><item>Indian
 Scholar (including persons from O-rgyan and Za-hor)</item><item>Nepalese
Scholar</item><item>Chinese  Scholar</item><item>Tibetan Translator</item></list></p><p><list
rend="u"><head>Redactor  Types</head><item>Original Redactor</item><item>Translation
Redactor</item><item>Revelation  Redactor</item><item>Modern Redactor</item></list></p><p><list
rend="u"><head>Transmission  Status</head><item><hi rend="italics">bka<ent
value="rsquo"></ent> ma</hi></item><item><hi rend="italics">gter ma</hi></item><item><hi
rend="italics">sa gter</hi></item><item><hi rend="italics">dgongs gter</hi></item><item><hi
rend="italics">yang gter</hi></item></list></p><p><list rend="u"><head>Editor
Types</head><item>Original Editor</item><item>Translation  Editor</item><item>Redaction
Editor</item><item>Revelation Editor</item></list></p><p>Institutional  and
sectarian affiliation is an important part of Tibet<ent value="rsquo"></ent>s
 religious culture. Furthermore, a person is often associated with a general
 movement that might cross sectarian lines. At the more detailed level, a
person  is often associated with a single monastery and a college within that
monastery.  All these affiliations are classified as a person<ent value="rsquo"></ent>s <ent
value="ldquo"></ent>organization<ent value="rdquo"></ent>. We have delineated
the following  sub-categories for this important classification:<list rend="u"><head>Organization
 Types</head><item>Movement<list><item>Therav<ent value="amacr"></ent>da</item><item>Great
 Vehicle</item><item>Tantra</item></list></item><item>Sect (subsects to be
 included but not shown)<list><item>rNying ma</item><item>Sa skya</item><item>bKa<ent
value="rsquo"></ent> brgyud</item><item>dGe lugs</item><item>B<ent value="ouml"></ent>n</item><item>Jo
 nang</item><item>bKa<ent value="rsquo"></ent> gdams</item><item>Ris med</item><item>Indeterminate</item><item>Unknown</item></list></item><item>Monastery</item
><item>Department</item><item>Political</item><item>Hereditary</item></list></p></div4><div4><head>Chapter-level
 Elements</head><p>As related above in the section on cataloging a Tibetan
 text, these works can have a rather elaborate structure. We have adopted
the  basic outline of the TEI (Text Encoding Initiative) scheme by dividing
a text<ent value="rsquo"></ent>s structure into front, body, and back. However,
the internal  sub-sections within these include uniquely Tibetan categories.
On the analogy  of the body of a text being composed of chapters, these sub-sections
for all  three major divisions are called <ent value="ldquo"></ent>Chapter-level
Elements<ent value="rdquo"></ent>, since they are components that occupy the
same level as a  chapter in the hierarchy. Of course, chapters themselves
are also chapter-level  elements. The unique structure of a Tibetan text has
impelled us to classify  the chapter-level elements according to their most
prominent content. The  various types of chapter-level elements for each of
the three major divisions  of a text are as follows:</p><p><list n="1" rend="I"><item>Front<list
n="1" rend="1"><item>Title page (<hi rend="italics">mtshan</hi>)</item><item>Title
 line</item><item>Homage/Invocation/Praise (<hi rend="italics">mchod par brjod
 pa</hi></item><item>Statement of Intent or Promise to Compose (<hi rend="italics">rtsom
 par dam bca<ent value="rsquo"></ent> ba</hi>)</item><item>Untitled introduction</item><item><ent
value="ldquo"></ent>Ordinary introductory scene<ent value="rdquo"></ent> (<hi
rend="italics">thun  mong gleng gzhi</hi>)</item><item><ent value="ldquo"></ent>Extraordinary
introductory  scene<ent value="rdquo"></ent> (<hi rend="italics">thon mong
ma yin pa<ent value="rsquo"></ent>i  gleng gzhi</hi>)</item><item>Outline
(<hi rend="italics">sa bcad</hi>)</item></list></item><item>Body<list n="1"
rend="1"><item>Section divisions</item><item>Chapters (may have the  following
parts:)<list n="1" rend="i"><item>Chapter title</item><item>Chapter  homage</item><item>Chapter
body</item><item>Chapter<ent value="rsquo"></ent>s colophon</item></list></item><item>Interstitial
 chapters (i.e., unnumbered sections found between sequentially numbered chapters)</item></list></item><item>Back <list
n="1" rend="1"><item>Closing section</item><item>Author<ent value="rsquo"></ent>s
 colophon</item><item>Redactor<ent value="rsquo"></ent>s colophon</item><item>Translator<ent
value="rsquo"></ent>s colophon</item><item>Lineage transmission</item><item>Reviser<ent
value="rsquo"></ent>s colophon</item><item>Editorial colophon</item><item>Scribal
 colophon</item><item>Printing colophon</item><item>Concluding prayer</item><item>Closing
 invocation</item><item>Undetermined colophon</item></list></item></list></p></div4><div4><head>Miscellaneous
 Cataloging Terminology</head><p>There are also some miscellaneous English
 terms and phrases that are included in the catalog to describe the state
 of certain categories of information. These are generally used when information
 is not supplied by the text in order to explicitly indicate such. They are
 for the most part self-explanatory but a list of these phrases is included
 here for completeness. They are:</p><p><list><item>Not specified <ent value="mdash"></ent>
 the generic phrase to indicate an absence of information in the text itself.</item><item>No
 title given <ent value="mdash"></ent> used when a chapter does not give a
text  title.</item><item>Untitled <ent value="mdash"></ent> used for chapters
without  titles.</item><item>Not found <ent value="mdash"></ent> when a text
is not found  in another catalog, such as Kaneko<ent value="rsquo"></ent>s
or the Taiwan index.</item><item>Undeciphered  ligature <ent value="mdash"></ent>
used when a title or some portion of the text  cannot be deciphered by the
editor.</item><item>Intra-canonical relation <ent value="mdash"></ent> the
relationship of the present text to other texts in the  same corpus, such
as a series of texts.</item></list></p></div4></div3></div2><div2 type="section"><head>Text
Identification Numbers</head><div3 n="" type=""><head>Overview</head><p>A
 canon consists not only of differing editions, but also of multiple texts
 within each edition. In order to refer to the different editions as well
as  to individual texts in an unambiguous way, we need a canonical scheme
of ID  numbers. Such a scheme must be stable and unchanging. ID numbers are
thus  crucial to the Tibetan Literature Archive in terms of identifying the
editions, texts  within editions, and passages within texts. These ID numbers
are instrumental  for scholarly reference, for navigation, for searches, and
for collation.  In addition to ID schemes for particular published editions,
our ID scheme  also deals with the purely electronic <ent value="ldquo"></ent>editions<ent
value="rdquo"></ent>  of our <ent value="ldquo"></ent>master<ent value="rdquo"></ent>
catalogs, which have  no printed correlate. </p><p>ID numbers will also be
assigned to individual  texts that are not part of larger collections. In
all of these abbreviation  schemes, we will have machine translations to spell
out each abbreviation  so that users are not required to remember the abbreviations.
In general,  sigla allow quick input and require limited space, two considerations
which  are not as relevant for electronic collections as for traditional books.
However,  there remain significant cases when space limitations apply for
an electronic  edition as well, such as in listing variant readings in a critical
edition.</p></div3><div3 n="" type=""><head>ID Numbers for <hi>The Collected
Tantras of the Ancients</hi>  (NGB)</head><p>The NGB ID numbers have three
components: (i) the identification  of a particular edition of the NGB (edition
ID); (ii) the identification of  a particular text within a given NGB edition
(text ID); and (iii) the identification  of a particular line within a given
text (passage ID). If the full ID is given,  it will begin with the edition
ID, continue with the text ID, and terminate  with the passage ID. However,
when working within a particular edition, or  a particular text, it would
be possible to leave out one or two components  of the ID<ent value="mdash"></ent>for
example just citing the passage ID when it  is understood what text is being
referred to.</p><div4 n="" type=""><head>Edition  IDs</head><p>To identify
a Collected Tantras of the Ancients edition, we are  using siglas composed
of a sequence of two letters to identify particular  editions, with a single
number in the third slot to specify a variant form  of an edition which has
only slight modifications. When there is a family  of editions, we use the
two letter sequence to represent the most commonly  available one plus an
added number to signify the others. Thus Tb signifies  the PL480-mTshams brag
reprint, while Tb2 signifies the original printing  that was based on. </p><p><list
n="1" rend="I"><head>Edition Sigla</head><item>Master  Catalog<p>The master
catalog will contain bibliographic records for all  texts found within any
of the various editions of <hi>The Collected Tantras  of the Ancients</hi>.
This will thus be a comprehensive digital catalog  collecting in a single
place all titles associated with <hi>The Collected  Tantras</hi>. There will
be cross-references to actual occurrences of the  texts in particular editions,
and the master catalog will also serve as  the entry point for accessing critical
editions of texts, when those become  available. As this will be a compilation,
the master edition sigla will refer  to doxographical categories rather than
a physical edition. The master catalog  will thus be divided into the three
major traditions included within <hi>The  Collected Tantras</hi> canon: Atiyoga,
Anuyoga, and Mah<ent value="amacr"></ent>yoga.  These will have the following
sigla:</p><list rend="u"><item>Ng1: Atiyoga</item><item>Ng2:  Anuyoga</item><item>Ng3:
Mah<ent value="amacr"></ent>yoga</item><item><p>In addition,  the Master ID
scheme will be further divided into doxographical sub-categories  with the
sub-category number separated from the sigla by a period. This can  proceed
down several levels. Thus, a valid Master ID number would be Ng1.3.2.4.  This
would be parsed:<list rend="u"><item>Ng1 specifies the Atiyoga section</item><item>3
 signifies the third doxographical category within Atiyoga (i.e. the <hi>Man
 ngag sde</hi> grouping), </item><item>2 signifies the second category in
the <hi>Man  ngag sde</hi> (i.e. the Secret Cycles or <hi rend="italics">gsang
skor</hi>)</item><item>4  signifies the fourth text within the Secret Cycles. </item></list></p><p>Thus,
 as new texts are discovered, they can be added into the appropriate category
 of texts at the end of the series. In addition, each of the three main sections
 will have a subgrouping labelled <ent value="ldquo"></ent>Miscellaneous<ent
value="rdquo"></ent>  to cover unclassified texts. </p><p>Following the same
principle, we also  specify an <ent value="ldquo"></ent>Edition Doxographical
ID<ent value="rdquo"></ent>  which would identify a text based on the its
doxographical classification  within that particular edition.</p></item></list></item><item>mTshams
brag<list rend="u"><item>Tb: mTshams brag reproduction by Dingo.</item><item>Tb1:
mTshams  brag reproduction by Dingo in the Taiwan Tripitika.</item><item>Tb2:
mTshams  brag original edition.</item></list></item><item>gTing skyes<list
rend="u"><item>Tk:  gTing skyes edition reproduction by Dingo.</item><item>Tk1:
gTing skyes edition  in Kaneko<ent value="rsquo"></ent>s catalog.</item><item>Tk2:
gTing skyes original  edition ( from <hi>dgon pa byang</hi>). {Was Tk photostatically
reproduced  without alteration?}</item></list></item><item>sDe dge <list rend="u"><item>Dg:
 contemporary sde dge reprint.</item><item>Dg1: sde dge edition in Kathok
Getsay<ent value="rsquo"></ent>s index.</item></list></item><item>sKyid grong <list
rend="u"><item>Kg:  sKyid grong complete edition.</item><item>Kg1: sKyid grong
incomplete edition.</item></list></item><item>rGyud  bcu bdun editions<list
rend="u"><item>Gc: Adzom Drukpa edition.</item><item>Gc1:  other editions.</item></list></item><item>Other
Editions<list rend="u"><item>Po:  pad ma <ent value="rsquo"></ent>od gling
edition in 'Jigs med gling pa's index. </item><item>Wa:  Waddell edition located
at Oxford and the British Museum.</item><item>Bg:  bai ro rgyud <ent value="rsquo"></ent>bum.</item></list></item></list></p></div4><div4
n="" type=""><head>Text IDs</head><p>Within each edition, the most important
 text identification number will be its sequential number within that edition.
 Thus, the three hundred and twelfth text in the mTshams brag edition reproduced
 by Dingo Khyentse Rinpoche (Tb) would have the unique ID number of Tb.312.
 The basic principle is that the component parts of an identification number
 will be separated by a period (.). This rule will be applied at all levels,
 even down to the sub-section or even line number as detailed below. The one 
obvious exception is in the case of any kind of <hi rend="italics">range</hi>.
 For these, the beginning and ending delimiters will be separated by a hyphen,
 so that for instance the range from line 3 to 7 would be represented by <ent
value="ldquo"></ent>3-7<ent value="rdquo"></ent>. (Of course, this would be
proceeded  by the text<ent value="rsquo"></ent>s own ID.)</p><p>However, there
is also a secondary  method for identifying a text in a multi-volume canon,
such as <hi>The Collected  Tantras of the Ancients</hi>. This is connected
with the physical form of  the edition, namely its division into volume. A
text<ent value="rsquo"></ent>s  location can be described according to what
volume it is in <hi rend="bold">and</hi>  its sequential number within that
volume. This is constructed on the same  principle, except that the volume
number is preceded by a lower case <ent value="ldquo"></ent>v<ent value="rdquo"></ent>.
All components of this description are,  as usual, separated by a period.
To take an example, the seventh text in the  fourteenth volume of the same
mTshams brag edition is entitled <hi rend="italics">stong  pa<ent value="rsquo"></ent>i
sgra/</hi>. While it has a main ID number of Tb.335,  meaning it is the 335th
text in that edition, it also has an alternative descriptive  number, describing
where it resides in which volume, namely Tb.v14.7.</p><p>Similarly,  a particular
page within a specific volume can also be referenced by this  means under
the same principle. Just as a <ent value="ldquo"></ent>v<ent value="rdquo"></ent>
 was inserted before a volume number in the previous example, an <ent value="ldquo"></ent>f<ent
value="rdquo"></ent> (for folio) can be inserted before a page number within
a  volume. Thus, the 556th page of volume 32 would be referred to as Tb.v32.f556.
 Folio, here, is counted as one side of the page, the page numbers being printed
 in Arabic numerals in the margins of the mTshams brag edition. This kind
of  reference, however, is not in any way connected to a text ID but is merely
 a description of the physical placement of a page within a volume. Its use
 is therefore discouraged.</p></div4><div4 n="" type="subsection"><head>Passage
 IDs</head><div5><head>Methodology</head><p>The most complex element of the
 ID numbers is the passage ID, which pinpoints a particular passage <hi rend="bold">within</hi>
 a text with a unique identification number. To refer within texts, it is
necessary  to base the ID on internal components of the text itself. Our passage
ID has  two components: the chapter-level ID and the line ID.</p><list n="1"
rend="i"><item><hi rend="underline">The chapter-level ID</hi>: most commonly,
there are three  structural divisions of a Tibetan text: front, body, and
back. Our passage  ID system labels these as <ent value="ldquo"></ent>a<ent
value="rdquo"></ent> (front), <ent value="ldquo"></ent>b<ent value="rdquo"></ent>
(body) and <ent value="ldquo"></ent>c<ent value="rdquo"></ent> (back). We
have then identified internal sections of these  divisions as <ent value="ldquo"></ent>chapter-level
elements<ent value="rdquo"></ent><ent value="mdash"></ent>based upon the body
being divided into a series of chapters.  These chapter level elements are
then assigned a sequential number beginning  from number one within each of
the three structural divisions. Thus, the lower  case letter a, b, or c is
followed directly by a number without an intervening  space or period. Thus
b4.454 indicates the 454th line of the fourth chapter  of the body; a3.34
indicates the thirty fourth line of the 3rd <ent value="ldquo"></ent>chapter<ent
value="rdquo"></ent> (i.e. distinct section) of the <ent value="ldquo"></ent>front<ent
value="rdquo"></ent>.</item><item><hi rend="underline">The line ID</hi>: A <ent
value="ldquo"></ent>line<ent value="rdquo"></ent> in this context is defined
as a <hi>shad</hi>-delimited  series of syllables. By preceding this with
the edition ID, text ID and chapter-level  ID, it constitutes a unique ID
for every line in every edition. We are using  a simple computer routine to
assign these numbers, bypassing the problem of  human error. The incorporation
of chapter numbers not only helps indicate  the structural position of the
passage in the text across editions, but it  also keeps the line numbers from
getting into the tens of thousands, which  would be cumbersome. </item></list></div5><div5><head>Integration
with other  IDs</head><p>Example: Tb1.99.b4.211.</p><list><item>Edition ID:
Tb1 is the  sigla for the edition in question.</item><item>Text ID: the ninety
ninth text  within the edition.</item><item>Passage ID: 4th chapter of the
body, 211th  line within the fourth chapter. If a text has no discernible
chapters, then  the whole body of the text is considered a single chapter
and is identified  as b1.</item></list><p>Periods are thus used to delimit
separate ID items  in general. If one wants to specify a ranges of lines<ent
value="mdash"></ent>say  lines 211 to 228 in chapter four<ent value="mdash"></ent>one
would use a hyphen,  as for instance in b4.211-228. If this range of lines
spanned multiple chapter-level  elements, one would include the chapter-level
ID on both sides of the dash.  Thus, a reference to a textual passage ranging
from line 211 in chapter four  to line 14 in chapter five would be b4.211-b5.14.</p></div5><div5><head>Background</head><p>In
 using print editions, it is standard practice to refer to a line by reference
 to the page and line number within that page<ent value="mdash"></ent>thus
45.4  means the fourth physical line on the 45th page. While this is a simple
and  unambiguous ID, it tells us nothing about the location of the passage
within  the text<ent value="rsquo"></ent>s own structure, and it tells us
nothing about  the location of this line in any other edition. It is simply
a contingent  product of the limitations of the print medium. In determining
an electronic  ID for passages, we want to avoid inordinately lengthy strings
of separate  numbers (like 4.5.6.8.4.9.10.34), large numbers of any types
(1458483, etc.),  complex schemes that encourage mistakes by data entry people
and proofreaders,  and purely arbitrary schemes that have little relation
to the text<ent value="rsquo"></ent>s  own divisions and flow.</p><p>Since
electronic editions are not bound by the  arbitrary page size limitations
of print editions, it is possible to fundamentally  rethink the nature of
passage IDs. We will preserve in the SGML data references  to the original
printed document<ent value="rsquo"></ent>s pagination/lineation  (using MILESTONES),
and the site will allow for any passage ID to be translated  to a pagination/lineation
reference referring back to the printed text. In  coming up with a passage
ID scheme that takes advantage of the new media,  our aim has been, in addition
to simplicity and clarity, to convey something  about the location of the
passage within the text<ent value="rsquo"></ent>s own  structure. This scheme
for passage IDs determines how we divide the seamless  flow of text into textual
units for purposes of collocation of editions, reference,  internal data management,
and so on. While it is impossible to have an identical  passage ID enumeration
across editions because of the variation across editions  (missing lines,
added lines, etc.), it is possible to indicate more about  the passage<ent
value="rsquo"></ent>s structural position within the text, thereby  indicating
roughly where in any given edition it will be.</p><p>Incorporating  the structural
organization of the text into the passage ID<ent value="mdash"></ent>rather
 than merely an arbitrary electronic page of predetermined and regular length<ent
value="mdash"></ent>has the advantage of yielding a certain common reference
to  the PASSAGE ID across editions. For example, within a Shakespeare play,
Hamlet  3.2.4-8 is the fourth through eighth lines of the second scene of
the third  act; or within the Bible, Luke 4:5 is verse 5 of chapter 4. While
the particular  line enumeration may vary from edition to edition within a
given act or verse,  presumably this information of act/scene and chapter/verse
tells us where  we are in general within all editions. Since the normal print
ID of page/line  tells us absolutely nothing about its location within another
edition, this  electronic system has a great benefit.</p><p>When we look to <hi>The
Collected  Tantras of the Ancients</hi> texts for structural divisions that
could be  incorporated within the passage ID, <ent value="ldquo"></ent>chapters<ent
value="rdquo"></ent>  (<hi rend="italics">le<ent value="rsquo"></ent>u</hi>, <hi
rend="italics">skabs</hi>,  etc.) are the most widespread and standard structural
division that would  be useful in this regard. Chapters constitute the <ent
value="ldquo"></ent>body<ent value="rdquo"></ent> of the text, in addition
to which there are the <ent value="ldquo"></ent>front<ent value="rdquo"></ent>
materials (homage, etc.) and <ent value="ldquo"></ent>back<ent value="rdquo"></ent>
materials (colophon, etc.). <ent value="ldquo"></ent>Front<ent value="rdquo"></ent>
and <ent value="ldquo"></ent>back<ent value="rdquo"></ent> may be quite  simple,
or they may also consist of several structural units that in some  sense are
parallel to the divisions of chapters in the body. Unfortunately,  there is
no standard sub-chapter structural division in these types of Tibetan  texts
analogous to an <ent value="ldquo"></ent>act<ent value="rdquo"></ent> or <ent
value="ldquo"></ent>verse<ent value="rdquo"></ent> that might be of use. Outlines <hi
rend="italics">sa bcad</hi>) are not useful since, on the one hand, many texts
 do not use an outline structure, and, on the other hand, they are so complicated
 in their nested hierarchies as to not yield useful information about the
structural  position of a passage within the text. Our scheme allows for a
general correspondence  of passage IDs across editions and also indicates
where one is structurally  within the text. One could add a specification
of the total number of chapters  and lines (b4-11.454.500<ent value="mdash"></ent>the
fourth of 11 chapters, and  the 454th line out of 500 lines), but we have
decided that the value of this  information is outweighed by the increased
bulkiness.</p><p>While we hope  our editing processes eliminate such mistakes,
we may find after we stabilize  the numbering scheme for a given text, that
a line has been left out or an  extra line inserted. If a missing line must
be inserted, we will use the preceding  line<ent value="rsquo"></ent>s number
and add <ent value="ldquo"></ent>a<ent value="rdquo"></ent>, <ent value="ldquo"></ent>b<ent
value="rdquo"></ent>, etc. without any intervening space or  period. If an
extra line must be deleted, then we will simply eliminate that  number from
the scheme, so that the enumeration will go from 56 to 58, for  example, if
57 was the deleted line. </p><p><hi rend="underline">Collation</hi>:  while
the passage ID will be roughly similar across editions, missing/extra  lines
will introduce a gradual discordance in enumeration of lineation. This  is
particularly true for such things as differences in the insertion or absence
 of <ent value="ldquo"></ent><hi rend="italics">shad</hi><ent value="rdquo"></ent>.
It  is thus necessary to have a standard lineation ID which can thus be used
to  coordinate all the separate editions. Usually people choose the lineation
 of the edition thought to be closest to the original, or the longest edition.
 Since we are dealing with an entire canon, we are suggesting we choose the
 lineation given for the edition of texts found in the largest edition of
the  canon<ent value="mdash"></ent>i.e., that which contains the most individual
texts<ent value="mdash"></ent>namely, the mTshams brag (Tb) edition. When
a given text is  absent in Tb, we will choose the standard lineation on a
case by case basis,  following the general principles cited above.</p></div5></div4></div3><div3
n="" type=""><head>Non-<hi>Collected Tantras of the Ancients</hi> Collections</head><p>The
 main standard in the use of Sigla has been recently established in reference
 to editions of the bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur
and bsTan <ent value="rsquo"></ent>gyur canon, the main two part canon of
Tibetan translations  of Indic texts. These have been given in PIATS 7 vol.
1, p. xvii-xx, and discussed  in the bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur
volume (IATS 7 vol  III). They basically assign single letter siglum to each
edition. We will  follow this scheme for references to the bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur/bsTan <ent value="rsquo"></ent>gyur:</p><list rend="u"><item>A:
Ta p(h)o/rTa po MS fragments</item><item>B:  Berlin MS bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur</item><item>C:  Co ne bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur and bsTan <ent value="rsquo"></ent>gyur</item><item>D:
 sDe dge bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur and bsTan <ent
value="rsquo"></ent>gyur</item><item>E:  Newark MS bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur</item><item>F:  Phu brag MS bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur</item><item>G:  dGa<ent value="rsquo"></ent> ldan
or <ent value="ldquo"></ent>Golden Manuscript<ent value="rdquo"></ent> bsTan <ent
value="rsquo"></ent>gyur</item><item>H: Lha sa bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur</item><item>J: Li thang or <ent value="rsquo"></ent>Jang
 sa tham bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>K:
Peking  edition of the bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur
prepared in  1684/92 under the Kangxi emperor</item><item>L: London MS bKa<ent
value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>M: Peking edition
of the Mongolian bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>N:
sNar thang bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur and
bsTan <ent value="rsquo"></ent>gyur</item><item>O:  O rgyan gling or Tawang
MS bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>P:
 Petersburg MS of Mongolian bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>Q:
 Peking bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur/bsTan <ent
value="rsquo"></ent>gyur  prepared in 1737 under the Qianlong emperor; also
the modern photographic  reprint of the Peking edition, which is based on
a combination of the 1717/20  and 1737 issues.</item><item>S: sTog Palace
MS bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>T:
Tokyo or Kawaguchi MS bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur</item><item>W:
bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur  and bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur supplement prepared under  the Wanli emperor</item><item>Y:
bKa<ent value="rsquo"></ent> <ent value="rsquo"></ent>gyur  prepared under
the Yongle emperor.</item></list><p>These are important to  the extent that
they have <ent value="ldquo"></ent><hi>rnying rgyud</hi><ent value="rdquo"></ent>
 sections, as do the Tawang, sDe dge, and so forth. </p><p>We have decided
 that for all collections of multiple texts other than the bKa<ent value="rsquo"></ent> <ent
value="rsquo"></ent>gyur standards<ent value="mdash"></ent>i.e., various canons,
as  well as various collected works (<hi rend="italics">gsung <ent value="rsquo"></ent>bum</hi>)
 and miscellaneous works (<hi rend="italics">gsung thor bu</hi>)<ent value="mdash"></ent>we
 will follow our principle of assigning sigla of two letters (the first letter
 is upper case, and the second letter is lower case). The text ID will follow
 the standard <hi>Collected Tantras of the Ancients</hi> scheme, while the
 passage ID will include chapter and then the page/line of the manuscript.
 Thus Rc84.6-4.2.3 is the 3rd line of the 2nd page of the 4th chapter of the
 6th text in the 84th volume of the <hi rend="italics">Rin chen gter mdzod</hi>.</p><list><item>Rc:
 Rin chen gter mdzod.</item><item>Km: rNying ma bka<ent value="rsquo"></ent>
ma.</item><item>Km1:  rNying ma bka<ent value="rsquo"></ent> ma rgyas pa.</item><item>Dn:
gdam ngag mdzod.</item><item>Gc:  rgyud bcu bdun. The various editions number
1 onwards. </item><item>Bn: bi  ma snying thig. </item><item>Kn: mkha<ent
value="rsquo"></ent> <ent value="rsquo"></ent>gro  snying thig.</item><item>Ky:
mkha<ent value="rsquo"></ent> <ent value="rsquo"></ent>gro  yang thig</item><item>Ly:
bla ma yang thig.</item><item>Zy: zab mo yang thig.</item><item>Dh:  signifies
Dunhuang materials. Then have a period followed by the traditional <ent value="ldquo"></ent>P<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>IOL<ent value="rdquo"></ent>,
 etc. This is important for the Dunhuang Ma<ent value="ntilde"></ent>ju<ent
value="sacute"></ent>r<ent value="imacr"></ent>n<ent value="amacr"></ent>masamgiti,
the Dunhuang Vajrakil<ent value="amacr"></ent>ya stuff,  the Guhyagarbha stuff,
etc</item><item>Potala: 7 volume collection of <hi>rnying  rgyud</hi>.</item></list></div3><div3
n="" type=""><head>Sigla for early lists  of <hi>The Collected Tantras of
the Ancients</hi> Texts</head><p>We are reserving  the sigla <ent value="ldquo"></ent>Li#<ent
value="rdquo"></ent> for lists of texts found  in other texts. Ng-Li# specifically
indicates lists of <hi rend="italics">rnying  rgyud</hi> prior to the fifteenth
century that may indicate now lost texts.  The use of the standard <ent value="ldquo"></ent>Li<ent
value="rdquo"></ent> allows  the user to immediately identify this as a list
rather than an edition. All  the lists together from a given text will be
entered as a single file, assigned  numbers with the two letter code, and
then cross referenced against <hi>The  Collected Tantras of the Ancients</hi>
indexes. </p><list rend="u"><item>Ng-Li1:  grub mtha<ent value="rsquo"></ent>
mdzod</item><item>Ng-Li2: karma pakshi</item><item>Ng-Li3:  nyang ral chos <ent
value="rsquo"></ent>byung</item><item>Ng-Li4: zang gling ma</item><item>Ng-Li5:
 lde<ent value="rsquo"></ent>u chos <ent value="rsquo"></ent>byung.</item><item>Ng-Li6:
 bu ston<ent value="rsquo"></ent>s lists of texts left out of the bKa<ent
value="rsquo"></ent> <ent value="rsquo"></ent>gyur and bsTan <ent value="rsquo"></ent>gyur.</item><item>Ng-Li7:
 List of <hi>The Collected Tantras of the Ancients</hi> dkar chag from Chos
 rgyal <ent value="rsquo"></ent>phags pa.</item><item>Etc.</item></list></div3><div3
n="" type=""><head>Sigla for Individual Texts</head><p>Works printed individually
 or separately from <hi>The Collected Tantras of the Ancients</hi>, such as
 commentaries, the source of lemma, and individual editions of texts from <hi>The
 Collected Tantras</hi>, are essential to a thorough understanding of <hi>The
 Collected Tantras</hi>. Sigla for individual works consist of three letters,
 the first of which is upper case and the latter two of which are lower case.
 We begin by using the root letters of the first three words, and then use
 the successive words as necessary until we have a unique siglum. In terms
 of variant editions of a given work, we simply use a number in the fourth
 slot to distinguish them (Tdd, Tdd1, Tdd2, etc.). </p><list rend="u"><item>Tdd:
 tshig don mdzod. </item><item>Tcd: theg mchog mdzod</item><item>Yzd: yid
bzhin  mdzod</item><item>Nld: gnas lugs mdzod</item><item>Cbd: chos dbyings
mdzod</item><item>Gtd:  grub mtha<ent value="rsquo"></ent> mdzod</item></list></div3></div2><div2
type="section"><head>File  Types and Naming Conventions</head><p>A number
of different file types are  included in Tibetan Literature Archive<ent value="rsquo"></ent>
digital catalogs. We  have already seen three different levels of objects
in regards to the identification  numbers: the edition, the volume, and the
text. Each of these has a unique  bibliographic record describing it. There
are moreover other files associated  with these, such as scanned images of
the pages, transliterations of texts,  textual summaries and so forth. In
order that these may be properly interconnected,  a file naming convention
has been developed so that these files may be named  in a consistent fashion.
The average user will generally not be concerned  with the file names, but
they will be important when downloadable access to  files becomes available.
Therefore, below is a list of the various file types  and their naming protocol.
More detailed description of what these files are  will be found in the next
section on Mark-up.</p><p><list rend="u"><head>File  Names</head><item><hi
rend="underline">Bibliographic Records</hi>: All bibliographic  record files
will be composed of the ID number plus <ent value="ldquo"></ent>.bib.sgm<ent
value="rdquo"></ent> (until we convert to XML at which point that will be
the extension).  Thus, the mTshams brag edition bibliographic record is in
a file called Tb.bib.sgm;  its second volume<ent value="rsquo"></ent>s bibliographic
record is in Tb.v2.bib.sgm,  and its 762nd text<ent value="rsquo"></ent>s
bibliographic record is in Tb.762.bib.sgm.</item><item><hi rend="underline">Catalog
Files</hi>: An individual item<ent value="rsquo"></ent>s  bibliographic record
is contained in a single file. This produces a great  number of individual
files. These are all brought together in the catalog  files. The catalog files
record the structure of a volume or an edition  by creating an SGML hierarchy
with the volume<ent value="rsquo"></ent>s bibliographic  record at the top
with the bibliographic records of the individual text it  contains under it.
The same principle applies to an edition with its bibliographic  record on
top and the individual volumes listed containing their texts under  that.
This effectively recreates the hierarchical structure of Edition:Volume:Text.
 The way this is done is through catalog records that use Astoria<ent value="rsquo"></ent>s
 sharing capacity to import the latest version of text or volume bibliographic
 records into a predefined SGML hierarchy. These catalog files are named 
on a similar principle as the bibliographic records, except that the suffix <ent
value="ldquo"></ent>.cat.sgm<ent value="rdquo"></ent> is used instead. Thus,
the catalog  record for the mTshams brag is Tb.cat and its 43rd volume is
Tb.v43.cat. These  files serve to organize and hold the data, though they
are only viewed in  the sense that they define the hierarchy which is portrayed
in the TOC.</item><item><hi rend="underline">Doxographical Catalog Files</hi>:
An alternative type of  catalog file is one that organizes the texts within
an edition not by volume  and placement in the volume but by how the edition
categorizes them doxographically.  Each edition is organized along different
doxographical lines. The doxographical  catalog file will use SGML/XML to
recreate the hierarchy of doxographical  categories portrayed in each edition
and import the text bibliographic records  into the appropriate categories.
These files will be named with the edition  sigla followed by the extension <ent
value="ldquo"></ent>.doxcat.sgm<ent value="rdquo"></ent>.  Thus, for the mTshams
brag edition, it would be Tb.doxcat.sgm. These kind  of files will only exist
for editions. The master catalog file represents  an exception in that, though
it is organized doxographically, its name will  be Ng.cat.sgm.</item><item><hi
rend="underline">Text Representation Files</hi>:  There are several types
of files that will contain representations of Tibetan  texts. The Library
intends to provide access to four digital versions of each  text: extended
Wylie transliteration, a digital Tibetan script version (created  from the
Wylie with converters), English or other non-Tibetan language translations,
 and scanned images of the folios of the text. Critical editions of a text
 in any format, except of course the scanned images, will be associated with
 the master catalog and will have identification numbers associated with 
that catalog. Those ID numbers will represent the doxographical category 
of the text and will often have multiple parts. For instance, Ng3.1.2.4 refers
 to the fourth text in the second sub-sub-section of the first sub-section
 of the Mah<ent value="amacr"></ent>yoga section of <hi>The Collected Tantras
of  the Ancients</hi>. These representations of texts will be accessible through
 the catalog. Each will have its own naming scheme:<list rend="u"><item><hi
rend="underline">Transliteration</hi>: Using our extended Wylie transliteration
 scheme, each text will be entered in transliterated form. This text will
be  marked-up in SGML/XML for greatest flexibility. These names for these
files  will be the text<ent value="rsquo"></ent>s ID number with <ent value="ldquo"></ent>.wyl.sgm<ent
value="rdquo"></ent> appended to it. E.g., Tb.516.wyl.sgm. (Note: The final
extension,  .sgm, is interchangeable with .xml in these descriptions. For
the moment,  our data is encoded in SGML but the new extension will be used
when we convert  to XML. This also applies to the following file names.)</item><item><hi
rend="underline">Tibetan  Script</hi>: The Tibetan script version of the text
will make use of a Wylie/Tibetan  converter being developed by the Library.
The converter will be run on the  Wylie file and output an SGML/XML file with
Tibetan Unicode, when such becomes  available. The names for these files will
be the file ID plus <ent value="ldquo"></ent>.tib.sgm<ent value="rdquo"></ent>.
Thus, Tk.109.tib.sgm.</item><item><hi rend="underline">Translations</hi>:
 The Library<ent value="rsquo"></ent>s translation policy has not been fully
worked  out. Our intention is to include well-done scholarly translations
of texts,  whenever suitable and available. Because much of the material in <hi>The
Collected  Tantras of the Ancients</hi> is considered sacred, esoteric literature,
a  translation policy that respects the traditional attitudes must be worked
 out through a dialogue between scholars and traditional practitioners. Whenever
 translations are available, however, they will be named with the text<ent
value="rsquo"></ent>s ID number, followed by a three-letter language code
and the  .sgm. Thus, it would be, for instance, Dg.18.eng.sgm, Dg.18.fre.sgm,
Dg.18.jap.sgm,  etc.</item><item><hi rend="underline">Digital Images</hi>:
The scanned images  of a text<ent value="rsquo"></ent>s actual folio sides
presents a different problem.  The texts of an edition are compiled into volumes
where one text follows directly  after the previous. Texts, therefore, generally
begin and end in the middle  of a folio side. It becomes problematic to name
a digital image of a folio  using a text ID since a single folio may contain
the end of one text and the  beginning of another. To avoid this problem,
digital images are instead named  according to their volume and folio number.
Thus, the three-hundred and sixty-eighth  side of volume seventeen in the
mTshams brag edition would be Tb.v17.368.jpg.  This is the same way that pages
in volumes can be referred to without referring  to a text. The difference
is, however, that the extension describes the image  type in this case it
is a JPEG image with the extension (.jpg).</item></list></item><item><hi rend="underline">Chapter
Summary Files</hi>: Most likely before most translations  come on line, summaries
of a text<ent value="rsquo"></ent>s chapters will be available.  These are
easier to produce as they require less intense time-commitments  though they
would still be reviewed with academic rigor. Each text would have  a single
SGML/XML file that would contain all its chapter summaries. This  file would
be called by the text<ent value="rsquo"></ent>s ID followed by .chps.sgm.
 Thus, an example would be Kg.301.chps.sgm.</item><item><hi rend="underline">Text
 Scholarship Files</hi>: Modern scholarship on a particular text will be recorded
 in two ways. Within the text<ent value="rsquo"></ent>s bibliographic record
itself,  there will be a number of discussion fields for shorter discussions
on various  topics revolving around that text<ent value="mdash"></ent>its
translators, doxographical  category, composition, and so forth. However,
longer articles on a particular  item in the catalog, whether text, volume
or edition, will have a separate  file. All articles pertaining to a single
item will be contained in the same  file. These scholarship files will be
named by the convention of ID number  plus the extension, .schp.sgm. For instance,
Wa.482.schp.sgm.</item></list></p><p>Below  is a table summarizing the file
types and the name formats. Items in braces  provide descriptive information:</p><p><table
rend="b1" rows="20" cols="150"><head>File  Types and Naming Conventions Table</head><row
rend="head"><cell rend="150">File  Type</cell><cell rend="150">Naming Method</cell><cell
rend="150">Example</cell></row><row><cell>Edition  Bibliographic Record</cell><cell>{sigla}.bib.sgm</cell><cell>Tb.bib.sgm</cell></row><row><cell
>Volume  Bibliographic Record</cell><cell>{sigla}.v{vol. #}.bib.sgm</cell><cell>Tk.v3.bib.sgm</cell></row><row><cell>Text
 Bibliographic Record</cell><cell>{sigla}.{text #}.bib.sgm</cell><cell>Dg.591.bib.sgm</cell></row><row><cell>Edition
 Catalog Record</cell><cell>{sigla}.cat.sgm</cell><cell>Kg.cat.sgm</cell></row><row><cell>Volume
 Catalog Record</cell><cell>{sigla}.v{vol. #}.cat.sgm</cell><cell>Bg.v03.cat.sgm</cell></row><row><cell>Doxographical
 Catalog Files</cell><cell>{sigla}.doxcat.sgm</cell><cell>Wa.doxcat.sgm</cell></row><row><cell>Text
 Transliterations</cell><cell>{sigla}.{text #}.wyl.sgm</cell><cell>Ng3.1.2.4.wyl.sgm</cell></row><row><cell>Digital
 Tibetan Script Representations</cell><cell>{sigla}.{text#}.tib.sgm</cell><cell>Bg.45.tib.sgm</cell></row><row><cell>Text
 Translations</cell><cell>{sigla}.{text#}.{language code}.sgm</cell><cell>Tb.417.eng.sgm</cell></row><row><cell>Scanned
 Images of Folios</cell><cell>{sigla}.v{vol. #}.{page #}.{image format}</cell><cell>Tk.v09.471.tif</cell></row><row><cell>Chapter
 Summaries</cell><cell>{sigla}.{text #}.chps.sgm</cell><cell>Dg.108.chps.sgm</cell></row><row><cell>Scholarship
 Files</cell><cell>{sigla}.{text #}.schp.sgm</cell><cell>Tb.45.schp.sgm</cell></row></table></p></div2><div2
type=""><head>HTML Display Conventions</head><p>As the extent of the Tibetan
 and Himalayan Digital Library<ent value="rsquo"></ent>s project has grown,
we have  seen the need to bring some standards into the web-pages for each
of the various  components. Our goal is to use a standard HTML mark-up scheme,
relying on  CSS style-sheets, to provide a consistent look to each of the
many facets  of the project. To this end, we have devised a standard format
for home pages  in the THDL.</p><div3><head>Standard Format for Home Pages</head><p>THDL
asks  that subcomponents follow the standard format for home pages to maintain
consistency  of look and use for projects at their root level as part of an
integrated  approach.  This involves the standard visual appearance and structure
of the  home page.  The basic visual design for these subcomponents follows
a single  library pattern, but the color and font differ depending on which
component  of the library the subcomponent belongs to.</p><p><list rend="u"><item>In
 the main part of the page, a short blurb to the left and an illustrative
image  to the right, and then the standard set of info links on the right
lower part  of the page.</item><item>The left framing banner should have a
list of components  of the project that are hot-linked, and at the bottom
should be the hot-linked  name of the component of the library it belongs
to (The Environmental Archive,  etc.), and below that the hot-linked name
of The Tibetan and Himalayan Library.</item></list></p><p><list n="1" rend="1"><item>Thus
there will be a left bar which will be the locus  for the chief menu options
for the front page. At the bottom of the left bar,  and in this order, it
should read vertically arranged: <ent value="ldquo"></ent>Modes  of Access<ent
value="rdquo"></ent>, The name of this Library component (i.e. which  of the
six components this subcomponent belongs to), and finally a link to  the home
page for the Tibetan and Himalayan Digital Library.</item><item>There  will
be a top bar with the name of the subcomponent as the large header, and  with
the name in Tibetan script below it. To the right of the large header  will
appear either the <ent value="ldquo"></ent>University of Virginia<ent value="rdquo"></ent>,
 or other Universities as appropriate. Both left and top bars are colored
to  match the color coding for that part of the library.</item><item>The standard
 logo of the THDL is in the upper left hand corner<ent value="mdash"></ent>the
sacred  knot. </item><item>As mentioned above, in the body of the page inside
of the  two frames is a paragraph on the left hand side describing the project,
and  to the right hand side a visual image that somehow embodies this subcomponent.
  The image should have a one line caption in cursive.</item><item>Below the
 blurb and image, to the right hand side below the image should be the standard
 set of info links on the subcomponent.  See one of the pages for the structure
 of the infolinks and content.</item></list></p><p>For email contact addresses,
 and update/announcement mailing lists, THDL follows a common format. The
basic  email contact address is the name of the project@.  Thus The Samantabhadra
 Collection<ent value="rsquo"></ent>s email contact is <hi rend="email">samantabhadra@virginia.edu</hi>.
  If the name is too long, then an appropriate abbreviated form is used. 
For  mailing lists used for announcements and updates, we use the form Name
of  Project-update@.  Thus, in the previous case, <hi rend="email">samantabhadra-update@virginia.edu</hi></p></div3></div2></div1>

<!--MARKUP-->

<div1 type="chapter"><head>III. Markup</head><p>This
chapter of the technical notes deals with the markup procedures used  in the
s. It has the following sections: <list n="1" rend="A"><item>Introduction
 to SGML/XML</item><item>The Use of SGML in The Samantabhadra Collection</item><item>Encoding
 the Structure of a Text</item><item>SGML Tagging Issues</item><item>Display
 Issues and Style Sheets</item></list></p><div2><head>Introduction to SGML/XML</head><p>In
 this age of ever-advancing technology, <ent value="ldquo"></ent>markup<ent
value="rdquo"></ent>  languages have become the standard for both structuring
documents and presenting  them over the World Wide Web. The most widely-known
of these languages is  HTML (HyperText Markup Language), which is used for
creating web pages. Now  in its fourth version, HTML is actually a specific
implementation of SGML  (Standard Generalized Markup Language). SGML is described
as a markup meta-language  in that it is used to define the rules of other
markup languages such as HTML. </p><p>SGML  is an international standard for
the description of marked-up electronic text.  It is a meta-language that
formally describes a markup language. In this context, <ent value="ldquo"></ent>markup<ent
value="rdquo"></ent><ent value="mdash"></ent>or more precisely, <ent value="ldquo"></ent>encoding<ent
value="rdquo"></ent><ent value="mdash"></ent>signifies any  means of making
explicit an interpretation of a text in order to direct the  user of the text
in how the content of the text should be interpreted. A <ent value="ldquo"></ent>markup
language<ent value="rdquo"></ent> is thus a set of such conventions  used
for encoding texts that specifies what markup is allowed, what markup  is
required, and how markup is to be distinguished from the actual primary  text.
SGML is the parent language of HTML (Hypertext Markup Language), the  basic
language generally used to encode Web pages. It is a descriptive set  of tags
that is enabled by the tag-set building rules contained within SGML.  EXtensible
Markup Language (XML) is a parent language like SGML and is both  derived
from and compatible with SGML. XML is designed for easier delivery  on the
Internet than is SGML, and for easier implementation by software developers.
 XML is beginning to supplant HTML as the language of choice for the Web,
though  this shift will take several years to complete. </p><p>SGML was developed
 as a legal markup language for IBM by Goldfarb, Mosher, and Lorie in 1969
 and was first standardized by the ISO (International Organization for Standardization)
 in 1986. It was eventually discovered, however, that SGML was in fact <hi
rend="italics">too</hi> general to be easily implemented by the average user.
 Then, in 1989, Tim Berners-Lee and Andrew Burglund developed HTML for CERN.
 Because HTML was a much-simplified and static subset of SGML, it became the <ent
value="ldquo"></ent>standard<ent value="rdquo"></ent> markup language used
on the internet.  While HTML is convenient because of its simplicity, it is
not adaptable to  diverse types of information, but is merely a formatting
language. The flexibility  of SGML, on the other hand, lies in the ability
to create user-defined tags  for structural encoding (delimiting the structure
of a document) and descriptive  encoding (describing the nature of a document<ent
value="rsquo"></ent>s content).  This is done through a Document Type Definition
(DTD) that determines where  tags can be used, what attributes they have,
and what content they can hold. </p><p>HTML  is one language defined using
an SGML DTD. Its simplicity led to it being  quickly adopted by the fledgling
internet community so that now it is the  standard for <hi rend="italics">presentation</hi>
of web documents. However,  structural and descriptive markup is almost completely
absent from HTML, as  it deals only with how the document is to be displayed.
For this reason, SGML  has continued to be used despite its unwieldiness.
In an effort to make SGML  more manageable and accessible, a version of it
known as XML or eXtensible  Markup Language has been developed. Whereas SGML
can only be displayed using  expensive, proprietary software, XML is paired
with a styling language, XSL  (eXtensible Style Language), that allows XML
documents to be transformed in  any number of ways. The theory is that XML
will be transformed using XSL into  formatted objects that someday browsers
will be able to read and display.  However, because XML is still relatively
new, we are just now seeing its initial  implementations in the most common
browsers. Until browsers are fully XML-compatible,  XML must first be transformed
into HTML for it to be presented on the web.</p><p>When  the Samantabhadra
Project first approached IATH for technical assistance in  1997, the W3C consortium,
which developed XML, had yet to release XML 1.0,  which came out in 1998.
Even now XML is still in its infancy. Thus, the information  documented and
stored in our  has been encoded in SGML, in which it will remain  until XML
becomes sufficiently stable and can be implemented easily enough  to make
the conversion practical.</p><p>The general value of SGML can be understood
 via three points.  The first is that SGML allows one to descriptively markup
 the intellectual content and structure of a document. In contrast, HTML is
 chiefly concerned with superficial display issues. SGML uses descriptive
rather  than procedural markup, meaning that it simply identifies portions
of the  document rather than specifying what processing should be carried
out at particular  points in a document. Thus the data structure of a particular
SGML file is  geared towards an intellectual analysis of the document. In
this way, its  tags allow one to create richly structured documents by designating/encoding
 such information as structural divisions (title page, main body of text,
scene,  stanza, section, date, author, etc.), in addition to conveying information
 about renditional and typographical elements (changes in typeface, line breaks,
 etc.). This enables sophisticated searches and other manipulation of the
data.</p><p>The  second advantage is SGML<ent value="rsquo"></ent>s universality<ent
value="mdash"></ent>as  long as the underlying data structure is accurate,
it can be understood and  used by any program or tool that understands SGML.
Instructions are needed  to actually process the document. Since the instructions
for processing an  SGML document for a particular purpose (such as formatting)
are strictly separated  from the descriptive markup within the document, they
are usually collected  outside the document in separate procedures or programs.
This means that the  same SGML document can be readily processed by entirely
different pieces of  software in quite distinct ways. A single part of an
SGML file can thus be  processed simultaneously in a variety of ways. In addition,
an SGML text consists  of plain ASCII text with special tags contained in
angle brackets (e.g., <ent value="lt"></ent>author> Padmasambhava <ent value="lt"></ent>/author>).
 These <ent value="ldquo"></ent>tags<ent value="rdquo"></ent> or <ent value="ldquo"></ent>elements<ent
value="rdquo"></ent> identify the type or format of information contained
between  them. Since the tags are composed of plain text ASCII characters,
no special  software or proprietary binary code is necessary to create an
SGML file.   This ensures long-term viability and easy delivery of files across
networks  and platforms. </p><p>The third point is the <ent value="ldquo"></ent>Document
 Type Definition<ent value="rdquo"></ent> (DTD), which defines the identity
and  functionality of the tagging elements that can be used. For a detailed
discussion  of tagging, see the separate document on <ent value="ldquo"></ent>SGML-tagging
 in <hi rend="italics">The Collected Tantras of the Ancients (rNying ma rgyud<ent
value="rsquo"></ent>bum)</hi>)<ent value="rdquo"></ent>.  Documents are thus
understood  as having <ent value="ldquo"></ent>types<ent value="rdquo"></ent>,
which is formally  defined by its constituent parts and their structure. 
Documents of the same  type can thus be processed in a uniform way. SGML is
used in many fields<ent value="mdash"></ent>the airline industry, defense
industry<ent value="mdash"></ent>and  for each field of use, there tends to
be standard sets of elements/tags (or  DTDs) that apply to the types of concerns
and documents dominating that field.  Most pertinently for our purposes, most
serious digital library initiatives  use the tag set or DTD known as the TEI
(Text Encoding Initiative) Guidelines,  which was developed specifically for
humanities computing projects. A subset  of this is referred to as TEI-Lite.
 TEI is an attempt to set standards for  the use and manipulation of texts.
HTML versions are then created, often on-the-fly,  for web delivery. The downside
of relying on TEI is that it can overly restrict  one<ent value="rsquo"></ent>s
flexibility; the upside is that it conforms to standards  being used in other
humanities projects. Thus TEI will often lack the elements  with the precise
functionality one wants, at which point one uses a program  to create those
specialized elements in a TEI-compliant DTD that defines them.  In addition
to creating new elements, a DTD can also modify attributes of  an existing
element (making them required fields, etc.). </p><p>In order to  facilitate
data entry for large projects which require multiple files of a  similar structure,
it is possible to create SGML-based templates consisting  of SGML tags in
precise hierarchies of arrangement into which the data is  entered. The associated
DTD then defines the nature of the tagging elements  used in that template.</p><p>The
following sections of this chapter deal with  the SGML markup of our information
and its display. Because of the unique  issues involved in cataloging and
representing Tibetan texts electronically,  we developed a specific SGML-defined
language to accommodate these peculiarities.  Working for over a year in weekly
meetings with Daniel Pitti from IATH, we  developed our TIBBIBL (TIBetan BIBLiography)
DTD for the encoding of <hi rend="italics">The  Collected Tantras of the Ancients
(rNying ma rgyud <ent value="rsquo"></ent>bum)</hi>  catalog. The TIBBIBL
DTD is based on the TEI (Text Encoding Initiative)  DTD with significant additions
to account for information specific to Tibetan  literature such as a text<ent
value="rsquo"></ent>s <ent value="ldquo"></ent>revealer<ent value="rdquo"></ent>
and <ent value="ldquo"></ent>concealer<ent value="rdquo"></ent>, multiple
 titles for a single text, and so forth. This will be the topic discussed
in  what follows.</p></div2><div2 n="" type=""><head>The Use of SGML in The
Samantabhadra  Collection</head><p>We have chosen to exclusively base our
project in SGML.   This ensures the data<ent value="rsquo"></ent>s long-term
viability, optimal accessibility  over the Internet, ability to interact with
other sophisticated humanities  projects, and finally that the project<ent
value="rsquo"></ent>s electronic nature  takes full intellectual advantage
of the possibilities of new technology.  We have thus created a series of
separate SGML templates with their own special  TEI-compliant DTDs into which
the <ent value="rsquo"></ent>s underlying data is  being entered. The initial
infrastructure has necessitated an immense commitment  of skilled labor over
a two-year period, but the long-term advantages justify  it. </p><p>We are
working with an augmented TEI which has specialized DTDs  embedded within
it. Thus we have all the TEI general elements available to  us, as well as
the unique elements we have added for use in our project. Our  major extension
to TEI is the TIBBIBL, which is found in our TIBBIBL DTD.  It borrows as many
of the TEI tags as possible but has its own structure and  contains between
twenty and thirty unique elements with functionality specific  to our project.
These elements are encapsulated within, rather than being  scattered throughout,
TEI. Thus, the DTD is <ent value="ldquo"></ent>TEI WITH TIBBIBL  EXTENSION<ent
value="rdquo"></ent>. A <ent value="ldquo"></ent>BIBL<ent value="rdquo"></ent>
 is a TEI grouping element that tells SGML that the enclosed information is
 bibliographic information, thus organizing the information into a hierarchy
 of common groups.</p><p>During the development phase of the project, a great
 deal of time was spent in constructing the TIBBIBL DTD. This involved conceptually
 delineating and constructing the code for tags and attributes not found in
 TEI that were necessary for the adequate cataloging and representation of
 Tibetan texts. Tags such as AUTHOR and EDITOR were already present in TEI,
 but tags for REVEALER and CONCEALER were not. Ways for dealing with a multiplicity
 of end-of-chapter text-titles and so forth were also developed. The general 
organizational principle was the cataloging categories outlined in the cataloging
 standards for Tibetan texts detailed above. Details of the DTD are available
 on request.</p></div2><div2><head>Encoding the Structure of a Text</head><div3
n="" type=""><head>The SGML structure of the data</head><p>Entering the Tibetan
 text into an SGML data structure requires that we take the flow of the Tibetan
 text and divide it up within an SGML-defined set of structural elements.
This  involves striking a balance between recording the text<ent value="rsquo"></ent>s
 own structural divisions and inserting new divisions for our own purposes.
 Another issue is preserving reference to the original manuscript<ent value="rsquo"></ent>s
 pagination and lineation, in contrast to the structure of the text<ent value="rsquo"></ent>s
 thematic divisions (chapters, outlines, etc.). Both sets of information can
 be preserved, but one only can become the basis for the SGML structural division
 of the text. This has consequences for the degree to which searches and display
 can be based on one or the other. </p><p>We can speak of 6 distinct layers
 to the text<ent value="rsquo"></ent>s structure: <list n="1" rend="1"><item><hi
rend="boldface">Section-level elements</hi>: front, body, back</item><item><hi
rend="boldface">Chapter-level elements</hi>: divisions of the preceding, such
 as the chapters of the body</item><item><hi rend="boldface">Subsection-level
 elements</hi>: labeled divisions of the chapters, such as in an <ent value="ldquo"></ent>outline<ent
value="rdquo"></ent> (<hi rend="italics">sa bcad</hi>) structure</item><item><hi
rend="boldface">Paragraph-level elements</hi>: paragraphs for prose or line
 groups for verse</item><item><hi rend="boldface">Sentence-level/sub-paragraph
 level elements</hi>: defined as <ent value="ldquo"></ent><hi rend="italics">shad</hi>-delimited
 lines<ent value="rdquo"></ent> in our scheme</item><item><hi rend="boldface">Individual
 syllables</hi></item></list> </p><p>We have chosen to base our structure
on  the text<ent value="rsquo"></ent>s own thematic divisions rather than
the edition<ent value="rsquo"></ent>s pagination. This allows the text to
be displayed according  to its own internal structure rather than the accidental
structure of what  text fit on what page in this printing. We will preserve
the original artifact<ent value="rsquo"></ent>s pagination and lineation,
but our own IDs will follow our  own numbering scheme that is common across
all editions (see <ent value="ldquo"></ent>ID/SIGLA<ent value="rdquo"></ent>
documentation).</p><p>The key TEI element that defines the  basic structure
of an SGML file is the <ent value="ldquo"></ent>DIV<ent value="rdquo"></ent>
 tag. A DIV element represents a <ent value="ldquo"></ent>DIVISION<ent value="rdquo"></ent>
 within the text. DIV tags can contain subdividing elements, which allows
one  to construct a series of nested hierarchies that mirror the structure
of the  text. Thus, DIV1 elements can contain DIV2 elements, which in turn
can contain  DIV3s, and so on. A document can also contain multiple DIV1 elements,
since  these represent the basic divisions of the text.</p><p>The DIV thus
allows  us to represent the structural elements of the subdivisions of the
FRONT,  BODY and BACK, as well as any nested outlines contained within those
subdivisions.  Every text has three DIV1s, which represent its Front, Body,
and Back sections.  Within each DIV1, DIV2<ent value="rsquo"></ent>s represent
the chapter-level elements  within the Front, Body and Back. When a text has
chapters and no <ent value="ldquo"></ent>outline<ent value="rdquo"></ent>
(<hi rend="italics">sa bcad</hi>), the DIV2s of the Body correspond  to the
chapters and there are no DIV3s. When a text has chapters with nested  hierarchies
of subsections that are contained in a single chapter, the chapter  occupies
the DIV2, the first layer of the outline occupies the DIV3, the second  layer
of the outline occupies the DIV4, and so forth. We need at least thirty  layers
of DIVs, because the nested hierarchies can easily be that complex.  Thus,
there is a fluid number of DIV layers that depend on the complexity  of the
outline. This direct correlation of the nested hierarchy of DIVs and  the
nested hierarchy of the text<ent value="rsquo"></ent>s own outline allows
for  searches on the individual sections of the outline.</p><p>When a text<ent
value="rsquo"></ent>s outline has a nested hierarchy of subsections which
spans  multiple chapters in terms of its enumeration (e.g., topic I.ii.2 begins
chapter  two rather than the outline beginning anew with <ent value="ldquo"></ent>I<ent
value="rdquo"></ent> in each chapter), the hierarchy of DIVs begins at DIV2
with  the first layer of the outline<ent value="rsquo"></ent>s own hierarchy
and then  directly maps onto that hierarchy. Chapters will then be marked
with the <ent value="ldquo"></ent>MILESTONE<ent value="rdquo"></ent> TEI element,
which is simply  an empty marker. This means the chapters won<ent value="rsquo"></ent>t
participate  in any meaningful hierarchy within the SGML data structure<ent
value="mdash"></ent>they  will just be things marked along the way, which
we happen to put information  in. We cannot then say where in the hierarchy
these things belong. Thus when  a text has chapter-spanning outlines, the
chapter sequence will no longer  coincide in any meaningful way with the hierarchical
structure of the SGML.  However, we will artificially link the chapters to
the outlines contained  within them, so that the TOC on the left-hand side
of the screen will go from  Body to Chapters to Outlines. </p><p>In all types
of texts, IDs are assigned  to the chapter-level elements within the Front,
Body and Back; they are also  assigned to all individual lines/sentences within
those chapter-level elements.  For example, the Passage ID b4.45 represents
the forty-fifth line (45) of  the fourth (4) chapter of the body (b).</p><p>Thus,
there are two possible  ways to navigate through the electronic versions of
Tibetan texts that make  use of structural divisions within the texts themselves.
The first uses the  chapter divisions and pagination of the original physical
artifact (that is,  the manuscript). The second makes use of the thematic
outline that organizes  the text<ent value="rsquo"></ent>s contents within
a nested hierarchy of meaning.  Using the former as the structure for electronic
versions of a text allows  one to know the section in which a term occurs
as well as its pagination/lineation,  but not where it lies in the thematic
outline, because one is electronically  preserving the physical layout of
the artifact. However, with critical editions  there is no artifact to preserve;
rather, a new version of the text is being  created. In this case, it will
be possible to organize the text in terms of  the thematic outline. When both
versions of a text are present, searching  will be possible not only by chapter
or page number of the original, but also  by referring to the critical edition
one will be able to search designated  parts of the thematic outline. Another
way of providing access to both structures  electronically has been discussed
by the project<ent value="rsquo"></ent>s editorial  staff. This would entail
initially marking the outline hierarchy with DIV  tags and the chapter/page
divisions with MILESTONE markers. A script would  then be written that could
convert the outline hierarchy DIV tags into MILESTONE  markers and the chapter/page
division MILESTONE markers into DIV tags. Then,  by clicking a single toggle
button the user could see either presentation  of the text. The feasibility
of this method has, however, yet to be determined.</p></div3></div2><div2><head>SGML
 Tagging Issues</head><p>We are first typing texts into our structured templates,
 and then we will mark them with thematic tags (see directly below). In 2001
 and 2002 we plan to hold a series of conferences of Tibetologists to articulate
 a series of classification schemes for thematic tags relevant to Tibetan
texts.</p><div3 n="" type=""><head>Three Categories of Tagging</head><p>In
the TEI community,  standards are emphasized. The opposite of adherence to
standards is the <ent value="ldquo"></ent>tag abuse<ent value="rdquo"></ent>
syndrome<ent value="mdash"></ent>if  you don<ent value="rsquo"></ent>t have
a tag for what you want, you just call it  anything you want and attach a
type attribute to it. Tagging in SGML can be  broken down into three categories:</p><list
n="1" rend="1"><item><hi rend="boldface">Structural  Elements</hi>: elements
which reflect the logical structure of a document<ent value="mdash"></ent>front,
back, heads, chapters, paragraphs, etc. Thus, one is  implicitly talking about
tagging issues when one talks about layout. At some  point, one begins to
identify things that intellectually cohere as units,  and thus the structural
tips over into the intellectual content.</item><item><hi rend="boldface">Nominal/Thematic
Elements</hi>: this mostly signifies name  elements that one generally wants
to search in some particular way<ent value="mdash"></ent>geographical  names,
personal names, etc. We could also tag such attributes as the gender  of speakers
or all death-related terms.</item><item><hi rend="boldface">Referential  Elements</hi>:
in the electronic text this refers to something that is absent  from the display
but can be invoked. This is similar to a footnote or bibliographic  citation
(if the object referred to is online, this becomes a hyperlink).</item></list></div3><div3
n="" type=""><head>Guidelines for Marking Something in Text</head><p>Encoding
 is driven by whether one is going to do anything with the item in question.
 One should only tag items that one is going to do something with<ent value="mdash"></ent>search,
 display, or something else<ent value="mdash"></ent>and no more than that.
Thus,  when considering naming a particular thing in the text, keep the following
 issues in mind:</p><list n="1" rend="1"><item><hi rend="boldface">Search
issues</hi>:  The searchability of the item or text in question is an important
factor in  determining whether it should be distinguished by special markup.
Will it  be searched for? Does it belong to a larger category that will be
searched  for?</item><item><hi rend="boldface">Citation</hi>: Scholars need
a convenient  way to cite particular passages across an electronic text. This
will be achieved  by the electronic pagination mentioned above as well as
through the Milestone  markers recording the page and line numbers of the
physical manuscript.</item><item><hi rend="boldface">Collocation issues</hi>:
The different versions of a single  text not only need to be cross-referenced
but also need to be marked up consistently.</item><item><hi rend="boldface">Thematic
arrangement/intellectual content</hi>: The sections of a text as they are
arranged according to theme or intellectual content<ent value="mdash"></ent>chapters,
subsections, topics, etc.<ent value="mdash"></ent>are marked up so as to replicate
the internal structure of the text. This allows for the automatic generation
of a table of contents, which provides one means of navigating through the
text.</item><item><hi rend="boldface">Navigation issues for users</hi>: Markup
can be inserted to facilitate the user's navigation through the material,
by providing the table of contents as above, marking certain "milestones"
within the text, and linking or cross-referencing relevant portions of the
text either internally to another section or externally to a separate text
or other data file.</item><item><hi rend="boldface">Display  issues</hi>:
In XML, display is primarily the concern of the stylesheet. The XML markup
concerns identifying the structure and content of a text. However, the accuracy
of a stylesheet's display depends heavily on consistency in markup. A stylesheet
can for instance format all chapter titles in 16pt bold italics only if they
are all marked up consistently. This concept must be kept in mind when marking
up a text. If the display of an element's information is intended to differ
from that of the same markup elsewhere in the text, the element's uniqueness
needs to be encoded. This can be done using an attribute such as Type or Rend.</item><item><hi
rend="boldface">Reference</hi>: Markup can be used as a hyperlink to reference
some other work. For instance, it can be used to take the viewer to the identical
passage in another version of the text or to a database entry on a particular
person or place mentioned within a text.</item><item><hi rend="boldface">Descriptive
Identification</hi>: XML elements can be used to identify and classify the
terms used in a text. By naming something, one causes others to identify 
and look for particular types of elements as a way of encouraging accurate
 and full descriptions.</item></list></div3><div3 n="" type=""><head>Thematic
 tagging</head><p>We plan to hold several conferences of Western and Tibetan
 scholars in the next year to create schemes for tagging. Constructing a solid
classification  scheme of the categories of things to tag is an essential
task, and one that  must be done very carefully and in collaboration with
other Tibetologists.  These are scholarly decisions. When we enter the Tibetan
texts in Tibetan,  we will only be encoding them structurally; thematic tagging
will be done  subsequently. Of course to do the tagging at the data entry
level, one must  have advanced knowledge of Tibetan, enabling one to decide
if a given Tibetan  word in context should be tagged in a particular way. </p><p>Thematic
tags  could, for example, mark standard and generic categories of information,
such  as <ent value="ldquo"></ent>dates<ent value="rdquo"></ent>, <ent value="ldquo"></ent>place
 names<ent value="rdquo"></ent>, <ent value="ldquo"></ent>personal names<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>names of Buddhas<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>names  of monasteries<ent value="rdquo"></ent>, and so
forth. Tagging these allows the  user to search for all personal names, or
all names of Buddhas, etc. Another  function of tagging is to eliminate ambiguity,
such as when a word may have  two quite distinct semantic fields. For instance,
in Tibetan the word "<hi rend="italic">rgyal po</hi>" can be part of a personal
name or it can mean <ent value="ldquo"></ent>king<ent value="rdquo"></ent>.
By tagging it differently for each  kind of usage, searching for all occurrences
of it as a personal name will  not yield hundreds of false hits when it occurs
in its meaning as <ent value="ldquo"></ent>king<ent value="rdquo"></ent>.
This enables one to contextually identify important things  which are otherwise
ambiguous, and to do more sophisticated searches than  just searching for
the word-string directly. It also allows one to tag <ent value="ldquo"></ent>types<ent
value="rdquo"></ent> of information which might be called  by different names.</p><p>Marking
text with thematic tags is analogous to  constructing the index for a traditional
bound book. One has to consider what  types of users will use the collections,
and what types of searches of texts  we are trying to facilitate for those
users. In addition, one should try to  use up-to-date terminology, and in
the future do global updates as terminology  evolves. However, updating in
that way can lose points of historical research  for intellectual history,
such as in the change of terminology from <ent value="ldquo"></ent>Mohammedanism<ent
value="rdquo"></ent> to <ent value="ldquo"></ent>Islam<ent value="rdquo"></ent>. </p><p>As
 one proceeds in identifying thematic tag schemes, the process begins to tip
 over into scholarly research. For example, we may want to create the element <ent
value="ldquo"></ent>Death<ent value="rdquo"></ent>, which we use to mark each
reference  to death in a text. Then we can specify what type of death-related
reference  it is: funerary rituals, <hi rend="italics">bardo</hi> (post-death)
theory,  accidental death, etc. Those subdivisions can be further subdivided,
and so  forth. The more granular level is done through attribute specification,
namely  specifying a distinct attribute for the element. Another example is
making  the higher level category <ent value="ldquo"></ent>ritual<ent value="rdquo"></ent>,
 and then having within ritual <ent value="ldquo"></ent>death-related<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>birth-related<ent value="rdquo"></ent>,
and so forth. In this way,  one could articulate a complex nested hierarchy
of tags which at the most  granular level is exceedingly specific. In general,
the finer the level of  specificity in tags, the more pinpointed the searches
that users can perform;  however, the more generic the tags are, the more
items can be retrieved. Also,  the more granular the categories of tags, the
more difficult the decision-making  process in selecting tags becomes. One
must take considerable time and care  in defining the granularity. Thus, a
shallow level of tagging with broader  categories is better than very narrow
categories. In addition, a shallower  scheme allows single things to be tagged
for a variety of different shallow  schemes, rather than a strict hierarchy. </p></div3><div3
n="" type=""><head>Display  issues</head><p><hi rend="boldface">Outlines (<hi
rend="italics">sa bcad)</hi></hi>:  each outlined subsection is prefaced by
an inserted header in boldface underlined  type that gives the name of that
section. These names will be prefaced by  their enumeration within that overall
hierarchy using the repeating sequence <ent value="ldquo"></ent>I.1.i<ent
value="rdquo"></ent> (<ent value="ldquo"></ent>section level  1-1<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>section level 2-1<ent value="rdquo"></ent>, <ent value="ldquo"></ent>section
level 3-1<ent value="rdquo"></ent> and so forth). Since  these can get too
long to display easily, every 8 levels will be represented  by an <ent value="ldquo"></ent>X<ent
value="rdquo"></ent>, with the full string displayable  at a click. </p><p><hi
rend="boldface">Prose</hi>: <ent value="ldquo"></ent>prose<ent value="rdquo"></ent>
refers to non-metrical lines of text, generally within a paragraph.  In TEI
it is marked by the sentence tags <ent value="lt"></ent>S> <ent value="lt"></ent>/S>.
 Each <ent value="ldquo"></ent>paragraph<ent value="rdquo"></ent> of prose
will be a  separate textual unit consisting of a number of sentences delimited
by the  paragraph tags <ent value="lt"></ent>P> <ent value="lt"></ent>/P>.
A paragraph is defined  as a prose section that ends with a terminating particle
(for instance, the <ent value="ldquo"></ent><hi rend="italics">do</hi><ent
value="rdquo"></ent> in <hi rend="italics">bshad  do</hi>) and which appears
to form a thematic unit. Thus, this requires a  scholarly judgement which
is sometimes straightforward, sometimes debatable.  We will be drawing up
guidelines as we encounter difficult cases. In terms  of formatting, prose
lines will wrap within paragraphs, but paragraphs will  be separated by carriage
returns, just as they are in a Western book. </p><p><hi rend="boldface">Verse</hi>: <ent
value="ldquo"></ent>verse<ent value="rdquo"></ent>  is a separate category
from prose. It is composed of metrical lines of text.  In TEI, lines are marked
with the line tags <ent value="lt"></ent>L> <ent value="lt"></ent>/L>,  while
verses are marked with the line-group tags<ent value="lt"></ent>LG> <ent value="lt"></ent>/LG>,
which each contain several lines. In terms of formatting,  verse will have
each <hi rend="italics">shad</hi>-delimited line as a separate  physical line.
In their display, we will use blank lines to separate stanzas,  mark shifts
to a different syllabic verse (e.g., from a six-syllable line  to an eight-syllable
line), and to distinguish couplets, quatrains, terminating  particles and
so forth.</p><p><hi rend="boldface">Lists</hi>: while lists  in Tibetan texts
are seldom numbered, we would like to insert numbering in  some cases.</p><p><hi
rend="boldface">Lineation</hi>: each <hi rend="italics">shad</hi>-delimited
 line will have an ID that begins at <ent value="ldquo"></ent>1<ent value="rdquo"></ent>
 with each new chapter. These numbers will be displayed at the end of the
line  as a smaller font superscript; one can change the VIEW to either display
or  suppress these numbers. </p><p><hi rend="boldface">Citations</hi>: citations
 in general will be block indented, and citations within citations will be
 double-block indented. Otherwise, the components<ent value="mdash"></ent>paragraphs,
 verses, lists, etc.<ent value="mdash"></ent>will be formatted the same way
they  are formatted in the rest of the text. In terms of the formatting of
citations,  we have at least four basic types: (i) verse, (ii) prose, (iii)
cited verse  within a citation, and (iv) cited prose within a citation. </p><p><hi
rend="boldface">Annotations</hi>:  some texts have annotations that are in
smaller type and are connected to  the text by a string of dots. We would
like to make these into hyperlinks  with a distinctive visual marker specific
to annotations in the main text<ent value="mdash"></ent>by clicking on the
marker, the annotation appears in a box  off to the side.</p><p><hi rend="boldface">Pagination/lineation
in original  manuscript</hi>: we will mark the pagination and lineation from
the edition<ent value="rsquo"></ent>s physical manuscript with the TEI MILESTONE
element. These  won<ent value="rsquo"></ent>t allow us to do any searches
in reference to them<ent value="mdash"></ent>for instance, searching for all
occurrences of the Tibetan  word <ent value="ldquo"></ent><hi rend="italics">stong
pa</hi><ent value="rdquo"></ent>  on page 4<ent value="mdash"></ent>but will
enable navigation in reference to them. <ent value="lt"></ent>MILESTONE> is
an empty element that simply indicates a significant  event happens here,
while the tag tells one what the event is. By giving each  milestone a unique
ID number, it can used for location purposes. One can also  use it to indicate
a poem, prose paragraph, and so forth.</p><p>We will make  a style sheet so
these markers can be displayed or suppressed, depending on  the user<ent value="rsquo"></ent>s
needs. When displayed, they will appear as highlighted  small numbers. </p><p><hi
rend="boldface">Root verses and auto-commentary</hi>:  Some texts are structured
as a series of <ent value="ldquo"></ent>root verses<ent value="rdquo"></ent>
(<hi rend="italics">rtsa ba</hi>) followed by an <ent value="ldquo"></ent>auto-commentary<ent
value="rdquo"></ent> (<hi rend="italics">rang <ent value="rsquo"></ent>grel</hi>),
which  repeats the root verses in clumps, followed each time by prose commentary
 and explanatory citations. We want the capability of searching the root verses
 or the auto-commentary, plus the ability to format the root verses so that
 they are distinctive.</p><p><hi rend="boldface">Conversations and debates</hi>:
 We would like to reformat these so they are easier to follow.</p><p><hi rend="bold">Variant
 readings</hi>: There is also the issue of hyperlinks when there are variant
 readings across different editions of a single text. We are not sure how
this  will be handled or where it fits in. However, we would like any word
that  has a variant reading to occur in the text itself in a distinct color,
such  as blue, and then clicking on it should bring you to a list of the variant
 readings, with each indicating their source and location within that source.</p></div3></div2><div2><head>Display
 Issues and Style Sheets</head><p>SGML describes intellectually what the Tibetan
 text is. SGML is not a formatting language<ent value="mdash"></ent>HTML,
however,  is vastly simplified SGML used for formatting purposes. HTML specifies
head  elements, paragraphs, tables, etc.<ent value="mdash"></ent>all of which
are presentation  issues. It doesn<ent value="rsquo"></ent>t tell you anything
about what is in the  document, its structure, its logic, or the logic of
its information. Rather,  HTML is there simply to make the text appear in
a certain way on the screen. </p><p>SGML,  on the other hand, uses style sheets
to determine formatting and display.  At the moment, most style sheets are
proprietary. Using style sheets, one  can identify which elements in the DTD
are to be displayed and how they are  to be formatted. SGML also allows one
to define searches that apply only to  specific elements within the DTD. HTML
and SGML are thus very different tools.</p><p>Both  SGML and XML (a refined
version of SGML) are descriptive. They identify structures, meaning units,
and interrelations within and between texts. They are not concerned with display.
HTML, on the other hand, is solely concerned with display and navigation,
hardly at all with the structure and content of the text. For stylized presentation
in SGML and XML,  one uses a separate style sheet language, such as DSSSL
(Document Style Semantics and Specification Language,  an ISO standard stylesheet
language) for SGML or XSL (eXtensible Stylesheet Language) for XML. However,
browsers and server-side applications that can handle SGML/XML and their stylesheets
are only just beginning to make their appearance, and these lack some important
functionality such as robust searching capabilities. Therefore, at present,
we use Dynatext/Dynaweb (a single  integrated program) to publish the SGML.
Dynaweb uses its own proprietary style language to interpret the SGML and
display it in HTML frames. It is simultaneously  a publishing tool, a navigational
browser, and a search engine. It generates a hypertext HTML front-end that
presents the SGML document with its table of contents on the left and the
content on the right. This resulting HTML frame provides the user with the
ability to explore an SGML document. Similarly, when it searches the document,
it displays the number of hits within each section, allowing the user to see
at a glance the distribution of a particular topic. Dynaweb thus controls
all aspects of display and navigation, defining in its proprietary application
the direction we are heading towards an integrated Internet web page environment
that is based on XML and allows for ease in viewing, navigation, and searching. </p><p>As
mentioned, Dynatext/Dynaweb includes a search engine that queries the SGML
files and returns a summary report of hits. Like a database, it returns only
those records that contain hits, as well as the number of hits within each
section of the table of contents. All hits are clearly marked in a separate
color and font size. The results are returned in standard HTML format. This
functionality allows the user to navigate the SGML files in a standard browser.
Though Dynaweb  provides much of the functionality necessary for searching
the Tibetan Literature Archive,  we are working to provide three categories
of searches:   <list n="1" rend="1"><item><hi rend="boldface">Standard Search
Operations</hi>: These are what is already available in Dynaweb</item><item><hi
rend="boldface">Complex Searches</hi>: More complex searches, such as searching
for a particular translator or fuzzy searches, require minimal programming
within Dynaweb itself</item><item><hi rend="boldface">External Routines for
Searching</hi>: Additional searches that operate outside Dynaweb will be written
in languages such as Perl or Java. These can provide searches integrating
information from other databases, such as variant names, and apply them to
the Dynaweb searches. Perl is particularly appropriate for searching for alternate
spellings.</item></list>Enhancement of the search capabilities provided by
Dynaweb will provide deeper and richer access to the materials contained within
the Tibetan Literature Archive.</p>  <p>One problem we face is that different
centers use different publishing programs  like Open Text (used by the Electronic
Text Center at the University of Virginia), Dynaweb, etc.   Despite the problem
of software maintenance and customer support, these programs present the problem
of differences in code and proprietary style languages. These make it difficult
to interconnect collections displayed by different programs. With the advent
of the suite of eXtensible-languages (XML, XSL, XQL, XPATH, etc.), steps are
being taken to mitigate such problems. When these are fully implemented standards,
different sites can be smoothly interconnected to allow users to navigate,
search, and view in a consistent and coherent format. They will also allow
one site to query another database but format the results according to its
display standards. In short, they will provide a more seamless web-environment,
where the user can focus more exclusively on the content rather than the technological
quirks.</p></div2></div1>

<!--APPLIED TECHNOLOGY -->

<div1><head>IV.
Applied Technology</head><p>The display of catalogs and representations of
Tibetan texts over the internet requires a variety of technological applications,
employed during all the stages of the process of electronic publication. The
following section outlines the most significant technological applications
we presently use to present the Tibetan Literature Archive and others we hope
to use in the future. The topics are:</p><list n="1" rend="A"><item>Networked
management of collaborative projects</item><item>Data Entry and SGML Editing</item><item>File
management</item><item>Displaying SGML over the internet</item><item>Conversion
to Tibetan script</item><item>Scholarly submissions</item><item>Future uses
of technology</item></list><div2><head>Networked Management of a Collaborative
Humanities Project at Multiple Remote Centers</head><p>A key aspect of our
project is the working out of technology and procedures for the networked
management of a humanities project that involves a complex collaboration between
a number of remote centers and various individuals with varying levels of
access to computer technology. This involves development of standards, data
entry procedures, quality-control guidelines, file management processes, and
submission forms.</p><p>Standards are developed within a community to be used
within the defined bounds of that community. In this sense, one can talk about
standards of varying scope. The broadest scope for a standard is the global
community. Standards for this community, or something approximating it, are
set by the ISO, W3C, the Unicode Consortium, and so forth. In general, such
standards are developed through a vetting process of refining standards previously
defined by other smaller groups. In managing a collaborative humanities project
in the field of Tibetan Studies, we have been required to develop a set of
standards for the project as a whole. These standards were developed with
an eye toward broadening the scope of our community to its widest possible
limit. The standards we are developing (discussed in chapter II) are viewed
as proposals to be vetted by the wider community of Tibetan scholars in hopes
of creating a more universal standard for electronic Tibetan humanities projects.</p><p>With
standards for encoding and describing Tibetan material in hand, the next stage
of the process involves the entry of data itself. In order to ensure completeness
of cataloging, the project has developed a standard tabular entry form that
contains all the required fields of a catalog record. This form is electronically
processed to produce a valid SGML file, which is then reviewed by the cataloger
who adds any notes or special features required for that document.</p><p>Once
the SGML catalog file has been created it is then entered into Astoria, our
file-management system, and printed for proofreading. The proofreader then
must check against the original text every entry in every field<ent value="mdash"></ent>all
page/line numbers, Tibetan words, etc.<ent value="mdash"></ent>on the print-out.
The proofreader then enters all corrections, additions, deletions, etc. in
the electronic version of the catalog file, which is stored in the file-management
program. When the entire volume is completed, the proofreader then creates
a bibliographic record and a catalog record for the volume, and these are
also stored in the file-management program. We are in the process of implementing
Astoria<ent value="rsquo"></ent>s web-capability so that collaborators from
remote centers will be able to submit, view and edit records in Astoria over
the internet.</p><p>The final step is to publish the electronic edition of
the catalog from the file-management program. The program we use, Astoria,
concatenates all the bibliographic records for a volume into a single hierarchically-structured
document and exports it. The individual volume catalogs are then concatenated
into a single edition catalog and published on the web through the SGML web-publishing
software, Dynaweb. This publishing process is repeated at regular intervals
to incorporate corrections, additions, and updates. Scholars will also have
the means to submit new information or corrections over the internet that
our editors can then incorporate into the catalogs. This will be discussed
in more detail below.</p><p>In this way, we have developed a stable and detailed
process for managing a collaborative humanities project among geographically-distant
centers so as to maintain order and consistency. Employing a single, centrally-located
file-management program which can be accessed by remote centers over the web
and a consistent entry methodology greatly facilitates communication and problem-solving
between different centers and ensures that agreed-upon standards are consistently
upheld.</p></div2><div2><head>Data Entry and SGML Editing</head><p>The initial
step in the cataloging or text-entry process is entering the data into an
electronic format. Because we are dealing with Tibetan texts, certain unique
guidelines needed to be developed. A single, comprehensive transliteration
scheme was necessary for consistent representation, and a detailed outline
of cataloging categories relevant to the literature also needed to be developed.
These are outlined above in Chapter II: Standards. Using these standards,
the cataloger then enters the bibliographic information individually for each
text. This is initially done using a simple four-column table in a word-processor
document. The table is laid out in the basic format of the bibliographic record.
Based on the identifying label in the first column, the cataloger enters the
relevant information in the second and third columns, with the fourth column
reserved for special notes. This is done for every text in the volume, and
these word-processing files are stored on our FTP site for back-up purposes.</p><p>Once
all the texts in a volume have been cataloged, a second person, the proofreader,
takes the folder of files from the FTP site and runs a Visual Basic routine
on them. This macro takes each file in the folder and converts it into a properly
structured SGML file in which every piece of data the cataloger entered is
placed in the appropriate tag. These SGML files are then entered into Astoria,
our file-management system. They are then printed, and the proofreader thoroughly
checks each entry against the original text, marking on the print-out where
corrections, deletions, or insertions are to be made. When the volume is completely
proofread, the proofreader then enters the changes into the electronic files
stored in Astoria.</p><p>SGML files are simple text files that can be viewed
and manipulated in any text editor. However, with the increasing interest
in SGML and XML, more software specifically designed for editing SGML texts
has become available . These programs format the SGML into an easy-to-read
display that presents tags and text in graphically different ways so they
are immediately identifiable. They also check marked up documents for completeness
and validate them against the DTD. In addition, these programs identify which
tags are available at the point the cursor is placed and can also present
a document map that depicts the document<ent value="rsquo"></ent>s hierarchical
structure and where one is within that structure. The SGML editing program
the project uses is called Epic (developed by ArborText). Once the basic SGML
text file has been created through the Visual Basic conversion routine, the
proofreader opens it in Epic to check for completeness and then, with Epic's
easy-to-use tagging features, enters in additional information such as notes
or unusual fields not covered by the conversion routine. The final version
of the file is then printed and the file is stored in the file-management
program, Astoria. After the print-out of the file has been proofread, the
SGML file is checked out of Astoria, corrections are entered, and the file
is checked back in to Astoria.</p><p>In this way, each text is gone over by
at least two different people (the cataloger and the proofreader). It is a
required policy of the project that the proofreader must be different from
the cataloger. This process is widely acknowledged to significantly improve
the quality of the output, both because a single person is likely to make
the same mistakes when cataloging and proofreading a single text, and because
using two different people increases the breadth of knowledge brought to the
process. Furthermore, at our UVA site we have instituted a policy of weekly
meetings during which the catalogers discuss unique problems that have arisen
over the course of the week. Group decisions are made as to how to handle
these problems and the solutions are recorded for future reference.</p></div2><div2><head>File
Management</head><p>When the bibliographic file has been converted into SGML
it is then imported into our file-management program, Astoria. Astoria is
a server-based software installed on our host server Jefferson at IATH. It
allows authorized individuals to import and export SGML files into hierarchically
organized cabinets containing folders that themselves contain groups of texts.
Once a file is imported into Astoria, the program acts as a kind of SGML library.
Authorized individuals can <ent value="ldquo"></ent>check out<ent value="rdquo"></ent>
a text from licensed work-stations. A text or, in this case, a bibliographic
record, is checked out through a bridge that connects Astoria with the SGML
editor, Epic. Once a user checks out a file, it can be edited only by that
person. Other users can view the most recent version of the file, but only
the user who has checked out a file can make changes to that file. When finished,
the user then checks the document back in to Astoria and the updated version
becomes available to all authorized users.</p><p>Astoria has other capabilities
as well. Being strictly hierarchical, SGML documents have a concretely defined
tree-like structure. By way of its bridge to the SGML editor, Astoria allows
discrete segments of a text<ent value="rsquo"></ent>s structure to be shared
into another tree. Thus, rather than duplicating the same information in multiple
documents, the information can reside in one document and be shared by an
unlimited number of other documents. For instance, a volume<ent value="rsquo"></ent>s
catalog record is made up of a <ent value="ldquo"></ent>node<ent value="rdquo"></ent>
for each text<ent value="rsquo"></ent>s bibliographic record. The header for
this node, which is contained in a TEI DIV3 element, is taken from the text<ent
value="rsquo"></ent>s normalized title, which is embedded in the text<ent
value="rsquo"></ent>s SGML record (it follows the front material). Rather
than having to re-enter the normalized title in the header for the DIV3 element,
Astoria allows one to <ent value="ldquo"></ent>share<ent value="rdquo"></ent>
that title from within the text<ent value="rsquo"></ent>s record. Thus, though
the title is used in two places<ent value="mdash"></ent>as a header for the
whole record and within the record<ent value="mdash"></ent>it actually only
resides in one place: within the record. If it is changed, that is, if the
editors decide on a different normalized title, then the shared version also
is changed. For larger collections, such as the mTshams brag edition of <hi
rend="italics">The Collected Tantras of the Ancients (rNying ma rgyud <ent
value="rsquo"></ent>bum)</hi>, all the records of the individual texts within
a single volume are shared into that volume<ent value="rsquo"></ent>s catalog
record. When the catalog record is published by Astoria, the program inserts
the individual texts<ent value="rsquo"></ent> bibliographic records into the
volume<ent value="rsquo"></ent>s catalog record and publishes it as a single
document. Again, if changes are made to a text<ent value="rsquo"></ent>s record,
these will be incorporated into the volume<ent value="rsquo"></ent>s catalog
record the next time it is published.</p></div2><div2><head>Displaying SGML
over the Internet</head><p>The primary language for displaying information
over the internet is HTML. HTML is a specifically defined language created
from the meta-language of SGML. SGML incorporates a means for defining one<ent
value="rsquo"></ent>s own document type in what is known as a DTD (Document
Type Definition). This flexibility allows one to create tags according to
the information at hand so as to organize and label it in a specific way.
However, such flexibility creates a kind of mark-up tower of babel by allowing
each specialized field to develop its own mark-up language. Because tags or
elements can be defined according to the user's wishes, browsers have no way
of determining how the marked up text should be displayed. A second document,
the stylesheet, needs to be created to tell browsers how to format each element
for display. Such a set-up requires browsers to match data-documents with
their stylesheets. Initially, this was more than the internet could handle,
and HTML<ent value="mdash"></ent>a simpler, more universal language for <hi
rend="italics">presenting</hi> documents over the web<ent value="mdash"></ent>was
developed out of SGML. HTML is itself an instance of SGML, in that it has
a DTD. However, this DTD is static and the primary uses of its tags are formatting
purposes. In essence, it is solely a stylesheet for static tags. This made
its incorporation into the standard browsers easier, and so HTML has become
the mark-up language for the web.</p> <p>Nevertheless, people continued to
use SGML for other purposes because of its greater flexibility, despite the
fact that it could not be deciphered by the standard browser. For this reason,
there has been a continuing demand for ways to present SGML documents over
the web. Individual companies have developed proprietary software to do so;
our present method for displaying our collections employs such proprietary
software. Another tack, however, was to develop a language that was as flexible
as SGML but as portable as HTML; hence, XML (eXtensible Markup Language) and
its formatting language XSL (eXtensible Stylesheet Language) were developed.
These are still in their relatively early stages (the first version) and methods
for handling them have not been fully incorporated into the standard browsers.
So, for the time being, we will continue to use the proprietary software and
SGML, but eventually the collections will be converted to XML.</p><p>The Textual
Collections use a software program called DynaWeb. DynaWeb has its own style-sheet
language which it uses to transform SGML into formatted HTML. It also keeps
track of the SGML document<ent value="rsquo"></ent>s hierarchy, providing
a structure table of contents with which the user can navigate through a particular
document, in this case, a catalog. DynaWeb also has a powerful searching interface
that will isolate and flag all instances of the search string. Once the volume
catalogs have been published through Astoria, they are concatenated into a
single edition catalog file. This file is then made into a <ent value="ldquo"></ent>book<ent
value="rdquo"></ent> by special software so that it can be read by DynaWeb.
It then becomes available to users over the internet.</p></div2> <div2> <head>Searching
the SGML Catalogs</head> <p>XML searche engines are just beginning to become
available and are still in there infancy stage. For this reason, the catalogs
in the collection have, for the time being, remained in SGML. This is because
Dynaweb, the software presently used to display the SGML catalogs, does have
a rather sophisticated search engine that allows one to perform different
kinds of searches. In general, these follow into simple searches on words
or phrases and advanced searches that utilize keywords.</p> <div3> <head>Simple
Searches</head> <p>The simple search feature is accessible directly from the
button-bar at the bottom of the collection browse window. Below the row of
navigational icons, there is a text field followed by a search button. Simple
searches may be entered here. One may search on a single syllable or phrase
or use one of the wildcards listed below. Furthermore, it is possible to enter
advanced searches in this field using, for instance, the boolean and/or/not
keywords. Double quotes indicate an exact phrase; search criteria in parentheses
are evaluated first. </p> <p><hi rend="bold">Wildcards:</hi> Two wildcards
are available for all searches. The question mark "?" represents any single
character; e.g., "sky?b" returns both <hi rend="italics">skyob</hi> and <hi
rend="italics">skyab</hi>. The asterisk (*) represents any set of 0 or more
characters up to the end of the word, e.g., "sky*" would return <hi rend="italics">skye</hi>, <hi
rend="italics">skyon</hi> and the English word, sky, as well as <hi rend="italics">skyob</hi>
and <hi rend="italics">skyab</hi>. (<hi rend="bold">Note:</hi> Due to an apparent
bug in the software, the asterisk does not always seem to function as it should.)</p> <p><hi
rend="bold">Note on the Apostrophe (<hi rend="italics">a chung</hi>):</hi>
When performing searches on a Tibetan phrase containing an <hi rend="italics">a
chung</hi>, represented by an apostrophe ('), although the searching software
recognizes the apostrophe, in some situations the apostrophe is not highlighted
in the search results.</p> <p> <hi rend="bold">Note on Transliteration:</hi>
All Tibetan terms must be entered using THDL's 	<xref n="http://jefferson.village.virginia.edu/tibet/EWTS.html">extended
Wylie transliteration system</xref> and/or English.</p> <div4> <head>Explanation
of Search Results</head> <p>This is a key to understanding the results, when
one performs any search on a collection through the DynaWeb search mechanism.
The resulting screen will still have a left-hand table of contents (TOC),
a right-hand content area and a button bar, identical to the browsing screen.
However, for each level of the TOC -- i.e. edition, volume, or text -- the
search hits will be indicated with bold, red numbers next to that level's
title. Furthermore, the title of the first text with hits will be highlighted
in blue, and its record will appear in the right-hand content area with the
first hit highlighted in large red letters. </p> <p>One can then use the TOC
to navigate to other volumes or texts with hits. By clicking on a item's title,
whether it be an edition, volume or text, its bibliographic record will appear
in the right-hand content area, displaying in bold, red letters the first
hit within that record.</p> <p>Some records contain links to extended discussions
on various topics. Normally, these links appear as underlined blue text. When
performing searches, if there is a hit within such a link, the text of the
link itself will appear in larger blue letters. Within many records, there are also notes, represented
by the blue N icon: <xptr n="note" type="jpg" resp="ndg" targtype="image"></xptr>
 However, unlike the discussions, if a hit occurs
within such a note, the icon <hi>does not</hi> change. The only way to tell
if there is a hit within a note is if the number of hits for an item in the
TOC is greater than the number of highlighted hits in the content area. This
problem will be addressed in future releases.</p> <p>The left-hand area showing
the number of hits for each level of the TOC will continue to be displayed
until the clear search button (binoculars with a strike through) is pressed.</p> <p>To
clear a search, click on: <xptr n="b_clear" type="gif" resp="ndg" targtype="image"></xptr></p> <p>Below
is an image graphically describing the left-hand table of contents, indicating
what each part means:</p> <xptr n="dw_search_left" type="gif" resp="ndg" targtype="image"></xptr> </div4> </div3> <div3> <head>Advanced
Searches</head> <p>The search capability of DynaWeb has made it possible for
the Textual Collections to offer three different advanced search features
to accompany the catalogs. DynaWeb (the software used to display the SGML
catalogs in HTML) has its own search language for specifying contextual as
well as the other searches. However, for the average user to benefit from
this capability they would not only need to know this language but be intimately
familiar with the Tibbibl DTD that defines our SGML/XML tags. Because such
a requirement is unreasonable, advanced search forms have been created to
provide a pre-formatted way to search for specific kinds of information. <xref
n="http://jefferson.village.virginia.edu/cgi-tibet/xsearchForm.pl#cntxt">Click here
to view forms</xref>.</p> <p>The three types of advanced searching that are
presently offered are:</p> <list> <item><hi rend="bold">Contextual Search:</hi>
This option searches for a certain type of information within a specified
field. Thus, for example, one can search for a word or phrase within a text
or chapter title, a person's name, a note, or a number of other categories. </item> <item><hi
rend="bold">Proximity Search:</hi> Searches for two words or phrases that
are within a designated proximity of each other.</item> <item><hi rend="bold">Boolean
Search:</hi> Searches for inclusive or exclusive combinations of keywords,
using standard boolean terms such as "or", "and", "not".</item> </list> <p>All
three types of searches provide one with the option of narrowing the search
to a particular doxographical category. The <hi rend="italics">Collected Tantras
of the Ancients</hi> is generally organized into three large categories: Atiyoga,
Anuyoga, Mahayoga. One can perform any of these searches within a particular
doxographical category or in all of them, through the check boxes underneath
each search form.</p> <div4> <head>Contextual Searches</head> 
<p>A contextual search refers to a search that takes into account the SGML/XML context of the information. Or, more simply, it looks within specific tags for information. Because SGML/XML is descriptive markup (unlike HTML which is solely for formatting), different types of information can be distinguished within a document. Thus, titles can be located if they are marked up with distinctive tag, such as "&lt;title&gt;". The use of tag or element attributes allows for even greater specificity so that one can search for specific kinds of titles by looking for tags such as "&lt;title type='Chapter'&gt;". These are the principles upon which the contextual search option is based.
</p>
<p> Contextual
searches allow the user to specify which category of information in the collection
they would like to search. For example, a user could search for a phrase only
in "Tibetan text titles" or only in "places". The following is the full list
of categories we currently enable users to search within contextually.</p> <list> <item><hi
rend="bold">Tibetan text titles:</hi> searches for a text string within the
text's normalized Tibetan title.</item> <item><hi rend="bold">Sanskrit text
titles:</hi> searches for a text string within the Tibetan transliteration
of the text's Sanskrit title.</item> <item><hi rend="bold">English text titles:</hi>
searches for a text string within the available English text titles.</item> <item><hi
rend="bold">chapter titles:</hi> searches for a text string within any chapter
title of any text.</item> <item><hi rend="bold">people's names:</hi> searches
for a text string within a person's name.</item> <item><hi rend="bold">places:</hi>
searches for a text string within a place name.</item> <item><hi rend="bold">doxographical
category:</hi> searches for a particular doxographical category, such as Atiyoga.</item> <item><hi
rend="bold">notes and discussions:</hi> searches for a text string within
a note or discussion.</item> <item><hi rend="bold">homage deities:</hi> searches
for a text string within the object of homage field.</item> <item><hi rend="bold">pagination:</hi>
searches for a text string within text <hi rend="italics">or</hi> chapter
paginations.</item> <item><hi rend="bold">number of sides:</hi> searches for
texts of a specific length.</item> <item><hi rend="bold">number of body sections:</hi>
searches for texts with a specific number of chapters.</item> <item><hi rend="bold">chapter
designation:</hi> searches for texts with specific chapter labels, such as
le'u or skabs.</item> </list> <p><hi rend="bold">Note:</hi> When searching
on text titles, if the search phrase is contained in the title listed in the
TOC area, this is also counted as a hit but is not highlighted. Therefore,
the number of hits listed in the TOC area may be one greater than the hits
highlighted in the content area to the right.</p> <p>A few sample searches
are offered below for a first time user to try out this facility. For each
search, one can change the "Search within all" option to "Atiyoga", "Anuyoga"
or "Mahayoga" and see the difference in results:</p> <list> <item>Search for <hi
rend="italics">Tibetan text titles</hi> that contain the phrase <hi rend="italics">chos
nyid</hi></item> <item>Search for <hi rend="italics">Sanskrit text titles</hi>
that contain the phrase <hi rend="italics">tsit+ta</hi></item> <item>Search
for <hi rend="italics">English text titles</hi> that contain the phrase <hi
rend="italics">bliss</hi></item> <item>Search for <hi rend="italics">people's
names</hi> that contain the phrase <hi rend="italics">bai ro</hi> ("Bai ro
tsa na" was a famous Tibetan translator of the Old School.</item> <item>Search
for <hi rend="italics">homage deities</hi> that contain the phrase <hi rend="italics">rdo
rje 'chang</hi> (the name of a Buddha.)</item> <item>Search for <hi rend="italics">number
of sides</hi> that contain the phrase <hi rend="italics">57</hi> (this will
return all texts that have 57 folio sides.)</item> <item>Search for <hi rend="italics">number
of body sections</hi> that contain the phrase <hi rend="italics">1??</hi>
(in this search the '?' acts as a wildcard; thus, it will return all texts
that have 100 to 199 chapters.)</item> <item>Search for <hi rend="italics">chapter
designation</hi> that contains the phrase <hi rend="italics">skabs</hi> (this
will return all texts whose 'chapters' are called 'skabs'.)</item> </list> </div4> <div4> <head>Proximity
Search</head> <p>As the name of this search suggests, one can use this form
for searching for words or phrases that are within a certain proximity to
another set of words or phrases. In the Tibetan context, syllables are delimited
by a dot, called a "<hi rend="italics">tsheg</hi>", while phrases are delimited
by a line or "<hi rend="italics">shay</hi>". However, there is no clear demarcation
for words, which can be composed of more than one syllable. In proximity searches,
"word" actually means syllable so that one can search for two syllables or
words within a designated number of syllables each other. This is useful for
Tibetan searches as it is a common convention to abbreviate a word by leaving
out essential syllables. The proximity search can allow one to find all such
variants. Either a single syllable or a word composed of multiple syllables
(separated by spaces) can be entered into any of the search phrase boxes.
There is also an option to choose the order of the words, whether one word
comes before or after the other. Thus, possible searches are:</p> <list> <item>Search
for <hi rend="italics">gsang ba</hi> within <hi rend="italics">1</hi> word
of <hi rend="italics">snying po</hi></item> <item>Search for <hi rend="italics">gsang
ba</hi> within <hi rend="italics">2</hi> words before <hi rend="italics">snying
po</hi></item> <item>Search for <hi rend="italics">gsang</hi> within <hi rend="italics">1</hi>
word after<hi rend="italics">snying</hi></item> <item>Search for <hi rend="italics">ye
shes</hi> within <hi rend="italics">5</hi> words of <hi rend="italics">rang
snang</hi></item> </list> <p>Though in the first three the same terms are
used (<hi rend="italics">gsang ba</hi> or 'secret', and <hi rend="italics">snying
po</hi> or 'matrix'), each yields different results.</p> </div4> <div4> <head>Boolean
Search</head> <p>The boolean search allows one to look for various combinations
of words or phrases. In the advanced search form, one can enter up to three
different search terms connected by any possible combination of boolean terms.
The available boolean terms are:</p> <list> <item>and</item> <item>or</item> <item>not</item> </list> <p>The
basic structure of the boolean search is:</p> <p> <hi rend="italics">phrase
1</hi> and/or [not] <hi rend="italics">phrase 2</hi> [and/or [not] <hi rend="italics">phrase
3</hi>]</p><?Pub Caret1?> <p>One must use either an "and" or an "or" between
phrases. The "not" is optional, as is the entire third phrase. Thus, one can
search for:</p> <list> <item>ye and not shes -- to find "ye nas" and "ye grol",
etc. but not "ye shes".</item> <item>sgrib or mun -- to find records with
either "sgrib" or "mun" (roughly synonymous).</item> <item>kun and rig and
not rgyal -- to find records with "kun ... rig" but not "kun ... rgyal".</item> <item>ye
shes and not rang byung and snang ba -- to find "ye shes snang ba" but not
"rang byung ye shes"</item> <item>'byor and not rnal -- to find occurrences
of "'byor" other than "rnal 'byor".</item> </list> <p>Generally, as shown
above, the "not" keyword is used in conjunction with the "and" keyword. The
software claims to recognize the "or not" combination. However, this logically
makes little sense, and the implementation does not appear to work properly.</p> </div4> </div3> </div2> <div2><head>Conversion
to Tibetan Script</head><p>All catalog entries and textual transcriptions
of Tibetan are initially done in the extended Wylie transliteration system.
Because of the font issues described above, we are at present not displaying
these in Tibetan font. However, the project has begun to develop software
to convert Wylie transliteration into Tibetan fonts. This process is not a
simple one-to-one correspondence, since Tibetan uses stacks of letters which
in computer fonts are generally represented by a single glyph. Thus, the stacked
letters must be identified and the corresponding glyph called. To date, beta
versions of three types of programs have been developed for this kind of conversion:</p><p><list
n="1" rend="1"><item>a Perl script that converts Tibetan embedded in SGML
into Tibetan font</item><item>Jskad (pronounced "jay-gey"), a Java program that displays Wylie transliteration
as Tibetan and allows editing of a simple database file</item><item>Wylie Word 1.0, a Visual
Basic macro in MS Word that converts Wylie transliteration into Tibetan script
as it is typed in</item></list></p><p>The <xref n="http://www.trace.org/">Trace Foundation</xref>
and the <xref n="http://www.tibet.dk/tcc/">Tibetan Computer Company</xref>
have generously collaborated to provide the public-domain with a cross-platform,
web-usable Tibetan font, based on the Tibetan Computer Company<ent value="rsquo"></ent>s
TibetanMachine font. We are in the process of updating our conversion software
and converting our extended Wylie data into this new TibetanMachineWeb font.
This font will be available for download from both our site and the Tibetan
Computer Company<ent value="rsquo"></ent>s site. However, this font is really
a collection, of ten different fonts that use the lower ASCII register to
map the 1,000 or so Tibetan glyphs. Such as situation requires a good deal
of font changing within a document, which is less than efficient. However,
such a problem would be solved by a Unicode Tibetan font, which would be a
single font and thus obviate the need for font changes. When a viable Unicode
font becomes available for Tibetan, we will use that to display our data in
Tibetan script. Such conversions<ent value="mdash"></ent>from Wylie to Tibetan
script and from one Tibetan font to another<ent value="mdash"></ent>are done
through a batch method rather than on the fly in order to increase the speed
of access to the information.</p><p> The latter two programs and the public-domain Tibetan
fonts are now available from the THDL's <xref n="http://jefferson.village.virginia.edu/tibet/tools/">Digital Media Tools for Tibetan</xref> page. </p></div2><div2><head>Scholarly Submissions</head><p>The
project will be able to receive information from the scholarly community in
two ways. Feedback in terms of suggestions, corrections and so forth is received
through our e-mail address at <hi rend="email">samantabhadra@virginia.edu</hi>.
Correspondingly, regular updates are posted on samantabhadra-update, our mailing
list, which can be subscribed to from our web-page. As the project develops,
we will also implement a password-protected submission form for approved scholars.
Through this means, scholars will be able to add data to the collection in
a number of ways: discussions on various topics; essays concerning a text,
person or practice; translations; and catalogs of other collections. </p></div2><div2><head>Future
Uses of Technology</head><p>The above uses of technology directly pertain
to the process of cataloging that the Tibetan Literature Archive projects
are most actively involved in at present. However, along with the cataloging
there are other issues that we are considering pertaining to textual representation,
searching, and interfacing between digital projects. Below are brief descriptions
of the most important topics relating to this issue.</p><div3 n="" type=""><head>IBabble</head><p><xref
n="http://www.iath.virginia.edu/babble/">IBabble</xref> is a synoptic Unicode
SGML-aware browser and publication tool; it is a stand-alone application. <ent
value="ldquo"></ent>Synoptic<ent value="rdquo"></ent> means it runs multiple
texts side by side. The ability to browse and display a Unicode document is
the primary reason IBabble was first developed at IATH by Robert Bingler.
Before it was developed, browsers couldn<ent value="rsquo"></ent>t even look
at Unicode; even now, most won<ent value="rsquo"></ent>t enable the user to
do anything more than display fonts. IBabble allows a user to work directly
with SGML in Unicode, to hide all the tags, to do easy searching of Unicode,
and to use a special virtual keyboard with foreign language glyphs on the
screen, etc.</p><p>We plan to adapt IBabble to work with Tibetan glyphs once
we have developed a Unicode Tibetan font. Its elaborate interface allows the
simultaneous viewing of multiple texts, an important tool when inspecting
multiple editions of a single text for variant readings. It works by tagging
the texts so that they scroll together. IBabble allows blocking of associated
tags, which means that scrolling one window (one text) automatically makes
the other windows (other editions of that text) scroll such that the corresponding
parts of each edition of the text is displayed in each window. IBabble doesn<ent
value="rsquo"></ent>t attempt to interpret the tags, but it is aware of them
and has the ability to link or associate tags together. However, a style sheet
can be used through IBabble to display the tags in formatted ways.</p><p>IBabble
can also be set up as a helper application for a browser.</p></div3><div3
n="" type=""><head>Collation Programs/Processes</head><p>We have been looking
into applications that will facilitate the collation of variant readings of
individual texts from edition to edition in order to aid in the compilation
of critical editions. We are trying to develop two different procedures for
creating electronic editions of individual texts as well as diplomatic editions
and critical editions that note all variants.</p><div4 n="" type=""><head>Procedure
#1</head><list n="1" rend="1"><item>Type in a single edition using specified
conventions.</item><item>Copy that edition to a new file, and then alter it
with a second edition<ent value="rsquo"></ent>s readings to make an electronic
version of the second edition. Follow principle of checking original document
of the first edition if any variants are encountered; thus, this process also
functions to proofread the first electronic edition.</item><item>Repeat the
above to create separate electronic versions for all editions of the text.</item><item>Use
a collation program on these separate documents to generate a diplomatic edition
with all variants, taking the mTshams brag edition as the base reading.</item><item>Convert
this into SGML template for critical editions.</item></list></div4><div4 n=""
type=""><head>Procedure #2</head><list n="1" rend="1"><item>Type in a single
edition using specified conventions. </item><item>Put all variants for other
editions into annotations.</item><item>Use a routine to generate the other
editions out of the annotations.</item></list><p>We have been looking at collation
programs to see how they work in general for these purposes, and how they
work with Tibetan in particular. The two issues that have immediately surfaced
are: (i) programs can only identify variants at the syllable level, since
they have no way of recognizing word- or phrase-boundaries in Tibetan; and
(ii) programs have difficulties dealing with missing/inserted lines in comparing
editions. Neither of these difficulties are insurmountable, since manual rearrangement
of variants can deal with (i), and manual resetting of the program is an easy
way to cope with (ii). </p><p>As a sample case, we took three variant editions
of a single short text, the <hi rend="italics">Khyung chen lding pa</hi>,
from the gTing skyes, Bai ro rgyud <ent value="rsquo"></ent>bum, and from
the 22nd chapter of the <hi rend="italics">Kun byed rgyal po</hi>, and ran
them through two programs designed to compare textual materials.</p><p>The
first program is <xref n="http://www.cta.dmu.ac.uk/projects/collate/">Collate</xref>,
designed by Peter Robinson in England for the specific purpose of creating
critical editions. The advantage of this program is that it can compare multiple
versions of a single text simultaneously. Also, it identifies only those passages
where variant readings occur and displays those in the output. Unfortunately,
as the output from our sample run indicates, it cannot deal with significant
textual anomalies such as the insertion or omission of entire lines of text.
Once a significant variant of this magnitude is introduced, the program is
no longer able to recognize textual congruencies since the line-to-line correspondence
has been lost.</p><p>Another simpler program called <xref n="http://www.english.upenn.edu/~jlynch/Computing/compare.html">Compare</xref>
was used, in some ways to better effect. This program was designed by a graduate
student and represents no more than 200 lines of code. Despite its simplicity,
it is able to effectively compare two versions of a given text. The advantage
of this program is that it produces a very clean and intuitive comparison
of two individual versions of a text, and through the use of <ent value="ldquo"></ent>fuzzy
logic<ent value="rdquo"></ent> it is able to successfully identify omissions
and insertions. Having done that, it proceeds to identify correspondences
between lines, even if they are no longer in exact numerical sequence. The
obvious disadvantage is that the program can compare no more than two versions
of a single text at a time. Generally we are looking at anywhere from two
to six variant versions of a given text, so this may present difficulties.
Also, <ent value="ldquo"></ent>Compare<ent value="rdquo"></ent> does not highlight
only those passages with variations, and thus much of the actual critical
editing work would still need to be done manually.</p></div4></div3><div3
n="" type=""><head>Searching and Sorting</head><div4 n="" type=""><head>Navigation
and searching</head><p><ent value="ldquo"></ent>Navigation<ent value="rdquo"></ent>
refers to how users find their way through a document or web site, while <ent
value="ldquo"></ent>searching<ent value="rdquo"></ent> refers to how users
submit queries to which the program responds. <ent value="ldquo"></ent>Sorting<ent
value="rdquo"></ent> then is how the response to a query gets ordered. </p><p>Any
web browser like Netscape can open HTML documents on your hard disk and run
them locally instead of on the web (using <ent value="ldquo"></ent>open file<ent
value="rdquo"></ent> instead of <ent value="ldquo"></ent>open location<ent
value="rdquo"></ent>). However, this will not enable any searches, or create
any executable materials. For searches, one needs to use separate search software,
or <ent value="ldquo"></ent>search engines<ent value="rdquo"></ent>, of which
there are a variety of proprietary programs (<ent value="ldquo"></ent>PATT<ent
value="rdquo"></ent>, and so forth). Unfortunately, most of these only work
on servers and not on PCs, though it is likely that this will be changing
rapidly so that searches can be done on one<ent value="rsquo"></ent>s PC. </p><p>The
relevant questions in addressing searching are: What kinds of searches will
be done? What kinds of things might people ask? What kinds of searches might
they want to do? For navigation, we need to predict alternative ways users
might want to access the Tibetan Literature Archive, like buttons to skip
a long series of sequential movements. We can divide our searching issues
into three categories:</p><list n="1" rend="1"><item>The structure of our
SGML templates determines the general parameters and scope of searches. Thus
the work done in designing these templates simultaneously delineates search
possibilities. </item><item>The second order of searching involves the design
of advanced additional queries and a mechanism to present search options.
This includes fuzzy searches and so forth.</item><item>The third aspect of
searching concerns the different input modes: English language, Tibetan in
transliteration, and a virtual Tibetan keyboard. This pertains both to how
the user inputs the text and also to what data is queried.</item></list></div4><div4
n="" type=""><head>Overview of searches</head><list n="1" rend="1"><item>Searching
authority files.</item><item>Search window not at level of particular database,
edition or text. </item><item>Search window at the level of a particular text.</item><item>Searching
passages<ent value="rsquo"></ent> correspondences.</item><item>Navigation
tools: how to provide short-cuts.</item><item>Top and bottom toolbars.</item></list></div4><div4
n="" type=""><head>Top <ent value="ldquo"></ent>authority<ent value="rdquo"></ent>
toolbar (accessible at every point in the textual collection)</head><p>There
are six buttons corresponding to the six authority databases. Each one of
these will take a highlighted word and run it through a predetermined search
sequence. At the simplest level, the Tibetan Literature Archive allows word
searches on a primary category of information in that database. For example,
in the geo-referencing database, it would ask <ent value="ldquo"></ent>Place?<ent
value="rdquo"></ent>. The user types in <ent value="ldquo"></ent>Lhasa<ent
value="rdquo"></ent>, and it takes the user to a map, description and pictures
of Lhasa. There would also be more sophisticated search windows for specifying
a complex array of indicators. For example, again in the geo-referencing database,
there would be a variety of fields, and one might specify all places within
100 miles of Lhasa with populations of more than one hundred people. </p></div4><div4
n="" type=""><head>Bottom <ent value="ldquo"></ent>general service<ent value="rdquo"></ent>
toolbar (accessible at every point in the Tibetan Literature Archive)</head><list
n="1" rend="i"><item>Wylie: this takes a highlighted syllable and yields the
Tibetan character it represents; otherwise it requests User to input a Wylie
syllable to get the corresponding Tibetan character back. There is a <ent
value="ldquo"></ent>complete scheme<ent value="rdquo"></ent> button that yields
the full page of the Wylie transliteration scheme along with explanations
of its principles and the function of unusual Tibetan punctuation.</item><item>ID:
this pops up with a menu bar listing all editions of the text. Clicking on
one allows the user to convert the electronic lineation PASSAGE ID into the
page/line reference for any particular edition in terms of its original physical
pagination/lineation, as well as into the standard electronic lineation. <list
n="1" rend="1"><item>Text ID</item><item>Passage ID in the electronic form</item><item>Passage
in original pagination/lineation</item><item>A list of all the corresponding
versions of texts</item></list></item><item>SEARCH: this is the search window
for analyzing entire catalogs. There is an opening <ent value="ldquo"></ent>favorites<ent
value="rdquo"></ent> menu of terms we think people will most frequently want
to search from the catalogs. This will include a simple word or string of
words search capacity. This is the <ent value="ldquo"></ent>basic search form<ent
value="rdquo"></ent>. Then a <ent value="ldquo"></ent>more<ent value="rdquo"></ent>
button leads the user to a version of the SGML catalog template displaying
all elements so they can search on anything. This is an <ent value="ldquo"></ent>advanced
search form<ent value="rdquo"></ent>. <p>The default is to query the master
database, but there will also be a place to specify any combination of editions
or collections to query. One may also specify whether one wants to search
against the transcription, translation, catalogs, and so forth.</p><p>In addition,
there is an expanded set of elements to search particular passages in combination
with criteria specified from the catalog template<ent value="rsquo"></ent>s
elements. With those elements operating as control on what texts are searched
(for example, just the mTshams brag edition, just the Atiyoga category across
all editions, etc.), additional elements are as follows:</p><p><list rend="u"><item>Search
on particular structural divisions of the texts (e.g., search all homage chapter-level
elements, or search just the <ent value="ldquo"></ent>Body<ent value="rdquo"></ent>
sections)</item><item>Search on encoded tagged elements of the text</item><item>Control
on language (Tibetan or English)</item></list></p><p>One of the precautions
to take when developing searching routines is to ensure that a particular
item is only "hit" once. For example, multiple matches for one text due to
the variant titles would be redundant.</p><p>Searches on words will yield
a listed number of occurrences. That will open a box with the ID and six words
before and after the word for each occurrence. Then, one can say <ent value="ldquo"></ent>get
surrounding text<ent value="rdquo"></ent>. The next screen has <ent value="ldquo"></ent>next
occurrence, prior occurrence, full list of occurrences, forward and back one
page<ent value="rdquo"></ent>.</p></item><item>HOME: this goes back to the
entire collection<ent value="rsquo"></ent>s home page.</item><item>INTO COLLECTION:
this would return to the INTO THE COLLECTION opening page.</item></list></div4><div4
n="" type=""><head>Most general search window</head><p>The specification for
the general search window are still under consideration. However, at minimum
the search form to be included here would allow one to search for any combination
of: <list rend="u"><item>Text or chapter title (Tibetan or Sanskrit)</item><item>Tibetan
phrase or keyword</item><item>Subject (doxographical or otherwise)</item><item>Personal
name</item><item>Role (translator, editor, author, etc.)</item><item>Date</item><item>Place</item><item>Genre</item></list></p><p>It
would allow these to be combined with boolean terms, such as AND, OR, and
NOT. Finally, it would also allow a means for fuzzy searches and/or wildcard
searches. </p></div4><div4 n="" type=""><head>General issues</head><list rend="u"><item>All
searches should be Boolean so we can specify AND, OR, etc.</item><item>We
would like to do proximity searches so the user can specify the number of
words within which proximity two specified words are defined.</item><item>Searches
should ignore end of line and end of page types of markers.</item><item>Users
should be able to combine search criteria of different levels, such as searching
for a particular word or phrase within all texts of a particular genre. </item></list></div4><div4
n="" type=""><head>Text analysis tools</head><p>We are interested in <ent
value="ldquo"></ent>fuzzy<ent value="rdquo"></ent> tools that can analyze
two texts of indeterminate relationship and give us useful information about
patterns of intertextuality that obtain.</p></div4><div4 n="" type=""><head>Multi-lingual
and multi-script user modalities</head><p>Whenever a search inquiry is input,
one types using a Wylie system, but Tibetan fonts as well as Wylie should
appear on the screen. The Tibetan is optimal for Tibetans and Tibetologists,
while Wylie is necessary for those who don<ent value="rsquo"></ent>t know
Tibetan. The searches will be done against the Wylie files, which will be
the primary storage form of all data. The Wylie files are also the files that
will be altered as we make corrections, updates, etc. in an ongoing fashion. </p><p>In
addition, we have to keep in mind that searches will be done for English and
phonetic renderings of names used in the English documents. We will thus want
language-limited searches, so users can specify to search only the Tibetan
language entries, or search only the English language entries. Also, the queries
should have options for entering Wylie, Tibetan script, English, or phonetic
renderings. They should also have the option of specifying that the responses
come in Wylie or Tibetan script. When doing a search on Tibetan language items,
we would like the search results to come up sorted according to Tibetan alphabetical
order, not English alphabetical order. </p><p>The four user modalities are:<list
n="1" rend="1"><item>Some (Asianists) want to type Wylie and see Wylie.</item><item>Some
(Tibetologists) want to type Wylie and see Tibetan script.</item><item>Some
(Tibetans) want virtual Tibetan script keyboard, which the program itself
will understand as Wylie.</item><item>Some (the wider public) want phonetics.</item></list></p></div4></div3></div2></div1>

<!-- EDITORS GUIDE -->

<div1 type="chapter"><head>V. Editor<ent value="rsquo"></ent>s Guide</head><p>This
section  is a guide for editors or catalogers of the Tibetan Literature Archive.
It contains  information pertaining to the process of cataloging textual collections
according  to the standards devised by the Tibetan and Himalayan Digital Library<ent
value="rsquo"></ent>s Tibetan Literature Archive projects, primarily the Samantabhadra
 Project and the B<ent value="ouml"></ent>npo Textual Collection. Editors
are encouraged to give feedback  to the project managers to keep the guidelines
complete and up-to-date. This  section contains the following subsections:</p><p><list
n="1" rend="1"><item>Guidelines for entering a translation or transliteration</item><item>How
 to fill out the entry form</item><item>Conversion to SGML</item><item>Using
 Astoria</item><item>Proofreading Guidelines</item><item>FAQs</item><item>SGML
 tag library</item><item>Detailed description of bibliographic records</item><item>Editor<ent
value="rsquo"></ent>s quick reference guide</item><item>Glossary of terms
and proper  names</item><item>Contributor<ent value="rsquo"></ent>s guidelines</item></list></p><div2><head>Guidelines
for Entering a Translation or Transliteration</head><p>One may create a translation
or transliteration of a Tibetan text for submission to the archive within
a standard Word processor by adhering to the following guidelines.</p><p>For
transcription: <list n="1" rend="1"><item>Adhere strictly to the THDL Extended
Wylie scheme for entering Tibetan.</item><item>Mark the beginning of each
physical line in the manuscript with a \ (forward-slash).  This should not
have underscores or spaces before or after it. It will generally be preceded
by a space representing the final <hi>tsheg</hi> of the line but will have
no space after it. </item>  <item>Mark the beginning of each physical page
with the bracketed page number.  In other words, [33] would mean the beginning
of page 33.  This should not have underscores before or after it. It will
often be preceded by a space, representing the <hi>tsheg</hi> at the end of
the page but will not have one after it. In general, the <hi>mgo rgyan</hi>
or ornamental marks at the beginning or end of a page are <hi rend="bold">not</hi>
transliterated.</item>   <item>There are conventions for indicating how the
entered text should be displayed on the computer, i.e. marking such issues
as display verse on separate lines, or displaying text together in paragraphs,
etc; in addition, we have issues of bold face, and so forth.  We have not
resolved this yet, but we hope to create a RTF conversion so that bold, italic
and so forth can be marked in the word processor, and when the final document
is saved in RTF, it can be automatically converted into XML.</item><item>Annotations
on the text should be inserted between braces ({}) wherever in the text one
wants the annotation to be linked.</item></list></p>  <p>For translations,
the procedures will be different depending on whether the translation is being
keyed to a digital master edition of the Tibetan text, or to an edition-specific
edition of the Tibetan text.</p><p>In the former case, <list n="1" rend="1"><item>Mark
the beginning of each shad-delimited line with a \ (forward-slash). This should
not have underscores or spaces before or after it.</item><item>The master
edition will mark every 20 shad-delimited lines with a electronic page. This
should be marked with  the bracketed page number.  In other words, [33] would
mean the beginning of page 33.  This should not have underscores or spaces
before or after it.</item><item>The formatting conventions are identical to
the transcription work, as are annotations (see above).</item><item>Diacritics
should use the THDL diacritic input method and one of the several free diacritic
fonts.  See (hot link to THDL diacritic font tool page we are about to create).
In the latter case, everything is identical, except that lines and pages will
be marked identically to how they were marked in the original transcription
of the Tibetan.</item></list></p></div2>   <div2><head>How  to Fill Out the
Entry Form</head><div3><head>Introducing the Entry Form</head><p>There are
two ways to enter cataloging  information into the system. Initially, editors
were entering information  directly into SGML documents. Standard templates
for bibliographic records  have been created and are stored at the top level
of Astoria within a folder  called templates. These documents contain the
basic structure of a bibliographic  level and there are distinct documents
for edition, volume, and text bibliographic  records. However, each template
must be specifically tailored for the particular  record at hand. Text bibliographic
records will have to add or delete chapter TIBBIBLs to match their specifications.
If other persons, such as revisers,  are listed in the Provenance the SGML
for these will need to be added, and  so forth.</p><p>To minimize the amount
of SGML editing and knowledge necessary  to enter a catalog record, and therefore
to quicken the process, a form  has been devised to allow editors to quickly
and easily enter cataloging  information. This form is subsequently processed
through a MS Word Visual  Basic macro that transforms the simple table into
a complete SGML document.  This has two benefits: increased speed of cataloging
and decreased need for  editors to have in-depth knowledge of SGML. Editors
can still enter information  directly into SGML document, but it is recommended
for larger collections,  such as <hi>The Collected Tantras of the Ancients
(rNying ma rgyud <ent value="rsquo"></ent>bum)</hi>  that editors use the
form-entry method. The standard form contains a discrete  number of fields
that can be added to or subtracted from in order to mirror  the structure
of the text. The basic form looks like this:</p><p><xptr n="entryForm" type="gif"
resp="ndg" targtype="image"></xptr></p><p>This form only represents five 
chapters, which is a relative low number on average. Often, editors will pre-format
 an entry form to represent as many as 100 chapters, since deleting unused
 lines is easier than adding new ones. Also, such information as edition sigla,
 editor<ent value="rsquo"></ent>s initials, date and so forth can be entered
into  the template one uses as these remain the same over the course of several
 documents. Such decisions are, of course, left up to the editor.</p><p>The
 form documents the basic information that every text regularly contains.
Fields  are to be deleted only when they are duplicates that are not necessary
for  a particular text, for instance, lines for recording chapters numbered
beyond  what a text actually contains. In general, if information required
by the  form is not available, this is noted by one of a series of standard
English  phrases such as <ent value="ldquo"></ent>Not specified.<ent value="rdquo"></ent>
or <ent value="ldquo"></ent>No title given.<ent value="rdquo"></ent> This
is to indicate that  it is not an oversight by the cataloger but is in fact
absent in the text  itself.</p><p>The entry form has four columns. The first
column contains the  name for the information contained in that row or in
that section of the form.  Generally, the information is entered in the second
column in the same row  as the description or name. Thus, the normalized title
is entered in the second  column of the same row that says <ent value="ldquo"></ent>normalized
title<ent value="rdquo"></ent> in the first column. Exceptions are with chapters
where the  number of the chapter is indicated in the first column and with
EOC titles  (see below for description of <ent value="ldquo"></ent>EOC<ent
value="rdquo"></ent>),  where the EOC text-title is listed in the first column,
as described below.  The third column contains paginations for certain items,
namely the chapters  and EOC titles. The fourth column is for notes. These
are not directly inserted  into the SGML file by the conversion program but
are extracted and placed  in a separate note file (a text file). Notes are
put in the fourth column  1) to be inserted as note fields in the SGML document
by the person who converts  the file, 2) to alert the person who converts
the file to some anomaly in  the record, or 3) for information that does not
have a designated row in the  table but should be entered in the file, such
as redactors and others in the  provenance other than the translators. </p><p>All
entries are to be made in  extended Wylie transliteration or English descriptions.
For English descriptions,  in general, only the first word is capitalized.
This applies to section names  such as <ent value="ldquo"></ent>Closing section<ent
value="rdquo"></ent> and <ent value="ldquo"></ent>Title  line<ent value="rdquo"></ent>,
as well as indications of lack of information such  as <ent value="ldquo"></ent>No
title given.<ent value="rdquo"></ent> and <ent value="ldquo"></ent>Not  specified.<ent
value="rdquo"></ent>. The latter, however, should also be followed  by a period,
whereas the former will generally be followed by a comma and  a pagination
(no period in any case with section names). <hi rend="bold">All  entries of
transliterated Tibetan should be followed by a <hi>shad</hi>, i.e,  the backslash
(/), unless the entry ends in the letter <ent value="lsquo"></ent><hi>ga</hi><ent
value="rsquo"></ent> or <ent value="lsquo"></ent><hi>ka</hi><ent value="rsquo"></ent>,
as  per Tibetan grammatical rules. If the entry ends in a <ent value="lsquo"></ent><hi>nga</hi><ent
value="rsquo"></ent> the <hi>shad</hi> should be preceded by a <hi>tsheg</hi>,
 which in transliteration is a space, also according to Tibetan rules.</hi></p><p>For an example of a completed entry form see the next section.</p></div3><div3><head>Entry
Form Example</head><p>This
is an actual entry form for the mTshams brag edition, filled in by Steve Weinberger.
It demonstrates  some, but not all, of the special characteristics that can
be recorded in  the form. For instance, the fourth column is reserved for
notes. Notes are  either to alert the person who converts the table into SGML
of some peculiarity  or to be inserted into the SGML as a note. Also, the
Kaneko number is listed  as <ent value="ldquo"></ent>Not found.<ent value="rdquo"></ent><ent
value="mdash"></ent>one  of the special phrases used to indicate a lack of
information. Similarly,  the Indian scholar is <ent value="ldquo"></ent>Not
specified.<ent value="rdquo"></ent>  as is the title line (a generally uncommon
occurrence). Most of the chapters  do not list a text title, but one does.
Also, there is no non-Tibetan title.  Of course, such things vary from text
to text.</p><p><xptr n="entryForm2" type="gif" resp="ndg" targtype="image"></xptr></p></div3><div3><head>Entry Form Explanation</head><p>The following
list details  the process of filling out the entry form. The steps go through
each of the  lines in the entry form, describing what kind of information
goes there and  how they are to be filled in. An editor is not required to
follow the steps  in this particular order, as long as each piece of information
is entered  and all the categories are filled. This list below deals with
the general  form (the first one above); exceptions and peculiarities will
be dealt with  subsequently:</p><p><table><row><cell>Date</cell><cell>Enter
the date the  table-file was created.</cell></row><row><cell>Creator</cell><cell>Enter
your  standardized editing initials, e.g. <ent value="ldquo"></ent>ndg<ent
value="rdquo"></ent>.  Your initials should be submitted to the project manager
for approval so that  no conflicts occur.</cell></row><row><cell>Normalized
title</cell><cell>The  normalized title is entered here. Generally, the normalized
title is taken  from the title line, omitting the <hi>zhes bya ba</hi>. If
the title line  is absent, the normalized title is taken from the closing
section. If that  is absent, then the most common end of chapter text-title
is used. If you  know that another separate title is commonly used for the
text, this may be  used, but you must enter yourself as the source in the
field below.</cell></row><row><cell>Source  of normalized title</cell><cell>Enter
the source for the above entry. If it  comes from the text, enter the section
name, comma, pagination, as in <ent value="ldquo"></ent>Title line, 419.6<ent
value="rdquo"></ent>. The first letter of  the section name only is capitalized.
If you are the source, enter your editing  initials.</cell></row><row><cell>English
normalized</cell><cell>This is for  the English translation of the normalized
title. Unless otherwise instructed,  this field should be left blank.</cell></row><row><cell>Edition
name</cell><cell>Enter  the name, <hi>not the sigla</hi>, for the edition.
Thus, <ent value="ldquo"></ent>mTshams  brag<ent value="rdquo"></ent>. This
is for display, so the root letter of the first  syllable should be capitalized.</cell></row><row><cell>Edition
sigla</cell><cell>Enter  the standardized edition or collection sigla for
the collection that the text  belongs to. Examples are <ent value="ldquo"></ent>Tb<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>Bg<ent value="rdquo"></ent>, <ent
value="ldquo"></ent>Tk1<ent value="rdquo"></ent>  and so forth. Sigla are
standardized and always have the first letter capitalized.  See the section
below with a list of sigla.</cell></row><row><cell>Text number</cell><cell>This
 is the sequential number of the text within its collection. These numbers
 continue sequentially over volumes so that each text has a unique ID number.
 Only collections organized doxographically (the electronic master editions)
 have multipart numbers describing the text<ent value="rsquo"></ent>s position
in  the doxographical hierarchy. In these situations, the numbers would look
like <ent value="ldquo"></ent>3.1.5.8<ent value="rdquo"></ent>.</cell></row><row><cell>Volume
 number</cell><cell>For multi-volume collections, this is the Arabic number
 of the volume which contains the text being recorded, e.g., 10, 5, 37.</cell></row><row><cell>Volume
 letter</cell><cell>This is the transliteration of the Tibetan letter or syllable
 used to identify the volume, e.g., cha, <ent value="rsquo"></ent>a, ki, etc.</cell></row><row><cell>Number
 in volume</cell><cell>This is the sequential number of the text within its
 volume, e.g., 5 for the fifth text in that volume.</cell></row><row><cell>TAI
 number</cell><cell>This is only for <hi>The Collected Tantras of the Ancients</hi>.
 It refers to the number of the listing for a text in the Taiwan catalog 
of the mTshams brag edition. If it is not found in the catalog, enter <ent
value="ldquo"></ent>Not found.<ent value="rdquo"></ent> (Always capitalize
the first  letter and end with a period.)</cell></row><row><cell>Kaneko number</cell><cell>Again,
 only for <hi>The Collected Tantras of the Ancients</hi>. This is the number
 of the listing for the text in Kaneko<ent value="rsquo"></ent>s catalog for
the  gTing skyes edition. <ent value="ldquo"></ent>Not found.<ent value="rdquo"></ent>
should  be entered when appropriate.</cell></row><row><cell>Pagination</cell><cell>This
 lists the range from beginning page and line to ending page and line, separated
 by a dash. Page and line numbers are separated by a period. Thus, <ent value="ldquo"></ent>795.3<ent
value="mdash"></ent>846.6<ent value="rdquo"></ent>.</cell></row><row><cell>Total
sides</cell><cell>Here  one enters the total number of sides the text covers.
In the example above  it would be <ent value="ldquo"></ent>52<ent value="rdquo"></ent>.</cell></row><row><cell>Total
 CLEs</cell><cell>CLE refers to Chapter Level Element. The structure of a
text  is broken into front, body, and back. The body of the text has sections
called  chapters, but the front and back also have constituent elements, such
as title  line, homage, closing section, and translator<ent value="rsquo"></ent>s
colophon,  to name a few. Chapter level element refers to any of these sections
that  falls at the same level as a chapter. Total CLEs refers to the total
number  of these. For instance, if a text had a title line, an homage, 10
chapters,  a closing section, and a translator<ent value="rsquo"></ent>s colophon,
it would  have 14 CLEs and one would enter <ent value="ldquo"></ent>14<ent
value="rdquo"></ent>  here. See the list of chapter-level elements in the
Editor<ent value="rsquo"></ent>s  Quick Reference Guide.</cell></row><row><cell>Front
CLEs</cell><cell>The number  of front chapter-level elements as described
above. If there are no front  sections, a zero should be entered here.</cell></row><row><cell>Body
CLEs</cell><cell>The  number of chapter-level elements in the body. Generally,
this is identical  to the number of chapters. However, this does not have
to be the case if the  chapters are misnumbered.</cell></row><row><cell>Name
of chapters</cell><cell>Here  one lists all the designations given for chapters
in the text. Often, it is <ent value="ldquo"></ent><hi>le<ent value="rsquo"></ent>u</hi><ent
value="rdquo"></ent> but can  also be <ent value="ldquo"></ent><hi>rtog pa</hi><ent
value="rdquo"></ent>, <ent value="ldquo"></ent><hi>skabs</hi><ent value="rdquo"></ent>,
or some other designation. All such designations should be  listed here separated
by a backslash and underscore combination. A note should  be written in the
fourth column to alert the converter that there are multiple  chapter names,
as this requires some special mark-up in the SGML. If chapters  are not given
a generic name such as <ent value="ldquo"></ent><hi>le<ent value="rsquo"></ent>u</hi><ent
value="rdquo"></ent> or <ent value="ldquo"></ent><hi>skabs</hi><ent value="rdquo"></ent>,
 enter <ent value="ldquo"></ent>Not specified.<ent value="rdquo"></ent></cell></row><row><cell>Back
 CLEs</cell><cell>The number of chapter-level elements contained in the back
 section, as described above. If there are no back sections, a zero should
 be entered.</cell></row><row><cell>Doxographical category</cell><cell>This
 is for the doxographical category of the text. For <hi>The Collected Tantras
 of the Ancients</hi>, we are entering only the text<ent value="rsquo"></ent>s
most  general category<ent value="mdash"></ent>Atiyoga, Anuyoga, or Mah<ent
value="amacr"></ent>yoga.  These will be refined in the future. For the standard
in other projects, consult  your project manager.</cell></row><row><cell>Object
of homage</cell><cell>Enter  the full title and name for the deity referred
to in the homage. Include all  epithets and so forth. The basic rule of thumb
is everything up to but not  including the <ent value="ldquo"></ent><hi>la
phyag tshal lo/</hi><ent value="rdquo"></ent>.  If there is no homage, enter <ent
value="ldquo"></ent>Not specified.<ent value="rdquo"></ent></cell></row><row><cell>Homage
 source</cell><cell>Refers to object of homage source. The word <ent value="ldquo"></ent>Homage<ent
value="rdquo"></ent> does not need to be included in ones entry here, since
it  is assumed that the source is the homage CLE. Instead, merely enter the
pagination  for the immediately preceding entry. If there is no homage, leave
this blank.</cell></row><row><cell>Indian  scholar</cell><cell>These next
couple of slots are for the translator<ent value="rsquo"></ent>s of the text.
The conversion program deals only with translators  and so any other persons
involved in the provenance of the text, such as redactors  or editors, should
be listed in the note column. The name of the Indian scholar  is put in the
second column, exactly as given in the text, including that  person<ent value="rsquo"></ent>s
title, if it is given, e.g. <ent value="ldquo"></ent><hi>rgya  gar gyi mkhan
po bi ma la mit+ra/</hi><ent value="rdquo"></ent>. If the scholar  is of some
other nationality, do not change the label for the column, as this  is used
by the converting software, but make a note of this in the fourth  column
so that the converter can change the label in the SGML document from <ent
value="ldquo"></ent>Indian scholar<ent value="rdquo"></ent> to, for example, <ent
value="ldquo"></ent>Nepalese  scholar<ent value="rdquo"></ent> or <ent value="ldquo"></ent>Chinese
scholar<ent value="rdquo"></ent>.  If there is no translator<ent value="rsquo"></ent>s
colophon, enter <ent value="ldquo"></ent>Not  specified.<ent value="rdquo"></ent></cell></row><row><cell>Tibetan
translator</cell><cell>Here  enter the full title and name of the Tibetan
translator, such as <ent value="ldquo"></ent><hi>ba  gor lo tsa ba bai ro
tsa na</hi><ent value="rdquo"></ent>. If there is no translator<ent value="rsquo"></ent>s
colophon, enter <ent value="ldquo"></ent>Not specified.<ent value="rdquo"></ent></cell></row><row><cell>Date
 translated</cell><cell>Enter the Tibetan date as given in the text. If no
 date is given, enter <ent value="ldquo"></ent>Not specified.<ent value="rdquo"></ent></cell></row><row><cell>Place
 translated</cell><cell>Enter the place translated if mentioned. Include the
 full place name. If not given, enter <ent value="ldquo"></ent>Not specified.<ent
value="rdquo"></ent></cell></row><row><cell>Trans source</cell><cell>Enter
the  source for any and all of the above information concerning translation
activity.  The source should reflect the section name and pagination. Thus, <ent
value="ldquo"></ent>Translator<ent value="rsquo"></ent>s colophon, 345.6-345.7<ent
value="rdquo"></ent>. The first word  of the section name only is capitalized.
If all the fields above say <ent value="ldquo"></ent>Not specified.<ent value="rdquo"></ent>,
but the title line does  say <ent value="ldquo"></ent><hi>rgya gar skad du</hi><ent
value="rdquo"></ent>, enter <hi rend="bold">just</hi> the pagination for that
phrase, i.e., the beginning  of the title line, to indicate that the text
does represent a translation.  If even this is lacking and there is no non-Tibetan
title, leave this blank.</cell></row><row><cell>Original  language</cell><cell>Enter
the Tibetan for the original language of the text.  Generally but not always,
this is <ent value="ldquo"></ent><hi>rgya gar skad</hi><ent value="rdquo"></ent>.
Exclude the <ent value="ldquo"></ent><hi>du</hi><ent value="rdquo"></ent>,
 but follow it with a <hi>shad</hi>. If this is the only information given
 concerning translation, the source field above should reflect this section
 of the title line.</cell></row><row><cell>Discussion</cell><cell>This is
for  any extended, scholarly discussions concerning the translation of the
text.  This should be left blank unless one is a scholar qualified to comment
on  this topic.</cell></row><row><cell>Normalized title</cell><cell>This repeats
 the information entered above in exactly the same form.</cell></row><row><cell>Normalized
 source</cell><cell>This also repeats the source for the normalized title
in  exactly the same form.</cell></row><row><cell>Discussion</cell><cell>This
 is for discussion concerning the origin of the normalized title. If the normalized
 title is simply the title line, nothing should be entered here. If for some
 reason, another version of the title is used, the reason for choosing that
 normalized title can be entered here as a discussion. All discussions should
 be followed by the editor<ent value="rsquo"></ent>s standardized initials
in parentheses.</cell></row><row><cell>No.  of front titles</cell><cell>The
next several rows deal with titles for the  text as a whole or <ent value="ldquo"></ent>text-titles<ent
value="rdquo"></ent>, beginning  from the front and progressing to the back.
This particular row records the  number of titles <hi rend="bold">for the
text</hi>, found in front sections.  For instance, if there is a title line
and a title page, one would enter <ent value="lsquo"></ent>2<ent value="rsquo"></ent>.
If there was also a title in the homage,  one would enter <ent value="lsquo"></ent>3<ent
value="rsquo"></ent>. However, whatever  number is entered must match the
number of front text-titles that follow immediately  after it, as described
next. A title line title should always be listed even  if there is no title
line! In such cases, enter <ent value="lsquo"></ent>1<ent value="rsquo"></ent>
here, and in the title line slot enter <ent value="ldquo"></ent>Not  specified.<ent
value="rdquo"></ent></cell></row><row><cell>Title line</cell><cell>There 
are only rows for the title line<ent value="rsquo"></ent>s Tibetan text title
included  in the entry form, because in general that is the only front text-title
given.  However, in cases where there are 2 or 3 front text-titles (that is,
titles  for the text included in front sections), rows should be inserted
and filled  out in the same manner. The first row contains the name of the
section<ent value="mdash"></ent>in this case, <ent value="ldquo"></ent>title
line<ent value="rdquo"></ent><ent value="mdash"></ent>in the first column
and the text-title given in the second  column. The second row says <ent value="ldquo"></ent>pagination<ent
value="rdquo"></ent>  in the first column and lists the pagination itself
in the second. These refer  only to Tibetan text title and not to the non-Tibetan
text title, which is  also given in the title line. This has a separate row
for entry below. If  there is no title line, enter <ent value="ldquo"></ent>Not
specified.<ent value="rdquo"></ent></cell></row><row><cell>Pagination</cell><cell>In
 the second column of this row, enter the pagination for the text-title given
 above. If there is no title line, this can be left blank. The above two rows
 can be repeated <hi rend="italics">mutatis mutandis</hi> for all text-titles
 given in front sections, though only the title line would be listed as <ent
value="ldquo"></ent>Not specified.<ent value="rdquo"></ent> if it is not included
in  the text.</cell></row><row><cell>No. of EOC titles</cell><cell>This refers
 to the number of End Of Chapter titles found in the text. Generally, a chapter
 will end with <ent value="ldquo"></ent>{text-title} las {chapter-title} kyi
le<ent value="rsquo"></ent>u ste {chapter-number}<ent value="rdquo"></ent>.
EOC titles refers  to the <hi rend="bold">distinct or unique</hi> text-titles
given at the ends  of chapters. For each unique title, a list will be created
in the section  below, listing all the chapters that use that text-title,
so that the title  does not need to be entered numerous times. In the example
given above, most  of the chapters did not include a text-title, so they were
entered in the  list under <ent value="ldquo"></ent>No title given.<ent value="rdquo"></ent>
One chapter,  however, chapter 7, did include a text-title, so it is the sole
entry in the  second list. Thus, in the second column of this row, the number
2 was entered.</cell></row><row><cell>{blank  cell}</cell><cell>The next row
is completely blank. This row and the next  five are the model for an EOC
title list. In this row, one enters the EOC  text-title <hi rend="bold">in
the first column</hi>, and the other columns  are left blank, unless for some
reason there is need to include a note in  the fourth column. A list should
be made, as in the example, for chapters  that do not list a text-title. This
list should be under <ent value="ldquo"></ent>No  title given.<ent value="rdquo"></ent></cell></row><row><cell>{blank
cell followed  by number in second column}</cell><cell>The rows under the
EOC text-title  row are for all the chapters that use this text-title at their
end. The number  of the chapter is listed <hi rend="bold">in the second column</hi>
and the  pagination for the EOC text-title is listed <hi rend="bold">in the
third column</hi>.  The numbers 1 through 5 included in the generic entry-form
are just place  holders. Lists of chapters under EOC text-titles often will
not be sequential  but will skip numbers because intermediate chapters will
have different text-titles.  Nonetheless, these lists should be ordered from
lowest chapter number to highest.  An editor will generally have to add and
delete rows here to match the specifics  of the text.</cell></row><row><cell>No.
of back titles</cell><cell>As with  front titles, this refers to the number
of text-titles included in the back  sections. Enter <ent value="lsquo"></ent>0<ent
value="rsquo"></ent> if there are no  text-titles in the back sections and
delete the following two rows.</cell></row><row><cell>Closing  section</cell><cell>This
row is for the text-title found in the closing section.  If there is none,
this row should be deleted. If there is one, the title should  be entered
in the second column of this row and the pagination in the next  row. Titles
found in other back sections, such as translator<ent value="rsquo"></ent>s
 colophon, should follow the same pattern with the section name in the first
 column and the text-title in the second.</cell></row><row><cell>Pagination</cell><cell>The
 pagination for the text-title listed above this line should be entered in
 the second column here. If there is no text-title for any back section, this
 row should be deleted.</cell></row><row><cell>Non-Tibetan title</cell><cell>In
 the second column enter the non-Tibetan title as is most commonly found between
 the <ent value="ldquo"></ent><hi>rgya gar skad du</hi><ent value="rdquo"></ent>
and  the <ent value="ldquo"></ent><hi>bod skad du</hi><ent value="rdquo"></ent>
using the  extended Wylie transliteration system. All such text titles that
are non-Tibetan  should be entered in this method; if needed, repeat this
and the next two  rows. </cell></row><row><cell>Original Language</cell><cell>As
with the same  field above, the Tibetan for the name of the original language
of the text  is put here.</cell></row><row><cell>Pagination</cell><cell>The
pagination  for the Non-Tibetan title is put here.</cell></row><row><cell>{FRONT
SECTIONS)</cell><cell>This  a simple header line. The second column should
be left blank.</cell></row><row><cell>Title  line</cell><cell>The remaining
rows are for describing the chapter-level elements  in the text. With a few
minor differences they all follow the same pattern.  The names for the elements
listed are only the most commonly occurring one<ent value="rsquo"></ent>s;
rows should be deleted or added depending on which elements  appear in a given
text. For instance, if there is no title line, that row  is taken out, as
in the example above. For the Front Sections, the name of  the chapter-level
element is in the first column. All words in these names  should be capitalized.
The second column contains the number of front section,  in ascending, sequential
order. The third column should contain the pagination  for that section. There
should be one row for each Front Section and no more.</cell></row><row><cell>{BODY
 SECTIONS}</cell><cell>This too is a header line. Nothing needs to be entered
 in this row.</cell></row><row><cell>1</cell><cell>Body sections are dealt
 with in the following manner. The first column contains the number of the
 section <hi rend="bold">as listed in the Tibetan</hi>. This means that if
 the chapter is misnumbered, the misnumbering should be reflected in the first
 column. If the first chapter is numbered <ent value="ldquo"></ent>le<ent
value="rsquo"></ent>u  bcu dgu pa<ent value="rdquo"></ent>, then <ent value="ldquo"></ent>19<ent
value="rdquo"></ent>  is entered in the first column of the first row. The
conversion program will  automatically insert the sequential identification
numbers <ent value="ldquo"></ent>b1<ent value="rdquo"></ent>, <ent value="ldquo"></ent>b2<ent
value="rdquo"></ent> and so forth.  The number in the first column should
correspond to whatever the Tibetan says.  If no numbers are given, then the
sequential number should be entered in brackets  to indicate it is an insertion
and a note should be included for the converter,  who will have to change
the header to read <ent value="ldquo"></ent>[Chapter 1]<ent value="rdquo"></ent>
and so forth. In the second column, enter the exact Tibetan  title for the
chapter (not including the text-title), followed by a <hi>shad</hi>.  The
source for the title is included in the same column, after the title,  within
parentheses. In the third column put in the pagination for the chapter.  If
the chapter is not titled, enter <ent value="ldquo"></ent>Untitled.<ent value="rdquo"></ent>
 (first cap, ending in period).</cell></row><row><cell>{BACK SECTIONS}</cell><cell>This
 also is a header line.</cell></row><row><cell>Closing Section</cell><cell>Back
 sections are dealt with in an identical manner to front sections. The name
 of the section goes in the first column (all words capitalized). The number
 of back section it is goes in the middle column and the third column is for
 the section<ent value="rsquo"></ent>s pagination. The sections included in
the  entry form are the most common ones. If any of them do not exist for
a certain  text, that row should be deleted from the table.</cell></row></table></p></div3><div3><head>Standard
English Phrases Used in Cataloging</head><p>The data entered into the cataloging
forms is generally extended Wylie transliteration or numeric data for volume
number, pagination and the like. However, there are several instances where
the use of English is require, such as when we want to explicitly indicate
that no information was supplied by the text and to note anomalies and so
forth within the text. While catalogers have the flexibility to indicate their
precise meaning, the use of standard English phrases is recommended in the
more common situations. Included here is a table of those standard English
phrase and the situation in which they are used.</p><p><head>Points to Note</head><list
n="1" rend="1"><item>Some instances have a period at the end; others do not.
 Please be mindful of this.</item><item>If you create a Physfacet or a Discussion,
always include your initials in parentheses at the end of the content.</item></list><table><row
rend="header"><cell>English Phrase</cell><cell>Occasions When Phrase is Used</cell></row><row><cell>No
title given.</cell><cell>No text title at the end of a chapter (occasionally
these are at the beginning).</cell></row><row><cell>Not found.</cell><cell>No
TAIWAN catalogue number; no KANEKO catalogue number</cell></row><row><cell>Not
specified.</cell><cell>If there is no:<list n="1" rend="1"><item>Object of
Homage</item><item>non-Tibetan title</item><item>title line title</item><item>chapter
designation</item><item>Indian scholar</item><item>Tibetan translator</item><item>date
translated</item><item>place translated</item><item>original language</item></list></cell></row><row><cell>No
colophon given.</cell><cell>If there is no colophon (Used in the abbreviated
entry form).</cell></row></table></p><div4><head>Common Phrasing For Notes
and Discussions</head><table><row rend="header"><cell>Situation</cell><cell>Action</cell><cell>Phrasing</cell></row><row><cell>For
chapter misnumbering,</cell><cell>create both a Physfacet type=Chapter numbering
&amp; a Note for the misnumbered chapter, type=Chapter numbering. Content
of both the Physfacet and the Note reads:</cell><cell>The chapter numbering
skips from X to Z, and all subsequent chapters are likewise misnumbered.</cell></row><row><cell>If
two chapters have the same number,</cell><cell>insert 1) a Physfacet and 2)
a Note after the chapter that is misnumbered:</cell><cell>There are two chapters
numbered xx.  Chapter xy is misnumbered xx, and all subsequent chapters are
likewise misnumbered</cell></row><row><cell>If the chapter numbering of a
text does not start with chapter 1,</cell><cell>insert 1) a Physfacet and
2) a Note after the first chapter:</cell><cell>The chapter numbering begins
with B.</cell></row><row><cell>If this continues the chapter numbering from
the previous text,</cell><cell>the content of both the Physfacet and the Note
is:</cell><cell>The chapter numbering begins with B, continuing that of the
previous text, AB.xy.</cell></row><row><cell>If there is no chapter title:</cell><cell></cell><cell>Untitled.</cell></row><row><cell>If
there is no end of chapter text title:</cell><cell></cell><cell>No title given.</cell></row><row><cell>If
the only back section of a text is rdzogs so//, and if this is set off from
the end of the last chapter,</cell><cell>add a Note type=Closing section,
the content of which reads:</cell><cell>Consists solely of rdzogs so/.</cell></row><row><cell>If
there is an inter-chapter closing section-a closing section within a text
(that is, between chapters rather than after the final chapter),</cell><cell>insert
1) Physfacet 2) a Note for the chapter the closing section follows:</cell><cell><hi
rend="bold">Physfacet type=Distinctive features</hi>: There is a closing section
between chapters X &amp; Y. (If this is appropriate, add this sentence:) This
might indicate that chapters a-y were once an independent text. <lb></lb><hi
rend="bold">Note</hi>: 	A closing section follows chapter X.  It reads: {enter
transliteration of the entire closing section. This is usually just a closing
section title}</cell></row><row><cell>If there are no front sections:</cell><cell>put
"0" in "front cles" field in entry table.</cell><cell>(The conversion program
should then insert) No front sections. (at the appropriate place in the SGML
file.  If it doesn't, you need to do it manually.)</cell></row><row><cell>If
there are no back sections:</cell><cell>put "0" in "back cles" field in entry
table.</cell><cell>(The same is true if there are no back sections, which
then reads:) No back sections.</cell></row></table></div4></div3></div2><div2><head>Conversion
 to SGML</head><p>The conversion from the word-processing entry-form table
 to SGML is handled in batch for large collections such as <hi rend="italics">The
 Collected Tantras of the Ancients</hi>. Such conversions can only be made
 in those cases where the documents have a fairly high degree of conformity
 and therefore can make use of an SGML or XML template. The conversion routine
 works on the algorithm of reading the information from the table, creating
 a separate SGML document based on the specifications outlined in the table<ent
value="rsquo"></ent>s information, and searching for flags embedded in that
SGML  document and replacing them with the actual data. Thus, for the table
there  is a need to maintain the integrity of the labels in the first column
because  these are identical to the flags embedded in the SGML document. Other
collections  are more idiosyncratic and cannot be reduced to a simple template
so that  entry of their information must be done individually for each text.</p><p>Once
 a volume is completed, the entry forms for all its texts are stored in a
single  folder on the FTP site and the proofreader is notified. S/he then
copies the  whole folder containing the text records on to a local machine.
The folder  should contain nothing but text entry-forms and all other documents
such as  FTP logs and so forth should be expunged from that directory. The
MSWord document  that contains the conversion macros is entitled: SAM Convert
Folder. This  is then copied into the folder. It uses a data file called <ent
value="ldquo"></ent>sgmlnew.dat<ent value="rdquo"></ent>. This file must reside
at the root level of the drive one  is using for the conversion routine to
work. The SAM Convert Folder file is  opened and the macro is initiated by
pressing CTRL-ALT-F12. A small text box  will appear asking for the converter<ent
value="rsquo"></ent>s initials. You must  register as an editor to use the
macro. Once the initials are entered, the  program will proceed to open and
convert each file in the folder, place the  SGML files in a subdirectory called
sgmlfile. If there are notes in the fourth  column of an entry form, these
will be extracted and saved in the same subdirectory  under the name <ent
value="ldquo"></ent>{name of text}.notes.txt<ent value="rdquo"></ent>.  If
no such file is created, there were no notes for that text.</p><p>The proofreader
 then must open each SGML file in an SGML editor that can parse and check
for  completeness. The conversion program is in a beta form and so mistakes
do  occur. The most common one, which is being addressed now, is to have random
 text from the clipboard inserted at the beginning of the SGML document. In
 this case, the document will only open as a text file. Once the garbage is
 deleted from the beginning of the file and its is saved, it can be reopened
 as an SGML document. Other errors may occur but these are generally pointed 
out by the parser/editor. Provided that the SGML document opens as a valid
 one, the proofreader then must go through the document and adjust it according
 to the notes. Generally, the notes document will contain information to be
 inserted in the SGML document. These are generally entered within NOTE tags.
 All note tags should be given an ID attribute that is the name of the text
 (without the .sgm extension) plus <ent value="ldquo"></ent>n<ent value="rdquo"></ent>
 and the sequential number of the note. Thus, the 3rd note in the bibliographic
 record for Tb.415 would have and ID of <ent value="ldquo"></ent>Tb.415.bib.n3<ent
value="rdquo"></ent>. The TYPE attribute for the note should contain the topic
 of the note. This is used as a header. The convention is that only the first
 word of the TYPE attribute is capitalized. An instance of a TYPE attribute
 would be <ent value="ldquo"></ent>Closing section contents<ent value="rdquo"></ent>.
 Do not include the word <ent value="ldquo"></ent>Note<ent value="rdquo"></ent>
in the  TYPE attribute as this is added automatically by style sheets. The
RESP attribute  of the tag is set to the note-writer<ent value="rsquo"></ent>s
standard initials.</p><p>A  good SGML editing program will inform the user
where any tag can be used and  will not allow one to insert tags where they
are not valid. The proofreader  will need to place the NOTE tag so that it
falls at the end of the display  line, since the NOTE will be converted to
the blue <ent value="ldquo"></ent>N<ent value="rdquo"></ent> note icon. This
means that a proofreader must look at the  display of the catalog on the web
to get a feel for where notes belong.  In some instances, the NOTE tag is
not allowed to be inserted but an ADD tag  can be inserted. The NOTE tag is
then inserted within the ADD tag. This is  used only when necessary. </p><p>Other
information a proofreader/converter  might need to add falls in the physical
description. Here, a different element  is used: the PHYSFACET. In this case,
the PHYSFACET<ent value="rsquo"></ent>s TYPE  attribute is set to <ent value="ldquo"></ent>Distinctive
features<ent value="rdquo"></ent>  and no ID is given. The content of the
tag is the substance of the note. Examples  are <ent value="ldquo"></ent>Chapter
15 is misnumbered as chapter 14. All subsequent  chapters are similarly misnumbered.
(ndg)<ent value="rdquo"></ent> PHYSFACET tags  do not have a RESP attribute
for the editor<ent value="rsquo"></ent>s initials  so these must be included
in the body of the <ent value="rsquo"></ent>note<ent value="rsquo"></ent>.
Finally, there are DISCUSSION tags which can be used in most  places. These
are for longer, scholarly discussions on any issue pertaining  to that entry.
They are to be handled in the same way as PHYSFACET tags. There  are two ways
to use DISCUSSION tags. If the REND attribute is set to <ent value="ldquo"></ent>info<ent
value="rdquo"></ent>, in the HTML document displayed on-line  the TYPE attribute
is converted to a label and the content is listed in-line.  If no REND attribute
is given, the TYPE is used as a title for a link included  in the HTML document.
Clicking on that link produces a separate window with  the discussion in it.
This latter use is not heavily relied on in the catalog  at present but is
generally reserved for higher-level scholarly discussion  that will be added
in the future.</p><p>Finally, the finished SGML documents  are stored in the
file management program, Astoria. From there, they are printed  out and proofed.
The SGML documents are then corrected by the proofreader,  who is a different
person from the editor, and finally a volume bibliographic  record and catalog
record are created. With this, the cataloging process  is complete and the
volume is ready for publication. These last steps will  be described in the
following sections.</p></div2><div2><head>Using  Astoria: The File-Management
Program</head><div3><head>Introduction</head><p>The  catalogs of the Samantabhadra
Collection are composed of numerous and various  SGML documents. Not only
does each text in The Collected Tantras of the Ancients  have an SGML document
as its separate bibliographic record, but the volumes  and editions themselves
also have bibliographic records. On top of this, there  are cataloging records
that serve to concatenate the information into a single  SGML file, publishable
through Dynaweb. Given that there are in the neighborhood  of 1,000 texts
in The Collected Tantras of the Ancients, the task of managing  and maintaining
definitive versions of these files while multiple editors  work on them is
daunting. Fortunately, the Astoria SGML file-management software  assists
us in this task.</p></div3><div3><head>Files Involved</head><p>As  Astoria
is used to keep track of the plethora of SGML files we continue to  create,
it would serve us well to review the different files involved in an  edition
catalog. Only the name and a brief description of each kind of file  will
be given here for reference in the following discussion. A full list  of files
and more detailed descriptions can be found in the section on SGML  files.</p><p><hi
rend="center"><table rend="border=1"><row><cell role="head">File  Name</cell><cell
role="head">Description</cell></row><row><cell>Tb.###</cell><cell>The  folder
containing files on text ###.</cell></row><row><cell>Tb.###.bib.sgm</cell><cell>The
 bibliographic record for a single text.</cell></row><row><cell>Tb.v##</cell><cell>The
 folder containing files for volume ##.</cell></row><row><cell> Tb.v##.bib.sgm</cell><cell>The
 bibliographic record for a volume.</cell></row><row><cell> Tb.v##.cat.sgm</cell><cell>The
 volume catalog file concatenates its texts<ent value="rsquo"></ent> bib.
records.</cell></row><row><cell>  Tb.bib.sgm</cell><cell>The bibliographic
record for the mTshams brag edition.</cell></row><row><cell>  Tb.cat.sgm</cell><cell>The
catalog file that concatenates all the volume  catalogs.</cell></row><row><cell>Tb.ed.sgm</cell><cell>The
final published  catalog of an edition.</cell></row></table></hi></p><p>For
illustration  purposes, the following discussion will cite particular texts
and volumes,  such as Tb.412.bib.sgm and Tb.v07.cat.sgm, rather than the formulas
above.  This will hopefully provide concrete illustrations of general procedures.</p></div3><div3><head>File
 Management in Astoria</head><p>Being a type of electronic library, Astoria
 utilizes a system for organizing its files, in the same way that a physical
 library needs a system to organize its books. Astoria does so in a hierarchical
 way, that is, folders nested within folders any of which can contain SGML
 documents. It uses the genealogical metaphor to describe the various levels.
 The higher level is the parent or ancestor of the lower level, which is called
 the child (or, less-frequently, descendent). The highest level or root directory 
of any collection is a filing <ent value="ldquo"></ent>cabinet<ent value="rdquo"></ent>.
 In our case, the filing cabinet for storing the data is called <ent value="ldquo"></ent>Sam<ent
value="rdquo"></ent>. Within Sam, at present, there are two children/folders:
one  for templates and the other for the mTshams brag edition. As other editions
 are cataloged, they will have their own folder and also become the children
 of Sam. Within the mTshams brag folder there are 49 children. These are the
 Tb.ed.sgm, Tb.cat.sgm, Tb.bib.sgm, and 46 folders for the 46 volumes of the
 edition. Within a volume folder, say Tb.v04, there would be its catalog 
record (Tb.v04.cat), bibliographic information (Tb.v04.bib) and a series of 
folders, one for each text found in volume 4. These text folders at present
 contain only a single document--the bibliographic record for that text--but
 in the future they will contain representational files and scholarship files
 for that text. The structure of the collection can be pictured thus:</p><p><xptr
n="astChart" type="gif" resp="ndg" targtype="image"></xptr></p></div3><div3><head>Entering
 SGML Documents into Astoria</head><p>The first thing an editor should do
once  the table of bibliographic information on a text has been converted
into an  SGML document is to enter it into the Astoria database. This can
be done directly  from Epic. However, the folder in which it is to be placed
needs to be set  up already. To do this, one goes into Astoria itself, navigates
to the place  where you want the folder, chooses <ent value="ldquo"></ent>New
Folder<ent value="rdquo"></ent>  from the file menu, and creates a folder
with the appropriate name. Volume  folders should be named: Tb.v## with no
extension (such as .sgm). Text folders  are named: Tb.### using the sequential
text number and again no extension.  The create folder window gives one the
option to place the folder <ent value="ldquo"></ent>before<ent value="rdquo"></ent>
or <ent value="ldquo"></ent>after<ent value="rdquo"></ent>, or as <ent value="ldquo"></ent>first
child<ent value="rdquo"></ent> or <ent value="ldquo"></ent>last  child.<ent
value="rdquo"></ent> <ent value="ldquo"></ent>Before<ent value="rdquo"></ent>
 will place the new folder before the object highlighted when one chose new
 folder. <ent value="ldquo"></ent>After<ent value="rdquo"></ent> will place
the new folder  after the highlighted object. <ent value="ldquo"></ent>First
child<ent value="rdquo"></ent>  places the folder as the first item within
the highlighted folder. <ent value="ldquo"></ent>Last  child<ent value="rdquo"></ent>
places it at the end of all other contents of the  highlighted folder. Once
the folder is created, an SGML file can then be entered  into it. This is
done through Epic as follows. The second to last menu on  the menu bar of
Epic (i.e., the one to the left of the help menu) is called  Astoria. The
third option under the Astoria menu is <ent value="ldquo"></ent>Save  As Document
Object<ent value="rdquo"></ent>. When this option is chosen, if one  has not
already opened the Astoria database, one will be asked for their handle  and
password. Once in, a window pops up. One needs then to enter the name  Astoria
is to give the file, such as Tb.141.bib. Since all files in Astoria  are SGML
files, the extension .sgm has been left off of files saved there,  but when
exported out of Astoria, the extension .sgm should be applied. Next,  one
needs to choose a destination for the file, that is, where the file is  to
be saved in Astoria. The bottom input field of the window is for this.  Click
on the browse button. There will be two sub-windows in this <ent value="ldquo"></ent>Browse<ent
value="rdquo"></ent> window. The one to the right displays the hierarchy of
folders  and files. The one to the left shows the possible destinations available
within  the folder highlighted on the right. Navigate down the hierarchy until
one  sees the desired destination folder appear in the left-hand window. Choose
 this and its name will appear in the bottom input field of the original window.
 The middle input field of that window is for any kind of notes, descriptions,
 and so forth concerning the file. This information is displayed in Astoria 
but not used in display or publishing. Make sure the <ent value="ldquo"></ent>Revoke
 Save on Errors<ent value="rdquo"></ent> option is checked so that Astoria
will  validate the document before saving it and alert one if there are any
errors.  Finally, click save. After some wait, the document will reappear
with a split  screen, showing the documents structure and the SGML tags. One
needs to save  it again and <ent value="ldquo"></ent>check in<ent value="rdquo"></ent>
the document  as described below.</p></div3><div3><head>Checking Documents
In and Out of  Astoria</head><p>When a document has been entered into Astoria,
it may be  checked out by those authorized with a valid password. Assuming
one is in  Astoria, one can navigate down the graphic hierarchy by double
clicking on  cabinets, folders, or files that one wishes to open. When the
folder where  the desired document is reached, the document<ent value="rsquo"></ent>s
name will  appear with a graphic of a piece of paper to its left. One needs
to double  click on this, which will open up access to the SGML document itself.
There  will be a yellow or red arrow pointing to the name of the document
with the  .sgm extension. This name comes from the SYSID field in the SGML
template.  For this reason, one must always use the .sgm extension in the
SYSID field.  Next, highlight this arrow and file name by clicking on it once
and then right  click it. A pop up menu appears with two or three options.
Choosing <ent value="ldquo"></ent>Edit<ent value="rdquo"></ent> will check
the document out of Astoria and open it up in Epic.  One can then make changes
to the document and have sole control over it. All  other clients can access
the document for viewing but will not be allowed  to make changes, because
the document is checked out to you. When finished  with editing the document,
choose save from the file menu or press <ent value="ldquo"></ent>Ctrl  S<ent
value="rdquo"></ent>. A save window will pop-up. If one is done editing, 
click on <ent value="ldquo"></ent>check in<ent value="rdquo"></ent> and then <ent
value="ldquo"></ent>OK<ent value="rdquo"></ent>. This will check the document
back into Astoria and any authorized  client will then be able to check it
out. If one wants to save changes into  Astoria but still continue to edit
the document, click on <ent value="rdquo"></ent>check  out again<ent value="ldquo"></ent>.
This option becomes available only when one  has first chosen <ent value="ldquo"></ent>check
in<ent value="rdquo"></ent>. Doing this  will save one<ent value="rsquo"></ent>s
changes into the Astoria database. If one  merely saves the document without
choosing <ent value="ldquo"></ent>check in<ent value="rdquo"></ent>, this
will save one<ent value="rsquo"></ent>s changes to the temporary  file stored
on the computer being working on but will not save them to Astoria  until
the <ent value="ldquo"></ent>check in<ent value="rdquo"></ent> option is chosen.
 Note: In all situations the <ent value="ldquo"></ent>Revoke Save on Errors<ent
value="rdquo"></ent> option should be chosen so that Astoria validates the
in-coming  SGML and will not allow any context errors to be saved.</p></div3><div3><head>Changing
 File Names and Moving Files</head><p>Another feature of Astoria is that file
 names can be changed without checking files out of Astoria and files can
be  moved within the hierarchy without having to export and re-import them.
To  rename or change the description of a file, highlight it and right click.
 From the pop-up menu, chose <ent value="ldquo"></ent>properties<ent value="rdquo"></ent>.
 This will present a window similar to the <ent value="ldquo"></ent>Save As
Document  Object<ent value="rdquo"></ent> widow, with fields containing the
file name and  description. These can then be edited and the changes will
be recorded upon  clicking <ent value="ldquo"></ent>OK<ent value="rdquo"></ent>.
These changes will not  affect the SGML output but merely the labels used
in Astoria itself. To move  a file around Astoria, the Object menu is used.
The process is similar to  cutting and pasting with a slight terminological
difference. First, highlight  the object one wishes to move. Then choose <ent
value="ldquo"></ent>move<ent value="rdquo"></ent>  under the Object menu.
The object will be grayed out. Then, move to the place  where you want to
put the object and under the same menu, choose <ent value="ldquo"></ent>paste<ent
value="rdquo"></ent>. Astoria will give you the options of <ent value="ldquo"></ent>before<ent
value="rdquo"></ent>, <ent value="ldquo"></ent>after<ent value="rdquo"></ent>,
and so forth.  Choose one and click <ent value="ldquo"></ent>OK<ent value="rdquo"></ent>.
The file  will then be placed in the desired location.</p></div3><div3><head>Sharing
 SGML Chunks through Astoria</head><p>One of the most versatile features of
 Astoria is the ability to include pieces of an SGML document within another
 document. Such sharing cannot violate the rules of the DTD, so the destination
 of the shared <ent value="ldquo"></ent>chunk<ent value="rdquo"></ent> must
be a place  where it is <ent value="ldquo"></ent>in context<ent value="rdquo"></ent>
or permissible.  Otherwise, Astoria will reject the attempt to share the SGML.
One does not  have to share a whole document but can share any autonomous
section of any  particular document. These autonomous sections are known as <ent
value="ldquo"></ent>Objects<ent value="rdquo"></ent>, so Astoria calls this
sharing of SGML <ent value="ldquo"></ent>Reusing  an Object<ent value="rdquo"></ent>.
The catalog files for the Collection are  built through reusing objects in
this way. The SGML hierarchy for an edition  catalog file, which is mirrored
in Astoria<ent value="rsquo"></ent>s file hierarchy,  is displayed below.
The overarching tag set is a modified TEI called TIB;  the particular tag
set for Tibetan bibliographic information is TIBBIBL. DIV1  through DIV3 are
hierarchical division markers:</p><p><list rend="u"><item>Tb.cat.sgm  (TIB
document) includes ...<list rend="u"><item>Tb.bib.sgm (TIBBIBL)*</item><item>DIV1
 Type=Atiyoga<list rend="u"><item>DIV2 ID=Tb.v01 Type=Volume N=1*<list rend="u"><item>Tb.v01.bib.sgm
 (TIBBIBL)*</item><item>DIV3 ID=Tb.1.bib Type=Text N=1<list rend="u"><item>Tb.1.bib.sgm
 (TIBIBL)*</item></list></item><item>DIV3ID=Tb.2.bib Type=Text N=2 <list rend="u"><item>Tb.2.bib.sgm
 (TIBIBL)* and so forth.</item></list></item></list></item></list></item></list></item></list></p><p>The
 asterisks (*) show which files are being shared or reused through Astoria.
 Notice that there is sharing within sharing. In other words, text files are
 first shared into volume catalogs and then the volume catalogs (DIV2 above)
 are shared into the edition catalog. The actual unshared SGML for the edition
 catalog is quite insubstantial. It merely consists of the Control information,
 a Front Section and a Body Section, which includes three DIV1s for Atiyoga,
 Anuyoga, and Mah<ent value="amacr"></ent>yoga, each of which begins with
a brief  description of that doxographical category. (At present we have no
use for  the Back Section, but this could be added if needed.) All other information
 in the edition catalog file is reused from other files. Below is an image
 of the Epic window displaying the mTshams brag edition catalog file. (The
 plus-indicator means portions of the SGML are not being shown. This is because
 they are irrelevant here.)</p><p><xptr n="astoria1" type="gif" resp="ndg"
targtype="image"></xptr></p><p>In the front section the catalog file reuses
the  whole Tb.bib.sgm file. Then within each of the DIV1s of the body it reuses
 the DIV2s extracted from the volume catalogs of those volumes in that particular
 category. The Atiyoga DIV1 includes DIV2s reused from the catalog files 
of volumes 1 through 14, and so forth. The volume catalog files have a complete
 structure-a BODY, a DIV1, a DIV2 containing DIV3s for each text-but only
the  DIV2 and its contents (the DIV3s) are reused in the edition catalog.
This  is so that the volume catalog can stand on its own, yet when it is reused
 in the edition catalog the SGML will still be in context. The volume catalog
 in its own turn reuses the full TIBBIBL of each text. Its single DIV2 contains
 a DIV3 for each text. The ID attribute for each DIV3 is the name of the record,
 such as Tb.23.bib (without extension); its N attribute is set to the sequential
 text number, here it would be 23; and its TYPE attribute is set to <ent value="ldquo"></ent>text<ent
value="rdquo"></ent>. Each text<ent value="rsquo"></ent>s DIV3 contains a
HEAD element,  which contains the following:</p><q>Tb.##: Normalized Title</q><p>Here,
even  the normalized title is reused from the TIBBIBL for that text. Following
the  header the complete TIBBIBL is reused or inserted. The header is used
for  the TOC window in Dynaweb.</p></div3><div3><head>Procedure for Reusing
an  Object</head><p>Reusing an object is actually done through the SGML editor,
 providing it has a bridge to Astoria. In our case, the editor is Epic. The
 document in which the reusing will occur must be one checked out of Astoria,
 however. First, one places the cursor where one wants the reused SGML to
be  placed. Then, choose the eighth option under the Astoria menu in Epic.
This  option is called <ent value="ldquo"></ent>Reuse Object<ent value="rdquo"></ent>.
A window  will appear with the Astoria hierarchy on the right and a blank
window on  the left. Navigate through the hierarchy to the document to be
reused. When  this document is opened (double-clicked), a red arrow will appear
below it  in the hierarchy as well as on the left in the <ent value="ldquo"></ent>Object<ent
value="rdquo"></ent> window. Only items that appear in the left <ent value="ldquo"></ent>Object<ent
value="rdquo"></ent> window can be shared. The first object to appear will
be a  complete TIBBIBL. However, one can navigate further into the guts of
that  TIBBIBL and take out just a piece, such as the Normalized title mentioned
 above. Double click on the arrow in the right window and all that element<ent
value="rsquo"></ent>s immediate children will appear. One can reuse one of
the  children or click on it and see its children. In this way, one can navigate
 to a single element and share just that. Whatever is to be shared must be
 highlighted in the left-hand window before one presses <ent value="ldquo"></ent>OK<ent
value="rdquo"></ent>. Astoria/Epic will not allow one to place SGML out-of-context
 and will reject such a request. To illustrate this with an example, when
creating  the edition catalog, I place my cursor in the DIV1 type=Atiyoga
and then  do Reuse Object. I navigate through Astoria with the pop-up window:
first  opening the edition folder, then opening the Volume 1 folder, and then
opening  the Tb.v1.cat file. This is the catalog file for volume 1. I see
two options  within it: CONTROLINFO and TEXT. Under TEXT there is a DIV1 type=Atiyoga. 
As this is already in the edition catalog, I do not want to share it but 
open it and go down a level further. There I find a DIV2 Type=Volume, N=1.
 When I open the parent element, the DIV1 in the right-hand window, The DIV2
 appears both below it in the right window and also in the left window. I
then  highlight the one in the left window and click OK. The SGML of the DIV2
is  then inserted as a file entity into the edition catalog. Note that the
DIV2  itself reuses the TIBBIBLs for each text in volume 1, so that these
text records  are first shared into the volume catalog and then from there
into the edition  catalog. This allows a user to navigate in the same hierarchical
fashion  when browsing the collection through Dynaweb.</p></div3><div3><head>Making
 an Edition through Astoria</head><p>This kind of SGML sharing not only saves
 one from extensive cutting and pasting but also assures that any changes
made  in one document will be reflected in the places where it is shared.
Thus,  for each file there is one definitive version to which changes can
be made,  and these changes will be reflected in the documents that share
said file.  However, the final product we want here is a single SGML file
that concatenates  all the information in the other files. How is this done?
In Astoria it is  done through making an edition. Making an edition in Astoria
is like taking  a snap shot of the whole SGML file including all shared data
or reused objects.  In this case, we will make an edition of the Tb.cat.sgm
document, the catalog  file for the mTshams brag edition. In doing so, Astoria
will place all reused  SGML in the appropriate place and create a static,
unchangeable version of  the complete file so that it can be published on
the internet. Any changes  subsequently made to any of the files will not
be reflected in the edition  until the edition is remade and reposted. One
makes an edition by first highlighting  the edition catalog file or Tb.cat.
Then, under the Object menu, the last  option is called <ent value="ldquo"></ent>Make
Edition<ent value="rdquo"></ent>. Choose  this, give the edition a name (Tb.ed
in this case) and a description (date  or some distinguishing piece of information
about this edition), choose where  to place it (generally before the catalog
file), and click <ent value="ldquo"></ent>OK<ent value="rdquo"></ent>. After
some time, a book icon will appear above the catalog  file. This is the <ent
value="ldquo"></ent>published<ent value="rdquo"></ent> edition.  It cannot
be changed or modified in Astoria but is a snap shot or a version  of the
edition frozen in time. One can then export this edition and post it  to Dynaweb,
making the information available on the web.</p></div3><div3><head>Exporting
 Documents from Astoria</head><p>If, for whatever reason, one would like to
 have a copy of any particular file in Astoria, the program does allow one
 to export information to a disk. The resulting SGML document will have no
 ties to Astoria and can be edited and used as desired. Changes made to it
 will not be reflected in Astoria, nor will changes made to the Astoria file
 be reflected in it. In general, it should only be used to get material for
 prototyping other stages of the project or for publishing an edition to Dynaweb,
 as mentioned above. To export a file from Astoria, highlight that file and 
then choose Export from the file menu, or press F7. Use the resulting pop-up
 window to tell Astoria where to place the exported file and click <ent value="ldquo"></ent>OK<ent
value="rdquo"></ent>. A clean copy of the file will be deposited on your disk
in  the specified location. (Note: When exporting text bibliographic records
there  are two ways to do it: by highlighting the document-that is, the image
of  a piece of paper followed by the SYSID-or by highlighting the TIBBIBL
contained  in that document. The former is the desirable option here. That
way the exported  document will have a public doctype declaration, which will
be recognized  by Epic. The other way, by opening that page icon and highlighting
the yellow  or red arrow and exporting, will give you a document with a system
doctype  declaration. This means Epic will look for the TIB or TIBBIBL DTD
on your  machine. Since it will not find the DTD there, the document will
not parse  and will instead open as a text file.)</p></div3><div3><head>Publishing
from  Astoria</head><p>Once the catalog record is complete, it can then be
published.  Because of the amount of sharing that is done in the Samantabhadra
catalog,  it is more time efficient to publish each volume individually and
then cut  and paste these into the edition catalog record, also published
separately.  This is done in the following way:</p><list n="1" rend="1"><item>Highlight
 the catalog record and choose the option <ent value="lsquo"></ent>Make Edition<ent
value="rsquo"></ent> from the Action menu or Ctrl-Shift-E.</item><item>In
the resulting  dialogue box, put the name of the file followed by <ent value="lsquo"></ent>.ed<ent
value="rsquo"></ent>. For instance, Tb.v06.ed. In the description box, put
today<ent value="rsquo"></ent>s date so as not to confuse previous and subsequent
editions.  Finally choose the <ent value="lsquo"></ent>Place Before<ent value="rsquo"></ent>
option  and click OK. The new edition will appear above the catalog record.</item><item>Delete
 the old edition, if there is one, by highlighting it and choosing <ent value="lsquo"></ent>Delete<ent
value="rsquo"></ent> from the Action menu or simply press the delete key.
This  may take a while depending on the size of the file.</item><item>Finally,
the  edition needs to be exported from Astoria. To do this, highlight the
new .ed  file and choose <ent value="lsquo"></ent>Export<ent value="rsquo"></ent>
from the Object  menu, or press F7. Give the exported file the same name as
the Astoria file  but add <ent value="lsquo"></ent>.sgm<ent value="rsquo"></ent>
to it. Thus, it would  be <ent value="lsquo"></ent>Tb.v06.ed.sgm<ent value="rsquo"></ent>.
Choose a folder  to save all the editions in so they can be easily concatenated.</item><item>You
 now have a folder with the published version of the edition and volumes.
These  need to be cut and pasted together to form a single document that is
the final  catalog. This is a very simple process with a little understanding
of SGML.  First open the Edition file in a text editor like NoteTab. The body
of the  Edition file (for the mTshams brag) is composed of the large divisions
or  DIV1 type=Atiyoga, Anuyoga, and Mah<ent value="amacr"></ent>yoga. For
each volume  that falls in that category, you must open its file and copy
the DIV2 it contains  and paste it in the appropriate DIV1 (e.g., the DIV2s
found in vols 1-14 go  in the editions DIV1 Type=Atiyoga).</item></list></div3><div3><head>Making
 Book for Dynaweb</head><p>When the full catalog edition has been published,
 you need to run a Dynatext routine called Mkbook (i.e., make book) on the
 SGML file, which processes SGML and indexes it. If this is a final proofed
version  to be put up on the web, you do nothing but give it, along with the
style  sheets, to the network systems manager, who will run the Dynaweb's
Mkbook on the  Unix server and make the catalog from the link on our web-page.
However,  it is also possible to do this locally on a machine with DynaText
and Dynaweb installed on it in order  to locally view the catalog and proof
it for style-sheet or SGML mistakes  that proofreaders will not catch. But,
before the new catalog can be view locally  you have to locally make book
on it. This is done as follows:</p><list n="1" rend="1"><item>The new published
SGML file needs to be renamed by its sigla followed by .ed.sgm<ent value="mdash"></ent>for
example, <ent value="ldquo"></ent>Tb.ed.sgm<ent value="rdquo"></ent> and put
in the following folder: C:\dtext43\ebtdocs\imcdoc\books\tb.ed</item><item>Make
sure the book is listed in the booklist within imcdoc. The entry should look
something like: <q>&lt;BOOK NAME="tb.ed" ALIAS="mTshams brag Edition" VERSION=""
EDITION="20010910" TYPE="Adi3xDocument"> &lt;/BOOK></q></item><item>Then 
you need to open a MSDOS window from the START menu.</item><item>Go to the
 correct directory by typing: cd ..\dtext43\bin </item><item>Then invoke the
 make book routine by typing: <q>mkbook -fa tb.ed</q></item></list><p>Once
the book is "made", it can then be copied over into the corresponding folder
in the Dynaweb application. If you are working on styles either manually or
using InSted, these styles are kept on the Dynaweb. So, make sure you always
copy the latest styles from Dynaweb to DText before making book. Then, after
book is made, the whole folder can be copied back into Dynaweb for viewing.
On the Dynaweb option on the Start Menu>Programs, there is a choice for "Dynaweb
Internet Server http-6336 Controller" which will be the local server. (The
number may be different.) Use this to start, stop, and restart the local Dynaweb
server. This needs to be done every time you change the style sheets or re-make
book.</p></div3></div2><div2><head>Proofreading Guidelines</head><p>All  catalog
entries as well as entries of electronic transliteration are proofed  at least
once. It is a fundamental principle of this project that the proofreader 
must be a different person from the original data-entry person, as this is
 more likely to ensure that individual idiosyncratic mistakes are caught and
 corrected. It is also insisted that the proofreading be done against a hard
 copy as opposed to a screen version. Once a volume has been completely cataloged
 and the files safely deposited on the FTP server, the cataloger notifies 
the project manager who assigns a proofreader to it. The proofreader converts
 the entry-forms into SGML and prints out the results for each file. The SGML
 files are also stored in Astoria for version control. Comparing the printout
 directly with the woodblock print, the proofreader checks all aspects<ent
value="mdash"></ent>pagination, Tibetan, number of sections, etc. Any mistakes
 are marked in red on the printout. When the volume is completely checked,
 the proofreader then checks out the SGML files from Astoria, one by one,
and  enters the corrections, adjusting the meta-data of each file to reflect
that  s/he has proofed it. The print-out is filed away for a certain amount
of time  for future reference, if necessary.</p><p>The basic steps of proofing
are  in list form:</p><p><list n="1" rend="1"><item>Convert entry-form to
SGML  (when necessary)</item><item>Insert notes and make changes from instructions
 on entry-form</item><item>Check SGML file into Astoria in a folder of the
 same name</item><item>Print out SGML file</item><item>Check print-out against
 original, marking mistakes in red. (Do whole volume)</item><item>Check out
 SGML file from Astoria</item><item>Make corrections/changes.</item><item>Enter
 initials and date of proofing to text's metadata.</item><item>Check SGML
back  into Astoria.</item></list></p></div2><div2><head>Unique Elements of
TIBBIBL</head><p>The  TIBBIBL DTD, which was designed for the purposes of
cataloging Tibetan collections,  was based on the DTD developed by TEI (Text-Encoding
Initiative). The bulk  of the elements defined in TIBBIBL are identical to
the light version of the  TEI DTD. However, it quickly became apparent that
even the extensive TEI DTD  could not account for some of the peculiarities
of the Tibetan cultural tradition.  To this end, we had to augment TEI with
a number of additional elements aimed  precisely at cataloging Tibetan texts.
The following lists describe these  additions. When TEI elements are used
extensively in our records, these are  recorded. The first table presents
an alphabetical list of elements found  in the TIBBIBL records and a brief
description of how each is used:</p><p><table><row rend="head"><cell>Element
Name</cell><cell>Element Description</cell></row><row><cell>ADDRESS</cell><cell>A
 grouping element for a multi-lined address. Each line of the address goes
 in an ADDRLINE element.</cell></row><row><cell>ADDRLINE</cell><cell>An address-line
 data element, contains a line of an address.</cell></row><row><cell>ALTID</cell><cell>Information
 element which contains the Tibetan letter ID for volumes. Its Lang attribute
 should be TIB. Generally used with lang=Tib and system=letter to record the
 Tibetan letter assigned to a volume or a text.</cell></row><row><cell>CHANGE</cell><cell>A
 grouping element within REVISIONDESC within CONTROLINFO. Used as metadata
 to describe changes made to an electronic file.</cell></row><row><cell>CONTROLINFO</cell><cell>Grouping
 element containing general information pertaining to the document. RESPSTMT
 are found here.</cell></row><row><cell>DATE</cell><cell>Contains a date.
When  entering Tibetan dates, change the Calendar attribute of this element
to Tibetan  and place the Gregorian date in the Value attribute in the form,
yyyy.mm.dd,  if that much information is available. Otherwise the year or
range of years.  Negative years indicate BCE. Between the tags themselves
enter the Wylie transliteration  of the Tibetan name for the year, such as <hi>lcags
'brug</hi>. For Gregorian  dates, Calendar=Gregorian; Value=yyyy.mm.dd, and
within the tags enter the  full written form of the date, e.g., January 4,
1954. For mythological dates,  Calendar=Mythological; Value=Beginning Time,
End Time, Indeterminate Time;  between tags enter the transliteration of the
Tibetan phrase indicating this  time, such as <hi rend="italic">de'i dus su</hi>.</cell></row><row><cell>DESIGNATION</cell><cell>Data
 element used within TITLEITEM for the designation or chapter number of the
 chapter that uses a certain end-of-chapter text-title. The chapter number
 alone is entered (Arabic numeral), according to what the Tibetan lists. If
 it is a misnumbered chapter, the correct section number without the 'b' is
 placed in parentheses next to it. Thus, the entry of "<hi>le'u ste brgyad
 pa</hi>" that is actually the sixth chapter because it was mistakenly numbered
 would be "8 (6)"</cell></row><row><cell>DIMENSIONS</cell><cell>Data element
 for the physical dimensions of an item.</cell></row><row><cell>DIVCOUNT</cell><cell>Data
 element for the actual number of divisions of a certain type within an item.
 Contained within DIVDECL and differentiated by class into total, front, body,
 and back or by doxographical genre.</cell></row><row><cell>DIVDECL</cell><cell>Grouping
 element for information on the divisions of an item, i.e., volumes in an
edition,  texts in a volume, chapters in a text, etc. The divisions are contained
in  internal DIVCOUNT tags differentiated as total, front, body, and back.</cell></row><row><cell>DOXOGRAPHY</cell><cell>Data
 element within INTELLDECL used to describe various intellectual classifications
 of an item. Not strictly limited to "doxography" but distinguished by type
 into 1) relation: refers to the item<ent value="rsquo"></ent>s relation to
a larger  collection, 2) category: refers to the actual doxographical category
(distinguished  into two sub-types: n=0 refers to the master doxographical
category assigned  by a present-day group of scholars and n=1 refers to the
traditional doxographical  category), 3) genre: refers to the literary genre
of an item: s<ent value="umacr"></ent>tra,  tantra, canon, poem, 4) keywords:
as a grouping element it contains subject  keywords within the element TERM.</cell></row><row><cell>EDITION</cell><cell>Data
 element for the name of an edition. Used in MSSDESC for traditional publication
 (Edition would be named <hi>sde sge</hi>, etc.) and manuscript information.</cell></row><row><cell>EDITOR</cell><cell>Data
 element for the name of an editor used in MSSDESC after EDITION. Use NAME
 and ADDRESS elements for specific information.</cell></row><row><cell>EXTENT</cell><cell>Data
 element for the number of sides. Distinguished by type.</cell></row><row><cell>EXTENTDECL</cell><cell>Grouping
 element for the extent of an item. Extent here refers to the total number
 of folio sides. Whereas pagination is a range, extent is a count of sides.</cell></row><row><cell>FOREIGN</cell><cell>Can
 be used in situations where Tibetan and English are side-by-side to distinguish
 them from each other. An older, deprecated method of dealing with Tibetan
 in HEADers. The lang attribute would be set to TIB</cell></row><row><cell>GENREFORM</cell><cell>Data
 element for the form of a text, edition and so forth. Distinguished by type.
 This refers to physical mediums such as xylograph, photocopy, photostat,
microfilm,  etc.</cell></row><row><cell>HEAD</cell><cell>This stands for header
and is  used to label sections and information elements for display purposes.
To ensure  professional display, the use of HEAD elements should always be
consistent.</cell></row><row><cell>IDNO</cell><cell>Information  element for
other identification numbers for an edition. Found within a TIBIDDECL,  its
various forms include LOC number, OCLC number, ISBN number, call numbers,
 or even other catalog numbers, such as Kaneko. The different IDNOs are distinguished
 through their Type attribute.</cell></row><row><cell>IMPRINT</cell><cell>Grouping
 tag contains publisher information. Used within MSSDESC for traditional and
 manuscript publishing information.</cell></row><row><cell>INTELLDECL</cell><cell>Grouping
 element for the Intellectual Description of an item. Refers to such things
 as literary genre, doxography, relation to collection, and subject keywords.</cell></row><row><cell>ITEM</cell><cell>A
 data element contained within CHANGE in the REVISIONDESC of the CONTROLINFO
 metadata. Used to describe the kind of change made on the electronic version
 of the text.</cell></row><row><cell>MSSDECL</cell><cell>Manuscript declaration.
 For 'publishing' information concerning a manuscript. This is also used for
 publishing information of wood-block prints with type=Traditional publication.</cell></row><row><cell>NAME</cell><cell>Contains
 a name. The Type attribute can be used to distinguish what kind of name it
 is, i.e., Translator, Audience, etc., and the Lang attribute can be set to:
 unspecified, Tibetan, Sanskrit, or English. If a name is entered in modified
 Wylie transliteration, the Lang element should always be changed to TIB.
Often  follows RESP and contains the name of the person or organization who
handled  that "RESPonsibility". Also used next to a DIVCOUNT for the chapter
designations,  i.e., the generic name(s) given to chapters, such as <hi>le'u</hi>, <hi>skabs</hi>
 and so forth.</cell></row><row><cell>ORGNAME</cell><cell>Data element for
 an organization<ent value="rsquo"></ent>s name. The type attribute is used
to distinguish  the type of organization, e.g. Monastery. The typology for
this has not yet  been clearly defined but a provisional list is available.</cell></row><row><cell>ORIGINATION</cell><cell>Grouping
 element for the various figures involved in the provenance or creation of
 an item. Most of the information is contained in RESPDECLs within this grouping
 element.</cell></row><row><cell>PAGINATION</cell><cell>Data element for the
 pagination of an item. Contains beginning and ending pagination separated
 by a dash. No line numbers are needed for volumes but they are included for
 texts separated from the page by a period (e.g., 583.4-629.1).</cell></row><row><cell>PERSNAME</cell><cell>Data
 element for a personal name. The type attribute is used to distinguish the
 type of person, e.g., Buddha, Tibetan translator, etc. The full typology
for  this has not been finalized. Found most often in RESPDECL.</cell></row><row><cell>PHYSDECL</cell><cell>Grouping
 element for the physical description of an edition, volume, text, or manuscript.
 Contains pagination, extent (i.e., # of pages), divisions, genre form, dimensions,
 and other physical facets. Found most often in RESPDECL.</cell></row><row><cell>PHYSFACET</cell><cell>Data
 element for a physical facet of the text. Distinguished by type, these include
 such things as Paper and ink, Standard lines per page, Page formatting, Illustrations
 (yes/no), and Script.</cell></row><row><cell>PLACNAME</cell><cell>Data element
 for a place name. Distinguished by type and lang elements. Tibetan place
names  are entered in extended Wylie and Lang is changed to <ent value="ldquo"></ent>tib<ent
value="rdquo"></ent>.</cell></row><row><cell>PUBDECL</cell><cell>A grouping
element  to contain all the publishing information concerning an edition,
text, or  volume. In a volume or multi-volume work, the publication information
will  always go in the record of the highest level. Thus, publishing information
 goes in the Edition bibliographic record if there is one. If not, then it
 goes in the volume bibliographic record. It is only included with a text
if  it is a single text published alone. This is divided into three parts:
Contemporary  Publishing Information (pubdecl), Traditional Publishing Information
(stemmadecl  1), and Manuscript Information (stemmadecl 2).</cell></row><row><cell>PUBLICATIONSTMT</cell><cell>A
 grouping tag that contains the statement of publication, namely publisher,
 place, date, and a list of other figures responsible. This is found in the
 rubric of Contemporary Publishing Information.</cell></row><row><cell>PUBLISHER</cell><cell>Data
 element for publisher<ent value="rsquo"></ent>s full name.</cell></row><row><cell>PUBPLACE</cell><cell>Data
 element for the place of publication</cell></row><row><cell>REPSISTORY</cell><cell>Grouping
 tag contains information on who has a manuscript/wood-block print. Contains
 either ORGNAME or PERSNAME of organization or person who has the manuscript
 in question.</cell></row><row><cell>RESP</cell><cell>Information element
within  a RESPSTMT describing the role or "RESPonsibility" that those names
grouped  with it had in producing the electronic document, i.e., creator,
proofreader,  sponsor.</cell></row><row><cell>RESPDECL</cell><cell>Grouping
element, a responsibility  declaration, much like a RESPSTMT. Distinguished
by type into author, audience,  requester, redactor, translator, concealer,
revealer, patron and editor. Further  distinctions between types of redactors
into translation, revelation and so  forth are distinguished using the n attribute.</cell></row><row><cell>RESPSTMT</cell><cell>Grouping
 element found in CONTROLINFO and elsewhere for acknowledging those responsible
 for the creation of the document. Contains a RESP and one or more NAME elements.</cell></row><row><cell>REVISIONDESC</cell><cell>Revision
 description. Part of the metadata in CONTROLINFO. A grouping element to contain
 all the changes and additions to an electronic file. Contains the grouping
 element, CHANGE</cell></row><row><cell>SOURCE</cell><cell>Data element for
 the source of the information of the immediately preceding element. Distinguished
 by REND attribute into two display types: 'Rend=own line' will list the source
 on its own line with a label "Source:". If Rend is left blank, the source
 will be listed without a label and within parentheses next to the preceding
 element<ent value="rsquo"></ent>s listing.</cell></row><row><cell>SYSID</cell><cell>A
 data element for the System ID for the record. The file-name for the record
 is put here. It is contained in the CONTROLINFO element.</cell></row><row><cell>TIBANAL</cell><cell>Grouping
 element for Tibetan Analysis. This is used to nest TIBBIBLs within TIBBIBLs.
 Generally used for one of two purposes: 1) to list the sub-components of
an  item, mainly chapters within a text, and 2) to encode the outline (<hi>sa
 bcad</hi>) of a text. In the first use, there are three TIBANALS: n=1 contains
 the front sections; n=2 contains the body sections; n=3 contains the back
 sections.</cell></row><row><cell>TIBID</cell><cell>Information and grouping
 element containing the unique ID for the Tibetan document. It is found within
 TIBIDDECL and can contain other TIBIDs and ALTIDs. Its important attributes
 are: Type, a required attribute which can be either Edition, Volume, Text,
 Chapter, or Class; and System, which can be Sigla, Number, Letter, or unspecified.
 The general combinations are [Type=Edition, System=Letter]->Edition name,
 such as "mTshams brag"; [Edition/Sigla]->Edition Sigla; [Text/Number]->ID
 number of text, its sequential number within the edition; [Volume/Number]->the 
volume<ent value="rsquo"></ent>s number; [Text/Number]->number of text within
volume.  The two Text/Number TIBIDs are distinguished by their position. The
TIBIDs  are nested within each other so the two Text/Number TIBIDs are associated
 with their immediate parent: the edition sigla and volume number, respectively.</cell></row><row><cell>TIBIDDECL</cell><cell>Grouping
 element that contains the information for identifying the document being
described  within TIBIDs and IDNOs.</cell></row><row><cell>TITLE</cell><cell>Information
 element found within TITLEDECL containing Tibetan titles for editions, volumes,
 texts, and chapters, or their English translations. A Tibetan title and its
 translation would be in two separate TITLE elements within a TITLEDECL and
 distinguished by the Lang attribute. Different TITLE elements within the
same  record are labeled with their Type attribute unless they are distinguished
 by their grouping tag. Titles within TITLEDECLs within TITLEDIVs do not need
 a type because the TITLEDIV is sufficiently typed to describe the title it 
contains.</cell></row><row><cell>TITLEDECL</cell><cell>Grouping element for
 title information. It contains one or more TITLE elements, SOURCE elements,
 and DISCUSSION elements. This is used to isolate the normalized title for
 an item from the other variant titles which are contained in the TITLEINFO
 grouping element. Thus, the TITLEs within this element will generally have
 the type=Normalized title. It is also used within TITLEDIVs for the variant
 titles. These titles do not use their type attribute.</cell></row><row><cell>TITLEDIV</cell><cell>Grouping
 element for individual variant titles. Each variant title should have its
 own TITLEDIV. Distinguished by type and subtype. Type describes general location
 of variant title: front, body, back, margin, oral, non-Tibetan, and secondary
 (literature). These are fixed categories. Subtype is an open field for a
more  specific description of the exact source, i.e., Title page, Closing
section,  etc.</cell></row><row><cell>TITLEINFO</cell><cell>Grouping element
containing  all the variant titles for an item. Thus, first element is a HEAD
containing  Title Variants. Contains a series of TITLEDIV grouping elements
that contain  the information concerning each variant title.</cell></row><row><cell>TITLEITEM</cell><cell>Grouping
 element for reference to an instance of an end-of-chapter text-title. Contains
 a DESIGNATION for the chapter number and PAGINATION for the pagination of
 that title. Contained within a TITLELIST after its TITLEDECL a series of
TITLEITEMS  will record each chapter that uses that end-of-text title.</cell></row><row><cell>TITLELIST</cell><cell>Grouping
 element for variant text-titles that come from chapters of a text. This was
 devised because several chapters will often share a text-title but there
will  be several such end-of-chapter text-titles in the work as a whole. The
title  list accounts for this by containing a single TITLEDECL at its opening.
This  contains the shared text-title found at the end of certain chapters.
It is  followed by a series of TITLEITEMs which contain the chapter number
and pagination  for each instance of that title.</cell></row></table></p><p>The
second table  lists the elements in alphabetical order along with their primary
characteristics.  The first column is the tag name. Though these are case-independent,
the primary  letters are capitalized for ease of reading. The second column
describes the  type of tag. There are two basic types of tags: g for grouping
tags whose  primary or sole purpose is to contain and group other tags and
d for data  tags whose primary purpose is to hold data. The third column contains
attributes  of that tag that are commonly used. All tags/elements come with
a standard  set of global attributes in TEI. These are not listed here. Only
the particular  attributes that are used in particular for that element are
listed along with  some of their more common values. Type is the most often
used. As the typology  for many of the categories is not finalized, the user
should not assume that  all the types for a tag are listed. Only the most
common ones are. The last  two columns are <ent value="ldquo"></ent>Contained
in<ent value="rdquo"></ent> and <ent value="ldquo"></ent>Contains<ent value="rdquo"></ent>.
Each is for the tags or elements  that the present element can be <ent value="ldquo"></ent>contained
in<ent value="rdquo"></ent>  or <ent value="ldquo"></ent>contain<ent value="rdquo"></ent>.
Only the unusual elements  are listed in these columns. Data is listed as
a content in the special situations  where that is the only thing an element
can contain (CDATA only). Other common  elements (P for paragraph and Q for
quote) are often found within other elements  and are for this reason not
listed here. Therefore, an empty cell with a <ent value="ldquo"></ent>-<ent
value="rdquo"></ent> in it only means that there are no special  elements
to list there.</p><p><table rend="1"><row rend="head"><cell>Tag Name</cell><cell>Type</cell><cell>Attributes</cell><cell>Contained
 in</cell><cell>Contains</cell></row><row><cell>Address</cell><cell>g</cell><cell></cell><cell>PublicationStmt</cell><cell>Addrline</cell></row><row
><cell>AddrLine</cell><cell>d</cell><cell></cell><cell>Address</cell><cell></cell></row><row><cell>Altid</cell><cell>d</cell><cell>Lang=tib,
 System=letter</cell><cell>Tibid</cell><cell>CDATA only</cell></row><row><cell>Analytic</cell><cell>g</cell><cell></cell><cell>MssDesc</cell><cell
>Editor, RespStmt, Author, Title</cell></row><row><cell>Change</cell><cell>g</cell><cell></cell><cell>RevisionDesc </cell><cell>Date,
 Respstmt, Item</cell></row><row><cell>Controlinfo</cell><cell>g</cell><cell></cell><cell>Tibid</cell><cell>Sysid,
 Date, Respstmt, Revisiondesc</cell></row><row><cell>Date</cell><cell>d</cell><cell>Calendar=Gregorian,
 Tibetan, Mythological; Value=yyyy.mm.dd, Beginning time, End time, Indeterminate
 time</cell><cell>Controlinfo, PublicationStmt, Imprint, RespDecl</cell><cell></cell></row><row><cell>Designation</cell><cell>d</cell><cell></cell
><cell></cell><cell></cell></row><row><cell>Designation</cell><cell>d</cell><cell></cell><cell>TitleItem</cell><cell></cell></row><row><cell>Dimensions</cell
><cell>d</cell><cell>Type=Page size</cell><cell>PhysDecl</cell><cell></cell></row><row><cell>Discussion</cell><cell>d</cell><cell>Type=Doxography,
 Author source, Author affiliations, Audience, Original requestor, Redaction,
 Original language, Translators, Transmission status, Lineage history, Patronage
 relationship, Translation editor, Common title, Object of Homage, Publication,
 Editing, Traditional publication, Distinctive features, Reproduction of manuscript,
 Manuscript, Chapter-level elements, Master doxographical category, Edition
 doxographical category, Keywords, Author source, Author affiliation, Audience,
 etc.</cell><cell>Pubdecl, StemmaDecl, PhysDecl, IntellDecl, RespDecl, TitleDecl</cell><cell></cell></row><row><cell>DivCount</cell><cell>d</cell
><cell>Class=Total, Front, Body, Back, Atiyoga, Anuyoga, Mah<ent value="amacr"></ent>yoga</cell><cell>DivDecl</cell><cell></cell></row><row><cell
>DivDecl</cell><cell>g</cell><cell>Type=Volumes, Texts, Chapter-level elements</cell><cell>PhysDecl</cell><cell>DivCount</cell></row><row><cell>Doxography</cell
><cell>d</cell><cell>Type=Relation, Category (n=0 or 1), genre, Keywords</cell><cell>IntellDecl</cell><cell>Term</cell></row><row><cell>Edition</cell
><cell>d</cell><cell>lang=tib</cell><cell>MssDecl</cell><cell></cell></row><row><cell>Editor</cell><cell>d</cell><cell>lang=tib</cell><cell>MssDecl</cell
><cell>Name</cell></row><row><cell>Extent</cell><cell>d</cell><cell>Class=Total,
 Atiyoga, Anuyoga, Mah<ent value="amacr"></ent>yoga</cell><cell></cell><cell></cell></row><row><cell>Extent</cell><cell>d</cell><cell>type=Total,
 Atiyoga, Anuyoga, Mah<ent value="amacr"></ent>yoga</cell><cell>ExtentDecl</cell><cell></cell></row><row><cell>ExtentDecl</cell><cell>g</cell><cell
>Type=Sides</cell><cell>PhysDecl</cell><cell>Extent</cell></row><row><cell>Foreign</cell><cell>d</cell><cell>Lang=TIB</cell><cell></cell><cell></cell
></row><row><cell>GenreForm</cell><cell>d</cell><cell>Type=Original, Reproduction</cell><cell>PhysDecl</cell><cell></cell></row><row><cell>Head</cell
><cell>d</cell><cell></cell><cell>Tibbibl, Pubdecl, Stemmadecl, PhysDecl,
IntellDecl, Origination,  TitleDecl, TitleInfo, TibAnal</cell><cell>Foreign</cell></row><row><cell>Idno</cell><cell>d</cell><cell>Type=LOC
 number (new), LOC number (old), OCLC number, ISBN number, Library call-number,
 etc., Arcive ID</cell><cell>Tibiddecl, MssDesc</cell><cell>CDATA only</cell></row><row><cell>Imprint</cell><cell>g</cell><cell></cell><cell>MssDecl</cell
><cell>Publisher, PubPlace, Date</cell></row><row><cell>IntellDecl</cell><cell>g</cell><cell></cell><cell>Tibbibl</cell><cell>Head,
 Doxography, Discussion</cell></row><row><cell>Item</cell><cell>d</cell><cell></cell><cell>Change</cell><cell></cell></row><row><cell>MssDesc</cell
><cell>g</cell><cell>type=Traditional publication, Manuscript</cell><cell>StemmaDecl</cell><cell>Edition,
 Editor, RespStmt, Repository, IdNo, Imprint, Analytic</cell></row><row><cell>Name</cell><cell>d</cell><cell>Type=chapter
 designation</cell><cell>RespStmt, Editor, DivDecl</cell><cell></cell></row><row><cell>Orgname</cell><cell>d</cell><cell>Type=Monastery,
 Movement, Department, Institution, University, Library</cell><cell>RespDecl,
 Repository</cell><cell></cell></row><row><cell>Origination</cell><cell>g</cell><cell></cell><cell>Tibbibl</cell><cell>Head,
 RespDecl, Discussion</cell></row><row><cell>Pagination</cell><cell>d</cell><cell>Type</cell><cell>PhysDecl,
 TitleDiv, TitleItem</cell><cell></cell></row><row><cell>Persname</cell><cell>d</cell><cell>Type=Speaker,
 Interlocutor, Buddha, Consort, Bodhisattva, Guardian, Original requestor,
 Redactor, Indian scholar, Tibetan translator, Concealer, Revealer, Patron,
 Recipient, Translation Editor (This typology is not final.)</cell><cell>RespDecl,
 Repository</cell><cell></cell></row><row><cell>Physdecl</cell><cell>g</cell><cell></cell><cell>Tibbibl</cell><cell>Head,
 ExtentDecl, DivDecl, Discussion, GenreForm, PhysFacet, Dimensions</cell></row><row><cell>Physfacet</cell><cell>d</cell><cell>Type=Illustrations,
 Script, Materials, Standard lines per page, Page formatting, Paper and Ink</cell><cell>PhysDecl</cell><cell></cell></row><row><cell>Placename</cell
><cell>d</cell><cell>Type=Composition, Request, Redaction, Translation, Concealment,
 Revelation, Patronage, Editing;<lb></lb>Lang=Tib, Eng</cell><cell>RespDecl</cell><cell></cell></row><row><cell>Pubdecl</cell><cell>g</cell><cell
></cell><cell>Tibbibl</cell><cell>Head, Publicationstmt, Discussion</cell></row><row><cell>PublicationStmt</cell><cell>g</cell><cell></cell><cell
>Pubdecl</cell><cell>Publisher, Pubplace, Date, Respstmt, Address</cell></row><row><cell>Publisher</cell><cell>d</cell><cell></cell><cell>PublicationStmt,
 Imprint</cell><cell></cell></row><row><cell>PubPlace</cell><cell>d</cell><cell></cell><cell>PublicationStmt,
 Imprint</cell><cell></cell></row><row><cell>Repository</cell><cell>g</cell><cell>type=Original
 location, Current location</cell><cell>MssDecl</cell><cell>Orgname, Persname</cell></row><row><cell>Resp</cell><cell>d</cell><cell></cell><cell>RespStmt</cell
><cell></cell></row><row><cell>Respdecl</cell><cell>g</cell><cell>Type=Author,
 Audience, Requestor, Redactor (n=Original, Translation, Revelation, Modern),
 Translator, Concealer, Revealer, Patron, Scribe, Editor</cell><cell></cell><cell></cell></row><row><cell>Respstmt</cell><cell>g</cell><cell></cell
><cell>RevisionDesc, Pubdecl, MssDecl, Analytic</cell><cell>Resp, Name</cell></row><row><cell>RevisionDesc</cell><cell>g</cell><cell></cell><cell
>ControlInfo</cell><cell>Change</cell></row><row><cell>Source</cell><cell>d</cell><cell>Rend=own
 line</cell><cell>RespDecl, TitleDecl</cell><cell></cell></row><row><cell>StemmaDecl</cell><cell>g</cell><cell>n=History,
 Traditional, Manuscript</cell><cell>Tibbibl</cell><cell>Head, MssDesc, P,
 Discussion</cell></row><row><cell>Sysid</cell><cell>d</cell><cell></cell><cell>Control
 Info</cell><cell></cell></row><row><cell>Tibanal</cell><cell>g</cell><cell></cell><cell></cell><cell></cell></row><row><cell>TibAnal</cell><cell
>g</cell><cell>n=1 (front), n=2 (body), n=3 (back)</cell><cell>Tibbibl</cell><cell>Head,
 Tibbibl</cell></row><row><cell>Tibid</cell><cell>d</cell><cell>Type=Chapter,
 Class Text, Volume, Edition;<lb></lb>System=Number, Sigla, Letter</cell><cell>Tibiddecl,
 Tibid</cell><cell>Tibid, AltId</cell></row><row><cell>Tibiddecl</cell><cell>g</cell><cell></cell><cell>Tibbibl</cell><cell>Tibid,
 IdNo</cell></row><row><cell>Title</cell><cell>d</cell><cell>Type=Normalized
 title, Chapter, Non-Tibetan, Edition</cell><cell>Head, TitleDecl</cell><cell></cell></row><row><cell>Titledecl</cell><cell>g</cell><cell></cell><cell
>Tibbibl, TitleDiv</cell><cell>Head, Title, Source, Discussion</cell></row><row><cell>Titlediv</cell><cell>g</cell><cell>Type=Front,
 Body, Back, Nontibet</cell><cell>TitleInfo</cell><cell>TitleDecl, Pagination</cell></row><row><cell>Titleinfo</cell><cell>g</cell><cell></cell><cell
>Tibbibl</cell><cell>TitleDiv</cell></row><row><cell>TitleItem</cell><cell>g</cell><cell></cell><cell>TitleList</cell><cell>Designation,
 Pagination</cell></row><row><cell>TitleList</cell><cell>g</cell><cell></cell><cell>TitleDiv</cell><cell>TitleDecl,
 TitleItem</cell></row></table></p></div2></div1>
</body></text></tei.2>
