Sophie

Sophie

distrib > Mandriva > 2010.2 > x86_64 > by-pkgid > a58d85af59edf7fb701c73ea3ce1c5aa > files > 72

hevea-1.10-4mdv2010.0.x86_64.rpm

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN"
            "http://www.w3.org/TR/REC-html40/loose.dtd">
<HTML>
<HEAD>
<TITLE>How to Use HEVEA with the Thai Character Set
</TITLE>

<META http-equiv="Content-Type" content="text/html; charset=US-ASCII">
<META name="GENERATOR" content="hevea 1.10">
<STYLE type="text/css">
.caption{padding-left:2ex; padding-right:2ex; margin-left:auto; margin-right:auto}
.title{margin:2ex auto;text-align:center}
.center{text-align:center;margin-left:auto;margin-right:auto;}
DIV TABLE{margin-left:inherit;margin-right:inherit;}
PRE{text-align:left;margin-left:0ex;margin-right:auto;}
BLOCKQUOTE{margin-left:4ex;margin-right:4ex;text-align:left;}
TD P{margin:0px;}
.hbar{border:none;height:2px;width:100%;background-color:black;}
</STYLE>
</HEAD>
<BODY >
<!--HEVEA command line is: ../hevea.opt -fix -exec xxdate.exe -O -o doc/thaihevea.html thai/thaihevea.ttex -->
<!--CUT DEF section 1 --><TABLE CLASS="title"><TR><TD><H1 CLASS="titlemain">How to Use H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A with the Thai Character Set</H1><H3 CLASS="titlerest">Andrew Seagar and &#XE19;&#XE34;&#XE15;&#XE22;&#XE32; &#XE0B;&#XE35;&#XE01;&#XE32;&#XE23;&#XE4C;<BR>
email: dr_andrew_seagar@ieee.org</H3></TD></TR>
</TABLE><!--TOC section Latin/Thai Character Set-->
<H2 CLASS="section"><!--SEC ANCHOR --><A NAME="htoc1">1</A>&#XA0;&#XA0;Latin/Thai Character Set</H2><!--SEC END --><P>Thai L<sup>A</sup>T<sub>E</sub>X is written in the TIS-620 character encoding. Some
people call this ISO-8859-11, but that name was (for a long time)
never officially recognised.</P><P>The TIS-620 character encoding is an 8-bit single byte character set.
It encodes both the ASCII Latin characters (0-127) and the Thai
characters (128-255). See, for the official Thai definition, the
docuemnt:<BR>
&#X201C;ISO 8859-11 Latin/Thai Character Set standard&#X201D;<BR>
at the
website:<BR>
<B>www.nectec.or.th/it-standards/iso8859-11/</B></P><P>Non-Thai variations to the official Thai character set were introduced
by some vendors. The <SPAN STYLE="font-variant:small-caps">Windows</SPAN> Thai character set (874) places
an unofficial &#X2018;smart quote&#X2019; character into one of the empty (illegal)
slots in the official Thai set. The DEC (Digital Equipement
Coorporation) character set places an unofficial &#X2018;no-break space&#X2019;
character into another of the empty (illegal) slots in the
<EM>original</EM> official Thai set. It is not too clear what is now
&#X201C;official&#X201D; and what is not. It is necessary to be a little bit
careful. Importing &#X201C;Thai&#X201D; docuemnts from <SPAN STYLE="font-variant:small-caps">Windows</SPAN> into a
<SPAN STYLE="font-variant:small-caps">Linux</SPAN> environment via (for example) Openoffice doesn&#X2019;t always
produce a faithful copy of the original text.</P><P>Figure&#XA0;<A HREF="#thaichar">1</A> shows the Thai characters according to the
Unicode Standard (version 3.0).</P><!--TOC section Thai in L<sup>A</sup>T<sub>E</sub>X-->
<H2 CLASS="section"><!--SEC ANCHOR --><A NAME="htoc2">2</A>&#XA0;&#XA0;Thai in L<sup>A</sup>T<sub>E</sub>X</H2><!--SEC END --><P>For Thai in L<sup>A</sup>T<sub>E</sub>X the package &#X2018;thai&#X2019; (file: thai.sty) is used,
<EM>i.e.</EM> <CODE>\usepackage{thai}</CODE>.</P><P>The source is run through a preprocessor (cttex) to encapsulate all
Thai text within bracketted pairs <CODE>{\thai ....}</CODE> and to insert
the thai-break &#X2018;<CODE>\tb</CODE>&#X2019; separator.</P><P>Normally Thai text is written in a continuous stream with few (if any)
blank (space) characters. The preprocessor inserts the &#X2018;<CODE>\tb</CODE>&#X2019;
command to indicate places where the text may be broken if near the
end of a line. If these separators are not inserted L<sup>A</sup>T<sub>E</sub>X has a
great deal of trouble in getting a flush right margin without leaving
huge gaps in the text.</P><P>The style file &#X2018;thai.sty&#X2019; contains the definitions for <CODE>{\thai ....}</CODE> 
and <CODE>\tb</CODE>. The <CODE>{\thai ....}</CODE> command is used to switch the
L<sup>A</sup>T<sub>E</sub>X font.</P><P>After passing through the preprocessor, the file is compiled by
L<sup>A</sup>T<sub>E</sub>X in the normal fashion.</P><!--TOC section Thai in H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A-->
<H2 CLASS="section"><!--SEC ANCHOR --><A NAME="htoc3">3</A>&#XA0;&#XA0;Thai in H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A</H2><!--SEC END --><P>For H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A the style (package) file &#X2018;thai.sty&#X2019; is not used. H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A
does not recognise the <CODE>{\thai ....}</CODE> or <CODE>\tb</CODE> constructs.
If these constructs are encountered, warnings will be issued and the
constructs will be ignored.</P><P>In order to use the Thai language with H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A, the preprocessor which
is normally used before invoking L<sup>A</sup>T<sub>E</sub>X should <EM>not</EM> be used.
The original (as typed) Thai L<sup>A</sup>T<sub>E</sub>X file should be passed directly to
H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A. The command <CODE>\usepackage{thai}</CODE> in the file is detected
by H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A and is used to establish a Thai character encoding. (It is
no longer necessary to use the command line flag &#X2013;charset=TIS-620.
This flag is no longer operational).</P><P>The commands required to process this file for both Thai L<sup>A</sup>T<sub>E</sub>X and
Thai H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A are listed in table&#XA0;<A HREF="#commands">1</A>. The original L<sup>A</sup>T<sub>E</sub>X
filename is assumed to be &#X2018;thaihevea.ttex&#X2019; (ttex = Thai tex).</P><BLOCKQUOTE CLASS="table"><DIV CLASS="center"><DIV CLASS="center"><HR WIDTH="80%" SIZE=2></DIV>
<TABLE CELLSPACING=6 CELLPADDING=0><TR><TD CLASS="hbar" COLSPAN=2></TD></TR>
<TR><TD ALIGN=left NOWRAP>for L<sup>A</sup>T<sub>E</sub>X</TD><TD ALIGN=left NOWRAP>&nbsp;</TD></TR>
<TR><TD CLASS="hbar" COLSPAN=2></TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>cttex  &lt;  thaihevea.ttex  &gt;  thaihevea.tex</CODE></TD><TD ALIGN=left NOWRAP>run preprocessor</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>latex  thaihevea.tex</CODE></TD><TD ALIGN=left NOWRAP>compile using L<sup>A</sup>T<sub>E</sub>X</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>dvips  thaihevea.dvi  -o</CODE></TD><TD ALIGN=left NOWRAP>convert using dvips</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>gv  thaihevea.ps</CODE></TD><TD ALIGN=left NOWRAP>view using ghostview</TD></TR>
<TR><TD CLASS="hbar" COLSPAN=2></TD></TR>
<TR><TD ALIGN=left NOWRAP>for H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A</TD><TD ALIGN=left NOWRAP>&nbsp;</TD></TR>
<TR><TD CLASS="hbar" COLSPAN=2></TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>cp  thaihevea.ttex  thaihevea.tex</CODE></TD><TD ALIGN=left NOWRAP>&#X2018;rename&#X2019; file for benefit of H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>hevea  thaihevea.tex</CODE></TD><TD ALIGN=left NOWRAP>compile using H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>imagen thaihevea</CODE></TD><TD ALIGN=left NOWRAP>convert image to bitmap</TD></TR>
<TR><TD ALIGN=left NOWRAP><CODE>firefox  thaihevea.html</CODE></TD><TD ALIGN=left NOWRAP>view using web browser</TD></TR>
<TR><TD CLASS="hbar" COLSPAN=2></TD></TR>
</TABLE>
<DIV CLASS="caption"><TABLE CELLSPACING=6 CELLPADDING=0><TR><TD VALIGN=top ALIGN=left>Table 1: Processing Thai text with L<sup>A</sup>T<sub>E</sub>X and H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A.</TD></TR>
</TABLE></DIV> <A NAME="commands"></A>
<DIV CLASS="center"><HR WIDTH="80%" SIZE=2></DIV></DIV></BLOCKQUOTE><P>Since the Thai text is not processed to indicate where the text may be
broken, the decision is left to the application displaying the html
code. The browser I currently use (Firefox) doesn&#X2019;t know how to break
continuous Thai text in suitable places without external help.
However the screen width is larger than a page width, which means that
on average there are more natural breaks in any line, and the browser
is left justifying the text so it doesn&#X2019;t make large ugly gaps. The
right margin is ragged, not flush, but that looks acceptable (to me).</P><P>Following is a paragraph of Thai text. It doesn&#X2019;t say anything
important, it is simply here to serve as a basic test. Even if you
can&#X2019;t compile this with L<sup>A</sup>T<sub>E</sub>X (<EM>e.g.</EM> you don&#X2019;t have the
file thai.sty or a Thai character set for printing), you can still
compile it with H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A and make an English/Thai web page.</P><P>If you want to eliminate the Thai so you can compile an English-only
version of this document, simply insert a comment % character before
the <CODE>\thaistuff</CODE> command at the top of the file and uncomment the
second version of the command (which eliminates the Thai) on the
adjacent line.</P><P>
&#XE28;&#XE36;&#XE01;&#XE29;&#XE32;&#XE04;&#XE27;&#XE32;&#XE21;&#XE2B;&#XE21;&#XE32;&#XE22; &#XE04;&#XE27;&#XE32;&#XE21;&#XE2A;&#XE33;&#XE04;&#XE31;&#XE0D;&#XE02;&#XE2D;&#XE07;&#XE2A;&#XE34;&#XE48;&#XE07;&#XE41;&#XE27;&#XE14;&#XE25;&#XE49;&#XE2D;&#XE21;&#XE28;&#XE36;&#XE01;&#XE29;&#XE32; &#XE27;&#XE34;&#XE18;&#XE35;&#XE01;&#XE32;&#XE23;&#XE40;&#XE1C;&#XE22;&#XE41;&#XE1E;&#XE23;&#XE48;&#XE1B;&#XE23;&#XE30;&#XE0A;&#XE32;&#XE2A;&#XE31;&#XE21;&#XE1E;&#XE31;&#XE19;&#XE18;&#XE4C;
&#XE04;&#XE27;&#XE32;&#XE21;&#XE23;&#XE39;&#XE49;&#XE17;&#XE32;&#XE07;&#XE2A;&#XE34;&#XE48;&#XE07;&#XE41;&#XE27;&#XE14;&#XE25;&#XE49;&#XE2D;&#XE21; &#XE27;&#XE34;&#XE18;&#XE35;&#XE01;&#XE32;&#XE23;&#XE40;&#XE02;&#XE35;&#XE22;&#XE19;&#XE41;&#XE1C;&#XE19;&#XE07;&#XE32;&#XE19;&#XE40;&#XE1E;&#XE37;&#XE48;&#XE2D;&#XE40;&#XE1C;&#XE22;&#XE41;&#XE1E;&#XE23;&#XE48;&#XE04;&#XE27;&#XE32;&#XE21;&#XE23;&#XE39;&#XE49;&#XE17;&#XE32;&#XE07;&#XE2A;&#XE34;&#XE48;&#XE07;&#XE41;&#XE27;&#XE14;&#XE25;&#XE49;&#XE2D;&#XE21; &#XE19;&#XE33;
&#XE2A;&#XE34;&#XE48;&#XE07;&#XE41;&#XE27;&#XE14;&#XE25;&#XE49;&#XE2D;&#XE21;&#XE28;&#XE36;&#XE01;&#XE29;&#XE32;&#XE44;&#XE1B;&#XE1B;&#XE23;&#XE30;&#XE22;&#XE38;&#XE01;&#XE15;&#XE4C;&#XE43;&#XE0A;&#XE49;&#XE43;&#XE19;&#XE01;&#XE32;&#XE23;&#XE1E;&#XE31;&#XE12;&#XE19;&#XE32;&#XE41;&#XE25;&#XE30;&#XE40;&#XE1C;&#XE22;&#XE41;&#XE1E;&#XE23;&#XE48;&#XE04;&#XE27;&#XE32;&#XE21;&#XE23;&#XE39;&#XE49;&#XE02;&#XE49;&#XE2D;&#XE21;&#XE39;&#XE25; &#XE02;&#XE48;&#XE32;&#XE27;&#XE2A;&#XE32;&#XE23;&#XE15;&#XE48;&#XE32;&#XE07;&#XE46;
&#XE43;&#XE19;&#XE42;&#XE04;&#XE23;&#XE07;&#XE01;&#XE32;&#XE23;&#XE2D;&#XE37;&#XE48;&#XE19;&#XE46; &#XE17;&#XE35;&#XE48;&#XE21;&#XE35;&#XE04;&#XE27;&#XE32;&#XE21;&#XE2A;&#XE31;&#XE21;&#XE1E;&#XE31;&#XE19;&#XE18;&#XE4C;&#XE40;&#XE01;&#XE35;&#XE48;&#XE22;&#XE27;&#XE02;&#XE49;&#XE2D;&#XE07;
</P><BLOCKQUOTE CLASS="figure"><DIV CLASS="center"><DIV CLASS="center"><HR WIDTH="80%" SIZE=2></DIV>
<IMG SRC="thaihevea001.gif"><DIV CLASS="caption"><TABLE CELLSPACING=6 CELLPADDING=0><TR><TD VALIGN=top ALIGN=left>Figure 1: Thai Character Set</TD></TR>
</TABLE></DIV> <A NAME="thaichar"></A>
<DIV CLASS="center"><HR WIDTH="80%" SIZE=2></DIV></DIV></BLOCKQUOTE><!--CUT END -->
<!--HTMLFOOT-->
<!--ENDHTML-->
<!--FOOTER-->
<HR SIZE=2><BLOCKQUOTE CLASS="quote"><EM>This document was translated from L<sup>A</sup>T<sub>E</sub>X by
<A HREF="http://hevea.inria.fr/index.html">H<FONT SIZE=2><sup>E</sup></FONT>V<FONT SIZE=2><sup>E</sup></FONT>A</A>.</EM></BLOCKQUOTE></BODY>
</HTML>