No subject
Wed Oct 19 14:55:25 CEST 2011
Bohori=C4=8Dica *orthography*, since the script is Latin (as is clear in =
the Wikipedia page, which lists a, b, d, e, f, g, h, etc.), being used =
for Slovenian (ISO language name).=20
=20
The way to proceed is to propose a variant subtag (via =
ietf-languages at iana.org), particularly if you want to have your data be =
available for general use.
=20
See RFC 5646, especially sections 2.2.5, 3.5, and 3.6: =
http://www.inter-locale.com/ID/rfc5646.html.
=20
(Useful background reading: =
http://www.w3.org/International/articles/language-tags/Overview.en.php =
and http://www.w3.org/International/questions/qa-choosing-language-tags =
)
=20
I wrote to Doug Ewell, who is directly involved with IANA subtag =
registry, to verify this. He recommended you propose a variant subtag =
but noted:
> Until something is registered, a private-use tag like "sl-x-bohoric" =
(or "sl-x-boh" if brevity [is=20
> preferred] over readability) should work.=20
> Remember that if a variant is later registered, he will want to use =
the "official" tag instead of the=20
> private one, and changing tags on existing data can be a headache.=20
=20
I hope this helps. (Doug can assist you in making a request, if you =
decide to go that route.)
=20
With best wishes,
Deborah Anderson
Researcher, Dept. of Linguistics
UC Berkeley
=20
=20
From: TEI (Text Encoding Initiative) public discussion list =
[mailto:TEI-L at LISTSERV.BROWN.EDU] On Behalf Of Toma Tasovac
Sent: Friday, April 13, 2012 5:21 PM
To: TEI-L at LISTSERV.BROWN.EDU
Subject: Re: Code for Bohori=C4=8D alphabet?
=20
Dear Toma=C5=BE,
=20
So far I've just been using @xml:lang=3D"sl-boh" but I know this is =
sinful - but I'm not sure how it should be encoded.
=20
Wouldn't this actually be a good candidate for the x-subtag? Since ISO =
doesn't really recognize Bohori=C4=8Dica, using xml:lang=3D"sl-x-boh" =
would stress that fact without sacrificing the readability of the =
attribute value. And with private use subtags you are pretty much free =
to do whatever you want ("Private use subtags are used to indicate =
distinctions in language that are important in a given context by =
private agreement.")
=20
Then to be perfectly safe you could use <langUsage> and <language> in =
the header:=20
=20
<langUsage>
<language ident=3D"sl-x-boh">Slovenian written using the Bohori=C4=8D =
alphabet</language>
</langUsage>
=20
All best,
Toma
=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=
=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=
=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94
Toma Tasovac
Center for Digital Humanities (Belgrade, Serbia)=20
http://humanistika.org =E2=80=A2 http://transpoetika.org
=20
13.04.2012, =D0=B2 22:42, Tomaz Erjavec =
=D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=BB(=D0=B0):
=20
Dear all,
in the context of a historical corpus of Slovene I'd want to mark texts =
that are written in the Bohori=C4=8D alphabet =
(http://en.wikipedia.org/wiki/Bohori%C4%8D_alphabet).
So far I've just been using @xml:lang=3D"sl-boh" but I know this is =
sinful - but I'm not sure how it should be encoded.
First, I'm not sure if it even qualifies as a "script", as e.g. I can't =
find a script for old English which used the long s, but maybe because =
this only substitutes one character for another - with Bohori=C4=8D =
it's more complicated.=20
Even taking it as a script (so I could write sl-Boho), =
http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CH.html#CHSH does =
say that they should be taken from ISO 15924, =
http://unicode.org/iso15924/iso15924-codes.html and there is no Boho =
there; I also can't find an extension mechanism as there is with =
languages.
Any tips gratefully received.
Best,
Toma=C5=BE
--=20
Toma=C5=BE Erjavec, http://nl.ijs.si/et/
Dept. of Knowledge Technologies, Jo=C5=BEef Stefan Institute, Ljubljana
=20
------=_NextPart_000_01F8_01CD1B4B.D15A36E0
Content-Type: text/html;
charset="utf-8"
Content-Transfer-Encoding: quoted-printable
<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40"><head><meta =
http-equiv=3DContent-Type content=3D"text/html; charset=3Dutf-8"><meta =
name=3DGenerator content=3D"Microsoft Word 14 (filtered =
medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Tahoma;
panose-1:2 11 6 4 3 5 4 4 2 4;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
@font-face
{font-family:"Myriad Pro";
panose-1:0 0 0 0 0 0 0 0 0 0;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
margin-bottom:.0001pt;
font-size:12.0pt;
font-family:"Times New Roman","serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
p.MsoPlainText, li.MsoPlainText, div.MsoPlainText
{mso-style-priority:99;
mso-style-link:"Plain Text Char";
margin:0cm;
margin-bottom:.0001pt;
font-size:10.5pt;
font-family:Consolas;}
p.MsoAcetate, li.MsoAcetate, div.MsoAcetate
{mso-style-priority:99;
mso-style-link:"Balloon Text Char";
margin:0cm;
margin-bottom:.0001pt;
font-size:8.0pt;
font-family:"Tahoma","sans-serif";}
span.PlainTextChar
{mso-style-name:"Plain Text Char";
mso-style-priority:99;
mso-style-link:"Plain Text";
font-family:Consolas;}
span.BalloonTextChar
{mso-style-name:"Balloon Text Char";
mso-style-priority:99;
mso-style-link:"Balloon Text";
font-family:"Tahoma","sans-serif";}
span.apple-style-span
{mso-style-name:apple-style-span;}
span.EmailStyle22
{mso-style-type:personal;
font-family:"Calibri","sans-serif";
color:#1F497D;}
span.EmailStyle23
{mso-style-type:personal-reply;
font-family:"Calibri","sans-serif";
color:#1F497D;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--></head><body lang=3DSL link=3Dblue =
vlink=3Dpurple><div class=3DWordSection1><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>Dear Deborah,<o:p></o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>thanks a lot for your informative mail =E2=80=93 and, of course, to =
Toma, who steered me in the right direction; we've had some exchange off =
the list, and settled on sl-x-Boho.<o:p></o:p></span></font></p><p =
class=3DMsoNormal><font size=3D2 color=3D"#1f497d" face=3DCalibri><span =
lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>But even now, reading rfc 5646, I still think it is a script, rather =
than variant:<o:p></o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>2.2.3.=C2=A0 Script Subtag are used to indicate the script or =
_<i><span style=3D'font-style:italic'>writing system =
variations</span></i>_ that distinguish the written forms of a =
language<o:p></o:p></span></font></p><p class=3DMsoNormal><font size=3D2 =
color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>2.2.5.=C2=A0 Variant Subtags are used to indicate additional, =
well-recognized variations that define _<i><span =
style=3D'font-style:italic'>a language or its =
dialects_</span></i><o:p></o:p></span></font></p><p =
class=3DMsoNormal><font size=3D2 color=3D"#1f497d" face=3DCalibri><span =
lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>I'd say Bohori=C4=8D is clearly a writing system variation, rather =
than a language =E2=80=93 that didn=E2=80=99t change suddenly with the =
switch to Gajica (~1850), which is what we use today (a-z + =
=C4=8D=C5=A1=C5=BE). <o:p></o:p></span></font></p><p =
class=3DMsoNormal><font size=3D2 color=3D"#1f497d" face=3DCalibri><span =
lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>If scripts, not only variants can be registered with IANA, =
I=E2=80=99d certainly like to do it =E2=80=93 except, while I=E2=80=99m =
at it, I=E2=80=99d also propose two others, which were briefly in vogue =
in Slovenia the mid-19<sup>th</sup> =
century.<o:p></o:p></span></font></p><p class=3DMsoNormal><font size=3D2 =
color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>I agree that back-changing sl-x-Boho to sl-Boho is a pain, and time =
is actually tight, as I=E2=80=99m presenting the corpus in about a month =
=E2=80=93 is an IANA that fast?<o:p></o:p></span></font></p><p =
class=3DMsoNormal><font size=3D2 color=3D"#1f497d" face=3DCalibri><span =
lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>My question was also CCed to the Linux localisation user group of =
Slovenia, where I got links to ISO-15924, in =
particular:<o:p></o:p></span></font></p><p class=3DMsoPlainText><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>Notice of changes from ISO of that standard: <a =
href=3D"http://unicode.org/iso15924/codechanges.html"><font =
color=3D"#1f497d"><span =
style=3D'color:#1F497D;text-decoration:none'>http://unicode.org/iso15924/=
codechanges.html</span></font></a> <o:p></o:p></span></font></p><p =
class=3DMsoPlainText><font size=3D2 color=3D"#1f497d" =
face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>Rules about adding new scripts (see article A.3.3): <a =
href=3D"http://www.unicode.org/iso15924/standard/index.html#annex"><font =
size=3D2 color=3D"#1f497d" face=3DConsolas><span =
style=3D'font-size:10.5pt;font-family:Consolas;color:#1F497D;text-decorat=
ion:none'>http://www.unicode.org/iso15924/standard/index.html#annex</span=
></font></a> =C2=A0<o:p></o:p></span></font></p><p =
class=3DMsoNormal><font size=3D2 color=3D"#1f497d" face=3DCalibri><span =
lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>This does look more complicated =
though.<o:p></o:p></span></font></p><p class=3DMsoNormal><font size=3D2 =
color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>All the best,<o:p></o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'>Toma=C5=BE<o:p></o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal><font =
size=3D2 color=3D"#1f497d" face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><div><div =
style=3D'border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm =
0cm 0cm'><p class=3DMsoNormal style=3D'margin-left:35.4pt'><b><font =
size=3D2 face=3DTahoma><span lang=3DEN-GB =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif";font-weight:b=
old'>From:</span></font></b><font size=3D2 face=3DTahoma><span =
lang=3DEN-GB =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif"'> Deborah W. =
Anderson [mailto:dwanders at sonic.net] <br><b><span =
style=3D'font-weight:bold'>Sent:</span></b> Sunday, April 15, 2012 8:14 =
PM<br><b><span style=3D'font-weight:bold'>To:</span></b> =
ttasovac at TRANSPOETIKA.ORG; TEI-L at LISTSERV.BROWN.EDU<br><b><span =
style=3D'font-weight:bold'>Cc:</span></b> =
tomaz.erjavec at IJS.SI<br><b><span =
style=3D'font-weight:bold'>Subject:</span></b> RE: Code for Bohori=C4=8D =
alphabet?<o:p></o:p></span></font></p></div></div><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D3 face=3D"Times New =
Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>Dear =
Toma=C5=BE (and Toma),<o:p></o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>To add a =
bit to what Toma has written=E2=80=A6<o:p></o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>From my =
reading of your message, you want to identify the Bohori=C4=8Dica =
*<b><span style=3D'font-weight:bold'>orthography</span></b>*, since the =
script is Latin (as is clear in the Wikipedia page, which lists a, b, d, =
e, f, g, h, etc.), being used for Slovenian (ISO language name). =
<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>The way to =
proceed is to propose a variant subtag (via <a =
href=3D"mailto:ietf-languages at iana.org">ietf-languages at iana.org</a>), =
particularly if you want to have your data be available for general =
use.<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>See RFC =
5646, especially sections 2.2.5, 3.5, and 3.6: <a =
href=3D"http://www.inter-locale.com/ID/rfc5646.html">http://www.inter-loc=
ale.com/ID/rfc5646.html</a>.<o:p></o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>(Useful =
background reading: <a =
href=3D"http://www.w3.org/International/articles/language-tags/Overview.e=
n.php">http://www.w3.org/International/articles/language-tags/Overview.en=
.php</a> and <a =
href=3D"http://www.w3.org/International/questions/qa-choosing-language-ta=
gs">http://www.w3.org/International/questions/qa-choosing-language-tags</=
a> )<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>I wrote to =
Doug Ewell, who is directly involved with IANA subtag registry, to =
verify this. He recommended you propose a variant subtag but =
noted:<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'><br>> Until something is =
registered, a private-use tag like "sl-x-bohoric" (or =
"sl-x-boh" if brevity [is <o:p></o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>> =
preferred] over readability) should work. =
<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>> Remember that if a variant =
is later registered, he will want to use the "official" tag =
instead of the <o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>> private one, and changing =
tags on existing data can be a headache. <o:p></o:p></span></font></p><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
color=3Dblack face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:12.0pt;font-family:"Calibri","sans-serif";color:black'=
><o:p> </o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>I hope this helps. (Doug =
can assist you in making a request, if you decide to go that =
route.)<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB =
style=3D'font-size:10.5pt'><o:p> </o:p></span></font></p><p =
class=3DMsoPlainText style=3D'margin-left:35.4pt'><font size=3D2 =
face=3DConsolas><span lang=3DEN-GB style=3D'font-size:10.5pt'>With best =
wishes,<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>Deborah =
Anderson<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>Researcher, Dept. of =
Linguistics<o:p></o:p></span></font></p><p class=3DMsoPlainText =
style=3D'margin-left:35.4pt'><font size=3D2 face=3DConsolas><span =
lang=3DEN-GB style=3D'font-size:10.5pt'>UC =
Berkeley<o:p></o:p></span></font></p><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D2 color=3D"#1f497d" =
face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D2 color=3D"#1f497d" =
face=3DCalibri><span lang=3DEN-GB =
style=3D'font-size:11.0pt;font-family:"Calibri","sans-serif";color:#1F497=
D'><o:p> </o:p></span></font></p><div><div =
style=3D'border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm =
0cm 0cm'><p class=3DMsoNormal style=3D'margin-left:35.4pt'><b><font =
size=3D2 face=3DTahoma><span lang=3DEN-GB =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif";font-weight:b=
old'>From:</span></font></b><font size=3D2 face=3DTahoma><span =
lang=3DEN-GB =
style=3D'font-size:10.0pt;font-family:"Tahoma","sans-serif"'> TEI (Text =
Encoding Initiative) public discussion list <a =
href=3D"mailto:[mailto:TEI-L at LISTSERV.BROWN.EDU]">[mailto:TEI-L at LISTSERV.=
BROWN.EDU]</a> <b><span style=3D'font-weight:bold'>On Behalf Of =
</span></b>Toma Tasovac<br><b><span =
style=3D'font-weight:bold'>Sent:</span></b> Friday, April 13, 2012 5:21 =
PM<br><b><span style=3D'font-weight:bold'>To:</span></b> <a =
href=3D"mailto:TEI-L at LISTSERV.BROWN.EDU">TEI-L at LISTSERV.BROWN.EDU</a><br>=
<b><span style=3D'font-weight:bold'>Subject:</span></b> Re: Code for =
Bohori=C4=8D alphabet?<o:p></o:p></span></font></p></div></div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>Dear =
Toma=C5=BE,<o:p></o:p></span></font></p></div><div><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D3 face=3D"Times New =
Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p></div><div>=
<blockquote style=3D'margin-top:5.0pt;margin-bottom:5.0pt'><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>So far I've just been using =
@xml:lang=3D"sl-boh" but I know this is sinful - but I'm not =
sure how it should be =
encoded.<o:p></o:p></span></font></p></div></blockquote><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p></div><div>=
<div><div><p class=3DMsoNormal style=3D'margin-left:35.4pt'><font =
size=3D4 color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'>Wouldn't this actually be a good candidate for =
the x-subtag? Since ISO doesn't really recognize Bohori=C4=8Dica, =
using xml:lang=3D"sl-x-boh" would stress that fact =
without sacrificing the readability of the attribute value. And =
with private use subtags you are pretty much free to do whatever =
you want ("Private use subtags are used to indicate distinctions in =
language that are important in a given context by private =
agreement.")<o:p></o:p></span></font></p></div><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'><o:p> </o:p></span></font></p></div><div><=
p class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'>Then to be perfectly safe you could use =
<langUsage> and <language> in the =
header: <o:p></o:p></span></font></p></div><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'><o:p> </o:p></span></font></p></div><div><=
p class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'><langUsage><o:p></o:p></span></font></p><=
/div><div><p class=3DMsoNormal style=3D'margin-left:35.4pt'><font =
size=3D4 color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'><language =
ident=3D"sl-x-boh">Slovenian written using the Bohori=C4=8D =
alphabet</language><o:p></o:p></span></font></p></div><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'></langUsage><o:p></o:p></span></font></p>=
</div><div><p class=3DMsoNormal style=3D'margin-left:35.4pt'><font =
size=3D4 color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'><o:p> </o:p></span></font></p></div><div><=
p class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'>All =
best,<o:p></o:p></span></font></p></div><div><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D4 color=3Dblack face=3D"Myriad =
Pro"><span lang=3DEN-GB style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'>Toma<o:p></o:p></span></font></p></div><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D4 =
color=3Dblack face=3D"Myriad Pro"><span lang=3DEN-GB =
style=3D'font-size:13.5pt;font-family:"Myriad =
Pro","serif";color:black'>=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=
=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=
=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94=E2=80=94<br>Tom=
a Tasovac<br>Center for Digital Humanities (Belgrade, =
Serbia) <br><a =
href=3D"http://humanistika.org">http://humanistika.org</a> =E2=80=A2=
<a =
href=3D"http://transpoetika.org">http://transpoetika.org</a><o:p></o:p></=
span></font></p></div></div></div><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D3 face=3D"Times New =
Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p><div><div><=
p class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>13.04.2012, =D0=B2 22:42, Tomaz Erjavec =
=D0=BD=D0=B0=D0=BF=D0=B8=D1=81=D0=B0=D0=BB(=D0=B0):<o:p></o:p></span></fo=
nt></p></div><p class=3DMsoNormal =
style=3D'mso-margin-top-alt:0cm;margin-right:0cm;margin-bottom:12.0pt;mar=
gin-left:35.4pt'><font size=3D3 face=3D"Times New Roman"><span =
lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p><div><p =
class=3DMsoNormal style=3D'margin-left:35.4pt'><font size=3D3 =
face=3D"Times New Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'>Dear all,<br>in the context of a historical =
corpus of Slovene I'd want to mark texts that are written in the =
Bohori=C4=8D alphabet (<a =
href=3D"http://en.wikipedia.org/wiki/Bohori%C4%8D_alphabet">http://en.wik=
ipedia.org/wiki/Bohori%C4%8D_alphabet</a>).<br>So far I've just been =
using @xml:lang=3D"sl-boh" but I know this is sinful - but I'm =
not sure how it should be encoded.<br>First, I'm not sure if it even =
qualifies as a "script", as e.g. I can't find a script for old =
English which used the long s, but maybe because this only substitutes =
one character for another - with Bohori=C4=8D it's more =
complicated. <br>Even taking it as a script (so I could write sl-Boho), =
<a =
href=3D"http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CH.html#CHSH"=
>http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CH.html#CHSH</a> =
does say that they should be taken from ISO 15924, <a =
href=3D"http://unicode.org/iso15924/iso15924-codes.html">http://unicode.o=
rg/iso15924/iso15924-codes.html</a> and there is no Boho there; I also =
can't find an extension mechanism as there is with languages.<br>Any =
tips gratefully received.<br>Best,<br>Toma=C5=BE<br>-- <br>Toma=C5=BE =
Erjavec, <a =
href=3D"http://nl.ijs.si/et/">http://nl.ijs.si/et/</a><br>Dept. of =
Knowledge Technologies, Jo=C5=BEef Stefan Institute, =
Ljubljana<o:p></o:p></span></font></p></div></div><p class=3DMsoNormal =
style=3D'margin-left:35.4pt'><font size=3D3 face=3D"Times New =
Roman"><span lang=3DEN-GB =
style=3D'font-size:12.0pt'><o:p> </o:p></span></font></p></div></div=
></body></html>
------=_NextPart_000_01F8_01CD1B4B.D15A36E0--
More information about the lugos-slo
mailing list