Fixes to the transliteration table in ext/misc/spellfix.c

Previous Topic Next Topic
 
classic Classic list List threaded Threaded
3 messages Options
Reply | Threaded
Open this post in threaded view
|

Fixes to the transliteration table in ext/misc/spellfix.c

Marcin Ciura
In http://www.sqlite.org/src/artifact/b3a644285cb008f3:

  { 0x0392,  0x42, 0x00, 0x00, 0x00 },  /* Β to B */
  { 0x03B2,  0x62, 0x00, 0x00, 0x00 },  /* β to b */

Although Ancient Greek beta is rendered as B, Modern Greek beta is
transliterated as V:
https://en.wikipedia.org/wiki/Romanization_of_Greek#Modern_Greek

  { 0x040B,  0x44, 0x00, 0x00, 0x00 },  /* Ћ to D */
  { 0x045B,  0x64, 0x00, 0x00, 0x00 },  /* ћ to d */

This Serbian letter corresponds to Croatian Ć, so it should be
transliterated as C: https://en.wikipedia.org/wiki/Tshe

  { 0x0409,  0x49, 0x00, 0x00, 0x00 },  /* Љ to I */
  { 0x0459,  0x69, 0x00, 0x00, 0x00 },  /* љ to i */

This Serbian letter corresponds to the Croatian digraph Lj, so it should be
transliterated as L (probably not Lj for consistency with Њ transliterated
as N): https://en.wikipedia.org/wiki/Lje

  { 0x0426,  0x54, 0x63, 0x00, 0x00 },  /* Ц to Tc */
  { 0x0446,  0x74, 0x63, 0x00, 0x00 },  /* ц to tc */

This Cyrillic letter is usually transliterated as Ts.  Unless "Tc" is a
hack meant to cover both "Ts" and the rarer transliteration "C", it looks
like a typo: https://en.wikipedia.org/wiki/Tse_(Cyrillic)

Best regards,
Marcin Ciura
_______________________________________________
sqlite-users mailing list
[hidden email]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users
Reply | Threaded
Open this post in threaded view
|

Re: Fixes to the transliteration table in ext/misc/spellfix.c

Ian Zimmerman-2
On 2018-03-30 11:28, Marcin Ciura wrote:

>   { 0x0426,  0x54, 0x63, 0x00, 0x00 },  /* Ц to Tc */
>   { 0x0446,  0x74, 0x63, 0x00, 0x00 },  /* ц to tc */
>
> This Cyrillic letter is usually transliterated as Ts.  Unless "Tc" is
> a hack meant to cover both "Ts" and the rarer transliteration "C", it
> looks like a typo: https://en.wikipedia.org/wiki/Tse_(Cyrillic)

Also often as "Cz".  Unfortunately :-P

--
Please don't Cc: me privately on mailing lists and Usenet,
if you also post the followup to the list or newsgroup.
To reply privately _only_ on Usenet and on broken lists
which rewrite From, fetch the TXT record for no-use.mooo.com.
_______________________________________________
sqlite-users mailing list
[hidden email]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users
Reply | Threaded
Open this post in threaded view
|

Re: Fixes to the transliteration table in ext/misc/spellfix.c

Will Parsons
On Friday, 30 Mar 2018 11:38 AM -0400, Ian Zimmerman wrote:

> On 2018-03-30 11:28, Marcin Ciura wrote:
>
>>   { 0x0426,  0x54, 0x63, 0x00, 0x00 },  /* Ц to Tc */
>>   { 0x0446,  0x74, 0x63, 0x00, 0x00 },  /* ц to tc */
>>
>> This Cyrillic letter is usually transliterated as Ts.  Unless "Tc" is
>> a hack meant to cover both "Ts" and the rarer transliteration "C", it
>> looks like a typo: https://en.wikipedia.org/wiki/Tse_(Cyrillic)
>
> Also often as "Cz".  Unfortunately :-P

"Often"?  I don't think so.  "Ts" is normal for most uses; "c" is
typically used in scientific/linguistic works.

--
Will

_______________________________________________
sqlite-users mailing list
[hidden email]
http://mailinglists.sqlite.org/cgi-bin/mailman/listinfo/sqlite-users