Re: Malayalam character classification



On Sat, 2004-07-31 at 11:12, Mahesh T. Pai wrote:
> Owen Taylor said on Fri, Jul 30, 2004 at 10:33:32AM -0400,:
> 
>  > As I read Peter Constable's proposal, I think you would need:
>  > 
>  >  ka + u + <space> + zwj + h + ya
>  > 
>  > That is, the syllable 'ku' followed by an isolated post-base form
>  > of ya. 
>  > 
>  >  ka + u + h + ya 
> 
> I am not very sure that  this proposal applies to Malayalam, and if at
> all, how  it is relevant.  I had  forwarded this to some  of the Indic
> language related lists, but nobody has responded so far.

The proposal perhaps isn't as *interesting* for Malayalam as for
some other languages since 'va' and 'ya' don't typically form ligatures
with the base consonant. (As far as I know.) But I don't see why
it wouldn't be relevant.

> As of now, h + ya and h+va (0D35) show the correct substitution in the
> font I used  in the (attached) png, but not for  Akruti.  Ummm ... the
> ligature ya  exists in Akruti,  as char 163;  and ligature va  is 123,
> which  too does  not render  as  expected. In  malotf, the  respective
> glyphs are 208  and 210. In the non-otf  Malayalam.ttf, the glyphs are
> ed2f, and 3d35 (va) resepctively.
> 
> I  think I have  just stumbled  on some  bug in  Akruti here.   The ya
> ligature is  rendered only with the h+ya+h+ya  sequence, which renders
> as  halant+ligature+standalone   ya.   Hmmm...   Akruti   is  behaving
> strangely here. 

An isolated 'h + ya' is not something that the Unicode standard 
envisions. I'm not sure we should worry too much about how it renders.
While eventually I'd like to have well-defined rendering output
for any arbitrary sequence of characters, the ones that matter most
are:

 - Correctly encoded according to Unicode
 - Represent sequences actually found in texts

Regards,
						Owen

Attachment: signature.asc
Description: This is a digitally signed message part



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]