[dasher: 29/38] Change LM used for Mandarin(trad)/bopomofo, works better w/ unannotated texts



commit b4d1050454d8ec80d336675c39185c662f763d6c
Author: Alan Lawrence <acl33 inf phy cam ac uk>
Date:   Wed Dec 21 10:20:51 2011 +0000

    Change LM used for Mandarin(trad)/bopomofo, works better w/ unannotated texts
    
    (We don't supply a text, so hope the user will - it may be unannotated)

 Data/alphabets/alphabet.bopoTrad.xml |    3 ++-
 1 files changed, 2 insertions(+), 1 deletions(-)
---
diff --git a/Data/alphabets/alphabet.bopoTrad.xml b/Data/alphabets/alphabet.bopoTrad.xml
index f664c11..6ed45ca 100644
--- a/Data/alphabets/alphabet.bopoTrad.xml
+++ b/Data/alphabets/alphabet.bopoTrad.xml
@@ -4,7 +4,8 @@
 <alphabets>
 <alphabet name="Mandarin (trad) via &#x3105;&#x3106;&#x3107;&#x3108; (Bopomofo)">
 <!--The nature of this alphabet is to represent every possible (syllable+tone) combination in a list, ordered and grouped by the representation of that sound in Bopomofo. The sounds are only indirectly represented, as groups containing (often-)multiple chinese symbols (which may even appear in more than one place); the complete sound/symbol dataset comes from the file bpmf-trad-mand.cin, converted into Dasher's group tree by bpmf-trad-mand.py-->
-<conversionmode id="2" start="&lt;" stop="&gt;"/>
+<!--Note use of the route-learning model: since we don't supply a training text, we are hoping the user will have one of their own; but this may well be unannotated, and this model works better on unannotated texts than PPMPY-->
+<conversionmode id="4" start="&lt;" stop="&gt;"/>
 <orientation type="LR"/>
 <encoding type="Western"/>
 <palette>European/Asian</palette>



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]