[orca-list] Speech in general (Re: Capital, Capital, Capital)

From: Willie Walker <William Walker Sun COM>
To: orca-list gnome org
Subject: [orca-list] Speech in general (Re: Capital, Capital, Capital)
Date: Thu, 20 Mar 2008 10:22:55 -0400

Hi All:

I think it's important to take a step back and look at the overallproblem we're facing. We have the desire for:


* Verbalized punctuation and capitalization
* Verbalized characters (e.g., 'double you' for 'w')
* Verbalized key names (e.g., 'Left Shift')
* Phrase spelling (letter-by-letter and 'military style')
* Customized pronunciation
* Abbreviation expansion
* Homograph disambiguation (the 'live' in 'Where do you live'
  vs. 'I live in cave')
* Natural F0 contour and prosody handling
* Audio cues
* Progress callbacks (e.g., 'this word was just spoken')
* Voice/pitch changes
* Multilingual support
* Etc.

A lot of these can be done at the speech layer without the need foradditional knowledge. Things like voice/pitch changes based uponcontext (e.g., it's a link, it's a pushbutton, etc.) may still need tolive at the screen reader level. Locale knowledge may also need to livea little higher in the stack.

We have a variety of speech synthesis engines with limitedstandardization for any of the above, each of which does all or a subsetof the above in different ways. SSML is an interesting thing, but onlya handful of engines really support it. In addition, it really doesn'tprovide support for things like 'say-as=military_spelling'.

We also have at least a couple kinds of users: 1) those that don'tnecessarily care about the details of what the speech synthesis enginesupports and just want all of the above to work, 2) those that are moreaware of the TTS engine's capabilities and are willing to work with itslimitations. Many of these same users are willing to pay money for ahigh quality commercial engine and expect Orca to work perfectly withthat engine.

This is a pretty complex problem. The solution we're currently workingwith in Orca is that it handles a lot of the above. Based upon thediscussion, I think we're agreeing there should be some delegation tothe lower layers. With this delegation approach, if Orca can learn thatthe lower layers support a feature, it can delegate responsibility forthat feature to the lower layer. If a lower layer doesn't support it,then Orca needs to provide it.

The first difficult task is figuring out how to obtain the informationto do the appropriate delegation. There's no standardization across anyof the engines. Take, for example, obtaining locale information. Theprogrammatic representation of locale differs greatly from engine to engine.

Take, for example, verbalized punctuation. The engines that support ithave their own ideas of 'none, some, all'. I'll guarantee you thatdelegating verbalized punctuation to the engine will result in at leastone member of this list shouting angrily that some punctuation mark wasspoken at the 'some' level with one engine, but not another.

As a way to move forward, I think we might need to fill out the desiresabove and see what can be done to address them. The TTSAPI work doessome of this, but I'll admit it was done before I had a betterunderstanding of the screen reader problem:http://www.freebsoft.org/tts-api.


Will

Follow-Ups:
- Re: [orca-list] Speech in general (Re: Capital, Capital, Capital)
  - From: Rich Caloggero
- Re: [orca-list] Speech in general (Re: Capital, Capital, Capital)
  - From: Hynek Hanke

References:
- Re: [orca-list] Capital, Capital, Capital
  - From: Willie Walker
- Re: [orca-list] Capital, Capital, Capital
  - From: Jonathan Duddington
- Re: [orca-list] Capital, Capital, Capital
  - From: Willie Walker
- Re: [orca-list] Capital, Capital, Capital
  - From: Tomas Cerha
- Re: [orca-list] Capital, Capital, Capital
  - From: Bohdan R. Rau
- Re: [orca-list] Capital, Capital, Capital
  - From: Tomas Cerha
- Re: [orca-list] Capital, Capital, Capital
  - From: Bohdan R. Rau
- Re: [orca-list] Capital, Capital, Capital
  - From: =?UTF-8?B?UGV0ZXIgVsOhZ25lcg==?=
- Re: [orca-list] Capital, Capital, Capital
  - From: Bohdan R. Rau
- Re: [orca-list] Capital, Capital, Capital
  - From: =?UTF-8?B?UGV0ZXIgVsOhZ25lcg==?=
- Re: [orca-list] Capital, Capital, Capital
  - From: Hermann

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]