Orca Introduction, comments and a question or two

From: Garry Turkington <garrys lists gmail com>
To: orca-list gnome org
Subject: Orca Introduction, comments and a question or two
Date: Mon, 11 Sep 2006 00:15:46 -0400 (EDT)

Hi all,

I've just started playing with Orca on a shiny new Edgy install and I mustsay that speech engine aside (more below) it's been a surprisinglypleasant experience. Given the relatively young status of Orca I thinkit's in great shape and shows tremendous potential. Hats off to allinvolved in the effort.

My road to getting Orca was a slightly convoluted one though much of thatcan be categorised as self-inflicted wounds. After years of relying onconsole access to Linux via either a DOS terminal session or Speakupdirectly I've wanted to experiment with the Gnome tools for some time.Due to other activities and the general home IT set-up my strongpreference was to do so under VMware Workstation with my main Windows boxas the host.

When I first tried this several months ago I discovered that most softwarespeech engines turn into random noise engines under VMware. I stronglysuspect this is due to some channel timing issues but can't prove that. Inever got a software speech engine working well enough under VMware to beusable so gave up and only returned to the topic this weekend. I knowthat some other people were hitting their heads against the same wall socan report that the Swift voices from Cepstral now work 'out of the box'under VMware; previously they required to be set in mono mode whichcouldn't easily be done from a higher-level speech layer.

So my set-up now is using a Cepstral Swift voice via Festival, using the"festivalify-cepstral-voice" Perl script provided by Cepstral. HackFestival and you can make the Swift voice the default and this provides aworking TTS engine under VMware. Hopefully that info will be of use tosomeone else.

That leads nicely into the first of my three questions, I find theperformance of Orca under this set-up to be pretty sluggish. I suspectthe problem isn't really Orca itself, more a combination of a beta OS, theuse of Festival as an intermediary layer and the VMware overhead. Playingwith some C test apps on the VM I am suspecting the Festival machinery asthe native Swift code seems more responsive than via Festival. Anyonegot experience with using Festival in this way or care to point the fingerelsewhere?

My next point may have me stirring up a hornet's nest but I'll take therisk. There seems to be an embarrassment of riches when it comes tointermediary TTS layers. There's Gnome-speech, there's Speech Dispatcherand as I've found even good old Festival can act in this role. I knowthat all these tools evolved from different backgrounds and had distinctinitial needs they were trying to meet but the situation now seemssomewhat duplicative to me. I may be betraying my background indistributed systems where abstraction layers get slapped atop anythingthat moves but is this situation as confused as I find it and if so isthere any sort of convergence going forward? Alternatively am I missingthe point somewhere?

Final question is less controversial. I find that setting the voice ratein the Orca control panel has no effect on the actual spoken speed.Insert and left/right arrows tell me the rate is being increased ordecreased but I find it either has no effect or actually does the reverse.I'm suspecting this a consequence of my funky Festival/Swift set-up as Idon't see any such bugs on Bugzilla and I guess this would be an obviousone if it was universal.

Many thanks for any input and I look forward to exploring Orca more overthe next days.


Regards,
Garry

--
Garry Turkington
garry turkington gmail com

Follow-Ups:
- Re: Orca Introduction, comments and a question or two
  - From: Cody Hurst
- Re: Orca Introduction, comments and a question or two
  - From: Kenny Hitt
- Re: Orca Introduction, comments and a question or two
  - From: Willie Walker

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]