Re: [orca-list] gnome-speech, and audio output, moving forward.

From: "Michael Whapples" <mwhapples aim com>
To: <orca-list gnome org>
Subject: Re: [orca-list] gnome-speech, and audio output, moving forward.
Date: Tue, 18 Sep 2007 15:44:02 +0100

Hello,

Here are my thoughts on this. Yes it would be good to have the speech API(gnome-speech) handling the audio. I say this, having now seenspeech-dispatcher 0.6.4, which works well with its espeak driver (not theespeak-generic, but the specific espeak driver). I know that some peoplehave had issues with the stability of speech-dispatcher in the past, and Idon't know if those still exist now for those users, but it works well forme. So if it was stable for all, then I would say that speech-dispatcher iswhat is needed. Adding to what you said about commercial synths, doesn'tspeech-dispatcher allow use of ibmtts through alsa as speech-dispatcherhandles the audio, like it does for espeak now, so getting round the issueof the commercial synths supporting only OSS.

Also in favour of something like speech-dispatcher is that for those of uswho don't always want gnome, and may have some systems with only speakup andcommand line stuff, then gnome-speech depends on some gnome stuff and sowould require gnome to be installed unnecessarily. Also there is the issueof conflicts between speech API's, so where I use speakup for command line,but may have gnome installed (and orca being used), speech-dispatcherhandles these two systems trying to use the same synth.

So what I am saying is, is it time to drop gnome-speech, and try and makeanother system such as speech-dispatcher more robust? I think TTSAPI may betrying to be the replacement for speech-dispatcher, but I don't know how itis performing.


From
Michael Whapples

----- Original Message -----From: "Luke Yelavich" <themuso themuso com>To: "Orca screen reader developers" <orca-list gnome org>; "UbuntuAccessibility Development Mailing List"<ubuntu-accessibility-devel lists ubuntu com>; "Gnome Accessibility List"<gnome-accessibility-list gnome org>; "GNOME Accessibility Developers"<gnome-accessibility-devel gnome org>

Sent: Tuesday, September 18, 2007 1:22 PM
Subject: [orca-list] gnome-speech, and audio output, moving forward.

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Greetings all.
For a while now, it has been possible to have multiple audio streamsplaying at the same time, using ALSA'sdmix plugin under Linux. This also has meant the ability to have speechaudible at the same time as otheraudio. Users have desired the ability to do this for a while now,particularly since it has been possible in
other operating systems for a long time.
Since eSpeak has been developed, we have had a very usable synthesizer forspeech output, which supports agrowing number of languages. Since this synthesizer is cross-platform, thechoice was made by the author touse PortAudio, thereby supporting all platforms where PortAudio isavailable. Since PortAudio v19, it has beenpossible to use Alsa for audio output via PortAudio. In theory, this isgood news, however in practice, thishas created more problems than it should solve, for the following reasons,as far as I see things:
* PortAudio v19 has had no official release, and so seems to be in arather constant state of flux, making it
difficult for distros to reliably support a working version.
* PortAudio's alsa implementation seems to currently be broken, which isevident while using eSpeak, andattempting to speak multiple strings of text rapidly over a short periodof time.* As far as I've seen, there is no easy way for the user to select whichoutput device portaudio should use.Added to that, if more than one app is using portaudio, this will affectthat application as well as espeak,
which may not be what the user desires.
* All proprietary synths only support oss output, which makes simultaneousaudio and speech currently
impossible.
What I would like to propose, is the following. Since a large porshion ofGNOME's multimedia framework is nowusing gStreamer, I would like to suggest that we make all gnome-speechdrivers use gStreamer, and if possible,add another option to the sound preferences, to allow the user to selectwhich soundcard they wish to use forspeech output. This would result in gstreamer being used via Alsa onLinux, thereby allowing simultaneousaudio and speech, which would likely happen at the gstreamer level beforeit even reaches alsa. (I don't
really know how gstreamer works, so this is a guess on my part.)
- From what I have seen, just about all proprietary synth APIs supportsending audio data from the synth back tothe calling application, thereby allowing the audio to be sent whereeverthe application wishes. I am wellaware that gnome-speech was initially designed to not care about how theaudio was played, but since itsinitial inclusion in GNOME, gstreamer has become the standard multimediaframework for GNOME, and at least inUbuntu's implementation, allows the user to set different devices forseveral different uses, such as sound
events, music and movies, and audio/video conferencing.
I think we owe users the ability to use speech alongside audio, and offerit in an easy to use way, therebyputting full control in their hands. Now that we are at the beginning of anew GNOME release, I personallythink its time to get serious about offering users a deacent screen readerand speech experience, the same, if
not better than what other operating systems offer.
I have sent this post to these lists, to try and get as wide a viewpoint,and discussion as possible. I wouldappreciate any replies to be sent to all lists, to ensure everybody canparticipate in the discussion.
I would like to invite both users and developers to express their views ona matter which I believe needsresolving. Input from gnome devs, particularly those for gnome-speech isvery much welcome.
So, lets sort something out.
- --Luke Yelavich
GPG key: 0xD06320CE
(http://www.themuso.com/themuso-gpg-key.txt)
Email & MSN: themuso themuso com
Jabber: themuso jabber org au
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (GNU/Linux)

iD8DBQFG78MHjVefwtBjIM4RAmvvAKCHJH5ZlcpwSwweLV9a/1mMJMXQHQCfTdtH
WXhAp+9KaQv85VOYyGKmtYw=
=46d4
-----END PGP SIGNATURE-----

Follow-Ups:
- Re: [orca-list] gnome-speech, and audio output, moving forward.
  - From: Halim Sahin

References:
- [orca-list] gnome-speech, and audio output, moving forward.
  - From: Luke Yelavich

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]