Archive for the 'Speech Technology' Category

Speech Recognition removed from Android

Friday, March 7th, 2008

Google have quietly removed the speech.recognition package from the Android API. I say quietly: the removal is noted in the API Diff specification for M3-RC37a to M5-RC14, released 15th Feb, but I haven’t been able to find any more public announcements - for example, it wasn’t mentioned in the m5-rc-14 release announcement. Google […]

Trefnydd-0.0.3

Thursday, January 10th, 2008

Trefnydd is an open-source toolkit for building speech recognition engines. Tools support the compilation of corpora and pronunciation dictionaries, creation of acoustic and language models, and generation of speech recognition engines compatible with the Media Resource Control Protocol (MRCP). Trefnydd can be used either via its GUI, or by incorporating the tools directly into your […]

“It seems like Google called for a party and forgot to order beer”

Tuesday, November 13th, 2007

The Android SDK and documentation came up late yesterday and already this morning there were a thousand messages on the Android Developers discussion group. I’ve had a quick look and here are my first impressions:

Android and the Open Handset Alliance

Wednesday, November 7th, 2007

This story is all over the net and the papers. Most of the stories just regurgitate the official press releases, which are here. By now everyone will know what it is, so I won’t bore you with that here.
The OHA developers’ page says that they “will make available an early look at the […]

OpenMRCP (2)

Tuesday, October 16th, 2007

I finally managed to have a look at OpenMRCP.
Installation
Compilation and installation goes smoothly enough - as long as you have the right versions of everything, in the right places (e.g., sofia-sip has to be in /usr/local/include/).
Fake TTS/ASR demos
The OpenMRCP webpage shows how to use OpenMRCP client and server for TTS and ASR. No real […]

OpenMRCP

Wednesday, August 22nd, 2007

Just read this story at earthtimes.org. I haven’t had a chance to look at it yet, but OpenMRCP could be very useful indeed.

Compiling HTK 3.4 on Windows XP

Monday, July 30th, 2007

Contents

Introduction
Preparation

Getting the htk source code
‘Mac’ format

Using cygwin

Requirements
Prepare HTK source code

1. Remove the reference to "-lX11" in the instructions for HSLab:
2. Overwrite HGraf.c

Compilation

To make HTK available from outside cygwin

Using visual studio express

Set up
Make clean!

Conclusion

Introduction
Here are some notes on my experiences of compiling HTK 3.4 on Windows XP. I compiled HTK successfully, but some research and […]

New Speech Recognition blog

Friday, July 27th, 2007

Google Alerts just told me about this new(-ish: it started up on the 11th) blog, called just ‘Speech Recognition‘. It’s a non-technical blog covering ASR in healthcare, written by Claire Betis, a Marketing Manager for Crescendo Systems Corp.

Speech tech journal and link watch

Wednesday, July 4th, 2007

Find a good home for all these links:

IEEE Transactions on Audio, Speech & Language Processing
NIST Publications
Speech Communication
ACM Transactions on Speech and Language Processing

[update 071207: uploaded all these to my del.icio.us homepage.]

Speech Processing for IP Networks: Media Resource Control Protocol (MRCP)

Monday, June 25th, 2007

I recommend this book highly to anyone working on speech processing over IP, or indeed to anyone thinking of writing a book on a new technology. See my review for the ACCU magazine CVu, here.

Author’s web page.
Publisher’s web page.