[Freeswitch-users] pocketsphinx

Brian West brian at freeswitch.org
Fri Jul 31 06:37:30 PDT 2009


On Jul 31, 2009, at 8:18 AM, Helmut Kuper wrote:

> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hello,
>
> I spent a few days working on my problem. I played around with
> voxforge's data, read some of the train docs of CMU, and debugged the
> sphinxbase, pocketsphinx and mod_pocketsphinx.
>
> Results:
> - -I found a way to use the voxforge data as training data for  
> creation of
> a german language corpus.

Care to document the process?

>
> - -I enabled the logging of pocketsphinx to stderr (Dirty, but easy  
> way to
> see what went wrong when FS loads grammar, mdef, etc and simply  
> stopped.
> Very helpful!


I can't recall if there is a logger callback we can register for  
this... Last I checked you couldn't this is something we should make a  
config option for up in the mod if possible.

>
> - -I had to add a "dictcase" parameter to pocketsphinx.conf.xml resp
> mod_pocketsphinx.c to allow case sensitive dictionaries (like the  
> german
> dictionary from voxforge).

What do you mean?  Can you put this on jira please.

>
>
> FS starts up with german language model and detects the words as
> expected. But it's not so reliable as I want to ... I guess this is
> caused by the very small amount of training audio data. I used 4000 of
> 19000 audio files provided by voxforge due to the reason that  
> voyforge's
> training fileid-list contains only 4000 files ...  I have to create  
> new
> fileid-list and transcription-lists containing all audio I have
> downloaded from voxforge.
>
>
> Quite complex the whole thing ...
>
> regards
> helmut





More information about the FreeSWITCH-users mailing list