I'm still interested in getting pocketsphinx to attempt speech recognition on an audio file.<br><br>To be honest, most of the problem is that at 8Khz (mobile phone call rate), speech detection is NOT very accurate, at 16Khz it IS significantly better.<br>
<br>I'm planning to have a play with the speechtools module and mod_pocketsphinx etc to try and get an audio file parsed, spare time permitting.<br><br>Will let the list know if I get anywhere.<br><br>Regards<br><br>Kirk Bateman<br>
<br><br><div class="gmail_quote">2009/8/11 David Knell <span dir="ltr"><<a href="mailto:dave@3c.co.uk">dave@3c.co.uk</a>></span><br><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Hi Pete,<br>
<br>
I'm afraid that the answer's still the same: use a human. Here's an<br>
article describing the state of the art:<br>
<a href="http://www.theregister.co.uk/2009/08/05/spinvox_demo_day/" target="_blank">http://www.theregister.co.uk/2009/08/05/spinvox_demo_day/</a><br>
- the links to previous stories at the bottom provide good background.<br>
<br>
--Dave<br>
<div><div></div><div class="h5"><br>
> I apologize, I should have been more clear. We will be using humans<br>
> to scan the translated results. But we are looking for a system to<br>
> perform the "first pass" on the audio to hopefully help the human type<br>
> less.<br>
><br>
> Although the question has been raised if it's faster to have a human<br>
> just transcribe the whole thing, or fix up what the computer spit out.<br>
> If you have any insights on this, that would be great.<br>
><br>
> -pete<br>
><br>
> -------- Original Message --------<br>
> Subject: Re: [Freeswitch-users] VoiceMail transcription<br>
> From: David Knell <<a href="mailto:dave@3c.co.uk">dave@3c.co.uk</a>><br>
> Date: Mon, August 10, 2009 11:51 am<br>
> To: <a href="mailto:freeswitch-users@lists.freeswitch.org">freeswitch-users@lists.freeswitch.org</a><br>
><br>
> Good evening Pete,<br>
><br>
> The only way to do this is, I'm afraid, to use a human. We use<br>
> Amazon's<br>
> Mechanical Turk to good effect.<br>
><br>
> Cheers --<br>
><br>
> Dave<br>
><br>
> > Good morning all,<br>
> ><br>
> > I realize this is slightly off the FS topic, but I am<br>
> wondering if<br>
> > anyone out there has experience with software packages<br>
> designed for<br>
> > the transcription of voicemails to text. I've used<br>
> pocketsphinx with<br>
> > FS to handle IVR menus, but now have the task of figuring<br>
> out how to<br>
> > convert recorded phone conversations (voicemails mostly) to<br>
> text.<br>
> ><br>
> > This does not have to be a real-time process, I can store<br>
> the audio<br>
> > files and process them over time. This would need to be a<br>
> software<br>
> > (preferable open source) solution. ASPs like VoiceCloud<br>
> would not<br>
> > work for this application.<br>
> ><br>
> > Thanks for any help<br>
> > -pete<br>
> > _______________________________________________<br>
> > FreeSWITCH-users mailing list<br>
> > <a href="mailto:FreeSWITCH-users@lists.freeswitch.org">FreeSWITCH-users@lists.freeswitch.org</a><br>
> ><br>
> <a href="http://lists.freeswitch.org/mailman/listinfo/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/listinfo/freeswitch-users</a><br>
> ><br>
> UNSUBSCRIBE:<a href="http://lists.freeswitch.org/mailman/options/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/options/freeswitch-users</a><br>
> > <a href="http://www.freeswitch.org" target="_blank">http://www.freeswitch.org</a><br>
> --<br>
> David Knell, Director, 3C Limited<br>
> T: +44 20 3298 2000<br>
> E: <a href="mailto:dave@3c.co.uk">dave@3c.co.uk</a><br>
> W: <a href="http://www.3c.co.uk" target="_blank">http://www.3c.co.uk</a><br>
><br>
><br>
> _______________________________________________<br>
> FreeSWITCH-users mailing list<br>
> <a href="mailto:FreeSWITCH-users@lists.freeswitch.org">FreeSWITCH-users@lists.freeswitch.org</a><br>
> <a href="http://lists.freeswitch.org/mailman/listinfo/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/listinfo/freeswitch-users</a><br>
> UNSUBSCRIBE:<a href="http://lists.freeswitch.org/mailman/options/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/options/freeswitch-users</a><br>
> <a href="http://www.freeswitch.org" target="_blank">http://www.freeswitch.org</a><br>
><br>
> _______________________________________________<br>
> FreeSWITCH-users mailing list<br>
> <a href="mailto:FreeSWITCH-users@lists.freeswitch.org">FreeSWITCH-users@lists.freeswitch.org</a><br>
> <a href="http://lists.freeswitch.org/mailman/listinfo/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/listinfo/freeswitch-users</a><br>
> UNSUBSCRIBE:<a href="http://lists.freeswitch.org/mailman/options/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/options/freeswitch-users</a><br>
> <a href="http://www.freeswitch.org" target="_blank">http://www.freeswitch.org</a><br>
--<br>
David Knell, Director, 3C Limited<br>
T: +44 20 3298 2000<br>
E: <a href="mailto:dave@3c.co.uk">dave@3c.co.uk</a><br>
W: <a href="http://www.3c.co.uk" target="_blank">http://www.3c.co.uk</a><br>
<br>
<br>
_______________________________________________<br>
FreeSWITCH-users mailing list<br>
<a href="mailto:FreeSWITCH-users@lists.freeswitch.org">FreeSWITCH-users@lists.freeswitch.org</a><br>
<a href="http://lists.freeswitch.org/mailman/listinfo/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/listinfo/freeswitch-users</a><br>
UNSUBSCRIBE:<a href="http://lists.freeswitch.org/mailman/options/freeswitch-users" target="_blank">http://lists.freeswitch.org/mailman/options/freeswitch-users</a><br>
<a href="http://www.freeswitch.org" target="_blank">http://www.freeswitch.org</a><br>
</div></div></blockquote></div><br>