[Freeswitch-users] Achieving streaming ASR

Stanislav Sinyagin ssinyagin at gmail.com
Tue Oct 11 00:34:08 MSD 2016


Brian, Mike, as far as I understand, they talk about speech to text, not
the other way round. TTS is something completely different )


On 10 Oct 2016 18:37, "Brian West" <brian at freeswitch.org> wrote:

> It can return as many as it wants, as it gets them from the speech server,
> there really is no limit in the ASR/TTS interface, We reuse handles in the
> MRCP interface module.
>
> /b
>
>
> On Mon, Oct 10, 2016 at 11:29 AM, Dave Horton <daveh at beachdognet.com>
> wrote:
>
>> Just an FYI on a couple of points:
>>
>> 1) I’ve tried to implment support for Nuance NCS API as an asr_tts
>> module.  tts works fine, but asr / transcription does not seem possible
>> because the NCS API is not really suited for a streaming application.
>> Based on my experience (and I opened tickets with them to try to resolve,
>> to no avail), their server does not handle streaming because when I start
>> sending small-ish chunks of audio using chunked-encoding, they immediately
>> return me a “no speech” result.  They seem to expect that you have a
>> recorded file on disk that you can ship over in much larger chunks.
>>
>> 2) I’m not completely sure I agree that transcription fits into the asr
>> part of the asr_tts modules.  Some of these cloud-based services support
>> providing a stream of interim results while the speaker is talking, and
>> this would be very useful to propogate up to the freeswitch application (or
>> esl) but the asr_tts pattern seems to be to return a single speech result
>> at the end of a segment of speech.
>>
>> Dave
>>
>>
>> On Oct 10, 2016, at 11:54 AM, Michael Jerris <mike at jerris.com> wrote:
>>
>> This should be implemented the way we implement other speech interfaces.
>>
>>
>> > On Oct 8, 2016, at 4:53 PM, Luca Pradovera <luca.pradovera at gmail.com>
>> wrote:
>> >
>> > Hello,
>> > following a discussion on IRC, I thought I would move this here so it
>> is more async.
>> >
>> > The need is to be able to use Nuance/IBM Watson/Google Speech APIs in
>> streaming mode for real-time transcription (essentially, captioning a video
>> conference and getting a transcript of the proceedings).
>> >
>> > My first idea was to simply get the frames from a recording as it is
>> being written, though there might be some obstacles there.
>> >
>> > It was suggested the media bugs infrastructure would work, at the C
>> level, though I need to check which formats it supports.
>> >
>> > Do you have any other suggestions?
>> >
>> > Thank you!
>> >
>> > Luca
>>
>> _________________________________________________________________________
>> Professional FreeSWITCH Consulting Services:
>> consulting at freeswitch.org
>> http://www.freeswitchsolutions.com
>>
>> Official FreeSWITCH Sites
>> http://www.freeswitch.org
>> http://confluence.freeswitch.org
>> http://www.cluecon.com
>>
>> FreeSWITCH-users mailing list
>> FreeSWITCH-users at lists.freeswitch.org
>> http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
>> UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
>> http://www.freeswitch.org
>>
>>
>> _________________________________________________________________________
>> Professional FreeSWITCH Consulting Services:
>> consulting at freeswitch.org
>> http://www.freeswitchsolutions.com
>>
>> Official FreeSWITCH Sites
>> http://www.freeswitch.org
>> http://confluence.freeswitch.org
>> http://www.cluecon.com
>>
>> FreeSWITCH-users mailing list
>> FreeSWITCH-users at lists.freeswitch.org
>> http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
>> UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
>> http://www.freeswitch.org
>>
>
>
>
> --
>
> *Brian West*
> brian at freeswitch.org
>
>
> *Twitter: @FreeSWITCH , @briankwest*
> http://www.freeswitchbook.com (50% Discount using code FreeSwitch50)
> http://www.freeswitchcookbook.com (50% Discount using code FreeSwitch50)
> https://www.gofundme.com/freeswitch_ubuntu
>
> Got Bugs? Report them here <https://freeswitch.org/jira>! | Reddit:
> /r/freeswitch <https://www.reddit.com/r/freeswitch>
>
> *T:*+19184209001 | *F:*+19184209002 | *M:*+1918424WEST (9378)
> *iNUM:*+883 5100 1420 9001 | *ISN:*410*543 | *Skype:*briankwest
>
> _________________________________________________________________________
> Professional FreeSWITCH Consulting Services:
> consulting at freeswitch.org
> http://www.freeswitchsolutions.com
>
> Official FreeSWITCH Sites
> http://www.freeswitch.org
> http://confluence.freeswitch.org
> http://www.cluecon.com
>
> FreeSWITCH-users mailing list
> FreeSWITCH-users at lists.freeswitch.org
> http://lists.freeswitch.org/mailman/listinfo/freeswitch-users
> UNSUBSCRIBE:http://lists.freeswitch.org/mailman/options/freeswitch-users
> http://www.freeswitch.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20161010/210549f3/attachment-0001.html 


Join us at ClueCon 2016 Aug 8-12, 2016
More information about the FreeSWITCH-users mailing list