[Freeswitch-users] Robust Affordable Speech Recognition
Kashif Kahn
info at evestech.com
Thu May 13 11:21:59 PDT 2010
Hi Jan,
Our answers to your questions are as
follows. Please note that there is a wide variety of information
available on our website under "Speech Engine" pull-down menu. You
should also review Freeswitch section under "Soft-PBX" for free connector and engine pricing information. Our website is: http://www.vestec.ca/
1) What languages
do you
support
We currently support American English and
working on major European and Asian language acoustic models as well.
2) What are your
recognition
stats per language?
Three points are worthy of note here:
(a) we have rigorously benchmarked our recognition
accuracy against leading commercial engines in the market and can safely say that we deliver among the highest recognition accuracy in the
industry;
(b) generally speaking, a native speaker can expect a
recognition accuracy - without "tuning" - in the 90% range while a
non-native speaker can expect a recognition accuracy - again, without
"tuning" - in the 80% range. Grammar "tuning" - for example, via
addition of custom pronunciations for difficult to recognize words -
generally improves recognition accuracy by an additional 5-10%;
(c) in comparative testing against some of the leading commercial speech
engines, we scored an accuracy improvement of over 3%. See: http://www.vestec.ca/recog_acc
3) What are the benefits
of
using this compared to Sphinx/Pocketsphinx?
There
are two fundamental advantages of using Vestec over Sphinx:
(a)
Vestec speech engine comes with a robust acoustic model while for Sphinx you need to develop your own acoustic model. (Acoustic model is the
"ear drum" if you will that does the recognition). Now, development of a high quality acoustic model is no mean feat, even if you have the
necessary tools. For starters you need high quality training data that
can cost tens of thousands of dollars. Next, you need highly specialized knowledge in order to properly manipulate the various parameters of
training tools for optimal recognition quality. Finally, you need to be
fully prepared to undertake a trial-and-error process - sometimes
spanning 6-8 months - to achieve the desired recognition accuracy
results, with the right data and training tools.
(b) Unlike
Sphinx, Vestec speech engine supports industry standard grammar writing format. This allows you to port your existing standard
grammars to the engine as well as make sure that your work is reusable.
Similarly, we provide a variety of useful grammar writing tools and
functions that are not available with Sphinx to make the developer's job a lot easier.
4) How does the (1)
installation of a license happen, (2) license check happen.
You need to put the license files that you purchase from Vestec webstore
under a specific directory. The engine then takes care of the rest. The
license check is host based. In other words, every license is
bounded to one physical/virtual machine. You cannot buy one port license and
expect to run the speech engine on several machines.
5) Does the licensing server
support redundancy schemes?
We used to have a
redundancy scheme but recently disabled it in preparation for the
release of a new architecture. The redundancy scheme will be integrated
in the new architecture that will support MRCP. Please note that
software maintenance - covering patches and major upgrades - is free for the first 12 months since date of license purchase. So, you will be
able to transition to the new architecture over next several months if
you were to purchase a license now.
Hope this helps.
Best
regards,
-Kashif
________________________________
From: Jan Berger <jan.berger at video24.no>
To: freeswitch-users at lists.freeswitch.org
Sent: Thu, May 13, 2010 11:36:54 AM
Subject: Re: [Freeswitch-users] Robust Affordable Speech Recognition
Hi,
Hope you don’t mind
a few nosy questions.
What languages do you
support (1) national characters, (2) pre-build databases.
What are your recognition
stats per language?
What are the benefits of
using this compared to Sphinx/Pocketsphinx?
How does the (1)
installation of a license happen, (2) license check happen.
Does the licensing server
support redundancy schemes?
Jan
________________________________
From:freeswitch-users-bounces at lists.freeswitch.org
[mailto:freeswitch-users-bounces at lists.freeswitch.org] On Behalf Of Kashif Kahn
Sent: 13. mai 2010 16:04
To: freeswitch-users at lists.freeswitch.org
Subject: [Freeswitch-users] Robust
Affordable Speech Recognition
Dear All,
All those who have wanted a speech recognition solution for Freeswitch but
found the software cost too expensive or the recognition accuracy
unsatisfactory, please consider Vestec Speech Engine for Freeswitch at: http://www.vestec.ca/products A starter kit - which is a specially priced one port (ie. one channel) license
for the standard engine - is available for only $25. Additional ports
(channels) licenses can be purchased for $99/port. The engine comes with a
free-of-charge Freeswitch connector, thereby allowing direct interaction via
Dialplan.
Best regards,
-Kashif
Kashif Kahn
VP, Business Development
Vestec, Inc.
Waterloo, ON Canada
phone: (519) 885-7615
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://lists.freeswitch.org/pipermail/freeswitch-users/attachments/20100513/f0c9a2b9/attachment.html
More information about the FreeSWITCH-users
mailing list