<div dir="ltr">Hello,<div>following a discussion on IRC, I thought I would move this here so it is more async.</div><div><br></div><div>The need is to be able to use Nuance/IBM Watson/Google Speech APIs in streaming mode for real-time transcription (essentially, captioning a video conference and getting a transcript of the proceedings).</div><div><br></div><div>My first idea was to simply get the frames from a recording as it is being written, though there might be some obstacles there.</div><div><br></div><div>It was suggested the media bugs infrastructure would work, at the C level, though I need to check which formats it supports.</div><div><br></div><div>Do you have any other suggestions?</div><div><br></div><div>Thank you!</div><div><br></div><div>Luca</div></div>