cm0002@piefed.world to Linux@programming.devEnglish · 20 天前FFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comexternal-linkmessage-square13linkfedilinkarrow-up163arrow-down15file-textcross-posted to: linux@sh.itjust.workslinux@lemmy.worldlinux@lemmy.mlopensource@programming.dev
arrow-up158arrow-down1external-linkFFmpeg 8.0 merges OpenAI "Whisper Filter" for automatic speech recognition, Vulkan AV1 encoding, & VP9 decodingwww.phoronix.comcm0002@piefed.world to Linux@programming.devEnglish · 20 天前message-square13linkfedilinkfile-textcross-posted to: linux@sh.itjust.workslinux@lemmy.worldlinux@lemmy.mlopensource@programming.dev
https://www.phoronix.com/news/FFmpeg-Vulkan-AV1-Encoding https://www.phoronix.com/news/FFmpeg-Lands-Whisper
minus-squarechrisbtoo@lemmy.calinkfedilinkarrow-up17·20 天前Hopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
minus-squarepirateKaiser@sh.itjust.workslinkfedilinkarrow-up8·20 天前I’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
minus-squaredata1701d (He/Him)@startrek.websitelinkfedilinkEnglisharrow-up1·20 天前I was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.
Hopefully the speech recognition is better than whatever the fuck most online video platforms use for automatic subtitles at the moment.
I’ve built an app with Whisper, the level of ‘hit or miss’ entirely depends on the size of the model and language. Even audio quality is a lesser factor in my experience. So, it depends…
I was messing around with HomeAssistant the other day, which uses the same speech recognition engine, and I found it to be decent.