Apple's New Speech APIs Outpace Whisper for Fast Transcription
(macstories.net)36 points by epaga 2 days ago
36 points by epaga 2 days ago
>> By harnessing SpeechAnalyzer and SpeechTranscriber on-device, the command line tool tore through the 7GB video file a full 55% faster than MacWhisper’s Large V3 Turbo model, with no noticeable difference in transcription quality.
App Transcripiton Time
--------------------------------
Yap (uses Apple APIs) 0:45
MacWhisper (Large V3 Turbo) 1:41
VidCap 1:55
MacWhisper (Large V2) 3:55
>> All three transcription workflows had similar trouble with last names and words like “AppStories,” which LLMs tend to separate into two words instead of camel casing.It sounds like the quality is better than YouTube's.
> a game changer for anyone who uses voice transcription to create text from lectures, podcasts, YouTube videos, and more... generating transcripts that I upload to YouTube because the site’s built-in transcription isn’t very good.
and on par with Whisper.
> SpeechAnalyzer and SpeechTranscriber – available across the iPhone, iPad, Mac, and Vision Pro – mark a significant leap forward in transcription speed without compromising on quality
Not open source/weights, FU apple