{"id":857235,"date":"2025-06-22T01:13:14","date_gmt":"2025-06-22T06:13:14","guid":{"rendered":"https:\/\/newsycanuse.com\/index.php\/2025\/06\/22\/apple-devices-offer-amazing-speech-to-text-transcription-in-developer-betas-shows-test\/"},"modified":"2025-06-22T01:13:14","modified_gmt":"2025-06-22T06:13:14","slug":"apple-devices-offer-amazing-speech-to-text-transcription-in-developer-betas-shows-test","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2025\/06\/22\/apple-devices-offer-amazing-speech-to-text-transcription-in-developer-betas-shows-test\/","title":{"rendered":"Apple devices offer amazing speech to text transcription in developer betas, shows test"},"content":{"rendered":"<div>\n<figure>\n\t<img width=\"1600\" height=\"800\" src=\"https:\/\/9to5mac.com\/wp-content\/uploads\/sites\/6\/2025\/06\/Apple-devices-offer-amazing-speech-to-text-transcription-in-developer-betas-shows-test.jpg?quality=82&#038;strip=all&#038;w=1600\" alt=\"Apple devices offer amazing speech to text transcription in developer betas, shows test | Screengrab of transcription file for a YouTube video\"  decoding=\"async\" fetchpriority=\"high\"><\/figure>\n<p>If you ever need to transcribe audio or video to text, most current apps are powered by OpenAI\u2019s Whisper model. You\u2019re probably using this model if you use apps like <a href=\"https:\/\/9to5mac.com\/2023\/08\/25\/macwhisper-audio-transcription\/\" target=\"_blank\" rel=\"noreferrer noopener\">MacWhisper<\/a> to transcribe meetings or lectures, or to generate subtitles for YouTube videos.<\/p>\n<p>But iOS 26 and Apple\u2019s other developer betas include the company\u2019s own transcription frameworks \u2013 and a test suggests that they match Whisper\u2019s accuracy while running at more than twice the speed \u2026 <\/p>\n<p>If you\u2019ve ever used the built-in dictation capabilities of any of your Apple devices, this is handled by <a href=\"https:\/\/developer.apple.com\/documentation\/speech\" target=\"_blank\" rel=\"noreferrer noopener\">Apple\u2019s own speech framework<\/a>. In the new betas, there are beta versions of <a href=\"https:\/\/developer.apple.com\/documentation\/speech\/speechanalyzer\" target=\"_blank\" rel=\"noreferrer noopener\">SpeechAnalyzer<\/a> and <a href=\"https:\/\/developer.apple.com\/documentation\/speech\/speechtranscriber\" target=\"_blank\" rel=\"noreferrer noopener\">SpeechTranscriber<\/a> which developers can use in their own apps.<\/p>\n<blockquote>\n<p>Use the Speech framework to recognize spoken words in recorded or live audio. The keyboard\u2019s dictation support uses speech recognition to translate audio content into text. This framework provides a similar behavior, except that you can use it without the presence of the keyboard.<\/p>\n<p>For example, you might use speech recognition to recognize verbal commands or to handle text dictation in other parts of your app. The framework provides a class, SpeechAnalyzer, and a number of modules that can be added to the analyzer to provide specific types of analysis and transcription. Many use cases only need a SpeechTranscriber module, which provides speech-to-text transcriptions.<\/p>\n<\/blockquote>\n<p><em><a href=\"https:\/\/www.macstories.net\/stories\/hands-on-how-apples-new-speech-apis-outpace-whisper-for-lightning-fast-transcription\/\" target=\"_blank\" rel=\"noreferrer noopener\">MacStories<\/a>\u2018<\/em> John Voorhees asked his son to create a command-line tool to test this new capability, and was incredibly impressed by the results.<\/p>\n<blockquote>\n<p>I asked Finn what it would take to build a command line tool to transcribe video and audio files with SpeechAnalyzer and SpeechTranscriber. He figured it would only take about 10 minutes, and he wasn\u2019t far off. In the end, it took me longer to get around to installing macOS Tahoe after WWDC than it took Finn to build\u00a0<a href=\"https:\/\/github.com\/finnvoor\/yap\">Yap<\/a>, a simple command line utility that takes audio and video files as input and outputs SRT- and TXT-formatted transcripts.<\/p>\n<\/blockquote>\n<p>He used a 34-minute video to test it against both MacWhisper and VidCap, two of the most popular transcription apps. He found the Apple\u2019s modules matched the accuracy of these, but was more than twice as fast as the most efficient existing app, MacWhisper running the Large V3 Turbo model:<\/p>\n<figure>\n<table readabilityDataTable=\"1\">\n<thead>\n<tr>\n<th>App<\/th>\n<th>Transcription Time<\/th>\n<\/tr>\n<\/thead>\n<tbody>\n<tr>\n<td>Yap (using Apple\u2019s framework)<\/td>\n<td>0:45<\/td>\n<\/tr>\n<tr>\n<td>MacWhisper (Large V3 Turbo)<\/td>\n<td>1:41<\/td>\n<\/tr>\n<tr>\n<td>VidCap<\/td>\n<td>1:55<\/td>\n<\/tr>\n<tr>\n<td>MacWhisper (Large V2)<\/td>\n<td>3:55<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<\/figure>\n<p id=\"p14\">He argues that while this might seem a relatively trivial improvement for one-off tasks, the differences will quickly add up when performing either batch transcriptions or needing to transcribe files very regularly, like students with lecture notes.<\/p>\n<p>If you\u2019re running the macOS Tahoe developer beta, you can <a href=\"https:\/\/github.com\/finnvoor\/yap\">install Yap from GitHub<\/a> to test it for yourself.<\/p>\n<h4 id=\"h-highlighted-accessories\">Highlighted accessories<\/h4>\n<ul>\n<li><a href=\"https:\/\/www.amazon.com\/Anker-Charger-Compact-Technology-Included\/dp\/B0CP7NWH6L?tag=blovejoy-20\" target=\"_blank\" rel=\"noreferrer noopener\">Anker 511 Nano Pro ultra-compact iPhone charger<\/a><\/li>\n<li><a href=\"https:\/\/www.amazon.com\/Spigen-Compatible-Accessories-Anti-Yellowing-Military-Grade\/dp\/B0DKGBTVHW?tag=blovejoy-20\" target=\"_blank\" rel=\"noreferrer noopener\">Spigen MagFit case for iPhone 16e \u2013 adds MagSafe support<\/a><\/li>\n<li><a href=\"https:\/\/www.amazon.com\/Apple-MagSafe-Charger-Capability-Compatible\/dp\/B0DGJ4QQ5W?tag=blovejoy-20\" target=\"_blank\" rel=\"noreferrer noopener\">Apple MagSafe Charger with 25w power for iPhone 16 models<\/a><\/li>\n<li><a href=\"https:\/\/www.amazon.com\/Apple-30W-USB-C-Power-Adapter\/dp\/B0CX23PHFD?tag=blovejoy-20\" target=\"_blank\" rel=\"noreferrer noopener\">Apple 30W charger for above<\/a><\/li>\n<li><a href=\"https:\/\/www.amazon.co.uk\/dp\/B0C4FDJ8F7?tag=blovejoy-20\" target=\"_blank\" rel=\"noreferrer noopener\">Anker 240W braided USB-C to USB-C cable<\/a><\/li>\n<\/ul>\n<p><em>Image: 9to5Mac screengrab of a YouTube video subtitle file<\/em><\/p>\n<p>\n\t\t<a target=\"_blank\" rel=\"nofollow\" href=\"https:\/\/news.google.com\/publications\/CAAqBggKMLOFATDAGg?hl=en-US&#038;gl=US&#038;ceid=US:en\"><br \/>\n\t\t\t<em>Add 9to5Mac to your Google News feed.<\/em>\u00a0<br \/>\n\t\t\t\t\t<\/a>\n\t<\/p>\n<div>\n<p><em>FTC: We use income earning auto affiliate links.<\/em> <a href=\"https:\/\/9to5mac.com\/about\/#affiliate\">More.<\/a><\/p>\n<\/div><\/div>\n<p><a href=\"https:\/\/9to5mac.com\/2025\/06\/18\/apple-devices-offer-amazing-speech-to-text-transcription-in-developer-betas-shows-test\/\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>If you ever need to transcribe audio or video to text, most current apps are powered by OpenAI\u2019s Whisper model. You\u2019re probably using this model if you use apps like MacWhisper to transcribe meetings or lectures, or to generate subtitles for YouTube videos. But iOS 26 and Apple\u2019s other developer betas include the company\u2019s own [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":857236,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[376,2540,104640],"tags":[],"class_list":{"0":"post-857235","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-apple","8":"category-devices","9":"category-youtube-videos"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/857235","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=857235"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/857235\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/857236"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=857235"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=857235"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=857235"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}