MacWhisper, native macOS app for Whisper #420

jordibruin · 2023-01-17T15:33:58Z

jordibruin
Jan 17, 2023

First of all, a massive thanks to @ggerganov for making all this! Most of the low level stuff is voodoo to me, but I was able to get a native macOS app up and running thanks to all your hard work!

MacWhisper lets you run Whisper locally on your Mac without having to install anything else.

Features

Easily record and transcribe audio files
Just drag and drop audio files to get a transcription
Get accurate text transcriptions in seconds (up to 15x realtime)
Search the entire transcript and highlight words
Supports multiple languages (fastest model is English only)
Copy the entire transcript or individual sections
Reader Mode

MacWhisper is very basic right now, so please let me know if you run into anything. You can download it for free here:
http://goodsnooze.gumroad.com/l/macwhisper

jordibruin · 2023-01-19T10:08:56Z

jordibruin
Jan 19, 2023
Author

Added a bunch more features such as editing and deleting segments, as well as language selection. Love this framework, thanks again!

5 replies

minfuon96 Oct 17, 2023

I've been using the free version of the MacWhisper app for about a month, and I must say, I'm a huge fan of the "Speaker Paragraph" export function. It's been incredibly helpful in helping me grasp the overall message conveyed by the speakers. I've been contemplating getting the full license, especially because I have a hefty library of over 500 GB of videos that need transcribing.

However, there's one aspect that's been bothering me. When it comes to bulk transcribing videos, I can't seem to find the "Speaker Paragraph" export option anywhere. This restriction has left me wondering why such a limitation exists. Wouldn't it make sense to unlock this export option for paying users who've already invested in the app?

Now, I find myself in a dilemma. I can either manually transcribe videos one by one, ensuring I get the "Speaker Paragraph" export, or I can opt for the PRO license to bulk transcribe my videos, but they'll come out as fragmented sentences.

For me, the absence of the "Speaker Paragraph" export option for bulk-transcribed videos is a significant drawback, preventing me from making the purchase I've been considering for over two weeks now. I kindly request adding the "Speaker Paragraph" export option for both single video transcriptions and bulk transcriptions. Your consideration would be greatly appreciated. Thank you.

Duseylicious Dec 11, 2023

I've already purchased and love it, but I'd 2nd this request. 👍

mrienstra Jan 9, 2024

@jordibruin, ping, just in case you missed this thread.

kjkunkle Feb 8, 2024

@jordibruin Can you add a functionality to hide the MacWhisper dock icon ?

Would be nice since I always have it open for the menubar shortcut, but don't like the app cluttering up my CMD+Tab area !

imthemastah Feb 15, 2024

I admit I'm still confused by this. I have the pro version and I all see that the useful option is unavailable?

pbassham · 2023-01-19T19:59:02Z

pbassham
Jan 19, 2023

Love it! Working well.

Here are a few thoughts from some initial use:

Would be good to have a way to stop a transcription in progress. Like if you chose a large file and it is going to take too long. I just dropped a 2nd file on top during it working and it crashed it, so I guess that works, lol.
Would be awesome to see a list of transcription history in the sidebar, for instance.
Some view preferences would be nice.
- show/hide timestamps (maybe just show them on the right to save space, or just on hover)
- reader view by default? (the default view is quite inefficient on space)
- An editable reader mode seems more useful than the very short segments in the default view right now. That would allow you to group it into paragraphs, separate by person, etc, more easily.
Showing the local audio file that is transcribed with a player would be awesome. And clicking on a sentence could jump to that timestamp in the audio file.

Definitely a good start!

Any way to see when updates are released?

0 replies

matsumurae · 2023-02-16T17:26:15Z

matsumurae
Feb 16, 2023

This is a really good app! I remember last time I tried something similar and the AI with Japanese was a bit shitty… I need to review the extracted text but it looks really good (at least what I've saw on the first mins).

I have two ideas that could be good to have:

Would be possible to differentiate between speakers? Well, if not, it's still possible to add it by hand but that's hard work too.
I would like to have the option to re-open files. This could be with a proprietary file format or using the exported ones (like .csv or .srt).

About the first point
This could be really useful to transcribe podcasts with multiple speakers.

For the 2nd point
I'm not sure if it's possible to re-open a file without a proprietary format so it stores both audio + text. I'm thinking on something like oTranscribe does with it's own files so you can review as many times as you want.

Also, I would like to ask why MacWhisper isn't on Github or Gitlab so all of us can collaborate too.

3 replies

taivlam Mar 1, 2023

For your first point, Discussion #450 involves what's called "diarization" in whisper.cpp.

Issue #489 (for 3/more speakers) is closed, though really issue #64 is the issue to keep tabs on with respect to diarization progress.

kai-shimada Mar 6, 2023

For your second point, we have added support for saving and reopening from SRT files in WhisperScript Pro. We were actually motivated to implement this feature by your suggestion :)

You can see other features of WhisperScript Pro here: https://getwavery.gumroad.com/l/whisperscript

matsumurae Mar 8, 2023

For your second point, we have added support for saving and reopening from SRT files in WhisperScript. We were actually motivated to implement this feature by your suggestion :)

Wow, happy to see it was a useful feature.

You can try WhisperScript right here: https://getwavery.gumroad.com/l/whisperscript

I couldn't resist to test it. I've sent to the mail all the feedback that I've found :)

stephando · 2023-03-01T14:42:03Z

stephando
Mar 1, 2023

Love it! And works really well.
However the export transcript button does not work on my machine (MacOS Monterey on MacBook Pro with M1 Max) - any thoughts?

2 replies

LtPierrot Mar 1, 2023

Same issue... cannot export

ReeceKidd Apr 8, 2023

Same

SWM89 · 2023-03-11T19:40:23Z

SWM89
Mar 11, 2023

Nice job ! Looking further for live transcript…

0 replies

dmuiX · 2023-03-18T01:28:46Z

dmuiX
Mar 18, 2023

would be nice if i could choose the language

0 replies

Sogl · 2023-03-20T23:20:39Z

Sogl
Mar 20, 2023

Nice app. Please add support for *.ogg files (WhatsApp, Telegram voice).

0 replies

Jeff9387346 · 2023-03-30T19:33:52Z

Jeff9387346
Mar 30, 2023

Fantastic work.

Question: I initially downloaded the non-pro version and chose only the english dataset. After testing I want to try the multi-language dataset, but I see no way in the app or website to go back and get that file. Pointer would be appreciated.

0 replies

jmantn · 2023-03-31T18:15:08Z

jmantn
Mar 31, 2023

I purchased MacWhisper Pro and love the application.

I primarily use this for podcasts and hour long recordings. I'd like to request a way to export a transcript and its accompanying audio or video and have it be interactive like click on three paragraphs down and there's where the media will jump to. Or A program that could do that as I'd happily pay for something that like that.

Would also love to see macwhisper come to iOS as well. Gladly pay for this again just to have it on mobile as well.

0 replies

Jeff9387346 · 2023-04-08T16:42:31Z

Jeff9387346
Apr 8, 2023

Also, once I identify a speaker, it would be fantastic for the AI to then label all instances of that voice appropriately!!

1 reply

jmantn Apr 12, 2023

Also, once I identify a speaker, it would be fantastic for the AI to then label all instances of that voice appropriately!!

That's honestly what I'm hoping for because the alternative would be tedious for longer transcripts.

canoben · 2023-04-11T20:48:34Z

canoben
Apr 11, 2023

Batch mode does not work in the Pro version (only reason why I bought it).

2 replies

jmantn Apr 12, 2023

Batch mode does not work in the Pro version (only reason why I bought it).

I'm using hour long .mp3's and I've done up to 5 at a time with the Large model selected and they all worked for me. One nice thing about batch mode that's missing from doing individuals for me is when I do batch mode I can select ahead of time multiple formats for output whereas when I do it individually I have to manually export with each format I want one at a time.

But at least for me and my use case batch mode is working. Might be helpful to the dev to know which macOS version you're running and what file types your trying to convert?

canoben Apr 12, 2023

I'm using the latest versions of everything (MacWhisper 2.14 and macOS 13.3.1) and tried with both mp3 and wav files. I've tried on both Intel and Apple Silicon machines, but the batch always stops after the first file (and nothing is written to the disk). Changing language models or output options does not change this. Starting a batch with 5 files or less doesn't help, either.
But I've found a solution for me: I compiled Whisper.cpp myself and use it with the command line. Works perfectly, although strangely much slower than MacWhisper. Maybe I missed some optimisation flags for Apple Silicon. MacWhisper runs much faster on AS compared to the Intel versions.

dchad3 · 2023-04-15T11:33:42Z

dchad3
Apr 15, 2023

I used Mac Whisper Pro Medium to transcribe an interview - stereo file. I wish I could get MW to identify each track with a name. It's too much work to do that manually. I see I can add people but I didn't figure out how that worked. Would be nice if there were YouTube videos showing what it can do and how to do it. Maybe there are but I haven't found them yet.

1 reply

dchad3 Apr 17, 2023

Turns out that either I did it wrong or that it doesn't do stereo files. It only transcribed the first track. I made it mono and transcribed it that way and now have to separate the host and guest comments. I don't see how to have host and guest each with separate paragraphs and identified as in Host: blah bla, Guest: blah blah - except manually. Looking forward to Mac Whisper being able to do that.

dchad3 · 2023-04-15T11:36:27Z

dchad3
Apr 15, 2023

I got Mac Whisper for my Macbook Air M1 Ventura and got the pro. It then gave me size options so I chose medium. Can I have Pro large? There was no option to do that with the Pro. I do podcasts with a different person each time and am in no hurry so the best I can get is what I want.

2 replies

j-f1 Apr 15, 2023

You should be able to go to the dropdown in the top right corner labelled “Medium” and then choose “Manage Models…” to download additional models.

dchad3 Apr 17, 2023

Thanks. I did look into that and see for English only, Pro medium seems to be the best. Large has multiple languages.

wanderingstan · 2023-04-20T20:59:34Z

wanderingstan
Apr 20, 2023

Just found a little bug: Looks like it only listens to the left channel of a stereo file.

I kept getting only "[BLANK_AUDIO]" for an mp3 file that clearly had voices in it. Mystery was solved when I opened it in audacity and saw that the speaking was all in the right channel.

1 reply

iandundas Nov 10, 2023

Thanks for the feedback - this is fixed in 5.7, which will be available soon, likely within the next week.

klh · 2023-04-28T07:33:31Z

klh
Apr 28, 2023

you should add a settings section to tweak num of cpus / threads weights etc.
should be easy to add.

0 replies

cgallerhh · 2023-11-28T22:11:27Z

cgallerhh
Nov 28, 2023

Hi, I have the MacWhisper Pro and work on a MavcBook Air (M1) with Airpods Pro.
When I record calls, only my part is recorded, but not what my conversation partner says.
The medium model is activated and I have never managed to record both my voice and that of my partner via the Airpods.

"New recording" -> only my voice via Airpods
"Record App Audio" -> listbox "airpods", Record System Audio -> only the other person's voice.

What do I have to do to record both my voice and that of my conversation partner via AirPods Pro?

1 reply

iandundas Dec 8, 2023

Hi @cgallerhh, one of the devs of MacWhisper here - I'm guessing this was FaceTime? As far as I know it's not possible to record a FaceTime call due to privacy constraints in Apple's API.

detchells · 2023-12-08T03:11:39Z

detchells
Dec 8, 2023

How to handle mixed-language audio? (Can it?)
I have the paid version of MacWhisper and a valid DeepL key, but when I process meeting audio with both English and Japanese spoken, only the English part is transcribed, it just says [Japanese Spoken] for the Japanese parts. When I select "Translate to ... English", it says the data can't be read because it's missing. Am I making some stupid mistake somewhere?

AFAIK Whisper itself can handle mixed conversations and I"m using the Large model that includes all the languages, is there a limit of some kind with how MacWhisper is set up, or am I perhaps just missing something in the interface?

2 replies

iandundas Dec 8, 2023

Hi @detchells, one of the devs of MacWhisper here, are you able to share such an audio file with me (even just an extract) so that I can do some experimentation? Thanks

detchells Dec 8, 2023

Wow, thanks for the super-fast reply, and absolutely, I've uploaded an MP3 file with a representative extract of the conversation.

The linked MP3 has a bit over 4 minutes of conversatoin, saved at 170-210 Kbps, forced to mono. There are 3 speakers, myself (loud English), a Japanese interviewee (medium-volume Japanese) and the interpreter (soft-spoken, both English and Japanese).

For me, if I select auto-detect, it will grab whatever language appears first in the stream and then just show the others as [speaking Japanese] in English text or the Kanji equivalent in the other direction for Japanese text. If I manually select English or Japanese, it does the same.

Even if I have it transcribe in Japanese, so I know there's Japanese text there, selecting Translate says the data is missing.

Thanks so much for looking into this for me!

(If it could really handle mixed dialogue, this would be incredibly useful to me. I'm also interested to see how it does with broken Jinglish, although I'm not expecting much on that front :-/ - That would be Truly helpful though...)

Hmm, it won't let me attach the file here, so here's a Dropbox link that should give you access, thanks again:
https://www.dropbox.com/scl/fi/iybdicxdkb84647cyf7gn/Canon_clip_for_MacWhisperDevs.mp3?rlkey=xktmfi0z91fk0bx7tixbbjfo4&dl=0

cgallerhh · 2023-12-08T10:08:43Z

cgallerhh
Dec 8, 2023

Hi Ian, no, it wasn’t Facetime, it was MS Teams. *Von: *Ian Dundas ***@***.***> *Datum: *Freitag, 8. Dezember 2023 um 09:26 *An: *ggerganov/whisper.cpp ***@***.***> *Cc: *cgallerhh ***@***.***>, Mention < ***@***.***> *Betreff: *Re: [ggerganov/whisper.cpp] MacWhisper, native macOS app for Whisper (Discussion #420) Hi @cgallerhh <https://github.com/cgallerhh>, one of the devs of MacWhisper here - I'm guessing this was FaceTime? As far as I know it's not possible to record a FaceTime call due to privacy constraints in Apple's API. — Reply to this email directly, view it on GitHub <#420 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A573WDR2A3AWGYFONFVAVKTYILFMJAVCNFSM6AAAAAAT6AEGNOVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TOOJXGQ2DK> . You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

cgallerhh · 2023-12-08T10:12:12Z

cgallerhh
Dec 8, 2023

Hi Ian, please find attached [image: Bildschirmfoto 2023-12-08 um 11.11.20.png] an audio file with my voice. It was recording “system audio” -> Airpods and I can only listen to my own voice. Viele liebe Grüße, Christian Galler Nymphenweg 8 21077 Hamburg Web www.christian-galler.de Mobil 0176 63107173 7. Dec at 09_59_55 Microphone.whisper <https://drive.google.com/file/d/1Q48QEesAZA2Wg-ivldwvTgn1tO4NP3u2/view?usp=drive_web>

…

On Fri, 8 Dec 2023 at 09:25, Ian Dundas ***@***.***> wrote: Hi @detchells <https://github.com/detchells>, one of the devs of MacWhisper here, are you able to share such an audio file with me (even just an extract) so that I can do some experimentation? Thanks — Reply to this email directly, view it on GitHub <#420 (reply in thread)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/A573WDUU63DF3ZNXXFYWAYTYILFHNAVCNFSM6AAAAAAT6AEGNOVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TOOJXGQZTK> . You are receiving this because you commented.Message ID: ***@***.***>

0 replies

cgallerhh · 2023-12-08T21:10:28Z

cgallerhh
Dec 8, 2023

Hi my friend, I‘m not that japanese guy - I am the Airpods guy with only one speaker in a conversation. Seems you‘ve written the wrong user. Gesendet von Outlook für iOS<https://aka.ms/o0ukef>

…

________________________________ Von: Dave Etchells ***@***.***> Gesendet: Friday, December 8, 2023 8:52:03 PM An: ggerganov/whisper.cpp ***@***.***> Cc: cgallerhh ***@***.***>; Mention ***@***.***> Betreff: Re: [ggerganov/whisper.cpp] MacWhisper, native macOS app for Whisper (Discussion #420) Wow, thanks for the super-fast reply, and absolutely, I've uploaded an MP3 file with a representative extract of the conversation. The linked MP3 has a bit over 4 minutes of conversatoin, saved at 170-210 Kbps, forced to mono. There are 3 speakers, myself (loud English), a Japanese interviewee (medium-volume Japanese) and the interpreter (soft-spoken, both English and Japanese). For me, if I select auto-detect, it will grab whatever language appears first in the stream and then just show the others as [speaking Japanese] in English text or the Kanji equivalent in the other direction for Japanese text. If I manually select English or Japanese, it does the same. Even if I have it transcribe in Japanese, so I know there's Japanese text there, selecting Translate says the data is missing. Thanks so much for looking into this for me! (If it could really handle mixed dialogue, this would be incredibly useful to me. I'm also interested to see how it does with broken Jinglish, although I'm not expecting much on that front :-/ - That would be Truly helpful though...) Hmm, it won't let me attach the file here, so here's a Dropbox link that should give you access, thanks again: https://www.dropbox.com/scl/fi/iybdicxdkb84647cyf7gn/Canon_clip_for_MacWhisperDevs.mp3?rlkey=xktmfi0z91fk0bx7tixbbjfo4&dl=0 — Reply to this email directly, view it on GitHub<#420 (reply in thread)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/A573WDSA4PR4CP2OSI2V5STYINVWHAVCNFSM6AAAAAAT6AEGNOVHI2DSMVQWIX3LMV43SRDJONRXK43TNFXW4Q3PNVWWK3TUHM3TQMBTGA4DM>. You are receiving this because you were mentioned.Message ID: ***@***.***>

0 replies

Nnnay07 · 2023-12-09T14:00:35Z

Nnnay07
Dec 9, 2023

Found an issue with transcription of speeches when interrupted by long pauses where music is played. Using Spanish language and small model on mp3 file. In that case the onlie training lasted 4h and had several breaks for coffee and lunch where music would be played. Transcription goes well for the fisrt part, then detects the music (and informs about [MUSIC], but never detects that the spoken speech starts again. It transcribes [MUSIC] until the end of the file. Result is 2h of correct transcription followed by 2h of [MUSIC], while it should have been 2h of transcription + 15min of [MUSIC] + 1h45 of transcription.
Systematic problem faced on several files.

1 reply

evenkeelhuang Dec 20, 2023

I encountered a similar situation in my experience. When the speaker in the recording has a long period of silence, the transcription that follows often becomes inaccurate. This seems to be a common issue when dealing with long pauses in speech.

Shmoopy2024 · 2024-01-14T19:23:08Z

Shmoopy2024
Jan 14, 2024

Macwhisper version 6.11 - no longer lets me edit the lines in the transcript to assign speakers! I'm not a comp sci guy, I am just a simple user. If anyone can provide help with this, or can suggest something like a new procedure - please - let me know, ok?

Without the ability to assign speakers - it's essentially lost 50% of its utility. yikes!

1 reply

Shmoopy2024 Jan 14, 2024

never mind - I just had a senior's moment . It's fine, the app is ok

gnewtonn · 2024-01-24T23:57:06Z

gnewtonn
Jan 24, 2024

Anyone knows something similar for Windows?

3 replies

mrienstra Jan 25, 2024

Some suggestions here: https://www.reddit.com/r/OpenAI/comments/163hzhe/recommended_whisper_ai_with_gui_and_gpu_support/

Sing303 Feb 14, 2024

Yeah, it's very similar to this https://easywhisper.io/

thewh1teagle May 22, 2024

Vibe works on macOS / Linux / Windows
https://github.com/thewh1teagle/vibe

auroraflux · 2024-06-17T23:02:40Z

auroraflux
Jun 17, 2024

This app is absolutely phenomenal, I've bought it 3 times now for myself and two friends.

Is there any chance there's an equivalent for Android? I'm looking for something that allows at the very least a long, uninterrupted mic recording where I can just put the phone down and let it pick everything up and transcribe it later.

I realize I could just do an audio recording and move the file and transcribe it via the MacWhisperer app later, but if I could do it all on-device that would be incredible.

Thanks!

0 replies

ef-is · 2024-06-25T08:24:20Z

ef-is
Jun 25, 2024

7 replies

auroraflux Jun 28, 2024

Or, he could do as the MIT license for OpenAI's Whisper specifically allows him to do:

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so

This is how the MIT open source license works. He is not only well within his rights to charge for it, he should be free to. This is what the spirit of open source licenses are, and this is what software freedom looks like.

If you don't understand how licenses like these work, maybe refrain from from making soapbox-style commentary about it.

readtedium Jun 28, 2024

Putting a nice interface on a script and charging for more advanced features is additional work. You are paying for the ease of use—and the dev is nice enough to make most of it available to people for free. If you don’t like this approach, use one of the other tools out there that won’t charge you at all.

sandipb Jun 29, 2024

I rarely pay for opensource wrappers unless they provide a significant value add. This gives me a native gui with additional support for managing models based on my requirements, keeps the model list updated, handles different kinds of inputs, allows me to edit transcriptions, etc all value adds to the option of doing all this manually even if I have the tech knowhow to do it myself.
I am completely ok with paying for this.

Now the way you are looking at pricing is backward - he could have charged for every aspect of the app, but customer acquisition strategy is to provide the lower quality fast models for free. It is pretty common paradigm in AI apps right now dont you think? perplexity is doing just the same.

ef-is Jul 1, 2024

Or, he could do as the MIT license for OpenAI's Whisper specifically allows him to do:

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so

This is how the MIT open source license works. He is not only well within his rights to charge for it, he should be free to. This is what the spirit of open source licenses are, and this is what software freedom looks like.

If you don't understand how licenses like these work, maybe refrain from from making soapbox-style commentary about it.

you're absolutely right. my bad.

auroraflux Jul 3, 2024

Hey, it's always a fair point to bring up, and I can understand feeling a bit indignant about something like this. But it's okay! We learn about these licenses and then we re-evaluate our stance when we get new information.

Don't beat yourself up.

sandipb · 2024-06-29T16:52:50Z

sandipb
Jun 29, 2024

I am using a paid Pro version of the app. Is this thread the only community for the app? I see a feedback email address for the app but no community links. I really think the app would benefit from either a forum/discord. Even a github discussion is fine.

0 replies

sandipb · 2024-06-29T16:54:20Z

sandipb
Jun 29, 2024

Now that I have marked speakers for the transcription, I see now output format which preserves this information. Exporting segments to pdf or html seems to just export the original timestamp based segments. Is there a way I can add this speaker information to audio segments anywhere?

0 replies

bwoodcock · 2024-07-03T20:19:58Z

bwoodcock
Jul 3, 2024

Hi. I've got a 34-second-long wav file, single speaker, in French, but only the first second gets transcribed... Anyone have any suggestions of what I might do to get this working? There's no apparently difference in the audio or file at the one-second mark, the transcription literally stops in the middle of a sentence, while the audio continues uninterrupted.

Thanks for any ideas!

0 replies

paulf12345 · 2024-07-22T06:43:43Z

paulf12345
Jul 22, 2024

Any suggestions for optimizing speed on an M2 Max MBP 96 GB? It'd be sweet if theres any benchmarks or you have any advice on models (distilled, turbo, normal?), audio pipeline and encoding/decoding format to use (assuming whisperkit is recommended), and effects of pipeline and encoding/decoding compute units on flash attention, greedy vs beam search, etc...

I saw some notes on CoreML + GPU processing in discussions back in march but have not been following repo closely enough to know whether this has been implemented (at which point I assume whisperkit is no longer best option? although tbh idk what the difference between whisperkit and .cpp models are other than swift support).

1 reply

johnmarshall4 Jul 25, 2024

For what it's worth, I have a similar machine and I have found the Small English C++ model to be the best speed vs. accuracy. I use it for nearly everything I do. I use flash attention. If you want / need a large model, the 1.5G distilled whisper kit models are faster than Large English C++, but I don't find them as accurate. The non-distilled large whisper kit models have better accuracy but are much slower than C++. At least that is what I've found on an m3 Max. YMMV.

realAbitbol · 2024-08-30T22:10:06Z

realAbitbol
Aug 30, 2024

Will ollama be supported as an alternative to openai and anthropic at one point ?

0 replies

MacWhisper, native macOS app for Whisper #420

Replies: 45 comments · 43 replies

jordibruin Jan 19, 2023 Author

Replies: 45 comments 43 replies

jordibruin
Jan 19, 2023
Author