andai 3 days ago

Interesting, I haven't used Whisper, is it cost effective? Seems to be about 36 cents per (hour long) video? How long does processing take?

  • kajecounterhack 3 days ago

    You can run it locally, and it's really fast. But since YouTube transcription is really good, I don't see why you'd use Whisper and get a worse transcription (unless maybe it's on videos that Google did not transcribe for whatever reason).

    • gs17 3 days ago

      > But since YouTube transcription is really good

      Are you sure you're looking at automatic transcripts? YouTube transcripts are bizarrely low quality if they're not provided by the creators (I've actually used my Google Pixel's live transcription to make better captions occasionally).

      I just checked a video my girlfriend uploaded a week ago and the auto-transcript was still pretty messy. I've used Whisper for the same task and it's significantly better.

      • ofou 2 days ago

        Agreed. However, you can get great YT transcriptions using GPT-4o mini to clean them up.

  • HPsquared 2 days ago

    36 cents an hour is how much it costs to hire an entire GPU like an A4000. I can assure you Whisper runs much, much faster than 1x!

davidzweig 3 days ago

The security against downloading audio from YouTube has been upped recently with 'PO tokens'.

Whisper is only a few tenths of a cent per hour transcribed if transcribing on your gpu though, at about 30x real-time on a 3080 etc. with batching.

  • swyx 2 days ago

    > The security against downloading audio from YouTube has been upped recently with 'PO tokens'.

    do you have a source? more generally is there a community or news source for youtube "api" news like this?

  • HPsquared 2 days ago

    Tbh I've not had trouble with this for personal use.