Whisper transcription app big performance regression #712

thundergolfer · 2024-04-20T17:14:58Z

https://modal-com.slack.com/archives/C069RAH7X4M/p1713624663717089

thundergolfer · 2024-04-20T17:18:30Z

A one hour podcast used to take ~1 minute, so big drop in performance.

thundergolfer · 2024-05-01T03:14:01Z

I think first thing to do is to replace the use of NFS

ahxxm · 2024-05-18T01:23:39Z

would be great if the official example uses WhisperX, it can transcribe one hour podcast in 1 minute using only 1 container(or more specific, 1 graphi card with 16G vram, using large-v3), instead of spins up 100-300 containers for a single transcription

ahxxm · 2024-05-18T12:53:49Z

made a poc repo here https://github.com/ahxxm/serverless-audio-transcriber

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Whisper transcription app big performance regression #712

Whisper transcription app big performance regression #712

thundergolfer commented Apr 20, 2024

thundergolfer commented Apr 20, 2024

thundergolfer commented May 1, 2024

ahxxm commented May 18, 2024 •

edited

Loading

ahxxm commented May 18, 2024

Whisper transcription app big performance regression #712

Whisper transcription app big performance regression #712

Comments

thundergolfer commented Apr 20, 2024

thundergolfer commented Apr 20, 2024

thundergolfer commented May 1, 2024

ahxxm commented May 18, 2024 • edited Loading

ahxxm commented May 18, 2024

ahxxm commented May 18, 2024 •

edited

Loading