agent-image-lead-qualification-assistant
Transcribe Long Audio Files Beyond 25MB Limit with FileFlows and OpenAI Whisper
Who's it for This template is designed for content creators, podcasters, businesses, and researchers who need to transcribe long audio recordings that...
screenshot-daily-task-reminder-bot
Created by:
Julien DEL RIO
Last Updated:

June 15th 2025

Agent Details:
n8n
OpenAI
general
HTTP Request
If
No Operation
do nothing
Description
Who's it for This template is designed for content creators, podcasters, businesses, and researchers who need to transcribe long audio recordings that exceed OpenAI Whisper's 25 MB file size limit (~20 minutes of audio). How it works This workflow combines n8n, FileFlows, and OpenAI Whisper API to transcribe audio files of any length: User uploads an MP3 file through a web form and provides an email address n8n splits the file into 4 MiB chunks and uploads them to FileFlows FileFlows uses FFmpeg to segment the audio into 15-minute chunks (safely under the 25 MB API limit) Each segment is transcribed using OpenAI's Whisper API (configured for French by default) All transcriptions are merged into a single text file The complete transcription is automatically emailed to the user Processing time: Typically 10-15 minutes for a 1-hour audio file. Requirements n8n instance (self-hosted or cloud) FileFlows with Docker and FFmpeg installed OpenAI API key (Whisper API access) Gmail account for email delivery Network access between n8n and FileFlows Setup Complete setup instructions, including FileFlows workflow import, credentials configuration, and storage setup, are provided in the workflow's sticky notes. Cost OpenAI Whisper API: $0.006 per minute. A 1-hour recording costs approximately $0.36.

Build Dynamic AI Agents

Join a community of growing Agentic AI Developers.