Automatically transcribe Telegram voice messages with OpenAI Whisper & Google Workspace

Created by

Last update

Last update 8 days ago

🧑‍💼 Who’s it for

Journalists, content creators, or busy professionals who often record voice memos or short interviews on the go.
Anyone who wants to turn voice recordings into searchable, structured notes.

User sends a voice message to a Telegram bot.
n8n checks if the message is an audio voice note.
If valid, it downloads the audio file and:
- Transcribes it using OpenAI Whisper (or your LLM of choice).
- Uploads the original audio to Google Drive for safekeeping.
The transcript and audio metadata are merged.
The workflow:
- Logs the data into a Google Sheet.
- Sends a formatted confirmation message to the user via Telegram.

If the input is not audio, the bot politely informs the user that only voice messages are accepted.

Telegram Trigger
Start the flow when a new message is sent to your bot.
Check Message Type
Use a conditional node to confirm it's a voice message.
Download Voice Message
Download the .oga file from Telegram.
Transcribe Audio
Send the binary audio to OpenAI Whisper or your transcription service.
Upload to Google Drive
Backup the original audio file.
Merge Outputs
Combine transcription with Drive metadata.
Transform to Row Format
Prepare structured JSON for Google Sheets.
Append to Google Sheet
Store the transcript log (DateTime, Duration, Transcript, AudioURL).
Send Confirmation to User
Inform the user via Telegram with their transcript and download link.
Unsupported Message Handler
Reply to users who send non-audio messages.

DateTime	Duration	Transcript	AudioURL
2025-08-07T13:12:19Z	27	Dự án Outlet Activation là...	https://drive.google.com/uc?id=xxxx&export=download

Created with ❤️ using n8n