Audio Handling for User Input is now available on WhatsApp 🔉

🔑 Key Benefits

If you're using custom solutions to handle audio files, you can now apply the same approach to the WhatsApp channel—making your automation even more versatile.

🚀 How to Use It

In Flow Builder, after a Default Reply or Data Collection action:

Add a Make External Request action.
Use the 'Last Text Input' field to send the audio file URL—just like you do for other channels.

🔮 What’s Next

Soon, we’ll be introducing a new condition: “Last Reply Type: text/audio”. This will let you route users based on whether they responded with text or audio, enabling more personalized and efficient experiences.

Start using it now and let us know what you think in the comments! 💬

Gustavo Boregio
Community Moderator & Expert
Forum|Forum|10 months ago
April 23, 2025

Thank you @Raquel C @Marina and the entire Manychat team!

This is awesome, and a very very requested feature!! 💪

🧑‍💻 https://engimarketers.com | 📲 https://superarmeonline.com

Marina
Head of Community
Forum|Forum|10 months ago
April 23, 2025

We are soooo excited to bring you the top-voted community request! 💥

~Marina~

Heric
Channel Explorer
Forum|Forum|10 months ago
April 23, 2025

Hello, @Raquel C , @Marina

This was very useful and valuable. You don't know how much trouble this caused users.

There are many details that still need to be added to the WhatsApp service, simple and basic things that should already be there. I believe it is simple enough to be able to add them.

I will create a topic for each of the ideas and mention you so that I can help.

The WhatsApp tool is very useful here in Brazil, and some features that are available on WhatsApp Web are not yet available in the tool.

I hope you continue to meet our requests and continue to improve the tool, especially WhatsApp. You will gain many customers by making the main features available.

This audio request took almost a year to be completed. I hope the others can be implemented quickly.

Thank you very much!

Sebastian Riehle
Rising Conversationalist
Forum|Forum|10 months ago
April 27, 2025

Awesome, thank you very much 💪😎

I already implemented the feature: OpenAI now transcribes the audio file and creates a text reply that is shown within WhatsApp 🚀

Next step would be creating an AI voice reply that can be sent dynamically via Manychat/ WhatsApp.
I hope Manychat will enable this feature as well in the future 🙏

Raquel C
Author
Manychat Community Manager
Forum|Forum|10 months ago
April 28, 2025

@Sebastian Riehle @Heric thank you for your honest feedback and for trusting us!

CBA
Channel Explorer
Forum|Forum|3 months ago
November 12, 2025

🔮 What’s Next

Hi it seems this has been implimented (correct?) however it does not work for “data collection” actions in a sequence. When collecting data, the “last reply type: audio” fails to detect audio each time.

Is tis known to the team or is it a bug?

Can it be fixed?? It sort of makes the feature redundant if it can only detect audio outside of expected responses.

Gustavo Boregio
Community Moderator & Expert
Forum|Forum|3 months ago
November 13, 2025

@CBA last reply type will only be set with the default reply, as far as I know.

If you’re using Data Collection, you already know it’s a file/audio. Simply check for the extension (starts with https, ends with .ogg) and you can confirm it’s an audio file for processing.

🧑‍💻 https://engimarketers.com | 📲 https://superarmeonline.com

CBA
Channel Explorer
Forum|Forum|3 months ago
November 13, 2025

@Gustavo Boregio The problem is that i dont know 😢

Example i have a flow that requests input from a user, the user can respond either by text image or by audio, the flow then feeds the text into a google sheet if it is audio and should reference the chat by WhatsappID if it is audio. Because audio is not recognised this step fails.

Is there a difference in extentions for audio and picture?

Is it possible to set a condition, “ends with .ogg” to be used as reliable conditional to differentiate media formats?

Or maybe i am asking the wrong questions, how can i create flows that differentiate between text and image/audio responses?

Gustavo Boregio
Community Moderator & Expert
Forum|Forum|3 months ago
November 13, 2025

Test it out ;)

But yes, audios end with .ogg, images with .jpeg, .jpg, .png or .webp ;)

🧑‍💻 https://engimarketers.com | 📲 https://superarmeonline.com

CBA
Channel Explorer
Forum|Forum|3 months ago
November 13, 2025

@Gustavo Boregio yes, I did hours of tests before my first post. I just seems like somthing the feature “is audio, is text” feature should already cover with little complication.

Not much use in applicable, but only semi-operational conditions. 🙃

🔑 Key Benefits

🚀 How to Use It

🔮 What’s Next

🔮 What’s Next

Sign up

Welcome to the Manychat Community!

Scanning file for viruses.

This file cannot be downloaded