Skip to main content

Audio Handling for User Input is now available on WhatsApp 🔉

Related product area:WhatsApp
  • April 23, 2025
  • 10 replies
  • 655 views
Audio Handling for User Input is now available on WhatsApp 🔉
Raquel C
Manychat Team Member
Forum|alt.badge.img+3

The feature request you’ve all been waiting for is now available! 

We're excited to announce that audio handling is now supported for WhatsApp in the 'Last Text Input' field! This feature has already been available for Instagram and Facebook Messenger, and now it's coming to WhatsApp by popular demand.

 

🔑 Key Benefits

If you're using custom solutions to handle audio files, you can now apply the same approach to the WhatsApp channel—making your automation even more versatile.

 

🚀 How to Use It

In Flow Builder, after a Default Reply or Data Collection action:

  1. Add a Make External Request action.

  2. Use the 'Last Text Input' field to send the audio file URL—just like you do for other channels.

 

🔮 What’s Next

Soon, we’ll be introducing a new condition: “Last Reply Type: text/audio”. This will let you route users based on whether they responded with text or audio, enabling more personalized and efficient experiences.

 

Start using it now and let us know what you think in the comments! 💬

10 replies

Gustavo Boregio
Forum|alt.badge.img+6
  • Manychat Community Moderator
  • April 23, 2025

Thank you ​@Raquel C ​@Marina and the entire Manychat team!

This is awesome, and a very very requested feature!! 💪


Marina
Manychat Team Member
  • Head of Community
  • April 23, 2025

We are soooo excited to bring you the top-voted community request! 💥


Heric
Forum|alt.badge.img
  • Channel Explorer
  • April 23, 2025

Hello, ​@Raquel C , ​@Marina 
 

This was very useful and valuable. You don't know how much trouble this caused users.
 

There are many details that still need to be added to the WhatsApp service, simple and basic things that should already be there. I believe it is simple enough to be able to add them.
 

I will create a topic for each of the ideas and mention you so that I can help.
 

The WhatsApp tool is very useful here in Brazil, and some features that are available on WhatsApp Web are not yet available in the tool.
 

I hope you continue to meet our requests and continue to improve the tool, especially WhatsApp. You will gain many customers by making the main features available.
 

This audio request took almost a year to be completed. I hope the others can be implemented quickly.
 

Thank you very much!


Sebastian Riehle
Forum|alt.badge.img+1
  • Rising Conversationalist
  • April 27, 2025

Awesome, thank you very much 💪😎

I already implemented the feature: OpenAI now transcribes the audio file and creates a text reply that is shown within WhatsApp 🚀

Next step would be creating an AI voice reply that can be sent dynamically via Manychat/ WhatsApp. 
I hope Manychat will enable this feature as well in the future 🙏


Raquel C
Manychat Team Member
Forum|alt.badge.img+3
  • Author
  • Manychat Community Manager
  • April 28, 2025

@Sebastian Riehle ​@Heric thank you for your honest feedback and for trusting us! 

 

 


CBA
  • Channel Explorer
  • November 12, 2025

🔮 What’s Next

Soon, we’ll be introducing a new condition: “Last Reply Type: text/audio”. This will let you route users based on whether they responded with text or audio, enabling more personalized and efficient experiences.

 

Hi it seems this has been implimented (correct?) however it does not work for “data collection” actions in a sequence. When collecting data, the “last reply type: audio” fails to detect audio each time.

Is tis known to the team or is it a bug?

Can it be fixed?? It sort of makes the feature redundant if it can only detect audio outside of expected responses.


Gustavo Boregio
Forum|alt.badge.img+6
  • Manychat Community Moderator
  • November 13, 2025

@CBA last reply type will only be set with the default reply, as far as I know.

If you’re using Data Collection, you already know it’s a file/audio. Simply check for the extension (starts with https, ends with .ogg) and you can confirm it’s an audio file for processing.


CBA
  • Channel Explorer
  • November 13, 2025

@Gustavo Boregio The problem is that i dont know 😢 

Example i have a flow that requests input from a user, the user can respond either by text image or by audio, the flow then feeds the text into a google sheet if it is audio and should reference the chat by WhatsappID if it is audio. Because audio is not recognised this step fails.

 

Is there a difference in extentions for audio and picture?

Is it possible to set a condition, “ends with .ogg” to be used as reliable conditional to differentiate media formats? 

Or maybe i am asking the wrong questions, how can i create flows that differentiate between text and image/audio responses?


Gustavo Boregio
Forum|alt.badge.img+6
  • Manychat Community Moderator
  • November 13, 2025

Test it out ;)

But yes, audios end with .ogg, images with .jpeg, .jpg, .png or .webp ;)


CBA
  • Channel Explorer
  • November 13, 2025

@Gustavo Boregio yes, I did hours of tests before my first post. I just seems like somthing the feature “is audio, is text” feature should already cover with little complication.

Not much use in applicable, but only semi-operational conditions. 🙃