Chrome Built-in AI: Simple Multimodal Sound

This example demonstrates the ability of the built-in AI to handle multimodal input, specifically converting sound to text. You can either upload a sound file or record a message directly using your microphone. The AI will then transcribe the audio content.

Audio Input

Upload an audio file:


Record with your microphone:

Idle.

Status and Output

Ready. Choose an audio input method.


A Note on Chrome Flags

To use this feature, you must have the "Enable built-in AI" flag enabled in Chrome. Open a new tab, paste the link below, press Enter, enable the flag, and then restart Chrome.



My GitHub

You can find more of my work on my hpssjellis GitHub page:

By Jeremy Ellis LinkedIn