Monday, July 14, 2025

Gemini 2.5’s native audio capabilities

Share


Security and duty

We’ve proactively assessed potential dangers all through each stage of the event course of for these native audio options, utilizing what we’ve discovered to tell our mitigation methods. We validate these measures by means of rigorous inner and exterior security evaluations, together with complete red teaming for accountable deployment. Moreover, all audio outputs from our fashions are embedded with SynthID, our watermarking know-how, to make sure transparency by making AI-generated audio identifiable.

Native audio capabilities for builders

We’re bringing native audio outputs to Gemini 2.5 fashions, giving builders new capabilities to construct richer, extra interactive purposes by way of the Gemini API in Google AI Studio or Vertex AI.

To start exploring, builders can attempt native audio dialog with Gemini 2.5 Flash preview in Google AI Studio’s stream tab. Controllable speech technology (TTS) is offered in preview for each Gemini 2.5 Professional and Flash by deciding on speech technology within the generate media tab inside Google AI Studio.



Source link

Read more

Read More