5 results
JULY 17, 2025 / Gemini
Veo 3, Google’s latest AI video generation model, is now available in paid preview via the Gemini API and Google AI Studio. Unveiled at Google I/O 2025, Veo 3 can generate both video and synchronized audio, including dialogue, background sounds, and even animal noises. This model delivers realistic visuals, natural lighting, and physics, with accurate lip syncing and sound that matches on-screen action.
JUNE 24, 2025 / Gemini
Imagen 4, Google's advanced text-to-image model, is now available in paid preview via the Gemini API and Google AI Studio, offering significant quality improvements, especially for text generation within images. The Imagen 4 family includes Imagen 4 for general tasks and Imagen 4 Ultra for high-precision prompt adherence, with all generated images featuring a non-visible SynthID watermark.
MAY 23, 2025 / Gemini
Announcing new features and models for the Gemini API, with the introduction of Gemini 2.5 Flash Preview with improved reasoning and efficiency, Gemini 2.5 Pro and Flash text-to-speech supporting multiple languages and speakers, and Gemini 2.5 Flash native audio dialog for conversational AI.
APRIL 15, 2025 / Gemini
Generate high-quality videos from text and image prompts with Veo 2, a video generation model, now generally available in the Gemini API and Google AI Studio to enhance your content creation and marketing efforts.
APRIL 9, 2025 / Gemini
Updates to the Gemini API, including the production readiness of Veo 2 for video generation, the preview of the Live API for real-time interactions, and the upcoming Gemini 2.5 Flash model, alongside the existing Gemini 2.5 Pro aim to enhance developer capabilities in building AI applications with improved thinking models, dynamic interactions, and high-quality video generation.