#video-generation

Decart's Oasis 3 world model brings photorealistic driving simulation to API, priced at $0.02 per second

Tools Jun 11, 2026

Decart launches Oasis 3, an interactive world model for autonomous vehicle simulation, as the startup positions itself as the 'OpenAI of world models' with a $4B valuation.

Google Unveils Gemini Omni Video Generation and Gemini 3.5 Flash for Agentic AI

LLMs May 30, 2026

Google announced Gemini Omni, a multimodal model that generates and edits video through natural language, and Gemini 3.5 Flash, optimized for complex agent workflows.

Google I/O 2026: Gemini Omni, Multimodal Search, and AI Agents debut

LLMs May 29, 2026

Google unveiled Gemini Omni for video generation, Gemini 3.5 Flash for agents and coding, and autonomous Search agents that monitor the web 24/7.

Google's Gemini Omni Raises Questions About Video Generation Quality and Consistency

LLMs May 25, 2026

Google released Omni Flash, the first model in its anything-to-anything Gemini family, but early tests reveal significant flaws in character consistency and object rendering.

Google's Gemini Avatar Tool Generates Photorealistic Video Clones—With a Catch

Tools May 23, 2026

Google's new Gemini avatar feature lets users create AI videos of themselves, but usage limits and setup quirks raise questions about accessibility and deepfake safeguards.

Google's Gemini Omni blurs the line between text prompt and video simulation

LLMs May 21, 2026

At I/O 2026, Google DeepMind unveiled Gemini Omni, a multimodal family that generates video from combined image, audio, and text inputs, signaling a shift from generative to simulational AI.

Google's Flow Avatars Bring Self-Deepfaking to Mainstream Creators

Tools May 20, 2026

Google's Flow platform now lets users generate AI videos featuring digital clones of themselves, powered by the new Omni Flash model—a capability that mirrors OpenAI's defunct Sora app.

Google launches Gemini 3.5 models and Omni multimodal family at I/O 2026

LLMs May 20, 2026

Google unveiled Gemini 3.5 Flash as the new default model, introduced Gemini Omni for text-to-video generation, and previewed always-on agents powered by Gemini Spark.

Google DeepMind Launches Gemini Omni Flash for AI-Powered Video Generation and Editing

LLMs May 20, 2026

Gemini Omni Flash enables users to generate and edit videos through natural language prompts, combining multimodal inputs with real-world knowledge.

Google I/O 2026: Gemini Omni and Agent-First Development Mark Shift Toward Agentic AI

LLMs May 20, 2026

Google unveiled Gemini Omni, a multimodal model capable of video creation, alongside Gemini 3.5 Flash and expanded agent capabilities across Search, Gmail, and shopping.

NVIDIA Cosmos Predict 2.5 Fine-Tuning with LoRA/DoRA Cuts Robot Video Model Training to Single GPU

Tools May 18, 2026

Hugging Face publishes parameter-efficient fine-tuning guide for NVIDIA's 2B-parameter world model, enabling domain adaptation for robotic manipulation on consumer hardware.

Runway Pivots From Video Generation to World Models, Betting Against Language-First AI

Startups May 16, 2026

The video-generation startup is expanding into physics-aware world models, positioning itself as an alternative to Google's language-dominated AI strategy.

Chinese Short-Drama Studios Deploy AI to Mass-Produce Content at Industrial Scale

Industry May 16, 2026

As Chinese short-drama platforms dominate global streaming, generative AI is collapsing production timelines from months to weeks while displacing traditional crew roles.