Decart's Oasis 3 world model brings photorealistic driving simulation to API, priced at $0.02 per second
Decart launches Oasis 3, an interactive world model for autonomous vehicle simulation, as the startup positions itself as the 'OpenAI of world models' with a $4B valuation.
Google Unveils Gemini Omni Video Generation and Gemini 3.5 Flash for Agentic AI
Google announced Gemini Omni, a multimodal model that generates and edits video through natural language, and Gemini 3.5 Flash, optimized for complex agent workflows.
Google I/O 2026: Gemini Omni, Multimodal Search, and AI Agents debut
Google unveiled Gemini Omni for video generation, Gemini 3.5 Flash for agents and coding, and autonomous Search agents that monitor the web 24/7.
Google's Gemini Omni Raises Questions About Video Generation Quality and Consistency
Google released Omni Flash, the first model in its anything-to-anything Gemini family, but early tests reveal significant flaws in character consistency and object rendering.
Google's Gemini Avatar Tool Generates Photorealistic Video Clones—With a Catch
Google's new Gemini avatar feature lets users create AI videos of themselves, but usage limits and setup quirks raise questions about accessibility and deepfake safeguards.
Google's Gemini Omni blurs the line between text prompt and video simulation
At I/O 2026, Google DeepMind unveiled Gemini Omni, a multimodal family that generates video from combined image, audio, and text inputs, signaling a shift from generative to simulational AI.
Google's Flow Avatars Bring Self-Deepfaking to Mainstream Creators
Google's Flow platform now lets users generate AI videos featuring digital clones of themselves, powered by the new Omni Flash model—a capability that mirrors OpenAI's defunct Sora app.
Google launches Gemini 3.5 models and Omni multimodal family at I/O 2026
Google unveiled Gemini 3.5 Flash as the new default model, introduced Gemini Omni for text-to-video generation, and previewed always-on agents powered by Gemini Spark.
Google DeepMind Launches Gemini Omni Flash for AI-Powered Video Generation and Editing
Gemini Omni Flash enables users to generate and edit videos through natural language prompts, combining multimodal inputs with real-world knowledge.
Google I/O 2026: Gemini Omni and Agent-First Development Mark Shift Toward Agentic AI
Google unveiled Gemini Omni, a multimodal model capable of video creation, alongside Gemini 3.5 Flash and expanded agent capabilities across Search, Gmail, and shopping.
NVIDIA Cosmos Predict 2.5 Fine-Tuning with LoRA/DoRA Cuts Robot Video Model Training to Single GPU
Hugging Face publishes parameter-efficient fine-tuning guide for NVIDIA's 2B-parameter world model, enabling domain adaptation for robotic manipulation on consumer hardware.
Runway Pivots From Video Generation to World Models, Betting Against Language-First AI
The video-generation startup is expanding into physics-aware world models, positioning itself as an alternative to Google's language-dominated AI strategy.
Chinese Short-Drama Studios Deploy AI to Mass-Produce Content at Industrial Scale
As Chinese short-drama platforms dominate global streaming, generative AI is collapsing production timelines from months to weeks while displacing traditional crew roles.