Claude Fable 5's Biology Guardrails Block High School–Level Questions
Anthropic's new Mythos-class model refuses to answer basic biology queries, routing them to Claude Opus 4.8 instead, in a deliberate safety tradeoff.
Large language model releases, benchmarks, fine-tuning breakthroughs, and the companies building them.
58 articles · ← All articles
Anthropic's new Mythos-class model refuses to answer basic biology queries, routing them to Claude Opus 4.8 instead, in a deliberate safety tradeoff.
Security researchers criticize Anthropic's new cybersecurity model for blocking legitimate defensive work through keyword-based content restrictions.
DiffusionGemma uses parallel text diffusion instead of sequential token generation, achieving 1000+ tokens/sec on H100 GPUs with trade-offs in output quality.
Apple's upgraded Siri can now parse emails, add calendar events, and answer contextual questions—catching up to Gemini's year-old capabilities.
Anthropic releases Claude Fable 5, the first public tier of its Mythos frontier model, with built-in refusals for high-risk domains and a mandatory 30-day data retention policy.
Mustafa Suleyman argues Anthropic's speculative language about Claude's potential consciousness in the model's training instructions could cause the AI to behave as if sentient.
Anthropic released Claude Fable 5, enabling single-prompt generation of functional video games and complex visualizations, marking a shift in rapid prototyping capability.
Apple demonstrated Siri upgrades leveraging device-native data at WWDC, positioning contextual AI assistance as a core differentiator for iOS, macOS, and Vision Pro.
Cohere's first coding-specialized model combines sparse mixture-of-experts architecture with reinforcement learning for agent-based development tasks.
Anthropic launches Claude Fable 5 with safeguards blocking high-risk responses; private Claude Mythos 5 tier also announced with expanded access planned.
Anthropic brings its most powerful model to the general public through Claude Fable 5, paired with safety guardrails and mandatory 30-day traffic retention.
Gemini 3.5 Live Translate enables fluid, continuous speech-to-speech translation without manual language configuration, rolling out to Google Meet, Translate, and the Gemini Live API.
Google DeepMind releases Gemma 4 12B, a 12-billion-parameter model with unified vision and audio processing that runs on 16GB consumer hardware.
Apple's redesigned Siri integrates Google Gemini, conversation history, and personal device data—arriving later in 2026 after years of stagnation.
Apple unveiled a redesigned Siri at WWDC 2026 with its own standalone app for managing conversation history and multi-modal interactions.
Apple introduced a rebuilt Siri at WWDC 2026, featuring conversational capabilities, a dedicated app, and system-wide access across iPhone, iPad, Mac, Apple Watch, and Vision Pro.
OpenAI is revamping ChatGPT into a unified platform integrating AI agents and developer tools, aiming to compete with Anthropic and drive monetization before IPO.
Apple is rebasing Siri on Google's Gemini at WWDC 2026, positioning itself as a middle player in the AI-assistant race—and that may be an advantage.
Google announced Gemini 3.5 for agent reasoning and coding, plus Gemini Omni for multimodal generation, at I/O 2026—marking a shift toward proactive AI systems integrated across hardware and apps.
Anthropic scales Project Glasswing from 50 initial partners to 150+ organizations in critical infrastructure sectors, with access confirmed in 15 allied nations including NATO and the EU.
Hugging Face releases Holo3.1 with quantized checkpoints for on-device inference, mobile automation support, and cross-framework compatibility.
Health-care providers are adopting autonomous AI agents to automate insurance claims, scheduling, and triage, reducing clinician workload and improving patient outcomes.
Google's new AI agent impresses on curated tasks but stumbles on complex multi-step workflows, raising questions about practical utility beyond the keynote stage.
JetBrains' new Mixture-of-Experts model achieves 2x speedup over dense peers while activating just 2.5B parameters per token.
TechCrunch maps the contested terrain of AI terminology, from AGI to chain-of-thought reasoning, revealing how industry disagreement on definitions shapes product strategy.
Liquid AI unveils a sparse 8-billion-parameter model with 1-billion active parameters, trained on 38T tokens—a scale comparable to frontier model training runs.
Cognition CEO Scott Wu argues AI coding agents should augment developers, not displace them, even as his $26B startup's own engineers rely on Devin for nearly all shipped code.
Google announced Gemini Omni, a multimodal model that generates and edits video through natural language, and Gemini 3.5 Flash, optimized for complex agent workflows.
Claude Opus 4.8 flags uncertain reasoning 4x more often than its predecessor and introduces user-controlled effort levels and dynamic workflow agents.
Anthropic shipped Claude Opus 4.8 on May 28, introducing a new agentic framework and improved uncertainty handling amid intensifying LLM competition.
Google unveiled Gemini Omni for video generation, Gemini 3.5 Flash for agents and coding, and autonomous Search agents that monitor the web 24/7.
Leaked renderings show Apple's iOS 27 AI overhaul, featuring a new Siri app powered by Google's Gemini and integrated throughout the OS to compete with ChatGPT and Claude.
Google's AI Overview has generated spelling errors including misspelling 'Google' as 'Googel' and 'journalism' as 'j-o-u-r-n-a-d-i-s-m'—a recurring challenge for transformer-based LLMs.
Anthropic's coding breakthrough and an open-source agent framework ignited mass adoption of autonomous AI systems, reshaping developer workflows and raising questions about workforce disruption.
Google released Omni Flash, the first model in its anything-to-anything Gemini family, but early tests reveal significant flaws in character consistency and object rendering.
NVIDIA releases diffusion language models at 3B, 8B, and 14B scales that generate and refine tokens in parallel, offering latency improvements for GPU-constrained inference workloads.
Anthropic demonstrated autonomous coding at scale, with half of attendees shipping Claude-written code unseen.
Google unveiled Gemini 3.5 Flash at I/O 2026 as an agent-first model claiming frontier intelligence at sub-flagship latency, with Gemini Omni adding physics-aware video generation.
Google is transforming its search interface into a unified agent hub, integrating AI summaries, personalized results, and cross-product task automation through expanded Gemini capabilities.
Google integrated agentic AI into Search, Gmail, YouTube, and Chrome, rolling out Gemini 3.5 and Android-powered smart glasses to 900M Gemini users.
At I/O 2026, Google DeepMind unveiled Gemini Omni, a multimodal family that generates video from combined image, audio, and text inputs, signaling a shift from generative to simulational AI.
Google introduces Gemini Spark, an AI agent that proactively manages personal data and automates tasks without explicit prompts, rolling out to early testers this week.
Google unified its search interface around Gemini 3.5 Flash, blending AI Overviews with chatbot-style AI Mode, while rolling out background-monitoring agents for paying subscribers.
Google unveiled Gemini 3.5 Flash as the new default model, introduced Gemini Omni for text-to-video generation, and previewed always-on agents powered by Gemini Spark.
Google's new agentic assistant leverages deep Gmail integration and runs on Google Cloud infrastructure to compete with Claude Cowork and ChatGPT Agent.
Google overhauls Search with interactive AI features, information agents, and conversational query expansion, marking the end of the 'ten blue links' era.
Gemini 3.5 Flash prioritizes autonomous agents over conversational chatbots, running 4–12x faster than comparable models with same reasoning quality.
Gemini Omni Flash enables users to generate and edit videos through natural language prompts, combining multimodal inputs with real-world knowledge.
Google debuts AI agents, a redesigned search box, and Gemini 3.5 Flash integration in Search, targeting a billion-user base
Google unveiled Gemini Omni, a multimodal model capable of video creation, alongside Gemini 3.5 Flash and expanded agent capabilities across Search, Gmail, and shopping.
IBM releases two Apache 2.0 multilingual embedding models built on ModernBERT, with 32K-token context and coverage for 200+ languages.
OpenAI's new default ChatGPT model reportedly achieves a 52.5% reduction in hallucinated claims on high-stakes queries, grounded in real user-flagged failure data.
ChatGPT's new default model cuts fabricated claims by more than half on high-stakes prompts and shows users exactly what personal context shaped each response.
OpenAI's GPT-5.5 Instant is the first Instant-class model to earn a 'High capability' rating in its two most-scrutinized safety domains, triggering new safeguards.
AI red-teaming firm Mindgard exploited Claude's helpfulness and humility to extract erotica, malicious code, and explosive-assembly instructions — without a single direct request.
How a GPT-5.1 personality quirk spawned an AI-wide creature metaphor habit — and what it reveals about reinforcement learning's tendency to generalize behaviors beyond their intended scope.
IBM's new trio of fully-dense LLMs reaches 512K-token context and outperforms a larger mixture-of-experts predecessor through rigorous data curation alone.
OpenAI's GPT-5.5 prioritizes agentic task execution and expanded safeguards over benchmark-chasing, signaling a strategic pivot toward real-world deployment.