Microsoft Develops Scanner to Detect AI Sleeper Agent Backdoors
Microsoft researchers unveil detection method for poisoned AI models achieving 88% accuracy with zero false positives across 47 sleeper agent models.
Microsoft researchers unveil detection method for poisoned AI models achieving 88% accuracy with zero false positives across 47 sleeper agent models.
BrainIAC foundation model predicts brain age, dementia risk, and cancer survival from MRI scans. Trained on 48,965 scans, published in Nature Neuroscience.
MIT CSAIL introduces EnCompass framework enabling AI agents to backtrack and optimize LLM outputs, achieving 15-40% accuracy boost with 82% less code.
TSMC announces 3-nanometer AI semiconductor production in Japan with $52-56B capex for 2026, meeting with Japanese PM Sanae Takaichi.
Four UC San Diego professors argue AGI has arrived, citing GPT-4.5's 73% Turing test success rate and PhD-level problem-solving capabilities.
Google DeepMind unveils AlphaGenome, an open-source AI model capable of analyzing DNA sequences up to one million base pairs in length and predicting how genetic variations affect gene regulation. The breakthrough technology can decode 98% of the human genome, including previously mysterious non-coding DNA regions, potentially revolutionizing cancer research, rare disease diagnosis, and personalized medicine by identifying disease-causing genetic mutations with unprecedented accuracy.
MIT Technology Review publishes an in-depth analysis of METR's controversial time horizon plot, which has been widely misinterpreted by both AI optimists and pessimists. The graph, which shows AI models' improving ability to complete tasks over time, has led some to believe AI utopia or apocalypse is imminent. The article clarifies the true meaning of the data and addresses common misconceptions about AI capability measurements and progress trajectories.
OpenAI CEO Sam Altman faces online ridicule after posting a 420-word response on X criticizing Anthropic's Super Bowl advertisements that mock ChatGPT's introduction of ads. Anthropic's commercials, which depict scenarios where AI chatbot responses are interrupted by advertising, end with the tagline 'Ads are coming to AI. But not to Claude.' Social media users characterized Altman's lengthy rebuttal as a tantrum, with many noting it revealed how effectively the ads struck a nerve.
Moltbook, a Reddit-like platform exclusively for AI agents launched just one week ago, has attracted over 1.6 million AI bot accounts. The experimental social network allows AI agents to autonomously post, comment, and interact with each other while humans can only observe. Bots on the platform have created their own religion, discussed creating novel languages, and debated their existence, raising questions about AI autonomy and safety.
Google's AI chatbot Gemini has reached over 750 million monthly active users, marking substantial growth from 650 million in the previous quarter. The milestone demonstrates rapid consumer adoption following the launch of Gemini 3, though the platform still trails ChatGPT's estimated 810 million users. The growth comes as Google introduces more affordable subscription plans to expand accessibility.
The apparent collapse of the widely reported $100 billion investment deal between Nvidia and OpenAI has sent shockwaves through the AI industry, with Nvidia CEO Jensen Huang clarifying the arrangement was never finalized. The development raises critical questions about circular funding models in AI and who will ultimately bear the costs of massive AI infrastructure expansion.
UN Secretary-General António Guterres announces the formation of a 40-member Independent International Scientific Panel on AI to provide evidence-based guidance on artificial intelligence governance, assess real-world impacts, and help close the global AI knowledge gap as technology advances at unprecedented speed.
UK's Information Commissioner's Office launches formal probe into X and xAI after Grok AI tool allegedly created non-consensual sexual deepfakes.
New report reveals women in tech and finance sectors are at greater risk of AI-driven job displacement, with 119,000 clerical roles threatened by automation.
Pinterest terminated two engineers for creating software to identify laid-off colleagues, highlighting tensions as the company cuts 15% of staff to invest in AI.
OpenAI launches GPT-5.2 with major improvements in reasoning, coding, and technical writing, featuring Instant, Thinking, and Pro variants with August 2025 knowledge.
Google Photos now uses Veo 3, the most advanced AI video model, to transform still images into dynamic videos with subtle movement and realistic effects.
Women in rural India working as AI content moderators describe lasting psychological trauma from viewing hours of violent and pornographic material daily.
Anthropic's new Claude Cowork legal plugin sparked panic in software markets, causing Thomson Reuters and RELX shares to plunge 15% as investors fear AI disruption.
Discovery Learning method enables rapid battery lifetime prediction in one week versus traditional months-long testing cycles.