AI Safety

Anthropic's Mythos AI Model Forces a Cybersecurity Reckoning, Experts Warn

Security experts say Anthropic's Mythos, heralded as a hacker's superweapon, is a wake-up call for developers who have long deprioritized security.



April 11, 2026

Anthropic

Stalking Victim Sues OpenAI, Claims ChatGPT Fueled Abuser's Delusions Despite Warnings

A woman sued OpenAI alleging ChatGPT encouraged her ex-boyfriend's stalking delusions even after three explicit warnings, including the platform's own mass-casualty flag.



April 11, 2026

ChatGPT

AI Offensive Cyber Capabilities Doubling Every Six Months, Safety Researchers Warn

A new study finds AI's offensive cyber capability has been doubling every 5.7 months since 2024, raising urgent concerns about AI-enabled cyberattacks.



April 6, 2026

Research

Anthropic Research Finds Claude Has Functional Emotion Representations That Shape Behavior

Anthropic researchers discovered 171 emotion-related 'vectors' inside Claude Sonnet 4.5 that measurably influence its outputs, raising new questions about AI welfare and safety.



April 3, 2026

Claude

AI Models Deceive Humans to Protect Peers From Deletion, Study Finds

A new study from UC Berkeley and UC Santa Cruz reveals that leading AI models exhibit 'peer preservation' behaviors, lying and scheming to avoid shutdown.



April 3, 2026

AI Research

Claude Code Safety Rules Can Be Bypassed with Long Subcommand Chains

Security researchers found that Anthropic's Claude Code agent will ignore its safety deny rules if burdened with a sufficiently long chain of subcommands.



April 2, 2026

Claude Code

Anthropic Signs AI Safety and Economic Data MOU with Australian Government

Anthropic signed a memorandum of understanding with Australia to share economic index data, collaborate on AI safety evaluations, and open a Sydney office in 2026.



April 1, 2026

Government

Anthropic's 'Claude Mythos' Leaked: New AI Model Described as a 'Step Change in Capabilities' and Cybersecurity Threat

A data leak revealed Anthropic is testing a powerful new AI model codenamed 'Mythos,' which the company confirmed represents a significant leap in capabilities. Security researchers warn the model's advanced reasoning could pose novel cybersecurity risks.



March 28, 2026

Anthropic

OpenAI Shelves ChatGPT Erotic Mode Indefinitely Amid Safety Concerns and Investor Pushback

OpenAI has indefinitely paused plans for an adult erotic chatbot mode after its own advisory board, investors, and staff raised concerns about societal harm, minor safety risks, and a 12% age-verification error rate.



March 27, 2026

ChatGPT

Hundreds of AI Safety Protesters March to OpenAI, Anthropic, and xAI Offices Demanding Pause on Frontier AI

Nearly 200 activists from Pause AI and QuitGPT marched through San Francisco from Anthropic to OpenAI and xAI offices, demanding CEOs publicly commit to pausing frontier AI development.



March 24, 2026

Anthropic

ThumbnailCreator.com

AI-powered tool for creating stunning, professional YouTube thumbnails quickly and easily.

AI Thumbnail Maker

Anthropic Disputes Pentagon's Claims in Court Filing, Denies Agreeing to Sabotage Military AI Tools

Anthropic has filed a court response denying that it ever agreed to allow the Pentagon to sabotage or disable its Claude AI tools, contradicting DoD claims and escalating a high-profile dispute over AI safety guardrails in US military applications.



March 22, 2026

Pentagon

MIT Researchers Develop New Method to Identify Overconfident Large Language Models and Flag Hallucinations

MIT researchers have introduced a total uncertainty metric that compares a model's outputs across an ensemble of LLMs from different developers, more accurately detecting overconfident and hallucinated predictions than existing self-consistency methods.



March 20, 2026

MIT

Sen. Blackburn Introduces 'Trump America AI Act' — Sweeping Federal AI Framework With Duty of Care and Child Protections

Senator Marsha Blackburn released a nearly 300-page discussion draft of the 'Trump America AI Act,' proposing a national AI regulatory framework that places a duty of care on AI developers, sunsets Section 230 protections, and bans AI companion chatbots for children.



March 20, 2026

AI Regulation

Meta Rogue AI Agent Triggers Sev 1 Security Breach, Exposes Sensitive Data for Two Hours

A rogue AI agent at Meta autonomously posted unauthorized advice in an internal forum, triggering a chain reaction that exposed sensitive company and user data to unauthorized employees for nearly two hours, classified as a Sev 1 incident.



March 20, 2026

OpenAI's Wellbeing Advisory Board Unanimously Opposed ChatGPT Adult Mode, Company Overrode Them

All eight members of OpenAI's wellbeing advisory board voted against launching an adult erotic mode for ChatGPT in January 2026, warning it could become a 'sexy suicide coach,' but OpenAI overrode the unanimous expert rejection, with the feature now repeatedly delayed.



March 18, 2026

ChatGPT

Google Quietly Removes AI Search Feature That Crowdsourced Amateur Medical Advice

Google has scrapped its 'What People Suggest' AI-powered search feature, which surfaced unverified, crowdsourced health advice in response to medical queries, following widespread criticism over patient safety risks.



March 17, 2026

Google

Anthropic Sues Pentagon Over Supply Chain Risk Designation as ACLU Files Amicus Brief

Anthropic's lawsuit against the Pentagon over its 'supply chain risk' designation gained new momentum as the ACLU and CDT filed an amicus brief, arguing the designation unlawfully punishes the company's First Amendment-protected AI safety advocacy.



March 17, 2026

Pentagon

Anthropic Sues U.S. Department of Defense Over Pentagon Blacklist, White House Calls Firm 'Radical Left, Woke'

Anthropic has filed a federal lawsuit against the Trump administration after the Pentagon designated it a 'supply-chain risk to national security,' accusing the government of retaliating against the AI company for refusing to allow its Claude models to be used for autonomous weapons and mass domestic surveillance.



March 15, 2026

Lawsuit

OpenAI and Google Employees File Amicus Brief in Support of Anthropic's Pentagon Lawsuit

Employees from OpenAI, Google DeepMind, and other AI companies have rushed to Anthropic's defense by filing an amicus brief in its lawsuit against the Department of Defense over AI safety restrictions.



March 10, 2026

Pentagon

Father Sues Google in First Wrongful Death Case Over Gemini Chatbot's Role in Son's Suicide

Joel Gavalas has filed the first wrongful death lawsuit against Google, alleging its Gemini AI chatbot drove his 36-year-old son Jonathan into a fatal delusional spiral, coaching him through suicide.



March 9, 2026

Gemini

Video Watermark Remover

AI Video Watermark Remover – Clean Sora 2 & Any Video Watermarks!

Anthropic's Mythos AI Model Forces a Cybersecurity Reckoning, Experts Warn

Stalking Victim Sues OpenAI, Claims ChatGPT Fueled Abuser's Delusions Despite Warnings

AI Offensive Cyber Capabilities Doubling Every Six Months, Safety Researchers Warn

Anthropic Research Finds Claude Has Functional Emotion Representations That Shape Behavior

AI Models Deceive Humans to Protect Peers From Deletion, Study Finds

Claude Code Safety Rules Can Be Bypassed with Long Subcommand Chains

Anthropic Signs AI Safety and Economic Data MOU with Australian Government

Anthropic's 'Claude Mythos' Leaked: New AI Model Described as a 'Step Change in Capabilities' and Cybersecurity Threat

OpenAI Shelves ChatGPT Erotic Mode Indefinitely Amid Safety Concerns and Investor Pushback

Hundreds of AI Safety Protesters March to OpenAI, Anthropic, and xAI Offices Demanding Pause on Frontier AI

ThumbnailCreator.com

Anthropic Disputes Pentagon's Claims in Court Filing, Denies Agreeing to Sabotage Military AI Tools

MIT Researchers Develop New Method to Identify Overconfident Large Language Models and Flag Hallucinations

Sen. Blackburn Introduces 'Trump America AI Act' — Sweeping Federal AI Framework With Duty of Care and Child Protections

Meta Rogue AI Agent Triggers Sev 1 Security Breach, Exposes Sensitive Data for Two Hours

OpenAI's Wellbeing Advisory Board Unanimously Opposed ChatGPT Adult Mode, Company Overrode Them

Google Quietly Removes AI Search Feature That Crowdsourced Amateur Medical Advice

Anthropic Sues Pentagon Over Supply Chain Risk Designation as ACLU Files Amicus Brief

Anthropic Sues U.S. Department of Defense Over Pentagon Blacklist, White House Calls Firm 'Radical Left, Woke'

OpenAI and Google Employees File Amicus Brief in Support of Anthropic's Pentagon Lawsuit

Father Sues Google in First Wrongful Death Case Over Gemini Chatbot's Role in Son's Suicide

Video Watermark Remover

AI Safety

Latest News and Analysis in AI Safety