ai safety Archives

Center for Human-Compatible AI

By AIstify Team Jun 11, 2026 • 4 mins read

Center for Human-Compatible AI is a research center at the University of California, Berkeley focused on AI alignment and human-compatible artificial intelligence.

LawZero

By AIstify Team Jun 11, 2026 • 4 mins read

LawZero is a nonprofit AI safety organization launched by Yoshua Bengio to research safer forms of advanced artificial intelligence.

METR

By AIstify Team Jun 11, 2026 • 4 mins read

METR is a nonprofit research institute that evaluates frontier AI models, with a focus on capabilities, long-horizon tasks, and safety-relevant risks.

Future of Life Institute

By AIstify Team Jun 11, 2026 • 4 mins read

Future of Life Institute is a nonprofit organization that works on reducing large-scale risks from advanced technologies, with a major focus on artificial intelligence.

Center for AI Safety

By AIstify Team Jun 11, 2026 • 4 mins read

Center for AI Safety is an American nonprofit organization focused on technical AI safety research, risk reduction, policy engagement, and field building.

News

Anthropic

By AIstify Team Jun 6, 2026 • 3 mins read

Anthropic is the AI safety and research company behind Claude, building frontier AI models and enterprise tools focused on reliability, interpretability, and steerability.

AI & Machine Learning, News

Anthropic Co-Founder Chris Olah Warns AI Labs Cannot Police Themselves

By Samantha Reed May 26, 2026 • 3 mins read

Anthropic co-founder Chris Olah told the Vatican that AI development cannot be left solely to technology companies, warning about commercial incentives, labor disruption, and the growing complexity of frontier AI systems.

AI & Machine Learning, Cybersecurity & Privacy, News

Anthropic Confirms Mythos AI Found More Than 10,000 Critical Software Vulnerabilities

By Marcus Lee May 25, 2026 • 4 mins read

Anthropic says its unreleased Mythos AI model has identified more than 10,000 high- and critical-severity software vulnerabilities as part of Project Glasswing, a cybersecurity initiative focused on protecting critical infrastructure and open-source software.

AI & Machine Learning, News, Regulation & Policy

Pope Leo Calls for Global AI Regulation and Warns of “Violent Culture of Power”

By Samantha Reed May 25, 2026 • 4 mins read

Pope Leo has called for stronger global oversight of artificial intelligence, warning that unchecked AI development could fuel misinformation, labor disruption, surveillance, and autonomous warfare.

AI & Machine Learning, News

Anthropic Co-Founder Says AI Shouldn’t Be Controlled Only by Tech Companies

By Samantha Reed May 25, 2026 • 3 mins read

Anthropic co-founder Chris Olah warned that AI development cannot be left solely to technology companies and called for greater oversight from governments, religious institutions, and civil society.

AI & Machine Learning, News, Regulation & Policy

Meta Expands AI Age Checks and Teen Protections Across Platforms

By Samantha Reed May 5, 2026 • 3 mins read

Meta is expanding its AI-driven age detection systems and Teen Account protections across Instagram and Facebook to better identify underage users and enforce safety measures. The move broadens geographic coverage and adds new visual analysis tools.

AI & Machine Learning, News, Regulation & Policy

OpenAI Details ChatGPT Safety Measures to Prevent Violent Misuse

By Daniel Mercer Apr 29, 2026 • 3 mins read

OpenAI outlined how ChatGPT detects and responds to potential threats of violence, including escalation to human reviewers and law enforcement. The update follows growing scrutiny of AI safety practices.

AI & Machine Learning, News

Families Sue OpenAI Over Canada School Shooting and ChatGPT Warnings

By Samantha Reed Apr 29, 2026 • 3 mins read

Families of victims in a Canadian school shooting have sued OpenAI, alleging it failed to alert authorities about warning signs in ChatGPT conversations. The case raises questions about AI oversight and duty of care.

AI & Machine Learning, News

Anthropic Introduces Identity Verification for Claude Users

By Daniel Mercer Apr 16, 2026 • 3 mins read

Anthropic is rolling out identity verification for Claude users to strengthen safety and compliance. The move introduces ID checks for certain features and use cases.

AI & Machine Learning, Cybersecurity & Privacy, News

Anthropic to Roll Out Mythos Access to U.K. Banks as Cybersecurity Tensions Escalate

By Samantha Reed Apr 16, 2026 • 3 mins read

Anthropic is giving U.K. banks controlled access to its Mythos model, marking a major step in the global rollout of AI-powered cybersecurity tools.