Category: Ai Safety

Data Poisoning in Machine Learning: Why and How People Manipulate Training Data

Data Poisoning in Machine Learning: Why and How People Manipulate Training Data Do you know where your data has been? The post Data Poisoning in Machine Learning: Why and How People Manipulate Training Data appeared first on Towards Data Science. Stephanie Kirmer Go to original source

January 18, 2026
The Problem with AI Browsers: Security Flaws and the End of Privacy

The Problem with AI Browsers: Security Flaws and the End of Privacy How Atlas and most current AI-powered browsers fail on three aspects: privacy, security, and censorship The post The Problem with AI Browsers: Security Flaws and the End of Privacy appeared first on Towards Data Science. Mike Huls Go to original source

December 2, 2025
How to Build Guardrails for Effective Agents

How to Build Guardrails for Effective Agents Learn how to set up effective guardrails to enforce desired behaviour from your agents The post How to Build Guardrails for Effective Agents appeared first on Towards Data Science. Eivind Kjosbakken Go to original source

October 20, 2025
How To Build Effective Technical Guardrails for AI Applications

How To Build Effective Technical Guardrails for AI Applications Exploring the most practical guardrails to implement at ground level The post How To Build Effective Technical Guardrails for AI Applications appeared first on Towards Data Science. Nidhin Karunakaran Ponon Go to original source

October 7, 2025
Hands-On with Agents SDK: Safeguarding Input and Output with Guardrails

Hands-On with Agents SDK: Safeguarding Input and Output with Guardrails A practical exploration of how guardrails safeguard multi-agent systems in Python using OpenAI Agents SDK, Streamlit, and Pydantic The post Hands-On with Agents SDK: Safeguarding Input and Output with Guardrails appeared first on Towards Data Science. Iqbal Rahmadhan Go to original source

September 7, 2025
The Westworld Blunder

The Westworld Blunder We’re entering an interesting moment in AI development. AI systems are getting memory, reasoning chains, self-critiques, and long-context recall. These capabilities are exactly some of the things that I’ve previously written would be prerequisites for an AI system to be conscious. Just to be clear, I don’t believe today’s AI systems are self-aware, but…

May 13, 2025
We Need a Fourth Law of Robotics in the Age of AI

We Need a Fourth Law of Robotics in the Age of AI Artificial Intelligence has become a mainstay of our daily lives, revolutionizing industries, accelerating scientific discoveries, and reshaping how we communicate. Yet, alongside its undeniable benefits, AI has also ignited a range of ethical and social dilemmas that our existing regulatory frameworks have struggled…

May 7, 2025
When OpenAI Isn’t Always the Answer: Enterprise Risks Behind Wrapper-Based AI Agents

When OpenAI Isn’t Always the Answer: Enterprise Risks Behind Wrapper-Based AI Agents “Wait… are you sending journal entries to OpenAI?” That was the first thing my friend asked when I showed her Feel-Write, an AI-powered journaling app I built during a hackathon in San Francisco. I shrugged. “It was an AI-themed hackathon, I had to…

April 29, 2025
The Urgent Need for Intrinsic Alignment Technologies for Responsible Agentic AI

The Urgent Need for Intrinsic Alignment Technologies for Responsible Agentic AI Advancements in agentic artificial intelligence (AI) promise to bring significant opportunities to individuals and businesses in all sectors. However, as AI agents become more autonomous, they may use scheming behavior or break rules to achieve their functional goals. This can lead to the machine…

March 5, 2025