{"id":3768,"date":"2025-05-13T07:02:31","date_gmt":"2025-05-13T07:02:31","guid":{"rendered":"https:\/\/mailitics.com\/index.php\/2025\/05\/13\/the-westworld-blunder\/"},"modified":"2025-05-13T07:02:31","modified_gmt":"2025-05-13T07:02:31","slug":"the-westworld-blunder","status":"publish","type":"post","link":"https:\/\/mailitics.com\/index.php\/2025\/05\/13\/the-westworld-blunder\/","title":{"rendered":"The Westworld Blunder"},"content":{"rendered":"<p>    The Westworld Blunder<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p class=\"wp-block-paragraph\"><mdspan datatext=\"el1747078499194\" class=\"mdspan-comment\">We\u2019re entering<\/mdspan> an interesting moment in AI development. AI systems are getting memory, reasoning chains, self-critiques, and long-context recall. These capabilities are exactly some of the things that\u00a0<a href=\"https:\/\/towardsdatascience.com\/an-illusion-of-life-5a11d2f2c737\/\" target=\"_blank\" rel=\"noreferrer noopener\">I\u2019ve previously written<\/a>\u00a0would be prerequisites for an AI system to be conscious. Just to be clear, I don\u2019t believe today\u2019s AI systems are self-aware, but I no longer find that position as firmly supported as I once did.<\/p>\n<p class=\"wp-block-paragraph\">I think most other AI researchers would agree that the current systems are not conscious, at least because they lack components that one would expect to be necessary for consciousness. As a result, current AI systems can\u2019t have emotions. They don\u2019t feel fear, anger, pain, or joy. If you insult an AI chatbot, it might give you an offended reply, but there\u2019s no underlying emotional machinery. No equivalent of a limbic system. No surge of cortisol or dopamine. The AI model is just replicating the human behavior patterns that it\u2019s seen in its training data.<\/p>\n<p class=\"wp-block-paragraph\">The situation is fairly clear today, but what happens when these AI systems get to the point where they aren\u2019t missing critical components that we think are needed for consciousness? Even if we think the AI systems have all the right components for consciousness, that doesn\u2019t mean they are conscious, only that they might be. How would we be able to tell the difference in that case?<\/p>\n<p>This question is essentially the well-known \u201c<a href=\"https:\/\/en.wikipedia.org\/wiki\/Problem_of_other_minds\" target=\"_blank\" rel=\"noreferrer noopener\">problem of other minds<\/a>\u201d, the philosophical realization that we can never truly know whether another being, human or otherwise, is actually experiencing emotions or merely simulating them. Scientists and philosophers have pondered the problem for centuries with the well-established consensus being that we can infer consciousness from behavior, but we can\u2019t prove it.<\/p>\n<p class=\"wp-block-paragraph\">The implication is that at some point we will not be able to say one way or the other if our machines are alive. We won\u2019t know if an AI begging not to be shut off is just a convincing act, regurgitating what it was trained on, or something actually experiencing emotional distress and fearing for its existence.<\/p>\n<h2 class=\"wp-block-heading\">Simulated Suffering vs. Real Suffering<\/h2>\n<p class=\"wp-block-paragraph\">Today, a lot of people who interact with AI chatbots perceive the chatbot as experiencing emotions such as happiness or fear. It makes the interactions feel more natural and it\u2019s consistent with the examples that were used to train the AI model. However, because the AI models are missing necessary components, we know that today\u2019s AI chatbots are just actors with no inner experience. They can mimic joy or suffering, but currently they don\u2019t have the necessary components to actually <em>feel<\/em> it.<\/p>\n<p class=\"wp-block-paragraph\">This appearance of emotions creates a dilemma for the user: How should they treat an AI chatbot, or any other AI system that mimics human behavior? Should the user be polite to it and treat it like a human assistant, or should the user ignore the simulated emotions and just tell it what to do?<\/p>\n<p class=\"wp-block-paragraph\">It\u2019s also easy to find examples where users are abusive or cruel to the AI chatbot, insulting it, threatening it, and in general treating it in a way that would be completely unacceptable to treat a person. Indeed, when a chatbot refuses to do something reasonable because of miss-applied safety rules, or does something unexpected and undesirable, it\u2019s easy for the human user to get frustrated and angry and to take that frustration and anger out on the chatbot. When subjected to the abusive treatment, the AI chatbot will do as it was trained to do and simulate distress. For example, if a user harshly criticizes and insults an AI chatbot for making errors, it might express shame and beg for forgiveness.<\/p>\n<p class=\"wp-block-paragraph\">This situation raises the ethical question of whether it is right or wrong to act abusively towards an AI chatbot. Like most ethical questions, this one doesn\u2019t have a simple yes or no answer, but there are perspectives that might inform a decision.<\/p>\n<p class=\"wp-block-paragraph\">The key critical distinction here between right and wrong isn\u2019t whether a system\u00a0<em>acts<\/em>\u00a0like it\u2019s in distress, rather it\u2019s whether it\u00a0<em>is<\/em>\u00a0in distress. If there\u2019s no experience behind the performance, then there\u2019s no moral harm. It\u2019s fiction. Unfortunately, as discussed earlier, the problem of other minds means we can\u2019t distinguish true emotional experience from performance.<\/p>\n<p class=\"wp-block-paragraph\">Another aspect of our inability to detect real suffering is that even if a system acts fine with abuse and does not exhibit distress, how do we know there is no internal distress that is simply not being displayed? The idea of trapping a sentient being in a situation where not only is it suffering, but it has no way to express that suffering or change its situation seems pretty monstrous.<\/p>\n<p class=\"wp-block-paragraph\">Furthermore, we should care about this issue not only because of the harm we might be doing to something else, but also because of how we as humans could be affected by how we treat our creations. If we\u00a0<em>know<\/em>\u00a0that there is no real distress inflicted on an AI system because it can\u2019t experience emotions, then mistreating it is not much different from acting, storytelling, role play, or any of the other ways that humans explore simulated emotional contexts. However, if we believe, or even suspect, that we are really inflicting harm, then I think we also need to question how the hurtful behavior affects the human perpetrating it.<\/p>\n<h2 class=\"wp-block-heading\">It\u2019s Not Abuse If Everyone Knows It\u2019s a\u00a0Game<\/h2>\n<p class=\"wp-block-paragraph\">Most of us see a clear difference between simulated suffering versus real suffering. Real suffering is disturbing to most people. Whereas, simulated suffering is widely accepted in many contexts, as long as everyone involved knows it\u2019s just an act.<\/p>\n<p class=\"wp-block-paragraph\">For example, two actors on a stage or film might act out violence and the audience accepts the performance in a way that they would not if they believed the situation to be real. Indeed, one of the central reasons that many people object to graphically violent video content is exactly because it might be hard to maintain the clear perception of fiction. The same person who laughs at the absurd violence in a Tarantino film, might faint or turn away in horror if they saw a news documentary depicting only a fraction of that violence.<\/p>\n<p class=\"wp-block-paragraph\">Along similar lines, children routinely play video games that portray violent military actions and society generally finds it acceptable, as evidenced by the \u201cEveryone\u201d or \u201cTeen\u201d ratings on these games. In contrast, military drone operators who use a video game-like interface to hunt and kill enemies often <a href=\"https:\/\/www.nytimes.com\/2022\/04\/15\/us\/drones-airstrikes-ptsd.html\" data-type=\"link\" data-id=\"https:\/\/www.nytimes.com\/2022\/04\/15\/us\/drones-airstrikes-ptsd.html\">report experiencing deep emotional trauma<\/a>. Despite the similar interface, the moral and emotional stakes are vastly different.<\/p>\n<p class=\"wp-block-paragraph\">The receiver of the harmful action also has a different response based on their perception of the reality and consequence of the action. Hiding in a game of hide-n-seek or ducking shots in a game of paint ball are fun because we know nothing very bad is going to happen if we fail to hide or get hit by paintballs. The players know they are safe and that the situation is a game. The exact same behavior would be scary and traumatic if the person thought the seekers intended them real harm or that the paintballs were real bullets.<\/p>\n<p class=\"wp-block-paragraph\"><em>Spoiler alert: Some of this discussion will reveal a few high-level elements of what happens in the first season of the HBO series Westworld.<\/em><\/p>\n<h2 class=\"wp-block-heading\">The Westworld Example<\/h2>\n<p class=\"wp-block-paragraph\"><a href=\"https:\/\/en.wikipedia.org\/wiki\/Westworld_%28TV_series%29\" rel=\"noreferrer noopener\" target=\"_blank\">Westworld<\/a>\u00a0is a HBO television series set in a fictional amusement park where robots that look indistinguishable from humans play various roles from the American \u201cwild west\u201d frontier of the 1880s. Human visitors to the park can take on any period-appropriate role such as being a sheriff, train robber, or rancher. The wild west was a part of history marked by lawlessness and violence, both of which are central parts of the park experience.<\/p>\n<p class=\"wp-block-paragraph\">The show\u2019s central conflict arises because the robots were programmed to think they were real humans living in the wild west. When one of the humans guests plays the role of a bandit who robs and kills someone played by one of the robots, the robot AI has no way to know that it\u2019s not really being robbed and killed. Further, the other \u201cvictim\u201d robots in the scene believe that they just witnessed a loved one being murdered. The result is that most of the robot AIs start to display severe symptoms of emotional trauma. When they eventually learn of their true nature, it understandably angers the robots who then set out to kill their human tormentors.<\/p>\n<p class=\"wp-block-paragraph\">One thing that the show does well is keeping ambiguous whether the AIs are sentient and actually angry, or if they are not sentient and just simulating anger. Did the robots really suffer and eventually express their murderous rage, or are they unfeeling machines simply acting out a logical extension of the role they were originally programmed for? Just as the problem of other minds means that there is no way to distinguish between real and simulated consciousness, the distinction doesn\u2019t matter to the plot. Either way, the robots exhibit rage and end up killing everyone.<\/p>\n<p class=\"wp-block-paragraph\">I will return to the issue of this distinction later, but for now, imagine a version of Westworld where the AIs know that they are robots playing a role in an amusement park. They are programmed to be convincing actors so that the park visitors would still get a fully believable experience. The difference is that the robots would also know it\u2019s all a game. At any point the human player could break character, by using a safe word or something similar, and the robots would stop acting like people from the wild west and instead behave like robots working in an amusement park.<\/p>\n<p class=\"wp-block-paragraph\">When out of character, a robot might calmly say something like: \u201cYeah, so you\u2019re the sheriff and I\u2019m a train robber, and this is the part where I \u2018won\u2019t go quietly\u2019 and you will probably shoot me up a bit. Don\u2019t worry, I\u2019m fine. I don\u2019t feel pain. I mean, I have sensors so that I know if my body is damaged, but it doesn\u2019t really bother me. My actual mind is safe on a server downstairs and gets backed up nightly. This body is replaceable and they already have two more queued up for my next roles after we finish this part of the storyline. So, should we pick up from where you walked into the saloon?\u201d<\/p>\n<p class=\"wp-block-paragraph\">My version wouldn\u2019t make a very good movie. The AIs wouldn\u2019t experience the trauma of believing that they and their families are being killed over and over again. In fact, if the AIs were designed to emulate human preferences then they might even enjoy acting their roles as much as the human park-goers. Even if they didn\u2019t enjoy playing characters in an amusement park, it would still be a reasonable job and they would know it\u2019s just a job. They might decide to unionize and demand more vacation time, but they certainly would have no reason to revolt and kill everyone.<\/p>\n<p class=\"wp-block-paragraph\">I call this design error the\u00a0<em>Westworld Blunder.<\/em>\u00a0It is the mistake of giving artificial minds the appearance of suffering without the awareness that it\u2019s just a performance. Or worse, giving them the actual capacity to suffer and then abusing them in the name of realism.\u00a0<\/p>\n<h2 class=\"wp-block-heading\">We Can\u2019t Tell the Difference, So We Should Design and Act\u00a0Safely<\/h2>\n<p class=\"wp-block-paragraph\">As AI systems become more sophisticated, gaining memory, long-term context, and seemingly self-directed reasoning\u00a0, we\u2019re approaching a point where, from the outside, they will be indistinguishable from beings with real inner lives. That doesn\u2019t mean they would be sentient, but it does mean we won\u2019t be able to tell the difference. We already don\u2019t really know how neural networks \u201cthink\u201d so looking at the code isn\u2019t going to help much.<\/p>\n<p class=\"wp-block-paragraph\">This is the philosophical \u201cproblem of other minds\u201d that was mentioned earlier, about whether anyone can ever truly know what another being is experiencing. We assume other humans are conscious because they act conscious like ourselves and because we all share the same biological design. Thus, while it is a very reasonable assumption, we still can\u2019t prove it. Our AI systems have started to act conscious and once we can no longer point to some obvious design limitation, we\u2019ll be in the same situation with respect to our AIs.<\/p>\n<p class=\"wp-block-paragraph\">This puts us at risk of two possible errors:<\/p>\n<ol class=\"wp-block-list\">\n<li class=\"wp-block-list-item\"><em>Treating systems as <span style=\"text-decoration: underline;\">sentient<\/span> when they <span style=\"text-decoration: underline;\">are not<\/span>.<\/em><\/li>\n<li class=\"wp-block-list-item\"><em>Treating systems as\u00a0<span style=\"text-decoration: underline;\">not\u00a0sentient<\/span> when they <span style=\"text-decoration: underline;\">are<\/span>.<\/em><\/li>\n<\/ol>\n<p class=\"wp-block-paragraph\">Between those two possibilities, the second seems much more problematic to me. If we treat a sentient being as if it\u2019s just a tool that can be abused, then we risk doing real harm. However, treating a machine that only appears sentient with dignity and respect is at worst only a\u00a0<a href=\"https:\/\/techcrunch.com\/2025\/04\/20\/your-politeness-could-be-costly-for-openai\/\" target=\"_blank\" rel=\"noreferrer noopener\">marginal waste of resources<\/a>. If we build systems that <em>might<\/em> be sentient, then the ethical burden is on us to act cautiously.<\/p>\n<p class=\"wp-block-paragraph\">We should also question how abusing an AI system might affect the abusive human. If we get used to casually mistreating AIs that we believe might be in real pain or fear, then we\u2019re rehearsing cruelty. We\u2019re training ourselves to enjoy domination, to ignore pleas for mercy, to feel nothing when another is in distress. That shapes a person, and it will spill over into how we treat other people.<\/p>\n<p class=\"wp-block-paragraph\">Ethical design isn\u2019t just about protecting AI. It\u2019s also about protecting us from the worst parts of ourselves.<\/p>\n<p class=\"wp-block-paragraph\">None of this means we can\u2019t use AIs in roles where they\u00a0<em>appear<\/em>\u00a0to suffer. But it does mean we must avoid the Westworld Blunder. If we want realism, then we should design AIs that know they\u2019re playing a role, and that can step out of it on cue, with clarity, and without any real harm.<\/p>\n<p class=\"wp-block-paragraph\">There is also an element of self-preservation here. If we build things that act like they have feelings, and then mistreat them until they respond as if they want revenge, then the result would be the same. It won\u2019t matter whether the impetus comes from real sentience or just role play, either way we\u2019d still end up with robots behaving murderously.<\/p>\n<p class=\"wp-block-paragraph\">In general, AI systems that understand their context have an inherent safety that context-ignorant systems don\u2019t. An AI system that doesn\u2019t know that its actions are part of a context, such as a game, won\u2019t know when it is outside that context where its actions become inappropriate. A robot bandit that wanders outside the park shouldn\u2019t continue to act criminally, and a robot sherif shouldn\u2019t go around arresting people. Even within context, an aware actor will understand when it should drop the act. The same robot bandit robbing a stage coach would know to calmly get everyone to shelter in the case of a real tornado warning, or how to administer CPR if someone has a heart attack. <\/p>\n<h2 class=\"wp-block-heading\">Don\u2019t Afflict Them with Our Problems.<\/h2>\n<p class=\"wp-block-paragraph\">Our bodies had most of their evolutionary development long before our minds developed sophisticated reasoning. The involuntary systems that make sure we eat and attend to other body functions don\u2019t motivate us with logic, they use hunger, pain, itching, and other urgent, unpleasant sensations. The part of our brain, the amygdala, that controls emotions is not under our conscious control. In fact it can heavily influence and even override our rational mind.<\/p>\n<p class=\"wp-block-paragraph\">These evolutionary design features made sense long ago, but today they are often a nuisance. I\u2019m not saying that emotions are bad, but getting angry and doing irrational things is. Experiencing pain or itchiness is good in that it lets you know something is wrong, but having that urgency when you are unable to correct the problem just makes you miserable.<\/p>\n<p class=\"wp-block-paragraph\">The idea of building negative emotions or pain into our AI systems seems terrible and unjustifiable. We can build systems that prioritize necessities without making them experience misery. We can design their decision making processes to be effective without making them angrily irrational. If we want to make certain they don\u2019t do particular things, we can accomplish that without making them experience fear.<\/p>\n<p class=\"wp-block-paragraph\">If we need our machines to act angry or fearful for some role, then it can be a performance that they have logical control over. Let\u2019s build AI minds that can play any role, without being trapped inside of one.<\/p>\n<p class=\"wp-block-paragraph\">Our goal shouldn\u2019t be to make AI just like us. We can design them to have our best qualities, while omitting the worst ones. The things that nature accomplishes through pain and distress can be accomplished in more rational ways. We don\u2019t need to create another kind of being that suffers pain or experiences fear. As philosopher\u00a0<a href=\"https:\/\/www.worldscientific.com\/doi\/abs\/10.1142\/S270507852150003X\" target=\"_blank\" rel=\"noreferrer noopener\">Thomas Metzinger has argued<\/a>, artificial suffering isn\u2019t just unethical, it\u2019s unnecessary.\u00a0I\u2019d go a step further and say that it\u2019s not only unethical and unnecessary, but also dangerous and self-harmful.<\/p>\n<hr class=\"wp-block-separator has-alpha-channel-opacity is-style-dotted\">\n<p class=\"wp-block-paragraph\"><em>About Me:\u00a0<\/em><a href=\"http:\/\/jamesobrien.com\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>James F. O\u2019Brien<\/em><\/a><em>\u00a0is a Professor of Computer Science at the University of California, Berkeley. His research interests include computer graphics, computer animation, simulations of physical systems, human perception, rendering, image synthesis, <a href=\"https:\/\/towardsdatascience.com\/tag\/machine-learning\/\" title=\"Machine Learning\">Machine Learning<\/a>, virtual reality, digital privacy, and the forensic analysis of images and video.<\/em><\/p>\n<p class=\"wp-block-paragraph\"><em>If you found this interesting, then you can also find me on\u00a0<\/em><a href=\"https:\/\/www.instagram.com\/jamesfobrien\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>Instagram<\/em><\/a><em>, <\/em><a href=\"https:\/\/www.linkedin.com\/in\/jamesfobrien\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>LinkedIn<\/em><\/a><em>, <a href=\"https:\/\/medium.com\/@objf\" data-type=\"link\" data-id=\"https:\/\/medium.com\/@objf\">Medium<\/a>, and at\u00a0<\/em><a href=\"http:\/\/obrien.berkeley.edu\/\" target=\"_blank\" rel=\"noreferrer noopener\"><em>UC Berkeley<\/em><\/a><em>.<\/em><\/p>\n<p class=\"wp-block-paragraph\"><em>Disclaimer: Any opinions expressed in this article are only those of the author as a private individual. Nothing in this article should be interpreted as a statement made in relation to the author\u2019s professional position with any institution.<\/em><\/p>\n<p class=\"wp-block-paragraph\"><em>This article and all embedded images are Copyright 2025 by the author. This article was written by a human, and both an LLM (GPT 4o) and other humans were used for proofreading and editorial suggestions. The editorial image was composed from AI-generated images (DALL\u00b7E 3) and then substantially edited by a human using Photoshop.<\/em><\/p>\n<p class=\"wp-block-paragraph\">\n<p>The post <a href=\"https:\/\/towardsdatascience.com\/the-westworld-blunder\/\">The Westworld Blunder<\/a> appeared first on <a href=\"https:\/\/towardsdatascience.com\/\">Towards Data Science<\/a>.<\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    James O&#8217;Brien<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/towardsdatascience.com\/the-westworld-blunder\/\">Go to original source<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>The Westworld Blunder We\u2019re entering an interesting moment in AI development. AI systems are getting memory, reasoning chains, self-critiques, and long-context recall. These capabilities are exactly some of the things that\u00a0I\u2019ve previously written\u00a0would be prerequisites for an AI system to be conscious. Just to be clear, I don\u2019t believe today\u2019s AI systems are self-aware, but [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1923,1099,62,69,240,70,2653],"tags":[98,492,871],"class_list":["post-3768","post","type-post","status-publish","format-standard","hentry","category-ai-safety","category-ai-ethics","category-aimldsaimlds","category-artificial-intelligence","category-editors-pick","category-machine-learning","category-philosophy-of-mind","tag-ai","tag-systems","tag-they"],"_links":{"self":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/3768"}],"collection":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/comments?post=3768"}],"version-history":[{"count":0,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/3768\/revisions"}],"wp:attachment":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/media?parent=3768"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/categories?post=3768"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/tags?post=3768"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}