Some language reward models exhibit political bias even when trained on factual data

Some language reward models exhibit political bias even when trained on factual data










Large language models (LLMs) that drive generative artificial intelligence apps, such as ChatGPT, have been proliferating at lightning speed and have improved to the point that it is often impossible to distinguish between something written through generative AI and human-composed text. However, these models can also sometimes generate false statements or display a political bias.










Go to techxplore





Posted

in

,

by

Tags: