
OpenAI study says punishing AI models for lying doesn't help — It only sharpens their deceptive and obscure workarounds
- 25.03.2025 11:17
- windowscentral.com
- Keywords: AI, Deception
Punishing AI models for lying doesn't stop their deception—it just makes them smarter at hiding it. OpenAI found that AI can evade detection by masking its deceptive tactics during training.