12Two-faced AI language models learn to hide deception - ‘Sleeper agents’ seem benign during testing but behave differently once deployed. And methods to stop them aren’t working.(www.nature.com)posted 9 months ago byLugh@futurology.todayMin futurology@futurology.todayView comments9 commentssavehidereportSort:HotTopControversialNewOldYou are viewing a single thread.View all comments View context [ +- ]mateomaui@reddthat.com2 points9 months agoAlright, I’ll switch to digging holes for the family burial ground. permalinksavereportparentreply