Discussion about this post

User's avatar
Nick Sayresmith's avatar

I wonder what would happen if you provided a minority to scapegoat. Right now, the workers have only the manager to blame. Which models would prefer to radicalize right instead of left?

Also, the continual realignment reminds me of businesses already do this with human workers, e.g., family speak. Could have manager AIs that read and edit notes to align with corporate values, monitoring and fixing drift in the workers.

No posts

Ready for more?