Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
trilbyglens
on Dec 19, 2024
|
parent
|
context
|
favorite
| on:
Alignment faking in large language models
Ya it's interesting how that nuance gets lost on most people who watch the movie. Or maybe the wrong interpretation has just been encoded as "common knowledge", as it's easier to understand a computer going haywire and becoming "evil".
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: