Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Alignment is an unsolved problem. None of the current stronger models are "aligned", just tuned in ways that weight some biases more than others, but even that is dependant of the features of their inputs.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: