Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

maybe fine tuning should involve sending an LLM through grade school

actually I wonder if thats what we need to do

a simple socialization package that fine tunes



also, alignment package with reward and punishment. “bad model, bad model! oh come here, my good model!”




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: