Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"AI Experts" learning the value of (automated) baseline evaluations and performance metrics

While calling an API is convenient (and with the implicit promise of always performing to a certain standard) this is anything but guaranteed

The examples on the thread are interesting, I wonder if the wording might have changed slightly or if the 'human fine tuning" loop might introduce certain instabilities in some specific tasks



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: