Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

They should try using tf-idf to create the initial representation of the keywords per post...also, I find there are many cases where applying machine learning/statistics correctly is harder than it looks, this single case not withstanding.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: