Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

You could open up the dataset by assigning every word an id, and giving an anonymous dataset of applications in code.


You could probably extract a good deal of information from this by examining word frequency and sentence structure. Or at least, the attempt would be too much for me to resist.


You'd love enough to make the risk of competition all but gone. The most meaningful sentences might have a word used 3 times in a 1000 word corpus. You just can't glean meaning on so little context.

I'd love to try too.


I've applied twice to YC, so my own applications would provide a Rosetta stone for a lot of important words. But that would be cheating.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: