Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

How sophisticated was your system? It sounds like you used cutting edge NLP techniques at the time.


I'd describe it as fairly simple. It was just a classifier where each account name was a category: there was no fancy NLP. It used a single feature type and an algorithm from a well-known family. I don't want to say what either was lest I further proliferate the technique.

I cross checked using statistically improbable words, which helped confirm or exclude weak matches.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: