Hacker News new | past | comments | ask | show | jobs | submit login

The: "tf-idf + cosine similarity + LSA metrics" bit from Pattern is what you are looking for.



In other words, the vector module: http://www.clips.ua.ac.be/pages/pattern-vector




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: