Hacker News new | past | comments | ask | show | jobs | submit login

So with tools like Sonarqube we get static code analysis on our code base which shows code duplication within itself. I wonder if there is some way to look for duplication from stack overflow? Not sure that is feasible, since no one is going to want to post all of their proprietary code to some API, but wonder if there is some system for local indexing SO that would also for local duplication detection.



StackExchange data dumps are (at least, were[1]) freely downloadable: https://meta.stackexchange.com/questions/2677/database-schem...

[1] not sure what happens since they "illegally" retrospectively relicensed all the content a few months ago.


Yeah I thought they had something like that. Might be interesting to play with. I meant more of a question as to if anyone had actually done something like that with it. Basically a Turn It In kind of tool that universities use for plagiarism detection, but for code.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: