Hacker News new | past | comments | ask | show | jobs | submit login

If you're looking for even larger graph datasets, the team at WebDataCommons[1] extracted hyperlink graphs from Common Crawl[2]. They're available at both page and domain levels of granularity.

The page level hyperlink graphs are 3.5 billion web pages and 128 billion hyperlinks for 2012 and 1.7 billion web pages connected by 64 billion hyperlinks for 2014.

[1]: http://webdatacommons.org/hyperlinkgraph/

[2]: http://commoncrawl.org/




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: