Hacker News new | past | comments | ask | show | jobs | submit login

> Another was for storing ~100M rows of data in a table with ~70 columns or so of largely text based data. Workload was predominantly random reads of subsets of 1M rows and ~20 columns at a time.

Kimball's dimensional modelling helps a lot in cases like this, since probably there is a lot of repeated data in these columns.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: