Hacker News new | past | comments | ask | show | jobs | submit login

At SnowPlow we break home-grown analytics down into five stages:

     Track -> Collect -> ETL -> Store -> Analyse
SnowPlow straddles all five stages - and the data is in non-proprietary formats throughout.

Have a look at https://github.com/snowplow/snowplow if you want to find out more...




Nice, I've gotten close to building this exact data flow from scratch and it was not fun.

You're just missing step 6 ("-> Present"). I'd build some really simple jquery datatables template that will present the output of a hive query, if only to have some screenshots for non-technical people involved in the decision.


Thanks jparker, and you're totally right - we are still missing 6. -> Present :-) We will get round to it - it should be easier once we have connected Infobright as a storage option alongside Hive...




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: