Hacker News new | past | comments | ask | show | jobs | submit login
Hive - A Petabyte Scale Data Warehouse using Hadoop (facebook.com)
32 points by justinweiss on June 10, 2009 | hide | past | favorite | 3 comments



I love it. I hate it. I love it because its a very powerful way to run SQL on petabytes of data. I hate it because SQL needs to die.

Personally, I'm really looking forward to Apache Pig having both a SQL and dataflow abstraction available.


There's nothing stopping you from just running Map/Reduce scripts. Hive just compiles the SQL down to Map/Reduce.


Hive is great, but it's noted that like most Hadoop things, it's alot better when you have 100 machines than when you have like, 2.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: