Hacker News new | past | comments | ask | show | jobs | submit login

Pandas is certainly excellent -- be aware of it's NA type promotion behavior before you start designing data analysis programs, however. I learned this the hard way:

http://pandas.pydata.org/pandas-docs/stable/gotchas.html#nan...




Another gotcha is variable type inference. Reading csv files can often produce varying column types. This can be a pain for any consistent data pipeline.


That's a good point, I've ran into problems with that before too. Thanks.




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: