Hacker News new | past | comments | ask | show | jobs | submit login

When learning ML at university one assumes that the data you have well represents the environment. We do the famous train/validation/test split and train our model.

However, in practice we see that it is very hard to collect a good dataset. There is a great twitter thread from Abubakar(CEO Gradio) about this topic: https://twitter.com/abidlabs/status/1423067498862219267




Thanks for your answer. I'm seeing a tweet, but not a thread. Is it expected?


Yes, sorry. I meant he started a good conversation with his tweet.




Consider applying for YC's Spring batch! Applications are open till Feb 11.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: