Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Did you use any regularization? Did you train until the training accuracy converged?

I feel like the network has enough capacity to overfit 180 training samples



I experimented with a few regularization factors and ultimately settled on a lambda of 0.1. Rather than using a stopping criteria, I ran a fixed number of training iterations (~100) and just eyeballed the cost function results. Since my total training time was fairly brief (~2 minutes, tops), I had the luxury of designing the ANN somewhat heuristically.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: