I liked the open courseware lectures with John Tsitsiklis and ended up with a few books by Bertsekas: neuro dynamic programming and intro probability. Both Bertsekas and Tsitsiklis recommended the Sutton and Barto intro book for an intuitive overview. I liked it.