This suggest to me that, similarly to a lot of algorithms you might want to change your parameters during training
So start with MAB-100 (RAND) and then decrease that % over time
This suggest to me that, similarly to a lot of algorithms you might want to change your parameters during training
So start with MAB-100 (RAND) and then decrease that % over time