ebook img
Author:Tingting Zhao
Language:English
File size:1.2 MB

Similar Efficient Sample Reuse in Policy Gradients with Parameter-based Exploration