Q-learning 文献资料 CHRISTOPHER J.C.H. WATKINS 25b Framfield Road, Highbury, London N5 IUU, England PETER DAYAN Centre for Cognitive Science, University of Edinburgh, 2 Buccleuch Place, Edinburgh EH8 9EH, Scotland
The popular Q-learning algorithm is known to overestimate action values under certain conditions. It was not previously known whether, in practice, such overestimations are common, whether they harm performance, and whether they can generally be prev
Femtocell is introduced to improve indoor coverage, which is beneficial for both users and operators. But it will also inevitably produce interference management issues in the heterogeneous network which consists of femtocells and macrocells. In this