A Q-learning Based Approach to Spectral Efficiency Maximization in Multiservice Wireless Systems
Juno Saraiva, Victor Farias Monteiro, Francisco Rafael Marques Lima, Tarcisio F. Maciel, Francisco R. P. Cavalcanti

DOI: 10.14209/sbrt.2019.1570557029
Evento: XXXVII Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT2019)
Keywords:
Abstract
In this article, we study Radio Resource Allocation (RRA) as a non-convex optimization problem, in which the aim is to maximize the spectral efficiency subject to satisfaction guarantees in multiservice wireless systems. This problem has already been previously investigated in the literature and efficient heuristics have been proposed. However, in order to assess the performance of Machine Learning (ML) algorithms when solving optimization problems in the context of RRA, we revisit that problem and propose a solution based on a Reinforcement Learning (RL) framework. Specifically, our proposal is based on the Q-learning technique where an agent gradually learns a policy by interacting with its local environment, until reaching convergence. Thus, in this article, the task of searching for an optimal solution in a combinatorial optimization problem is transformed into finding an optimal policy in Q-learning. Lastly, through computational simulations we compare the state-of-art proposals of the literature with our approach and we show a near optimal performance of the latter for a well-trained agent.

Download