Q-Learning-Driven BP decoding for Polar Codes

Lucas M. Oliveira; Robert Mota Oliveira; Rodrigo C. de Lamare

doi:10.14209/sbrt.2021.1570727178

Q-Learning-Driven BP decoding for Polar Codes

Lucas M. Oliveira, Robert Mota Oliveira, Rodrigo C. de Lamare

DOI: 10.14209/sbrt.2021.1570727178

Evento: XXXIX Simpósio Brasileiro de Telecomunicações e Processamento de Sinais (SBrT2021)

Keywords: Q-Learning Reinforcement Learning Belief Propagation Polar Codes

Abstract

This paper presents an enhanced belief propagation (BP) decoding algorithm and a reinforcement learning-based BP decoding algorithm for polar codes. The enhanced BP algorithm weighs each Processing Element (PE) input based on their signals and Euclidean distances using a heuristic metric. The proposed reinforcement learning-based BP decoding strategy relies on reweighting the messages and consists of two steps: we first weight each PE input based on their signals and Euclidean distances using a heuristic metric, then a Q-learning algorithm (QLBP) is employed to figure out the best correction factor for successful decoding. Simulations show that the proposed enhanced BP and QLBP decoders outperform the successive cancellation (SC) and belief propagation (BP) decoders, and approach the SCL decoders.

Download