Vision-Aided Deep Learning Solutions for Hybrid Beamforming and Beam Tracking

Claudio Alkazzi

Mémoire de maîtrise (2022)

Document en libre accès dans PolyPublie

Affichage préliminaire

Libre accès au plein texte de ce document
Conditions d'utilisation: Tous droits réservés
Télécharger (3MB)

Afficher le résumé

Cacher le résumé

Résumé

RÉSUMÉ Dans les systèmes MIMO (Multiple Input Multiple Output) massifs, les méthodes tradi-tionnelles d’estimation du canal deviennent très complexes, notablement dans les architec-tures hybrides. Pour améliorer la performance de HBF (Hybrid Beamforming), des solutions utilisant l’intelligence artificiel ont été introduits. Cependant, ces solutions nécessitent la connaissance du CSI (Channel State Information) et entraînent les modèles d’une manière supervisée. Ainsi, les solutions optimales doivent être calculées préalablement. Dans ce mémoire, deux solutions utilisant des données visuelles pour réduire la fréquence nécessaire d’estimation du canal sont introduites : HBF basé sur le LiDAR (Light Detection and Rang-ing) avec entraînement non supervisé, et un suiveur des faisceaux basé sur des séquences d’images. Concernant la première solution, un LiDAR peut détecter des usagers et des obstacles en 3D dans la zone de couverture de la station. Il est donc utilisé comme entrée pour notre réseau de neurones. En adoptant l’apprentissage profond, un modèle non supervisé peut être créer afin d’effectuer la formation des faisceaux. Ainsi, la transmission du CSI n’est plus nécessaire durant le fonctionnement en ligne. Une fonction de perte pour maximiser le débit total est définie. Pour la deuxième solution, l’objectif est de suivre les usagers quand ils se déplacent dans la zone de couverture. Un modèle LSTM (Long Short-Term Memory) supervisé est alors utilisée, prenant une séquence d’images et prévoyant le futur faisceau idéal. Le but est de réduire la fréquence nécessaire d’estimation du canal. De plus, il est nécessaire de réduire cette fréquence en utilisant des sources de l’information alternatifs au lieu du CSI pour ne pas augmenter l’utilisation des ressources fréquentielles. Deux modèles sont utilisés, dépen-damment du type d’entrée considéré : ConvLSTM pour les images, et SimpleLSTM pour les faisceaux idéaux antérieures. Il est vérifié que l’information visuelle est utile pour suivre les usagers et prévoir les futurs faisceaux.

Abstract

ABSTRACT Traditional channel sensing techniques for hybrid beamforming in wireless communications become complex when applied to massive Multiple Input Multiple Output (MIMO) systems due to the increased number of transmitting and receiving antennas. Several deep learning solutions have been proposed to solve this issue. These novel approaches, leveraging artifi-cial intelligence, have the potential to replace the traditional techniques due to the intrinsic statistical nature of wireless channels. However, many of the proposed techniques take the Channel State Information (CSI) as input, and many others use supervised learning to train their models, requiring that optimal solutions that are dependent on the CSI be calculated. Therefore, these solutions do not manage to reduce the overhead introduced with the trans-mission of the CSI. As a result, we present in this paper two vision-aided solutions to reduce the required frequency of transmission of the CSI: Light Detection and Ranging (LiDAR) enabled unsupervised learning for hybrid beamforming, and time-series beam tracking using images. LiDAR can detect users and obstacles in the coverage zone of a base station, making it a noteworthy candidate for an input to a deep learning model. We use this fact as the basis for the first part of our work: the implementation of an unsupervised model for hybrid beamforming. The use of unsupervised learning eliminates the need for channel estimation in the online mode, which vastly reduces the communication overhead. Therefore, we implement a LiDAR-based unsupervised deep learning model for hybrid beamforming, using a custom loss function to maximize the sum rate. For the second solution, we aim to track users as they move in the environment. We use a supervised Long Short-Term Memory (LSTM) model to take a sequence of images as input and output the best future beam. The goal is to reduce the frequency of performing channel estimation by predicting the future beams based on the motion of the user instead of the CSI. We use the ConvLSTM model to allow the processing of images in a neural network, and we build a SimpleLSTM model that takes in the previous beams as input instead of images. The goal is to verify that visual data can assist in beam tracking applications and to provide a simple yet efficient model to perform this task.

Département:	Département de génie électrique
Programme:	Génie électrique
Directeurs ou directrices:	Jean-François Frigon
URL de PolyPublie:	https://publications.polymtl.ca/10697/
Université/École:	Polytechnique Montréal
Date du dépôt:	24 mars 2023 11:34
Dernière modification:	08 avr. 2024 10:15

Citer en APA 7:	Alkazzi, C. (2022). Vision-Aided Deep Learning Solutions for Hybrid Beamforming and Beam Tracking [Mémoire de maîtrise, Polytechnique Montréal]. PolyPublie. https://publications.polymtl.ca/10697/

Statistiques

Total des téléchargements à partir de PolyPublie

Téléchargements par année

Provenance des téléchargements

Actions réservées au personnel

Afficher document