Full article: A review on deep learning aided pilot decontamination in massive MIMO

Formulae display: $MathJax Logo$ ?Mathematical formulae have been encoded as MathML and are displayed in this HTML version using MathJax in order to improve their display. Uncheck the box to turn MathJax off. This feature requires Javascript. Click on a formula to zoom.

Abstract

In multi-antenna systems, advanced techniques such as massive multiple-input multiple-output (MIMO), beamforming, and beam selection depend heavily on the accurate acquisition of the channel state. However, pilot contamination (PC) can be a major source of interference which degrades they are performance. Moreover, the severity of PC increases as more pilots are reused between users in the wireless systems. Researchers have shown that PC can be mitigated by using deep learning (DL) approaches. Nevertheless, when minimizing PC, the examination that identifies the applications and factors that distinguish these DL approaches is still limited. This paper reviews these DL approaches and the improvements needed to enhance their performance. Simulation results confirm that DL networks that learn to predict the channels directly have superior performance under PC.

Keywords:

REVIEWING EDITOR:

Qingsong Ai, Wuhan University of Technology, Wuhan, China

Subjects:

1. Introduction

Massive MIMO is the technology that equips base stations (BS) with several antennas much larger than the number of active users (Ferrante et al., Citation2016). To boost the network capacity, it increases Spectral Efficiency (SE) by serving multiple single-antenna terminals over the same time-frequency resource (Boccardi et al., Citation2014). However, to make the users’ transmission orthogonal, the BS must assign orthogonal pilot sequences to users (Marzetta, Citation2010). Pilot sequences are utilized by BSs during the channel estimation phase which is followed by uplink data detection and downlink precoding respectively. Due to the lack of enough orthogonal pilots, the reuse of pilots in multicell systems is inevitable which causes PC. PC causes the BS to receive the pilot from an intended user terminal (UT) and other UTs who are reusing the same pilot. Thus, PC is one of the major sources of errors during CE and signal detection (Bjornson et al., Citation2016).

PC can be minimized via allocating pilots based on the angle of arrivals (AoAs) (Almamori & Mohan, Citation2017), pre-multiplying each UT pilot signal with a corresponding BS code (Alwakeel & Mehana, Citation2017), or prediction of covariance matrices (Almamori & Mohan, Citation2018; Cheng et al., Citation2016). Temporal correlation between channel vectors must be high to predict covariance matrices successfully. Another way of reducing PC is to carry out joint pilot allocation (Neumann et al., Citation2018). Approaches in (Almamori & Mohan, Citation2017; Citation2018; Cheng et al., Citation2016) and (Neumann et al., Citation2018) use pilots only for decreasing PC. In addition, a comparative examination of different pilot assignment schemes for PC mitigation is conducted in (Misso et al., Citation2020). There are also semi-blind (SB) and blind CE methods that can minimize the impact of PC. SB CE methods employ pilots and data for this. Blind CE methods make use of data only (Nayebi & Rao, Citation2018). SB proposals reduced PC by performing CE using the steepest descent (Alnajjar & Abdallah, Citation2016), robust independent component analysis (ICA) (Fatema et al., Citation2017), and the spatial repetition scheme (Alwakeel & Mehana, Citation2017). However, the complexity of the steepest descent was higher than that of the pilot-based method, robust ICA suffered from error accumulation and careful design for the block structure was vital for these proposals respectively (Alnajjar & Abdallah, Citation2016; Alwakeel & Mehana, Citation2017; Fatema et al., Citation2017).

Under blind CE methods (Chen et al., Citation2016), performed alternating projection and singular value decomposition (SVD) on the sample covariance matrix for estimating channels of all UTs. However, the BS consumed additional power for estimating out-of-cell channels. In (Mezghani & Swindlehurst, Citation2017), channel sparsity in the angular frequency domain allowed CE of all UTs by gradient descent. Nevertheless, the procedure was undermined ambiguities in terms of phase, time shifts, and user assignments (Mezghani & Swindlehurst, Citation2017). PC can also be decreased by blind signal detection (BSD) methods. BSD methods avoid PC effects by detecting the uplink data first followed by data-aided CE. This differentiates them from blind CE approaches which lower PC by performing CE first followed by data detection.

A BSD scheme that projected the received signal onto the signal subspace using SVD and ICA was proposed for data detection and least square (LS) based CE in (Amiri et al., Citation2017). PC decayed as the number of receiving antennas and the data length increased. However, it happened when the power of the UT of interest exceeded the power of the strongest interfering UT (Amiri et al., Citation2017). A group-blind receiver that doesn’t require BS cooperation and performs better under PC was developed by (Ferrante et al., Citation2017). It required second-order statistics which were estimated during black subframes. Blank subframes are a subset of symbols without transmissions from UTs of interest. Higher throughput was possible when SVD and BIG-AMP were deployed for BSD on the sparse channel. However, for all SNR values, the achievable rate was inferior to the case with perfect channel knowledge (Zhang et al., Citation2018). A detailed review of other state-of-the-art pilot decontamination techniques can be found in (Elijah et al., Citation2016). Performance of various uplink (UL) sounding references (SRSs) allocation strategies in practical 5 G deployments, to relieve neighboring BSs from PC is presented in (Giordano, Citation2018).

Previous works have provided various ways of reducing PC in massive MIMO. However, the review of approaches based on DL is still limited. This paper aims to provide a detailed review of the DL methods that have considered PC mitigation. Emphasis is on how DL is applied to tackle PC. Inputs and outputs processed by DL networks when lessening PCs are discussed. Moreover, unsupervised and supervised learning for PC alleviation with DL is covered. The contributions of this article are as follows.

The work demonstrates the use of 5G sounding reference signals (SRS) parameters for conducting channel estimation via pilots organized in a two-dimensional resource grid (Mathworks Inc, Citation2022, Citation2023). It establishes that the synchronization of pilot signals does not affect the quality of channel estimation under PC. In addition, it states that the desired user channel can be retrieved effectively by cascading two deniers in spatial-frequency and angle-delay domains (Hirose et al., Citation2021; Jiang et al., Citation2021).
The understanding of the pilot design networks which utilize two-layer neural networks (TNNs) or DNNs to minimize PC by searching for optimal non-orthogonal pilot signals is extended. These networks learn pilots as weights or labels however it is proposed to treat them as pilot allocation schemes because they provide another way of assigning pilots optimally (Chun et al., Citation2019; Lim et al., Citation2021).
Computer simulations indicate that denoising approaches have the best channel estimation performance under PC. However, the DNN-based denoiser has a superior performance compared to the CNN-based denoiser (Hirose et al., Citation2021). This was because the channel gains at other symbol locations in the two-dimensional orthogonal frequency modulation (OFDM) grid were replicas of those at the first symbol. Therefore, the CNN-based denoiser did not benefit from learning the correlations of channel gains between OFDM symbols. Nevertheless, due to the abilities of CNNs to extract meaningful features from images denoisers based on CNNs can estimate the channel more accurately under PC effects well than the DNNs if pilots are transmitted by multiple OFDM carriers and symbols (Jiang et al., Citation2021; Lim et al., Citation2021).
It is shown that deep learning frameworks that perform power allocation in the uplink of massive MIMO have quite worse performance. The main reasons for this are shadowing effects and similar propagation conditions experienced by the pilot signals transmitted from different users because they force the algorithm to allocate comparable power levels to users which increases the severity of the PC. However, if distinct propagation conditions exist in the system power allocation can be a viable alternative (D’Andrea et al., Citation2019).
Although the labels for training the pilot and power allocation scheme were derived from solutions that maximize the signal-to-interference ratio (SINR) the performance of pilot allocation was way better than that of power allocation. This was caused by the fact that the considered 5G massive MIMO cellular system did not provide a substantial difference in propagation losses of users which allowed most of them to transmit pilots with the same powers. Without learning the channel of the user directly pilot allocation schemes showed a performance that is close to that of the CNN-based denoiser. Furthermore, pilot decontamination methods that use DNNs are faster in terms of training speed (D’Andrea et al., Citation2019; Kim et al., Citation2018).

The remainder of the paper is organized as follows. Section 2 establishes the system model which serves as the benchmark for discussing the DL approaches used to confront PC. Section 2 reviews different DL-based PC reduction techniques. Section 3 concludes the paper.

2. Pilot decontamination with deep learning

PC is an undesirable effect caused by inter-cell interference (ICI). ICI occurs when the UTs from unlike cells send the same pilot sequence to the corresponding BS on the uplink. ICI causes BSs to rely on an interference-limited received signal in the uplink which unavoidably infects the resulting channel estimate (Zhao et al., Citation2018). Due to PC, the BS in the $jth$ cell receives a signal which is the superposition of the pilot signals from the UTs in all the cells, and it is expressed as in EquationEquation (1)(1) $Y_{j} = \sum_{k = 1}^{K} h_{jk} S_{jk}^{H} + \sum_{l \neq j}^{L} \sum_{k = 1}^{K} h_{lk} S_{lk}^{H} + N_{j}$ (1) . The first term represents the signals of users in the $jth$ cell. The second term provides signals transmitted from other cells. The third term is the noise. (1) $Y_{j} = \sum_{k = 1}^{K} h_{jk} S_{jk}^{H} + \sum_{l \neq j}^{L} \sum_{k = 1}^{K} h_{lk} S_{lk}^{H} + N_{j}$ (1)

Transmitted pilots from jth and lth cells are given as $S_{jk} = \sqrt{p_{jk}} x_{jk}^{}$ and $S_{lk} = \sqrt{p_{lk}} x_{lk}$ respectively. Jth is a desired cell whilst all lth cells are causing inter-cell interference from their same pilot’s transmissions. $p_{jk}$ and $p_{lk}$ are desired and interfering UT pilot transmit powers respectively. $x_{jk}$ and $x_{lk}$ are actual pilots assigned to the UT in the jth and lth cells. If $τ$ is the pilot length, it leads to $Y_{j} \in ∁^{M \times τ}$ as the received signal. $N_{j} \in ∁^{M \times τ}$ is the additive white Gaussian noise (AWGN) with zero mean and element-wise variance of $σ_{n}^{2} .$ $M$ is the number of receiving antennas. To perform channel estimation the jth BS uses least square (LS) estimation expressed by EquationEquation (2)(2) ${\tilde{h}}_{jk}^{LS} = \frac{Y_{j} S_{jk}^{*}}{\sqrt{p_{jk}}}$ (2) (Björnson et al., Citation2017). LS projects the received signal $Y_{j}$ on the transmitted pilot sequence $S_{jk}$ leading to kth UT channel estimate ${\tilde{h}}_{jk}^{LS}$ given as in EquationEquation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) . Also, LS channel estimation can be regarded as the correlation between the received signal $Y_{j},$ and the pilot of the kth user. (2) ${\tilde{h}}_{jk}^{LS} = \frac{Y_{j} S_{jk}^{*}}{\sqrt{p_{jk}}}$ (2) (3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3)

In Equationequation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) , the first, second, and third terms represent the channel of UT of interest, PC from out-of-cell UTs, and noise respectively. $h_{jk}$ and $h_{lk}$ are channels for the desired and interfering UT respectively. The remaining part for the lth cells represents cases with $S_{lk} = S_{jk}$ for UTs in lth cells with the same pilot as a UT in the jth cell. Further, with a simple path loss model, the kth channel for each UT in the lth cell in can be modeled as in EquationEquation (4)(4) $h_{lk} = {\sqrt{β_{lk}^{}} a}_{lk}, l = 1, \dots \dots j \dots \dots ., L,$ (4) . (4) $h_{lk} = {\sqrt{β_{lk}^{}} a}_{lk}, l = 1, \dots \dots j \dots \dots ., L,$ (4) $β_{lk}^{} = σ_{lk}^{} d_{lk}^{- α}$ is the large-scale fading coefficient that includes the path loss and the shadowing effect (Neumann et al., Citation2014). $a_{lk}^{}$ comprises the fast fading variables which are i.i.d. complex Gaussian random variables with mean 0 and variance 1., $σ_{lk}^{}$ is the log-normal shadowing and $d_{lk}^{- α}$ is the attenuation due to kth UT distance from the lth BS, and path loss exponent (Neumann et al., Citation2014).

If ${\tilde{h}}_{jk}^{LS}$ is employed for detecting uplink data and precoding of the downlink data. The precoding matrix used by the jth BS in a jth cell will be ruined (Jose et al., Citation2011). Because, according to EquationEquation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) , ${\tilde{h}}_{jk}^{LS}$ contains PC caused by the sum of the channels from other UTs using the same pilot signal. Also, signal detection performance will be undermined especially for signal detectors such as zero-forcing (ZF) and minimum mean square error (MMSE). Because ZF and MMSE assume perfect channel estimation (Biglieri et al., Citation2007). Moreover, both the uplink and downlink sum rates will be limited by the ICI imposed by pilot reuse (Zhao et al., Citation2018).

As shown by EquationEquation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) , PC can be controlled by: power allocation through $p_{lk}$ and $p_{jk};$ assignment of actual pilots $x_{jk}$ and $x_{lk};$ Denoising ${\tilde{h}}_{jk}^{LS}$ to get the desired channel $h_{jk}$ and designing the pilots $x_{jk}$ and $x_{lk}$ such that multicell interference is minimized. Concentrating on the kth UT in the jth cell. DL-based proposals that deploy deep neural networks (DNNs), convolutional neural networks (CNNs), recurrent neural networks (RNNs), and reinforcement learning networks (RLN) for remedying PC are reviewed (Goodfellow et al., Citation2016). All UTs from other lth cells that transmit the same pilot signal are considered to impair the reception of the target pilot signal.

2.1. Power allocation

Power allocation can either increase or decrease the transmission power of UT for interference management. Optimizing the transmission powers in the uplink is more difficult than in the downlink because the uplink powers affect the uplink data transmission and the quality of the channel estimates. In addition, uplink power control indirectly affects the precoding vectors (Björnson et al., Citation2017). Power allocation deep networks are those designed to learn the mapping between $β_{lk}$ and $p_{lk} .$ That is, the power of UTs with the same pilot sequence can be controlled based on large-scale coefficients. According to EquationEquation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) and EquationEquation (4)(4) $h_{lk} = {\sqrt{β_{lk}^{}} a}_{lk}, l = 1, \dots \dots j \dots \dots ., L,$ (4) , if $β_{lk}$ is large decrease $p_{lk}$ to decrease $\sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk}$ which contaminates the LS estimation ${\tilde{h}}_{jk}^{LS} .$ Note that, $h_{lk} = {\sqrt{β_{lk}^{}} a}_{lk},$ so $β_{lk}$ and $p_{lk}$ can work in conjunction oppositely to control the level of PC interference. However, decreasing the power of other UTs for the sake of PC-affected UT will indirectly increase PC on them. Thus, power allocation must be conducted jointly based on a criterion such as uplink spectral efficiency (SE). Maximizing SE will be equivalent to PC minimization. If the SE expression is a function of the large-scale fading coefficient and power for each UT (Sanguinetti et al., Citation2018). Many criteria can be formulated. Nevertheless, these are the keys to designing supervised or unsupervised DL schemes.

An unsupervised learning scheme was proposed for power allocation among UTs based on the minimum sum of mean square error (MSE) of channel estimation as a criterion. Here, a multi-layer fully connected DNN was designed to optimize the power allocated to each sample in a pilot sequence. The sum MSE of channel estimation was derived based on the MMSE channel estimator (Xu et al., Citation2019). Subject to the total available power of each kth UT, the DNN was un-supervised and trained with $β_{lk}^{}$ and $p_{lk}^{}$ as inputs and outputs. Sum MSE which had $β_{lk}^{}$ and $p_{lk}^{}$ as parameters was the cost function. To facilitate power allocation for each cell under PC by the DNN. The input was formed as follows. Large-scale coefficients of users were column-aligned. Each column is a vector created by a large-scale coefficient of every receiving antenna. The output was also grabbed as follows. The powers of users were also column-aligned. Finally, each column consisted of a vector representing the power of every sample in a pilot sequence (Xu et al., Citation2019).

A supervised learning approach was developed that learns a mapping between either the users’ positions or the large-scale coefficients and the power allocation policy. A training sample was generated by either maximizing the system sum rate or maximizing the minimum of the users’ rate (D’Andrea et al., Citation2019). For the first case and second case, the inputs were (x, y) positions and the outputs were power allocated to each user. For the third case, the inputs were large-scale coefficients for all receiving antennas and the outputs were power allocated to each user. In all these cases the solution of two objectives was found by optimization methods found in (Alonzo et al., Citation2019; Buzzi & Zappone, Citation2017). The two objectives’ expressions had power and large coefficient variables. So, solving or learning them provided a DNN solution for PC mitigation (D’Andrea et al., Citation2019). Note that, as per EquationEquation (4)(4) $h_{lk} = {\sqrt{β_{lk}^{}} a}_{lk}, l = 1, \dots \dots j \dots \dots ., L,$ (4) (x, y) positions for UTs can be used in place of large-scale coefficients.

DL-based power allocation schemes showed the best sum Minimum Square Error (MSE) performance with low complexity compared to other methods (Xu et al., Citation2019). In addition, the presence of a PC does not significantly influence the learning capabilities of the DNN. Instead, shadowing effects are the source of worse performance (D’Andrea et al., Citation2019). Finally, joint optimization of channel estimation (CE) and bit error (BER) rate ascended as a must to overcome the discrepancy between MSE and BER results (D’Andrea et al., Citation2019).

2.2. Pilot assignment

The pilot assignment task is the problem of optimally allocating pilots to UTs to minimize PC. The target is to allocate the pilot $x_{lk}^{}$ for each kth UT in every lth cell to minimize PC interference on every LS channel estimate ${\tilde{h}}_{lk}^{LS} .$ Whether supervised or unsupervised learning method is adopted. The pilot assignment task is carried out based on optimizing a criterion similar to power allocation. A supervised learning approach that generated training data by joint maximization of the uplink net capacity was proposed. The net capacity encompassed the entire massive MIMO system with all cells. BSs were assumed to know UT information for all UTs via Coordinated Multipoint (CoMP) transmission and reception (Kim et al., Citation2018). The inputs were (x, y) locations of all UTs from all cells. The labels were groups of matrices. In each matrix one row vector represented the pilot assignment for UTs in that cell. However, using the SoftMax output layer converted each label into stacked vectors. In each vector, the largest value was selected when the uplink net capacity reached maximum. Generation of the training set proceeded as follows. UTs were randomly dropped in a cell. Location information was sent to the BS. Finally, each UT was assigned a pilot to maximize the uplink net capacity to generate a training sample (Kim et al., Citation2018). The approach in (Kim et al., Citation2018), was constructed based on DNN by exploiting dropout to prevent overfitting. The DNN gave pilot assignments as outputs for all users in all cells with 99.38% normalized capacity gain and it only required 0.92 ms for computations.

Deep reinforcement learning (DRL) has also been employed for dealing with PC based on a cost function defined by AoAs of UTs. The cost function depicting the reward included PC interference emanating from UTs with AoA within that of the target UT. These were UTs that had the same pilot sequence as the target user in the massive MIMO system. Appropriate sets of states, sets of actions, and reward functions allowed the agent to learn the policy to minimize the cost function while allocating pilots. The deep residual network (ResNet) implemented the Q-neural network (QNN) which realized the aforementioned DRL. However, it was assumed that, the target user has an AoA within the pre-known AoA range and location (Omid et al., Citation2021).

The STATE SET was defined by the binary pilot assignment pattern for all UTs, pilot index and the cell index of the UTs with the maximum cost function in each cell, pilot index of the UT that action takes place on, cell index of the UT that action takes place on and the value of the maximum cost function in each cell. The ACTION SET selected a random UT in the adjacent cell and assigned the pilot of the target UT to that user. If the UT in the adjacent cell has the same pilot no action is pursued. Two thresholds were considered for defining the REWARD SET. As time progressed, the DRL algorithm was able to track the channel while learning the pilot assignment policy (Omid et al., Citation2021).

2.3. Denoisers

Denoisers are implemented to consume the PC-corrupted channel ${\tilde{h}}_{jk}^{LS}$ and produce the PC-free channel response $h_{jk}$ considering the kth UT in the jth cell. By mapping ${\tilde{h}}_{jk}^{LS} \to h_{jk},$ PC effects are minimized. However, the mapper must be designed to deal with PC interference from other cells $\sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk}$ and noise $\frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ which is still AWGN. Alternatively, ${\tilde{h}}_{jk}^{LS}$ can be replaced by $Y_{j}$ because they both contain PC interference. Cascading the DNN as a denoiser and LS estimation block allowed the retrieval of the desired channel from the PC-impaired channel in a study by (Balevi et al., Citation2020). Assuming the kth UT in the jth cell for the DNN, the input was $Y_{j}$ and the output was $Y_{jk} {= \sqrt{p_{jk}} h}_{jk} S_{jk}^{H} .$ So, training the DNN denoiser with $Y_{j}$ and $Y_{jk}$ as inputs and labels was equivalent to removing multi-cell interference caused by PC. The DNN parameters were fit online on the fly. Deploying the untrained DNN overcomes training overheads that hinder the implementation of DNNs for high dimensional channel estimation (Balevi et al., Citation2020). Finally, the LS estimation block projected the denoised signal $Y_{jk}$ on the transmitted pilot sequence $S_{jk}$ leading to kth UT channel estimate $h_{jk}$ (Balevi et al., Citation2020). This was possible because the deep image prior does not require training empowering it to evade the need for the training dataset (Ulyanov et al., Citation2018).

The low-complexity DNN-based denoiser proposed by (Balevi et al., Citation2020) outperformed the more complex MMSE estimator. It even came close to the Genie-Aided MMSE with a perfectly known channel statistic. Deep residual learning (DRL) that consisted of the neural network with noise estimator and denoiser was adopted to suppress PC. Considering the kth UT in a jth cell the adopted DRL structure was as in . $h_{jk} = {\tilde{h}}_{jk}^{LS} - Z_{jk}$ and $h_{jk} = α_{jk} \circ {\tilde{h}}_{jk}^{LS} + γ_{jk} .$ $Z_{jk}$ computed the distortion noise due to PC interference from other cells and the AWGN (Lim et al., Citation2021). The deep residual network was trained offline for channel estimation in the presence of a PC. The input was the LS estimate ${\tilde{h}}_{jk}^{LS}$ and large-scale coefficient $β_{lk} .$ The output label was the channel estimate $h_{jk} .$ Together these provided the training data set for kth UT in the jth cell. $α_{jk}$ and $γ_{jk}$ were also learnable parameters. The advantage of this method was the ability of each BS to independently perform channel estimation based on its data (Lim et al., Citation2021).

Figure 1. DRL based channel denoiser (Lim et al., Citation2021).

As in EquationEquation (3)(3) ${\tilde{h}}_{jk}^{LS} = h_{jk} + \sum_{l \neq j}^{L} \sqrt{\frac{p_{lk}}{p_{jk}}} h_{lk} +_{}^{} \frac{1}{τ} \frac{N_{j} S_{jk}}{\sqrt{p_{jk}}}$ (3) , the LS estimated channel comprises the sum of the channels for all UTs using the same pilot signal. This motivated strategy is expressed as $h_{jk} = f ({\tilde{h}}_{jk}^{LS}),$ where $f$ denoted a mapping from the LS estimated channel to the desired channel (Hirose et al., Citation2021). The strategy employed two DL structures. First, the DNN was considered. However, the DNN could not exploit spatial local correlation. Therefore, CNN was brought on board because it eradicates the problem using sliding convolutional filters. The CNN performance was better compared to the DNN. Because CNN filters had the power to learn the information related to the channel covariance matrix such as AoAs and magnitude of signals. LS estimated channel was a perfect candidate for the input as it contains information about AoAs and the magnitude of signals (Hirose et al., Citation2021).

Three cases were considered: perfect timing, imperfect timing, and channel aging. Investigation under perfect and imperfect timing synchronization was motivated by the difficulties of achieving timing synchronization throughout the multi-cellular network (Pitarokoilis et al., Citation2017). Channel aging was for channel changes over time because of UT movements. However, the strategy assumed that the large-scale coefficient of the target UT is often larger than the UTs of the other cells (Hirose et al., Citation2021).

Two CNN denoisers in spatial-frequency (SF) and angle-delay (AD) were cascaded to retrieve the desired channel $h_{jk}$ from the LS estimated channel ${\tilde{h}}_{jk}^{LS}$ under the massive MIMO-OFDM system. Discrete Fourier Transform (DFT) transformed the output of the SF-CNN into the AD domain which served as the input to the AD-CNN. In the end, the output of the AD-CNN was transformed back by Inverse DFT (Jiang et al., Citation2021). The denoiser was called a dual CNN. It enabled the CNNs to ease interference in the SF domain. Simultaneously, it handled most of the white noise due to channel sparsity found in the AD domain. Moreover, correlation in the SF domain decreased the noise power in the AD domain so that the CNN is less confused when differentiating the channel and noise.

The dual CNN had better performance and robustness than estimation in a single domain under PC. However, the supremacy of CNN in the SF domain was key because PC interference spreads over the SF domain well. Therefore, CNN in the SF domain could cover these corrupted areas due to the correlation of adjacent areas (Jiang et al., Citation2021). This design was an attempt to combine CNN and expert knowledge in wireless communications. It is exemplified by the introduction of the DFT and IDFT to make CNN learn from different domains.

2.4. Pilot design

Pilot design approaches attempt to minimize PC by searching for optimal non-orthogonal pilot sequences. Pilot designers aim to jointly optimize the needed pilots ${x_{jk}}_{k = 1}^{K}$ of the K users and the channel estimator of the jth BS to minimize the discrepancy of the estimated channel ${\tilde{h}}_{jk}$ from the desired channel $h_{jk} .$ Two techniques are available for designing pilots. Pilots can be collected at the output of the DNN (Lim et al., Citation2021) or pilots can be learned as weights of the Two-layer Neural Network (TNN) (Chun et al., Citation2019). For the TNN approach, separate TNNs each learning the mapping $h_{jk} \to Y_{jk}$ for each kth UT in a jth cell are trained jointly and each weight stands for the pilot sample from the pilot sequence $x_{jk} .$ The design of non-orthogonal pilots can proceed by jointly optimizing TNNs for all UTs in all cells to minimize PC. Zero bias vectors and unit activation functions must be used in TNNS for the weights to be direct representatives of pilot samples (Chun et al., Citation2019).

To accurately model the computation $h_{jk} x_{jk} \to Y_{jk}$ with a TNN. Weights that cannot be mapped to pilot samples must be set to zero because all weights do not contribute to the received pilot signal in a TNN. It is known that optimal pilot sequences are influenced by changes in the fading characteristics of large-scale coefficients of UTs. Therefore, the second technique uses a DNN to allow cells to generate pilot sequences using the large-scale coefficient $β_{lk}^{}$ of UTs. By using a DNN, large-scale coefficients are fed as inputs to get optimal pilots for UTs at the output. If the orthogonality of pilot sequences for UTs with close large-scale coefficients is maintained by the DNN. PC can be reduced at the stage of channel estimation (Lim et al., Citation2021). However, usefulness would require knowing the large-scale coefficients for all UTs in all cells at the BSs.

2.5. Analysis

First, the use of assumed channel models may compromise the performance of DL networks on the real massive MIMO system. The reason is that the assumed channel may inaccurately reflect the actual channel. In addition, assumptions about a channel bias the learned weights, widening the discrepancy between the assumed and the actual channel (Ye et al., Citation2018). Online training can rescue the situation by fine-tuning the networks at the BS after offline training. However, networks can be trained from pure observations alone without any model of the massive MIMO channel via policy learning techniques to evade the drawback (Aoudia & Hoydis, Citation2018). Policy learning can be implemented by deep reinforcement networks (DRN) to enable the participation of the transmitter and receiver when decreasing PC. Using DRNs is also known as end-to-end learning (O’Shea & Hoydis, Citation2017).

Second, PC can be removed if the power received from UT within the cell of interest is larger than the power received from UTs in other cells. However, this is subject to irregularities caused by multicell systems and propagation environments. For instance, the target UT inside the building may require more power which can reach maximum without avoiding PC. If the interfering UT has a line of sight to the target BS (Müller et al., Citation2014). Power control may not be feasible in dense small-cell networks and for a fraction of UTs that experience similar channel conditions to more than a single BS (Ferrante et al., Citation2017; Müller et al., Citation2014). Moreover, its more complicated for hyperdense small‐cells installed in homes and are user controlled (Rodriguez, Citation2015). Power attenuation on interfering UTs is created for free by shadowing and path loss for the UT of interest. Power attenuation on these UTs reduces PC. However, shadowing and path loss might not help for interfering UTs found at the cell edges. Semi-blind and blind techniques have been developed to counteract this effect. Remedy of PC via DL designs centered on these techniques can be handy (Chen et al., Citation2016; Fatema et al., Citation2017).

Third, variability found in the data dimensions is a challenge for realizability. As an example, for five UTs, the DNN input dimension varied from 10 to 150 when the input labels changed from (x, y) positions to large-scale coefficients for 30 BS antennas (D’Andrea et al., Citation2019). Additionally, the performance of the dual CNN was evaluated for 32, 48, and 64 BS antennas. Implying that, the input dimension was changing in the spatial domain (Jiang et al., Citation2021). Variables such as these have an impact on the size of data processed by DL networks. This can be solved by designing adaptive schemes. For instance, instead of training different networks for different input sizes the same network can be trained to process data of different sizes (Orhan & Bastanlar, Citation2018; Sekou et al., Citation2019).

3. Performance evaluation

In simulation experiments, the 5 G massive MIMO system operating at a carrier frequency of $f_{c} = 4$ GHz is used to compare the deep learning based pilot decontamination techniques. The pilots are derived from uplink-sounding reference signals (SRSs) for uplink channel estimation. The SRS parameters and the transmission process are shown in . As shown in , a frame is made up of 10 subframes and each subframe has 10 slots. Within one slot 14 OFDM symbols arranged in a two-dimensional resource grid of size $N_{sc} \times N_{symbols}^{Slot}$ are communicated from a user to the BS. $N_{sc}$ and $N_{symbols}^{Slot}$ represent the number of subcarriers and symbols per slot in frequency and time dimensions respectively. Every resource grid has $N_{sc} N_{symbols}^{Slot}$ resource elements. shows 4 resource elements whereby 2 are designated for carrying the SRS samples. However, for the evaluations, a 5 G massive MIMO system with $N_{sc} = 52 \times 12 = 624$ carriers and $N_{symbols}^{Slot} = 14$ symbols giving $624 \times 14 = 8736$ resource elements has been considered (Mathworks Inc, Citation2022). In the experiments, slots and resource elements containing SRS pilots only are considered during channel estimation. An example of the SRS resource grids achieved by the parameters in is shown in . It can be seen that out of 14 OFDM symbols in the slot the SRS pilots can be transmitted using one or multiple symbols.

Figure 2. Sounding reference signal parameters and transmission process.

Figure 3. Sounding reference signal resource grid generated by parameters (symbol start, number of SRS symbols): (a) (0, 1) (b) (0, 4) (c) (7, 1) (d) (7, 4).

The impact of PC is measured by considering users dropped randomly in 7 hexagonal cells located in the urban microenvironment (UMi) (ETSI, Citation2017). A drop is an instant where each user sends the SRS signal in the allocated slot to its BS. Each user terminal has one transmitting antenna. The BSses are equipped with 128 receiving antennas. To realize the received SRS signal, an SRS signal is filtered by a tapped delay line (TDL) fading channel. Then, the AWGN is added at various signal-to-noise ratios (SNRs) (3GPP, Citation2020). Given a received SRS OFDM grid, PC happens when non-orthogonal SRSs that occupy the same time and frequency elements in an OFDM resource grid are transmitted from different users (Mathworks Inc, Citation2023). This is because multiplexed SRSses interfere with each other for identical resource elements with non-orthogonal SRS samples. When channel estimation is conducted by correlating the received SRS signal with a pre-known SRS signal PC affects the quality of the estimated channel gains for the user under consideration. It has already been pointed out that PC can be mitigated by power control, pilot (or SRS) allocation, pilot design, and denoising method respectively.

For the pilot allocation method, SRS pilots were generated by using different cyclic shifts (Mathworks Inc, Citation2022). For power control, each user selected an optimum power level for its transmission. Sgnal-to-interference plus noise (SINR) was an optimization criterion for power control and pilot allocation (Kim et al., Citation2018). However, the SINR of the entire 5 G massive MIMO cellular system expressed by EquationEquation (5)(5) ${SINR}_{net} = \sum_{l = 1}^{L} \sum_{k = 1}^{K_{l}} \frac{p_{k} β_{k}^{2} x_{k}^{} x_{k}^{*}}{\sum_{m = 1}^{K_{l}} \sum_{\begin{matrix} n = 1 \\ n \neq k \end{matrix}}^{K_{l}} {\sqrt{p_{m} p_{n}} β}_{m} β_{n} x_{m}^{} x_{n}^{*}}$ (5) was considered. $K_{l}$ is the number of users in the lth cell. $β, p,$ and $x$ give the propagation loss, power, and SRS pilot for each user respectively. (5) ${SINR}_{net} = \sum_{l = 1}^{L} \sum_{k = 1}^{K_{l}} \frac{p_{k} β_{k}^{2} x_{k}^{} x_{k}^{*}}{\sum_{m = 1}^{K_{l}} \sum_{\begin{matrix} n = 1 \\ n \neq k \end{matrix}}^{K_{l}} {\sqrt{p_{m} p_{n}} β}_{m} β_{n} x_{m}^{} x_{n}^{*}}$ (5)

Propagation losses $β_{}$ required for SINR computations were gathered using EquationEquations (6(6) $d_{3 D} = \sqrt{{(x_{UT} - x_{BS})}^{2} + {(y_{UT} - y_{BS})}^{2} + {(h_{UT} - h_{BS})}^{2}}$ (6) ,Equation7)(7) $β_{} = 35.3 {log}_{10} {(d}_{3 D}) + 22.4 + 21.3 {log}_{10} {(f}_{c}) + {0.3 (h_{UT} - 1.5) + σ}_{SF}$ (7) . $d_{3 D}$ is the three-dimensional distance between the user and the BS. $σ_{SF} = 4$ dB is the shadow fading standard deviation for non-line of sight (NLOS) propagation. The antenna height $h_{UT}$ and $h_{BS},$ at the user and BS, are 1.5 m and 35 m respectively. User and BS two-dimensional positions are given as $(x_{UT}, y_{UT})$ and $(x_{BS}, y_{BS})$ respectively. Note that positions are determined by randomly dropping users in the cell while for each BS a cell center determines its position (ETSI, Citation2017). (6) $d_{3 D} = \sqrt{{(x_{UT} - x_{BS})}^{2} + {(y_{UT} - y_{BS})}^{2} + {(h_{UT} - h_{BS})}^{2}}$ (6) (7) $β_{} = 35.3 {log}_{10} {(d}_{3 D}) + 22.4 + 21.3 {log}_{10} {(f}_{c}) + {0.3 (h_{UT} - 1.5) + σ}_{SF}$ (7)

In reality, pilot design by learning non-orthogonal pilots as weights from a TNN (Chun et al., Citation2019) or collecting them at the output of the DNN (Lim et al., Citation2021) is another way of allocating pilots to users. Therefore, pilot design has been regarded as pilot allocation in the evaluations. For power control, the neural network was trained by using the positions of the user and optimal power allocations (D’Andrea et al., Citation2019). A pilot allocation neural network adopted positions and pilot identifiers as inputs and labels during training (Kim et al., Citation2018). Denoising neural networks were structured to learn the actual channel $H$ given the LS channel estimate ${\tilde{H}}_{}^{LS} .$ The semi-blind channel estimator was utilized to compute the actual channel $H$ (Alnajjar & Abdallah, Citation2016). Neural networks were built in MATLAB based on a deep learning toolbox. A computer with GeForce 930MX NVIDIA GPU and intel core i5-7200U [email protected] GHz was used for simulation experiments. Adam optimizer was selected as a learning algorithm. For denoising a DNN that applied fully connected layers for mapping LS estimates, ${\tilde{H}}_{}^{LS}$ to actual channel $H$ (Wang & Xu, Citation2020). A CNN used deep residual learning to perform denoising of LS estimates, ${\tilde{H}}_{}^{LS}$ to give the actual channel $H$ (Lim et al., Citation2021). Average and normalized mean square error (MSE) were exploited as indicators of performance. EquationEquation (8)(8) $Average MSE = \frac{1}{S} \sum_{i = 1}^{S} {| | H - \hat{H} | |}_{}^{2}$ (8) and EquationEquation (9)(9) $NMSE = \frac{1}{S} \sum_{i = 1}^{S} \frac{{| | H - \hat{H} | |}_{}^{2}}{{| | H | |}_{}^{2}}$ (9) provide the average MSE and normalized MSE (NMSE) with $H$ and $\hat{H}$ as the actual and estimated channel. (8) $Average MSE = \frac{1}{S} \sum_{i = 1}^{S} {| | H - \hat{H} | |}_{}^{2}$ (8) (9) $NMSE = \frac{1}{S} \sum_{i = 1}^{S} \frac{{| | H - \hat{H} | |}_{}^{2}}{{| | H | |}_{}^{2}}$ (9)

In EquationEquation (8)(8) $Average MSE = \frac{1}{S} \sum_{i = 1}^{S} {| | H - \hat{H} | |}_{}^{2}$ (8) s stands for all samples while s is the total samples per SNR in EquationEquation (9)(9) $NMSE = \frac{1}{S} \sum_{i = 1}^{S} \frac{{| | H - \hat{H} | |}_{}^{2}}{{| | H | |}_{}^{2}}$ (9) . For power control users sent the same SRS using power levels assigned by a neural network (D’Andrea et al., Citation2019). Under pilot allocation users sent SRSs assigned by a neural network at the same power level. Then, for the user under consideration, channel estimation was conducted by correlating the received SRS signal with the pre-known SRS signal. For denoising users sent the same SRS using equal power level then the DNN and CNN denoisers predicted the actual channel given the LS channel estimate (Lim et al., Citation2021; Wang & Xu, Citation2020). Finally, the channel estimate of each method was compared with the actual channel to get the average MSE and NMSE (Alnajjar & Abdallah, Citation2016).

shows the average mean square error (MSE) on channel estimates for each pilot decontamination method. It can be observed that by benchmarking on the power control method the denoising schemes have the lowest average MSE. Specifically, the DNN denoiser shows superior performance improvement by a factor of 3.5949 compared to the CNN denoiser which has a factor of 2.3886. This was caused by the fact that the channel gains for the remaining symbol were assumed to be the same as the channel gains of the first symbol when training the CNN denoiser. It was done this way because the SRS was transmitted in the first OFDM symbol only. Therefore, the CNN architecture did not benefit from learning correlations of channel gains between symbols. If SRS parameters are changed to place SRS samples in multiple symbols as in , channel estimation would become an image-to-image regression task. In this case, the CNN denoiser would perform better than the DNN denoiser because CNNs are better when learning features from images. Although the labels of power control and pilot allocation methods were derived from the solutions that maximize the SINR in EquationEquation (5)(5) ${SINR}_{net} = \sum_{l = 1}^{L} \sum_{k = 1}^{K_{l}} \frac{p_{k} β_{k}^{2} x_{k}^{} x_{k}^{*}}{\sum_{m = 1}^{K_{l}} \sum_{\begin{matrix} n = 1 \\ n \neq k \end{matrix}}^{K_{l}} {\sqrt{p_{m} p_{n}} β}_{m} β_{n} x_{m}^{} x_{n}^{*}}$ (5) , the average MSE of the pilot allocation method is much better by a factor of 2.0455 than that of power control. Moreover, without learning the channel gains directly pilot allocation performance improvement is close to that of CNN denoising.

Figure 4. Average mean square error on channel estimates for each pilot decontamination method.

The performance of pilot decontamination versus SNR is provided in by using the NMSE as a metric for each pilot decontamination method. As the SNR increases the NMSE decreases for each method. The NMSE for the DNN denoising scheme is the lowest. It is revealed that the NMSE of the pilot allocation method follows closely that of the CNN denoising scheme. For pilot allocation, DNN and CNN denoising the difference in NMSE diminishes as the SNR tends to the highest value. The NMSE difference between power control and pilot allocation stays almost the same as the SNR increases. Power control has the highest NMSE because the 5 G massive MIMO cellular system provided similar propagation conditions to SRSes transmitted from different users. This forced the power control scheme to assign comparable power levels to users. In turn, it led to large deviations when channel estimation was performed for the intended user. However, in real propagation environments, distinguishable propagation conditions exist allowing power-controlled SRSs to give channel estimates with less error. Repeated experiments at SNR above 10 did not show a clear trend on NMSE versus SNR because for high SNR the SRS are PC-limited. The study proposes to investigate NMSE against SINR under high SNR conditions.

Figure 5. Normalized mean square error versus signal-to-noise ratio performance.

4. Conclusion

In this paper, deep learning-based schemes that lessen pilot contamination in massive MIMO have been discussed. It has been shown that pilot contamination can be decreased by pilot assignment, power allocation, denoising the least square channel estimation and design of non-orthogonal pilots. Pilot design and pilot assignment schemes do not differ because they attempt to search for optimal pilots that can be used by users. Performance evaluation with average MSE and NMSE as indicators showed that denoisers are superior because the channel of users can be learned directly. Power allocation schemes have a performance that depends on propagation conditions experienced by each user. If it is not possible to train the neural network to predict the channels, pilot assignment schemes promise optimal performance at less complexity and computational time. This is because DNNs that have fast learning speed can be exploited for pilot allocation. In the future, the study of the deep learning-based pilot decontamination under a massive MIMO system based on reflective surfaces is essential. Research on the development of deep learning frameworks for channel estimation with real 5 G measurement data is also very important. Extension of the deep learning techniques for end-to-end joint pilot decontamination and beamforming using reinforcement learning or conditional generative adversarial net is another area for future investigations.

Disclosure statement

The authors report there are no competing interests to declare.

Additional information

Funding

This work did not receive any funding.

Notes on contributors

Crallet M. Victor

Crallet M. Victor received the B.Sc. and M.Sc. degrees in Telecommunications engineering from the University of Dar es Salaam, Tanzania in 2009 and the University of Dodoma, Tanzania in 2012 respectively. Currently, he is a PhD student in Telecommunications Engineering at the Department of Electronics and Telecommunications Engineering, College of Informatics and Virtual Education, University of Dodoma, Dodoma, Tanzania. He is also a Lecturer in the Department of Electronics and Telecommunications Engineering, College of Informatics and Virtual Education, University of Dodoma, Dodoma, Tanzania. His research interest includes wireless communication, deep learning for communication systems, multimedia systems, and electronics engineering. Email: [email protected], [email protected]

Alloys N. Mvuma

Aloys N. Mvuma received a BSc in Electrical Engineering degree from the University of Dar es Salaam in 1994, an MSc in Information Science from Shimane University, Japan, and a Doctor of Engineering (Systems Engineering) from Hiroshima University, Japan in 2003. He is currently an Associate Professor at Mbeya University of Science and Technology. His research interests include adaptive signal processing, digital communication systems, and information communication technologies for development. He has published over 20 IEEE conference papers and various journal papers. He is a registered member of the Engineers Registration Board (ERB) and the Institute of Electrical and Electronic Engineers (IEEE). Email: [email protected]

Salehe I. Mrutu

Salehe I. Mrutu received his B.Sc. and M.Sc. degrees in Computer Science from the International University of Africa (IUA) in 2003 and the University of Gezira in 2006 respectively. In 2014, He obtained his PhD in Information and Communication Science and Engineering from the Nelson Mandela African Institution of Science and Technology (NM-AIST) in Arusha, Tanzania. He is currently serving as a lecturer at The University of Dodoma (UDOM) under the College of Informatics and Virtual Education. His research interests include signal processing, artificial intelligence, forward error correction (FEC) codes, quality-of-service provisioning, and resource management for multimedia communications networks. Email: [email protected]

References

Almamori, A., & Mohan, S. (2017). A spectrally efficient algorithm for massive MIMO for mitigating pilot contamination [Paper presentation]. https://doi.org/10.1109/SARNOF.2017.8080384
Google Scholar
Almamori, A., & Mohan, S. (2018). Improved MMSE channel estimation in massive MIMO system with a method for the prediction of channel correlation matrix [Paper presentation]. IEEE 8th Annual Computing and Communication Workshop and Conference, CCWC 2018, 2018, vol. 2018-Janua, No. 1, pp. 670–672. https://doi.org/10.1109/CCWC.2018.8301699
Google Scholar
Alnajjar, K. A., & Abdallah, S. (2016). Performance of low complexity receivers for massive MIMO with channel estimation and correlation [Paper presentation].2016 IEEE 3rd International Symposium on Telecommunication Technologies (ISTT), pp. 1–5. https://doi.org/10.1109/ISTT.2016.7918074
Google Scholar
Alonzo, M., Buzzi, S., Zappone, A., & D'Elia, C. (2019). Energy-efficient power control in cell-free and user-centric massive MIMO at millimeter wave. IEEE Transactions on Green Communications and Networking, 3(3), 651–663. https://doi.org/10.1109/TGCN.2019.2908228
Google Scholar
Alwakeel, A. S., & Mehana, A. H. (2017). Semi-blind channel estimation and interference alignment for massive MIMO Network [Paper presentation].in GLOBECOM 2017 - 2017 IEEE Global Communications Conference, vol. 2018-Janua. https://doi.org/10.1109/ICC.2018.8422351
Google Scholar
Amiri, E., Mueller, R., & Gerstacker, W. (2017). Blind pilot decontamination in massive MIMO by independent component analysis [Paper presentation].2017 IEEE Globecom Workshops (GC Wkshps), in vol. 2018-Janua, pp. 1–5. https://doi.org/10.1109/GLOCOMW.2017.8269156
Google Scholar
Aoudia, F. A., & Hoydis, J. (2018). End-to-end learning of communications systems without a channel model [Paper presentation]. 52nd Asilomar Conf. Signals, Syst. Comput, pp. 298–303.
Google Scholar
Balevi, E., Doshi, A., & Andrews, J. G. (2020). Massive MIMO channel estimation with an untrained deep neural network. IEEE Transactions on Wireless Communications, 19(3), 2079–2090. https://doi.org/10.1109/TWC.2019.2962474
Web of Science ®Google Scholar
Biglieri, E., Calderbank, R., Constantinides, A., Goldsmith, A., Paulraj, A., & Poor, H. V. (2007). MIMO wireless communications. Cambridge University Press.
Google Scholar
Björnson, E., Hoydis, J., & Sanguinetti, L. (2017). Massive MIMO networks: Spectral, energy, and hardware efficiency. Foundations and Trends® in Signal Processing, 11(3-4), 154–655. https://doi.org/10.1561/2000000093
Google Scholar
Bjornson, E., Larsson, E. G., & Debbah, M. (2016). Massive MIMO for maximal spectral efficiency: How many users and pilots should be allocated? IEEE Transactions on Wireless Communications, 15(2), 1293–1308. https://doi.org/10.1109/TWC.2015.2488634
Google Scholar
Boccardi, F., Heath, R., Lozano, A., Marzetta, T. L., & Popovski, P. (2014). Five disruptive technology directions for 5G. IEEE Communications Magazine, 52(2), 74–80. https://doi.org/10.1109/MCOM.2014.6736746
Web of Science ®Google Scholar
Buzzi, S., & Zappone, A. (2017). Downlink power control in user-centric and cell-free massive MIMO wireless networks [Paper presentation].in IEEE International Symposium on Personal, Indoor, and Mobile Radio Communications, PIMRC, 2017, volOctob, no. 1, pp. 1–6. https://doi.org/10.1109/PIMRC.2017.8292293
Google Scholar
Chen, Y., Ran, R., & Oh, S. (2016). Performance analysis for pilot decontamination of Massive MIMO using alternating projection [Paper presentation].in International Conference on Ubiquitous and Future Networks, ICUFN, vol. 2016Augus, pp. 297–300. https://doi.org/10.1109/ICUFN.2016.7537036
Google Scholar
Cheng, Q., Fang, G., Nguyen, D., & Dutkiewicz, E. (2016). Novel pilot decontamination methods for Massive MIMO systems under practical scenarios [Paper presentation]. 2016 16th Int. Symp. Commun. Inf. Technol. Isc, 2016, pp. 77–81. https://doi.org/10.1109/ISCIT.2016.7751595
Google Scholar
Chun, C. J., Kang, J. M., & Kim, I. M. (2019). Deep learning-based joint pilot design and channel estimation for multiuser MIMO channels. IEEE Communications Letters, 23(11), 1999–2003. https://doi.org/10.1109/LCOMM.2019.2937488
Google Scholar
D’Andrea, C., Zappone, A., Buzzi, S., & Debbah, M. (2019). Uplink power control in cell-free massive MIMO via deep learning [Paper presentation]. 2019 IEEE 8th international workshop on computational advances in multi-sensor adaptive processing, CAMSAP 2019 - proceedings, pp. 554–558. https://doi.org/10.1109/CAMSAP45676.2019.9022520
Google Scholar
Elijah, O., Leow, C. Y., Rahman, T. A., Nunoo, S., & Iliya, S. Z. (2016). A comprehensive survey of pilot contamination in massive MIMO - 5G system. IEEE Communications Surveys & Tutorials, 18(2), 905–923. https://doi.org/10.1109/COMST.2015.2504379
Google Scholar
ETSI. (2017). Study on 5G channel model for frequencies from 0.5 to 100 GHz [Online]. http://www.etsi.org/standards-search
Google Scholar
Fatema, N., Xiang, Y., & Natgunanathan, I. (2017). Analysis of a semi-blind pilot decontamination method in massive MIMO [Paper presentation].27th Int. Telecommun. Networks Appl. Conf. ITNAC, 2017 2017, vol. 2017-Janua, pp. 1–6. https://doi.org/10.1109/ATNAC.2017.8215368
Google Scholar
Ferrante, G. C., Geraci, G., & Quek, T. Q. S. (2017). A group-blind detection scheme for uplink multi-cell massive MIMO [Paper presentation]. IEEE Veh. Technol. Conf, vol. 2017-June. https://doi.org/10.1109/VTCSpring.2017.8108326
Google Scholar
Ferrante, G. C., Geraci, G., Quek, T. Q. S., & Win, M. Z. (2016). Group-blind detection with very large antenna arrays in the presence of pilot contamination [Paper presentation]. ICASSP, IEEE Int. Conf. Acoust. Speech Signal Process. - Proc., vol. 2016-May, no. 1, pp. 3701–3705. https://doi.org/10.1109/ICASSP.2016.7472368
Google Scholar
Giordano, L. G. (2018). Uplink sounding reference signal coordination to combat pilot contamination in 5G massive MIMO [Paper presentation]. IEEE Wireless Communications and Networking Conference, WCNC, vol. 2018-April, 1–6. https://doi.org/10.1109/WCNC.2018.8377316
Google Scholar
Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.
Google Scholar
GPP. (2020). Physical channels and modulation (3GPP TS 38.211 version 15.3.0 Release 15) [Online]. https://portal.etsi.org/TB/ETSIDeliverableStatus.aspx
Google Scholar
Hirose, H., Tomoaki, O., & Guan, G. (2021). Deep learning-based channel estimation for massive MIMO systems with pilot contamination. IEEE Open Journal of Vehicular Technology, 2(4), 67–77. https://doi.org/10.1109/OJVT.2020.3045470
Google Scholar
Jiang, P., Wen, C., Jin, S., & Li, G. Y. (2021). Dual CNN based channel estimation for MIMO-OFDM systems. IEEE Transactions on Communications, 69(9), 5859–5872. https://doi.org/10.1109/TCOMM.2021.3085895
Google Scholar
Jose, J., Ashikhmin, A., Marzetta, T. L., & Vishwanath, S. (2011). Pilot contamination and precoding in multi-cell TDD systems. IEEE Transactions on Wireless Communications, 10(8), 2640–2651. https://doi.org/10.1109/TWC.2011.060711.101155
Web of Science ®Google Scholar
Kim, K., Lee, J., & Choi, J. (2018). Deep learning based pilot allocation scheme (DL-PAS) for 5G massive MIMO system. IEEE Communications Letters, 22(4), 828–831. https://doi.org/10.1109/LCOMM.2018.2803054
Web of Science ®Google Scholar
Lim, B., Yun, W. J., Kim, J., & Ko, Y. (2021). Joint pilot design and channel estimation using deep residual learning for multi-cell massive MIMO under hardware impairments. arXiv:2108.04485v1, pp. 1–12. [Online]. http://arxiv.org/abs/2108.04485
Google Scholar
Marzetta, T. L. (2010). Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Transactions on Wireless Communications, 9(11), 3590–3600. https://doi.org/10.1109/TWC.2010.092810.091092
Web of Science ®Google Scholar
Mathworks Inc. (2022). New radio sounding reference signals configuration. https://www.mathworks.com/help/5g/ug/nr-sounding-reference-signals.html
Google Scholar
Mathworks Inc. (2023). NR uplink channel state information estimation using SRS. https://www.mathworks.com/help/5g/ug/nr-uplink-channel-state-information-estimation-using-srs.html
Google Scholar
Mezghani, A., & Swindlehurst, A. L. (2017). “Blind estimation of sparse multi-user massive MIMO channels,” in 21st International ITG Workshop on Smart Antennas, WSA 2017 pp. 265–269.
Google Scholar
Misso, A., Kissaka, M., & Maiseli, B. (2020). Exploring pilot assignment methods for pilot contamination mitigation in massive MIMO systems. Cogent Eng, 7(1), 1831126. https://doi.org/10.1080/23311916.2020
Google Scholar
Müller, R. R., Vehkaperä, M., & Cottatellucci, L. (2014). Blind pilot decontamination. IEEE Journal of Selected Topics in Signal Processing, 8(5), 773–786. https://doi.org/10.1109/JSTSP.2014.2310053
Google Scholar
Nayebi, E., & Rao, B. D. (2018). Semi-blind channel estimation for multiuser massive MIMO systems. IEEE Transactions on Signal Processing, 66(2), 540–553. https://doi.org/10.1109/TSP.2017.2771725
Google Scholar
Neumann, D., Joham, M., & Utschick, W. (2014). Suppression of pilot-contamination in massive MIMO Systems. IEEE 15th Int. Work. Signal Process. Adv. Wirel. Commun., no. 4, pp. 11–15.
Google Scholar
Neumann, D., Joham, M., & Utschick, W. (2018). Covariance matrix estimation in massive MIMO. IEEE Signal Processing Letters, 25(6), 863–867. https://doi.org/10.1109/LSP.2018.2827323
Google Scholar
O’Shea, T. O., & Hoydis, J. (2017). An introduction to deep learning for the physical layer. IEEE Transactions on Cognitive Communications and Networking, 3(4), 563–575. https://doi.org/10.1109/TCCN.2017.2758370
Google Scholar
Omid, Y., Hosseini, S. M., Shahabi, S. M., Shikh-Bahaei, M., & Nallanathan, A. (2021). AoA-based pilot assignment in massive MIMO systems using deep reinforcement learning. IEEE Communications Letters, 25(9), 2948–2952. https://doi.org/10.1109/LCOMM.2021.3089234
Google Scholar
Orhan, S., & Bastanlar, Y. (2018). Training CNNs with image patches for object localization. Electronics Letters, 54(7), 424–426. https://doi.org/10.1049/el.2017.4725
Google Scholar
Pitarokoilis, A., Bjomson, E., & Larsson, E. G. (2017). On the effect of imperfect timing synchronization on pilot contamination, [Paper presentation]. https://doi.org/10.1109/ICC.2017.7996671
Google Scholar
Rodriguez, J. (2015). Fundamentals of 5G mobile networks. John Wiley & Sons.
Google Scholar
Sanguinetti, L., Zappone, A., & Debbah, M. (2018). Deep learning power allocation in massive MIMO. arXiv:1812.03640v2 [Online]. https://doi.org/10.48550/arXiv.1812.03640
Google Scholar
Sekou, T. B., Hidane, M., Olivier, J., & Cardot, H. (2019). From patch to image segmentation using fully convolutional networks – application to retinal images,” arXiv:190403892v2 [Online]. http://arxiv.org/abs/1904.03892
Google Scholar
Ulyanov, D., Vedaldi, A., & Lempitsky, V. (2018). Deep image prior [Paper presentation]. 2018 IEEE/CVF Conf. Comput. Vis. Pattern Recognit,., pp. 9446–9454. https://doi.org/10.1109/CVPR.2018.00984
Google Scholar
Wang, X., & Xu, Y. (2020). Pilot-assisted channel estimation and signal detection in uplink multi-user MIMO systems with deep learning. IEEE Access, 8, 44936–44946. https://doi.org/10.1109/ACCESS.2020.2978253
Google Scholar
Xu, J., Zhu, P., Li, J., & You, X. (2019). Deep learning based pilot design for multi-user distributed massive MIMO systems. IEEE Wireless Communications, 8(4), 1016–1019. https://doi.org/10.1109/lwc.2019.2904229
Google Scholar
Ye, H., Li, G. Y., Juang, B. H. F., & Sivanesan, K. (2018). Channel agnostic end-to-end learning based communication systems with conditional GAN [Paper presentation]. 2018 IEEE Globecom Workshops (GC Wkshps), pp. 1–5. https://doi.org/10.1109/GLOCOMW.2018.8644250
Google Scholar
Zhang, J., Yuan, X., & Zhang, Y. J. A. (2018). Blind signal detection in massive MIMO: Exploiting the channel sparsity. IEEE Transactions on Communications, 66(2), 700–712. https://doi.org/10.1109/TCOMM.2017.2761384
Google Scholar
Zhao, L., Zhao, H., Zheng, K., & Xiang, W. (2018). Massive MIMO in 5G networks : Selected applications. Springer.
Google Scholar

A review on deep learning aided pilot decontamination in massive MIMO

Abstract

1. Introduction