Semi-Blind Strategies for MMSE Channel Estimation Utilizing Generative Priors
Franz Weißer, Nurettin Turan,
Dominik Semmler, Fares Ben Jazia, and Wolfgang Utschick
This work was supported by the Federal Ministry of Education and Research of Germany in the programme of “Souverän. Digital. Vernetzt.”. Joint project 6G-life, project identification number: 16KISK002.
An earlier version of this work was presented at ICASSP’24[1].
The authors are with the TUM School of Computation, Information and Technology, Technical University of Munich, 80333 Munich, Germany (e-mail: [email protected]).
Abstract
This paper investigates semi-blind channel estimation for massive multiple-input multiple-output (MIMO) systems.
To this end, we first estimate a subspace based on all received symbols (pilot and payload) to provide additional information for subsequent channel estimation.
We show how this additional information enhances minimum mean square error (MMSE) channel estimation.
Two variants of the linear MMSE (LMMSE) estimator are formulated, where the first one solves the estimation within the subspace, and the second one uses a subspace projection as a preprocessing step.
Theoretical derivations show the superior estimation performance of the latter method in terms of mean square error for uncorrelated Rayleigh fading.
Subsequently, we introduce parameterizations of this semi-blind LMMSE estimator based on two different conditional Gaussian latent models, i.e., the Gaussian mixture model and the variational autoencoder.
Both models learn the underlying channel distribution of the propagation environment based on training data
and serve as generative priors for semi-blind channel estimation.
Extensive simulations for real-world measurement data and spatial channel models show the superior performance of the proposed methods compared to state-of-the-art semi-blind channel estimators with respect to the MSE.
Accurate channel state information (CSI) is crucial for achieving the expected high data rates promised by multiple-input-multiple-output (MIMO) systems [2, 3, 4].
The CSI describes the communication link between transmitter and receiver, characterized by its time-varying and frequency-selective nature, which is prone to rapid changes
making the task of channel estimation complex [5].
As accurate channel estimates are essential for the successful transmission of data, it is placed at the center of several research efforts [6, 7].
The most widely adopted methods in wireless communication utilize known training or pilot symbols
transmitted across the channel using some of the radio resource blocks [8].
Afterward, the receiver uses the observed signals to determine a reliable CSI estimate.
As the number of pilots scales with the number of users, the spectral efficiency decreases for higher number of users as less symbols are available for transmitting data.
To enhance channel estimation without increasing the number of pilot symbols, various methods have been developed that leverage the information embedded in the observed data symbols at the receiver to infer channel characteristics [9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25].
These methods exploit structure and redundancy within the transmitted data and yield more accurate CSI estimates.
The benefit of semi-blind channel estimation was first studied in [9], where Cramer-Rao bounds (CRBs) for blind, semi-blind, and training-based channel estimation were investigated in the context of single-input-multiple-output (SIMO) systems.
In [10, 11], semi-blind channel estimation schemes based on maximum likelihood (ML) estimation were introduced.
The asymptotic performance of the respective estimators was also studied in [10] for infinitely long data sequences.
Another asymptotic behavior, where the number of antennas grows to infinity, was studied in [12]. Here, the authors identified two interference components in semi-blind channel estimation, which do not vanish even for large numbers of antennas.
Early work on improving the least squares (LS) estimator using a semi-blind algorithm was conducted in [13].
The authors in [14] propose to use partially decoded data for channel estimation.
Furthermore, in [15], semi-blind and blind channel estimation was studied to enhance the maximum a-posteriori (MAP) channel estimates in massive MIMO systems.
These findings are based on favorable propagation, which only holds for large antenna arrays deployed at the base station (BS).
In [16], two semi-blind channel estimators based on the expectation maximization (EM) algorithm are studied.
The assumption of a Gaussian distribution for the data was verified, leading to a closed-form solution for the E-step.
Another iterative framework optimizing the likelihood based on message passing (MP) is used in [17, 18] and references therein.
In [19], a data-aided iterative scheme is proposed for orthogonal time frequency space (OTFS) systems by employing affine-precoded superimposed pilots.
The performance improvement achieved with these iterative approaches generally requires computational costly updates.
A low complexity iterative LS channel estimation algorithm is proposed in [20] for a massive MIMO turbo-receiver.
In [21], the concept of semi-blind channel estimation was adapted for time domain synchronous-orthogonal frequency division multiplexing (OFDM) systems, where in addition to the pseudo noise sequence in the guard interval, the sent OFDM data symbols are exploited for the channel estimation.
In [22], a framework was introduced for choosing reliable decoded data symbols, which can be interpreted as additional pilots.
Similarly, reliably detected symbols are used for a semi-data-aided channel estimation in [23].
In [24], peak-power carriers in an OFDM system are selected to eliminate the need to determine reliable data symbols at the receiver.
Recently, a diffusion model-based approach for joint channel estimation and detection was proposed in [25], where a diffusion process is constructed that models the joint distribution of the channels and symbols given noisy observations.
In this article, we focus on pilot-based estimators which minimize the MSE and investigate how these estimators can be extended for the semi-blind case.
Notably, in [26] a subspace formulation of the MMSE estimator was used to mitigate pilot contamination in massive MIMO systems.
The MMSE estimator is known to be the conditional mean estimator (CME) [27, Ch. 10], which, in general, is intractable and can not be computed in closed form.
Recently, powerful approximations based on machine learning were presented in [28, 29, 30, 31, 32].
The benefit of machine learning is to enhance the task at hand by using prior information captured during the learning stage.
For a given BS cell environment, the probability density function (PDF) representing potential user channels can be considered valuable prior information.
Since this true underlying distribution is unknown, machine learning methods rely on a representative data set, which is assumed to be available at the BS.
Based on this data set the PDF of the user channels can be learned.
The first proposal of using a Gaussian mixture model (GMM) to formulate an estimator was done in [28] for the case of image processing.
The approaches in [30, 31, 32] build on that by constructing a conditionally Gaussian latent model (CGLM) for the PDF of a BS cell environment.
The learned CGLM not only enables MMSE channel estimation in [30, 31, 32] but can also be used for e.g., a limited feedback scheme as in [33].
In this work, we propose to utilize CGLMs to parameterize the CME in the semi-blind setting.
The contributions of this work are summarized as follows:
•
We introduce two variants of the linear MMSE (LMMSE) estimator incorporating subspace knowledge provided by the payload data symbols.
First, we depict how the LMMSE channel estimator can solve a subspace estimation problem [26].
As an alternative, we propose a projection method that is computationally more efficient since it allows for the pre-calculation of LMMSE filters.
•
With theoretical derivations we show the superior MSE performance of the proposed projection method in the case of uncorrelated Rayleigh fading and perfect subspace knowledge.
•
We show how the GMM [30] and variational autoencoder (VAE) [32], instances of the class of CGLMs, can be used to parameterize the semi-blind LMMSE estimator.
•
Extensive simulations on different datasets, consisting of typical massive MIMO systems with multiple users and including real-world measurement data, show the superior performance of our proposed methods compared to state-of-the-art semi-blind channel estimation algorithms with respect to the MSE.
Preliminary results were presented in [1] and extended to the multi-user MIMO case in [34], which we extend further in the following aspects.
The theoretical analyses in SectionIII enhance the foundation of the proposed semi-blind channel estimation strategies and provide analytic insights into the superior performance of the proposed projection method.
We extend our concept of semi-blind MMSE channel estimation to the whole class of CGLMs, providing a more general framework to parameterize the semi-blind LMMSE estimator.
Finally, we provide more comprehensive simulation results to show the strengths of our proposed strategies.
Notations: Matrices and vectors are denoted with boldface symbols. and denote the zero vector of appropriate size and the identity matrix of size , respectively. , , , and denote the expectation, trace, range, and rank operators, respectively. We use , , to denote the transpose, conjugate transpose, and inverse. denotes the -norm of a vector. denotes the circularly symmetric complex Gaussian distribution with mean and covariance matrix .
II System and Channel Model
We consider a multi-user uplink system with single-antenna users and a BS equipped with receive antennas.
The received signal vector at time instance is then
(1)
where and denote the signal sent by each of the users and the additive noise, respectively, whereas contains the individual channels of the users .
The case of multiple antennas at the users can be transformed into (1) by viewing each active stream as a different user.
The corresponding channel can then be seen as the effective channel.
For further details on semi-blind channel estimation in multi-user MIMO, we refer the reader to [34].
For the task of channel estimation, we consider a channel coherence interval larger than the number of snapshots , i.e., the channels are constant over all snapshots.
We assume that the noise is Gaussian with .
In conventional channel estimation schemes, each user’s signals include uplink pilots. These pilots are known to the BS.
Hence, the received observations at the BS side are
(2)
where , , , , and denote all received observations, received pilot observations, received payload data observations, sent pilots, and sent payload data symbols, respectively.
In order to fully illuminate the channels, the number of pilots is, at minimum, the number of users , and orthogonal pilots are used.
We set , and utilize discrete Fourier transform (DFT) pilot sequences.
After decorrelating the orthogonal pilot sequences
the received pilot observations simplify to
(3)
where has the same statistics as and, hence, we can omit the subscript.
This enables to consider channel estimation from a per user perspective in the subsequent discussions.
For reasons of simpler readability, the index for the respective user is, therefore, no longer given in the following.
Consequently, we denote the pilot observation of a user as
(4)
with .
II-ASpatial Channel Model
We consider a spatial channel model based on [35], where the channel vectors are considered as conditionally Gaussian distributed [29]
(5)
based on a set of parameters , which describe the directions and properties of the multi-path propagation clusters.
The main angles are drawn independently from a uniform distribution in and the path gains are independent zero-mean Gaussians.
The spatial covariance matrix is given by
(6)
where is the power density consisting of a weighted sum of Laplace densities, which have standard deviations corresponding to the angular spread of the propagation clusters.
The BS employs a uniform linear array (ULA) with antennas and spacing.
The steering vector is then given as
(7)
For every channel sample, we consider a new and draw the sample according to .
II-BMeasurement Campaign
Since synthetic data capture real-world CSI characteristics only up to some extent, we utilize real-world data from a measurement campaign conducted at the Nokia campus in Stuttgart, Germany, in October/November 2017,
cf. [36].
The BS antenna with a uniform rectangular array (URA) comprises vertical ( spacing) and horizontal ( spacing) single polarized patch antennas.
The operating carrier frequency is GHz and the antenna was mounted on a rooftop approximately meters above the ground.
For further details, we refer the reader to [36].
III Semi-Blind Channel Estimation using Perfect Statistical Knowledge
In this section, we introduce two variants of the LMMSE estimator incorporating information provided by the payload data symbols.
To do so, we first restrict ourselves to the case where perfect statistical knowledge is available at the receiver.
In channel estimation, commonly, only the pilot observation is considered for channel estimation.
The MSE optimal estimator given the pilot observation is the CME
(8)
If the channel sample is drawn from a Gaussian distribution according to
(9)
and if further this statistic is known at the receiver,
the genie-aided CME can be formulated as [27, Ch. 10]
(10)
This LMMSE estimator
achieves the MSE of
(11)
For , the MSE can be expressed using the Woodbury identity as
(12)
where are the eigenvalues of .
For our considerations concerning semi-blind channel estimation, we assume knowledge about the subspace defined by , where we denote with the left singular vectors of , which span the same subspace as the columns of .
Generally, we can formulate the ML estimate of in view of (2) as [16]
(13)
where the first term belongs to the pilot observations and the second part refers to the observation obtained from the payload symbols.
In [16] the EM algorithm is introduced to solve this channel estimation problem in terms of maximum likelihood, whereas, in [11] a semi-blind method is derived based on utilizing the subspace .
Now, given the subspace , we can reformulate (13) as [11]
(14)
with and .
The optimal solution for is given as
(15)
(16)
Reinserting this solution into (13), the right part of the objective simplifies to
(17)
where only the left part depends on . Thus, the ML problem for the user of interest results in [11]
(18)
with the closed form solution .
The MSE of this estimator is
(19)
In the following, we introduce two channel estimation strategies combining the information provided by and .
III-ASubspace Channel Estimator
Using the information in , we can solve the estimation within the subspace as previously proposed in [26].
For this, the pilot system model in (4) is projected into the -dimensional subspace as
(20)
Under the assumption that is chosen independently of , the distribution can be used to formulate the estimate [26]
(21)
One should note that by design actually depends on and, hence, (21) is a suboptimal but feasible estimate for .
After solving the estimation in the subspace for , the solution can be transformed back using [26]
(22)
III-BProjected Channel Estimator
As an alternative approach, we propose the use of the orthogonal subspace projection as a preprocessing filter.
Since the projector does not affect , the resulting projected observation is given by
(23)
To formulate the CME , we need to calculate the statistic of the noise with
(24)
To get an intuitive understanding of (24), let us consider a scenario involving spatially uncorrelated channels, meaning that path gains and channel directions are uncorrelated.
This is the case when users are uniformly distributed over the directions, e.g., the spatial channel model in Section II-A, resulting in a channel covariance matrix of the scenario that is a scaled identity [37, Def. 2.3].
In such a case, the matrices with the eigenvectors of the sample covariance matrix of such channels are distributed with Haar measure [38, Chap. 1], i.e., uniformly distributed on the manifold of unitary matrices.
Assuming spatially uncorrelated channels (24) results in
(25)
which we assume to hold for the remainder of this section.
We can then formulate the projected LMMSE estimator as
(26)
III-CPerformance Analysis
If (25) is true, the MSE of the proposed projected channel estimator can directly be written as (cf. Section-A)
(27)
(28)
Comparing the performance to the plain LMMSE we see that
(29)
holds for every , resulting in , cf. (12).
The inequality in (29) only holds with equality if .
Additionally, we can compare the MSE of the projected LMMSE to (19) by reformulating (28) as
(30)
To compare the projected variant to the subspace LMMSE, let us consider uncorrelated Rayleigh fading such that the channel covariance matrix is given as .
The MSE of the projected variant results in
(31)
For the case of , the subspace LMMSE estimator boils down to
Fig.1 shows the performance of the individual channel estimators based on perfect genie knowledge for the case of uncorrelated Rayleigh fading.
As one can see, the projected channel estimator outperforms all other estimators across the whole signal-to-noise ratio (SNR) range.
Further, we realize that the subspace LMMSE converges to the ML method from above.
Figure 1:
MSE over the SNR for given channel estimations based on perfect subspace and perfect statistical knowledge in a user and antennas scenario with uncorrelated Rayleigh fading.
IV Proposed Semi-Blind Channel Estimation - Utilizing Generative Prior
In practice, (10) can not be utilized directly as the channels have to be Gaussian distributed, and this distribution needs to be known.
In general, the CME given the pilot observation is formulated as
(34)
As can be seen in (34), the CME generally can not be computed analytically.
First, the CME needs access to , which is generally unavailable in practice.
Additionally, no closed-form solution exists to the integral in (34).
In order to reformulate the CME, we first use the property that for any arbitrarily distributed random variable , we can always find a condition which makes the conditional distribution Gaussian.
Secondly, it has been shown in [39] that for wireless communication channels, this conditional Gaussian distribution preserves the zero-mean property as
(35)
Thus, we can reformulate the CME as
(36)
(37)
(38)
where
(39)
denotes the LMMSE estimate given .
However, finding a suitable condition can be challenging, in particular as the true distribution of is unknown.
To this end, CGLMs were proposed in [30, 31, 32], which approximate the CME based on a GMM, mixtures of factor analyzers (MFA), and VAE, respectively.
All three methods learn a model that provides the conditional Gaussian distribution based on a discrete (GMM, MFA) or continuous (VAE) latent variable .
In this work, we focus on the GMM and VAE, which we adapt to semi-blind channel estimation in the following.
IV-AGMM-based Semi-blind Channel Estimation
Based on the universal approximation property of GMMs [40],
the PDF of is approximated by
(40)
where , and are the mixing coefficients, means and covariance matrices of the -th GMM component, respectively.
As we are considering wireless channels, the mean of each component is set to , cf. [39].
The fitting of the components in (40) is accomplished with the well-known EM algorithm [41] based on a set of channel
samples as training data.
Based on the formulation in (40) the conditional PDF given is
(41)
Thus, in the case of a GMM we have a discrete latent variable, which helps us parameterize the CME.
The resulting semi-blind subspace GMM can be formulated as
Figure 2: Structure of a semi-blind VAE. The matrix contains the dominant eigenvectors of (57).
The encoder and decoder represent DNNs.
IV-BVAE-based Semi-blind Channel Estimation
To learn the unknown distribution using a VAE, we lower bound the parameterized likelihood using the evidence-lower bound (ELBO).
To formulate the ELBO, the variational distributions and are introduced, which approximate and , respectively.
In contrast to the GMM, the used subspace is unknown to the encoder of the VAE making difficult to learn.
Additionally, the dimension of the encoder input would depend on the number of users in the system.
Thus, we propose to approximate both posteriors and with .
A version of the ELBO for this case, which is accessible, can be written as [42]
(48)
where is the expectation over the variational distribution .
The second term in (48) is the Kullback-Leibler (KL) divergence
(49)
In the VAE framework, the ELBO is optimized using deep neural networks (DNNs) and the reparameterization trick [42].
In order to do so, the involved distributions are defined as
(50)
The resulting semi-blind VAE structure is shown in Fig.2.
In the case of a ULA or URA at the BS the channel covariance matrix is either Toeplitz or block-Toeplitz, respectively.
As shown in [39], the conditional covariance matrix at the output of the VAE preserves this structure.
Thus, we parameterize the output covariance matrix as
(51)
where or , respectively, where is a DFT matrix of size resulting in a circulant approximation, cf. [32], and contains the first columns of the DFT matrix resulting in a block-Toeplitz parameterization, cf. [43].
Further, for the (block-)Toeplitz parameterization we can set , cf. [39].
After successfully training the VAE,
the output is a local parameterization of as conditionally Gaussian
(52)
As analyzed in [32] it is a reasonable approximation to set
(53)
Based on this parameterization we can formulate the semi-blind VAE-based estimators as
(54)
and
(55)
The respective estimators are summarized in Algorithm3 and Algorithm4.
For a more detailed introduction into the VAE framework and its usage for parameterization of the CME we refer the reader to [32].
After introducing the methods utilizing the additional subspace information provided by to enhance the CSI estimation quality, let us consider the estimation of such a subspace.
As the received data symbols are also transmitted over the same channel, they can be used to estimate the subspace containing all user channels.
To this end, let us reconsider the ML estimate of in (13).
Instead of directly optimizing on this ML formulation as done in [16],
which generally does not result in the MMSE, we only take this log-likelihood formulation as an intermediate step to estimate the subspace .
First, let us consider the right term of the objective function in (13).
We can then reformulate the problem by again solving for first and reinserting the solution resulting in [11]
(56)
where and , with from (2).
The maximization in (56) is solved
by setting equal to
with holding the dominant eigenvectors of the receive sample covariance matrix .
This result has also been used in [26].
Additionally, it is trivial to see that the first term in (13) is minimized by .
The subspace spanned by the solution is the same as the subspace spanned by the eigenvectors of the sample covariance matrix , which ignores the additional phase information contained in the pilot observation.
Thus,
the overall subspace estimate is found by
taking the dominant eigenvectors of the sample covariance matrix defined as
(57)
To utilize information from the previous coherence intervals, one can adaptively update the subspace using efficient tracking algorithms as proposed in, e.g., [44, 45].
IV-DComplexity Analysis
The standalone GMM estimator proposed by [30] precomputes the filters used for the individual components, resulting in a complexity of .
For the standalone VAE the complexity is given as [32], where denotes the number of layers in the forward pass of the VAE.
For our semi-blind methods, the calculation of the subspace requires .
This results from calculating the sample covariance matrix with and taking the eigenvectors of the largest eigenvalues for the solution of (56).
Using the projection approximation subspace tracking (PAST) algorithm [45], the computational complexity of calculating the subspace reduces to for every update.
In the case of the subspace GMM the LMMSE estimates can not be precomputed, which results in a complexity of .
Similarily, the subspace VAE exhibits a complexity of .
For the projected versions of the GMM and VAE the complexity becomes and , respectively.
One should note that the calculation for each of the components in the GMM can be parallelized.
Similarly, the computations in the convolutional layers of the VAE can be parallelized, mitigating the complexity.
V Baseline estimators
To compare our methods, the following baseline channel estimators are considered.
Based on the found subspace we can formulate the pilot-based ML estimator as ,
which is the closed-form solution to (18).
This can be interpreted as the subspace-adjusted version of the conventional least squares (LS) channel estimator given as .
Another estimator is based on the sample covariance matrix, which we can compute from the
training data set to infer the global statistics of the channels as
(58)
We can use the matrix as statistical prior to parameterize the semi-blind channel estimators outlined in Section III-A and Section III-B as
(59)
and
(60)
Lastly, we compare our proposed methods to two iterative algorithms optimizing the ML formulation in (13), namely the EM from [16] and a MP variant similar to [17], which we run both until convergence or iteration, whatever comes first.
Figure 3:
NMSE over the SNR for given channel estimations based on observations including one pilot per user in a user scenario using (a) the spatial channel model and (b) measurement data.
To evaluate our proposed methods, we use channel realizations, which are normalized with .
Thus, we can define the SNR .
Further, the normalized MSE (NMSE) defined as
(61)
is used to characterize the performance of the estimators based on
unseen channel samples stemming from the channel models detailed in SectionsII-A and II-B.
The assumption of spatial uncorrelated channels, as used in (25), only holds for the spatial channel model of SectionII-A.
For the case of the measurement campaign described in SectionII-B, we approximate the noise covariance matrix (24)
as
(62)
We use with training samples from the respective channel model to train the GMM and VAE, where we set the number of components to and the latent dimension to , respectively.
Further, in the case of the VAE we allow non-zero values for , as we use the circulant approximation for the spatial channel model and the block-Toeplitz property is not perfectly fulfilled for the measurement data, due to hardware imperfections.
The “s-cov” variants (“sub. s-cov” and “proj. s-cov”) utilize the same training samples.
The number of BS antenna is set to , cf. SectionsII-A and II-B, serving number of users, a representative operating point [37, Chap. 1.3.3].
Further, the number of snapshots is set to , if not stated otherwise, corresponding to a scenario that allows high channel dispersion and high mobility, e.g., up to kph, c.f. [37, Chap. 2.1].
The sent symbols during data transmission generally stem from a discrete constellation, e.g., QPSK, -QAM.
For this work, we utilize Gaussian symbols with such that .
Using a continuous symbol constellation has a negligible effect on the results of the simulations, as also previously observed in [16].
Fig.3a and Fig.3b show the performance of the different channel estimation methods with respect to the SNR for the spatial channel model (cf. SectionII-A) and measurement data (cf. SectionII-B), respectively.
One can see that the semi-blind methods utilizing the CGLMs perform the best across the whole SNR.
The projected variant slightly outperforms its subspace counterpart for most SNR values, which follows the derivations in SectionIII.
Interestingly, the order of the semi-blind GMM and semi-blind VAE depends on the utilized channel model.
In Fig.3a, the projected GMM and VAE show both the best overall result, whereas in Fig.3b, the projected GMM outperforms all other estimators.
Additionally, we see that in Fig.3a, the subspace VAE outperforms the subspace GMM and in Fig.3b, the results are vice versa.
This ordering follows the ordering of the standalone version, where the plain GMM is better than the plain VAE in the case of the measurement data and worse for the spatial channel model.
For high SNR values, all semi-blind variants approach each other except the EM and MP methods.
In Fig.3a, the EM and MP drastically improve from dB to dB showing similar performance at dB as the other semi-blind methods,
whereas, in Fig.3b, they show inferior results also for high SNR.
The CGLM-based approaches keep a slight advantage even for high SNR values, which can be attributed to the fact that prior information is beneficial even for high SNR.
A notable observation is that, in the mid-SNR range, the semi-blind CGLM variants outperform all related estimators by roughly dB.
Figure 4:
NMSE over the number of observations for given channel estimations based on observations including one pilot per user at SNR dB in a user scenario using (a) the spatial channel model and (b) measurement data.
For our proposed strategies, the accuracy of the estimated subspace influences the performance and, hence, the NMSE depends on the number of snapshots as shown in Fig.4.
We see that for an increasing number of snapshots, the NMSE of our proposed methods decreases.
In the case of the spatial channel model (Fig.4a) the projected GMM and VAE perform best for low numbers of snapshots, where the standalone VAE surpasses all other methods for .
Additionally, we observe in Fig.4a that for high , the subspace VAE becomes the best of all considered methods.
In Fig.4b, we observe again that for the measurement data, the semi-blind GMM variants perform the best, where for high the subspace GMM and low the projected GMM outperforms all other methods.
Again, for less than snapshots, the semi-blind methods are outperformed by the standalone GMM and VAE due to inaccuracies in estimating the subspace with a low number of payload data symbols.
For both utilized channel models, the subspace variant of the superior CGLM surpasses the projected variant for high numbers of snapshots converging to a lower error level.
Thus, in practice, where, in general, uncorrelated Rayleigh fading is not the case, there are cases where the subspace CGLM outperforms its projected counterpart.
A critical decrease in performance can be observed for the EM and MP algorithms.
Here, the NMSE increases after a certain point when increasing the number of snapshots.
Even though the minimum appears at different , the overall behavior exhibits similarities.
This is because both methods optimize the joint ML formulation in (13), where the optimization of the second term becomes dominant for a high number of snapshots.
Hence, the impact of the pilot observations relevant to estimating the phase of the channel vanishes.
Figure 5:
NMSE over the number of users for given channel estimations based on observations including one pilot per user at SNR dB using (a) the spatial channel model and (b) measurement data.
The dimension of the subspace directly influences the estimation quality of the proposed methods as shown in Fig.5a and Fig.5b.
For example, in the extreme case where the number of users in the system is equal to the number of BS antennas (), the solution to (56) becomes and, hence, as the number of users in the system increases all semi-blind estimators approach their respective purely pilot based version.
We restrict our simulations within the interval of , which is said to be the preferred operating regime in massive MIMO [37, Chap. 1.3.3], and set the number of snapshots to .
In the case of a single user, all semi-blind variants exhibit similar performance, except for the subspace sample covariance estimator, the subspace VAE, and in the case of the spatial channel model (Fig.5a) the subspace GMM.
For all other considered numbers of users, the proposed projected CGLM methods outperform all other channel estimators.
Additionally, for the spatial channel model in Fig.5a, the subspace GMM also shows inferior results to the other CGLM-based methods for all numbers of users.
Overall, we can conclude that the proposed semi-blind CGLMs show superior channel estimation performance across all different setups.
Depending on the used channel model, either the semi-blind GMMs or the semi-blind VAEs result in slightly better NMSE, where only in the case of the spatial channel model, the subspace GMM shows slightly worse performance compared to the other proposed methods.
Moreover, the projected CGLMs outperform their respective subspace counterpart for most simulated operating points, showing the superiority of the proposed projection method.
VII Conclusion
This work presented a novel semi-blind channel estimation technique based on the class of CGLMs.
To this end, two methods are discussed that incorporate subspace knowledge about the channel into the well-known LMMSE estimator.
Both methods exploit the estimated subspace derived from the dominant eigenvectors of sample covariance matrices constructed using the received symbols.
A theoretical analysis of the methods showed the superior estimation quality of the proposed projection-based estimator for uncorrelated Rayleigh fading channels.
Secondly, we showed how two examples from the class of CGLMs, i.e., the GMM and VAE, can be used to parameterize these estimators.
Extensive simulations based on real-world measurement and spatial channel model data demonstrated the superior estimation performance of the proposed methods compared to standard semi-blind channel estimators.
-AMSE of Projected LMMSE
For any linear estimator , the MSE is given as
(63)
(64)
For the case of the projected LMMSE the second term in (64) can be rewritten as
The overall MSE of the subspace variant for is then
(84)
(85)
References
[1]
F. Weißer, N. Turan, D. Semmler, and W. Utschick, “Data-aided channel
estimation utilizing Gaussian mixture models,” in ICASSP 2024 - 2024
IEEE International Conference on Acoustics, Speech and Signal Processing
(ICASSP), 2024, pp. 8886–8890.
[2]
F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta, O. Edfors, and
F. Tufvesson, “Scaling up MIMO: Opportunities and challenges with very
large arrays,” IEEE Signal Processing Magazine, vol. 30, no. 1, pp.
40–60, 2013.
[3]
Y. Kabalci, 5G Mobile Communication Systems: Fundamentals, Challenges,
and Key Technologies. Singapore:
Springer, 2019, pp. 329–359. [Online]. Available:
https://doi.org/10.1007/978-981-13-1768-2_10
[4]
H. Ye, G. Y. Li, and B.-H. Juang, “Power of deep learning for channel
estimation and signal detection in OFDM systems,” IEEE Wireless
Communications Letters, vol. 7, no. 1, pp. 114–117, 2018.
[5]
Z. Shang, T. Zhang, G. Hu, Y. Cai, and W. Yang, “Secure transmission for
NOMA-based cognitive radio networks with imperfect CSI,” IEEE
Communications Letters, vol. 25, no. 8, pp. 2517–2521, 2021.
[6]
H. Harkat, P. Monteiro, A. Gameiro, F. Guiomar, and H. Farhana Thariq Ahmed,
“A survey on MIMO-OFDM systems: Review of recent trends,” Signals,
vol. 3, no. 2, pp. 359–395, 2022.
[7]
S. C and J. Sandeep, “A review of channel estimation mechanisms in wireless
communication networks,” in 2021 5th International Conference on
Electronics, Communication and Aerospace Technology (ICECA), 2021, pp.
603–608.
[8]
T. L. Marzetta, “How much training is required for multiuser MIMO?” in
2006 Fortieth Asilomar Conference on Signals, Systems and Computers,
2006, pp. 359–363.
[9]
E. De Carvalho and D. Slock, “Cramer-Rao bounds for semi-blind, blind and
training sequence based channel estimation,” in First IEEE Signal
Processing Workshop on Signal Processing Advances in Wireless
Communications, 1997, pp. 129–132.
[10]
——, “Asymptotic performance of ML methods for semi-blind channel
estimation,” in Conference Record of the Thirty-First Asilomar
Conference on Signals, Systems and Computers (Cat. No.97CB36136), vol. 2,
1997, pp. 1624–1628 vol.2.
[11]
A. Medles and D. Slock, “Augmenting the training sequence part in semiblind
estimation for MIMO channels,” in The Thirty-Seventh Asilomar
Conference on Signals, Systems & Computers, 2003, vol. 2, 2003, pp.
1825–1829 Vol.2.
[12]
J. Ma and L. Ping, “Data-aided channel estimation in large antenna systems,”
IEEE Transactions on Signal Processing, vol. 62, no. 12, pp.
3111–3124, 2014.
[13]
M. Joham, W. Utschick, J. A. Nossek, and M. D. Zoltowski, “Semi-blind channel
estimation: a new least-squares approach,” in International Conference
on Telecommunications, Cheju Island, Korea, 1999, pp. 416–420.
[14]
S. Park, B. Shim, and J. W. Choi, “Iterative channel estimation using virtual
pilot signals for MIMO-OFDM systems,” IEEE Transactions on Signal
Processing, vol. 63, no. 12, pp. 3032–3045, 2015.
[15]
D. Neumann, M. Joham, and W. Utschick, “Channel estimation in massive MIMO
systems,” 2015. [Online]. Available: https://arxiv.org/abs/1503.08691
[16]
E. Nayebi and B. D. Rao, “Semi-blind channel estimation for multiuser massive
MIMO systems,” IEEE Transactions on Signal Processing, vol. 66,
no. 2, pp. 540–553, 2018.
[17]
Y. Liu, L. Brunel, and J. J. Boutros, “Joint channel estimation and decoding
using Gaussian approximation in a factor graph over multipath channel,” in
2009 IEEE 20th International Symposium on Personal, Indoor and Mobile
Radio Communications, 2009, pp. 3164–3168.
[18]
S. Wu, L. Kuang, Z. Ni, D. Huang, Q. Guo, and J. Lu, “Message-passing receiver
for joint channel estimation and decoding in 3D massive MIMO-OFDM
systems,” IEEE Transactions on Wireless Communications, vol. 15,
no. 12, pp. 8122–8138, 2016.
[19]
A. Mehrotra, S. Srivastava, A. K. Jagannatham, and L. Hanzo, “Data-aided CSI
estimation using affine-precoded superimposed pilots in orthogonal time
frequency space modulated MIMO systems,” IEEE Transactions on
Communications, vol. 71, no. 8, pp. 4482–4498, 2023.
[20]
A. Osinsky, A. Ivanov, D. Lakontsev, R. Bychkov, and D. Yarotsky, “Data-aided
LS channel estimation in massive MIMO turbo-receiver,” in 2020
IEEE 91st Vehicular Technology Conference (VTC2020-Spring), 2020, pp. 1–5.
[21]
M. Liu, M. Crussiere, and J.-F. Helard, “A novel data-aided channel estimation
with reduced complexity for TDS-OFDM systems,” IEEE Transactions
on Broadcasting, vol. 58, no. 2, pp. 247–260, 2012.
[22]
I. Khan, M. Cheffena, and M. M. Hasan, “Data aided channel estimation for
MIMO-OFDM wireless systems using reliable carriers,” IEEE Access,
vol. 11, pp. 47 836–47 847, 2023.
[23]
T.-K. Kim, Y.-S. Jeon, J. Li, N. Tavangaran, and H. V. Poor, “Semi-data-aided
channel estimation for MIMO systems via reinforcement learning,”
IEEE Transactions on Wireless Communications, vol. 22, no. 7, pp.
4565–4579, 2023.
[24]
I. Khan, M. M. Hasan, and M. Cheffena, “A novel low-complexity
peak-power-assisted data-aided channel estimation scheme for MIMO-OFDM
wireless systems,” 2024. [Online]. Available:
https://arxiv.org/abs/2410.05722
[25]
N. Zilberstein, A. Swami, and S. Segarra, “Joint channel estimation and data
detection in massive MIMO systems based on diffusion models,” in
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech
and Signal Processing (ICASSP), 2024, pp. 13 291–13 295.
[26]
Y. Deng and T. Ohtsuki, “Low-complexity subspace MMSE channel estimation in
massive MU-MIMO system,” IEEE Access, vol. 8, pp.
124 371–124 381, 2020.
[27]
S. M. Kay, Fundamentals of Statistical Signal Processing: Estimation
Theory. Englewood Cliffs, NJ:
Prentice-Hall, Inc., 1993.
[28]
J. Yang, X. Liao, X. Yuan, P. Llull, D. J. Brady, G. Sapiro, and L. Carin,
“Compressive sensing by learning a Gaussian mixture model from
measurements,” IEEE Transactions on Image Processing, vol. 24, no. 1,
pp. 106–119, 2015.
[29]
D. Neumann, T. Wiese, and W. Utschick, “Learning the MMSE channel
estimator,” IEEE Transactions on Signal Processing, vol. 66, no. 11,
pp. 2905–2917, 2018.
[30]
M. Koller, B. Fesl, N. Turan, and W. Utschick, “An asymptotically
MSE-optimal estimator based on Gaussian mixture models,” IEEE
Transactions on Signal Processing, vol. 70, pp. 4109–4123, 2022.
[31]
B. Fesl, N. Turan, and W. Utschick, “Low-rank structured MMSE channel
estimation with mixtures of factor analyzers,” in 2023 57th Asilomar
Conference on Signals, Systems, and Computers, 2023, pp. 375–380.
[32]
M. Baur, B. Fesl, and W. Utschick, “Leveraging variational autoencoders for
parameterized MMSE estimation,” IEEE Transactions on Signal
Processing, vol. 72, pp. 3731–3744, 2024.
[33]
N. Turan, B. Fesl, M. Koller, M. Joham, and W. Utschick, “A versatile
low-complexity feedback scheme for FDD systems via generative modeling,”
IEEE Transactions on Wireless Communications, vol. 23, no. 6, pp.
6251–6265, 2024.
[34]
F. Weißer, D. Semmler, N. Turan, and W. Utschick, “Data-aided MU-MIMO
channel estimation utilizing Gaussian mixture models,” in ICC 2024 -
IEEE International Conference on Communications, 2024, pp. 6684–6689.
[36]
N. Turan, B. Fesl, M. Grundei, M. Koller, and W. Utschick, “Evaluation of a
Gaussian mixture model-based channel estimator using measurement data,” in
2022 International Symposium on Wireless Communication Systems
(ISWCS), 2022, pp. 1–6.
[37]
E. Björnson, J. Hoydis, and L. Sanguinetti, “Massive MIMO networks:
Spectral, energy, and hardware efficiency,” Foundations and Trends®
in Signal Processing, vol. 11, no. 3-4, pp. 154–655, 2017.
[38]
V. Milman and G. Schechtman, Asymptotic Theory Of Finite Dimensional
Normed Spaces, ser. Lecture Notes in Mathematics. Springer, 1986, vol. 1200.
[39]
B. Böck, M. Baur, N. Turan, D. Semmler, and W. Utschick, “A statistical
characterization of wireless channels conditioned on side information,”
IEEE Wireless Communications Letters, pp. 1–1, 2024.
[40]
T. T. Nguyen, H. D. Nguyen, F. Chamroukhi, and G. J. McLachlan, “Approximation
by finite mixtures of continuous density functions that vanish at infinity,”
Cogent Mathematics & Statistics, vol. 7, no. 1, p. 1750861, 2020.
[41]
C. M. Bishop, Pattern Recognition and Machine Learning (Information
Science and Statistics). Berlin,
Heidelberg: Springer-Verlag, 2006.
[42]
D. P. Kingma and M. Welling, “An introduction to variational autoencoders,”
Foundations and Trends® in Machine Learning, vol. 12,
no. 4, pp. 307–392, 2019.
[43]
M. Baur, B. Böck, N. Turan, and W. Utschick, “Variational autoencoder for
channel estimation: Real-world measurement insights,” in 2024 27th
International Workshop on Smart Antennas (WSA), 2024, pp. 117–122.
[44]
W. Utschick, “Tracking of signal subspace projectors,” IEEE
Transactions on Signal Processing, vol. 50, no. 4, pp. 769–778, 2002.
[45]
B. Yang, “Projection approximation subspace tracking,” IEEE
Transactions on Signal Processing, vol. 43, no. 1, pp. 95–107, 1995.