Diffusion magnetic resonance imaging (dMRI) allows to estimate brain tissue microstructure as well as the connectivity of the white matter (known as tractography). Accurate estimation of the model parameters (by solving the inverse problem) is thus very important to infer the underlying biophysical tissue properties and fiber orientations. Although there has been extensive research on this topic with a myriad of dMRI models, most models use standard nonlinear optimization techniques and only provide an estimate of the model parameters without any information (quantification) about uncertainty in their estimation. Further, the effect of this uncertainty on the estimation of the derived dMRI microstructural measures downstream (e.g., fractional anisotropy) is often unknown and is rarely estimated. To address this issue, we first design a new deep-learning algorithm to identify the number of crossing fibers in each voxel. Then, at each voxel, we propose a robust likelihood-free deep learning method to estimate not only the mean estimate of the parameters of a multi-fiber dMRI model (e.g., the biexponential model), but also its full posterior distribution. The posterior distribution is then used to estimate the uncertainty in the model parameters as well as the derived measures. We perform several synthetic and in-vivo quantitative experiments to demonstrate the robustness of our approach for different noise levels and out-of-distribution test samples. Besides, our approach is computationally fast and requires an order of magnitude less time than standard nonlinear fitting techniques. The proposed method demonstrates much lower error (compared to existing methods) in estimating several metrics, including number of fibers in a voxel, fiber orientation, and tensor eigenvalues. The proposed methodology is quite general and can be used for the estimation of the parameters from any other dMRI model.

Unlike histological techniques, diffusion magnetic resonance imaging (dMRI) enables researchers to study the microstructural properties of brain tissues in a noninvasive fashion. Furthermore, the brain’s structural connectivity can be efficiently mapped via dMRI-based tractography strategies (Malcolm et al., 2010; Tournier et al., 2019; F. Zhang et al., 2022). The diffusion signal can be described as a function of its model parameters (e.g., number of fibers, orientation, and diffusion coefficient). Accurate estimation of these parameters is paramount for any neuroimaging study, especially when it involves calculating statistics of dMRI-derived measures for a fiber bundle or region-of-interest (ROI) in the brain.

1.1 Literature review

In recent decades, there have been several models and tools used to estimate the fiber orientation and the underlying microstructural measures of brain tissue. Generally, the proposed models are categorized into two groups: 1) signal representation and 2) biophysical models. In the signal representation models, diffusion tensor imaging (DTI) is the simplest and the most popular one, where diffusion is modeled assuming a single fiber orientation with a Gaussian distribution (Basser et al., 1994). While it is an effective approach in some cases, DTI fails to adequately describe the underlying structure if a voxel contains multiple fiber populations or complex architectures (D. Alexander et al., 2002). Therefore, subsequent works (e.g., Dell’Acqua et al., 2010; Kreher et al., 2005; Peled et al., 2006; Rathi et al., 2009; Sid et al., 2017; Tuch et al., 2002) focused on addressing these challenges by introducing more complex diffusion models. As an expansion of DTI, Diffusion Kurtosis Imaging (DKI) quantifies the non-Gaussian diffusion in biological tissues (Jensen et al., 2005) that is compatible with crossing fibers (Henriques et al., 2015), and results in a better characterization of tissue microstructural heterogeneity (Jensen & Helpern, 2010). Instead of directly estimating the number/direction of the fibers, an orientation distribution function (ODF) is estimated that provides the probability of diffusion in various directions (Tuch, 2004). The most pupolar among these methods uses the spherical harmonic basis (Descoteaux et al., 2007; Hess et al., 2006) to represent the ODF. Performing spherical deconvolution and modeling the signal response from a single-fiber allows one to compute the fiber-ODF (or fODF) (Aganj et al., 2010; Dell’Acqua & Tournier, 2019; Jian & Vemuri, 2007; Kaden et al., 2007). Finally, a 3-dimensional ensemble average diffusion propagator was proposed (Merlet & Deriche, 2013; Ning, Westin, et al., 2015; Özarslan et al., 2013) to compute the 3-dimensional probability distribution of the displacement of water molecules.

Another category of diffusion models aim to quantify the biophysical structure of the tissue in the brain. In H. Zhang et al. (2012), the authors introduce a new diffusion model known as neurite orientation dispersion and density imaging (NODDI) that can portray the geometrical angular variation of the fiber populations via a fiber orientation dispersion index. Other recent models include the standard model of diffusion and models that use specialized b-tensor encoding sequences (Coelho et al., 2022; Jelescu et al., 2016).

A comprehensive review of different diffusion models, analytical comparisons of the techniques, and existing challenges are provided in D.C. Alexander (2005), Assemlal et al. (2011), Descoteaux et al. (2008), Jelescu and Budde (2017), Novikov et al. (2019), and Panagiotaki et al. (2012). Histological validation and evaluation of various dMRI techniques can be found in Novikov et al. (2018) and Schilling et al. (2018). The sensitivity of diffusion MRI to microstructural properties and experimental factors is reviewed in Afzali et al. (2021).

Most existing methods use nonlinear least squares (or similar) optimization techniques to estimate the model parameters. However, given that the dMRI signal has a very low signal-to-noise ratio (SNR), the estimation of these parameters can vary significantly depending on the initialization of the optimization algorithm or the minimization method used (Jelescu et al., 2016; Jones & Basser, 2004; Koay & Basser, 2006). Some recent works have tried to address this issue using machine-learning techniques (Gibbons et al., 2019; D. Karimi et al., 2021; Tian et al., 2020), however, none of these methods address the problem of uncertainty quantification. Further, these methods do not consider the problem of crossing fibers in the estimation of the model parameters, which dramatically increases the complexity of the inverse problem.

Given the low SNR, characterization of the uncertainty in estimation becomes paramount for robust post-processing and analysis. For example, the mean FA can be highly biased if several voxel-wise estimates of eigenvalues are inaccurate, thereby reducing statistical power and increasing the possibility of false positive results when performing group analysis. To cope with the Rician noise in dMRI (Gudbjartsson & Patz, 1995), a method using feature fusion and attention network (FFA-DMRI) was proposed to account for high noise in the measured dMRI data (Hong et al., 2020). Some methods have been proposed to estimate the cone of uncertainty (Koay et al., 2008; Malcolm et al., 2010; Reddy & Rathi, 2016) using error propagation methods or within an unscented Kalman filter based tractography method. However, they only estimate the covariance between the relevant parameters and do not provide access to the full posterior distribution, that is, the joint distribution between the parameters of the model (which could be high dimensional). Uncertainty calculation is particularly relevant for dMRI tractography, which is associated with the problem of tracing false positive tracts (Maier-Hein et al., 2017).

In dMRI, the forward model is well-defined; however, the inverse problem is plagued with ambiguity as several likely solutions might exist (Ardizzone et al., 2018; Jelescu et al., 2016; Novikov et al., 2018; Scherrer & Warfield, 2010). Knowledge of the full posterior distribution of the model parameters can help resolve this ambiguity and enable a better understanding of the model uncertainties. Quantification of this uncertainty and its use in calculating robust statistics in neuroimaging studies will help reduce false positive results and could be of great use when performing tractography. Machine learning has become an effective approach in calculating uncertainty and its application. For example, Reinhold et al. (2020) designed a Bayesian deep learning method that learns to estimate voxel-wise uncertainty for pathological purposes. In a recent work, Jallais et al. (2021) employ a normalized flow technique to solve the inverse problem of grey matter modeling. In Tanno et al. (2021), the authors presents a strategy for uncertainty modeling in deep learning via two different signal representations of dMRI. To the best of our knowledge, characterizing the full posterior distribution of the diffusion model parameters has not been investigated yet. Therefore, this paper aims to bridge this gap.

1.2 Contributions

We propose several novel contributions in this work, namely: (i) a deep learning method to accurately estimate the number of principal fiber orientations per voxel, (ii) a likelihood-free inference method to estimate the full posterior distribution of the underlying model parameters (despite the large dimensionality of problem), and (iii) a method to estimate the uncertainty in the model parameters as well as the derived measures, which has not been done before beyond the DTI model.

To fully characterize the uncertainty associated with the estimated parameters of a dMRI model, estimation of the full posterior distribution (conditioned on the observed measurement) is required. However, the number of parameters to estimate depends on the number of fibers within each voxel (assuming a discrete number of fiber orientations per voxel). Thus, one has to first solve the model selection problem before addressing the problem of estimating the full posterior distribution of the model parameters. To do so, we develop a voxel-wise classifier mechanism that categorizes the dMRI signal based on the possible number of principal fiber orientations. This classifier takes the ODF of each voxel and passes it to a convolutional neural network (CNN), which determines the number of crossing fibers at that voxel. Next, we employ a likelihood-free inference approach, which allows for estimating the full posterior distribution of the underlying model parameters at that voxel using a neural network (NN) setup. Note that, the number of model parameters to estimate can be large and increases linearly with the number of fibers present in each voxel. Training can be easily done in this method using simulated samples that are obtained via the forward model. Third, we use an unscented transform to derive analytical expressions for uncertainty (quantified using the variance) in some of the derived dMRI measures (e.g., FA, RTOP, MSD). The proposed approach was evaluated using both synthetic data and in-vivo human brain datasets. Thus, a key novel aspect of our work includes development of a fast algorithm to estimate the full posterior distribution of the model parameters and subsequently of the derived measures (which are nonlinear functions of the model parameters). The proposed work will be very useful in calculating robust statistics in dMRI studies. While we demonstrate the power of this method using a biexponential multi-fiber model, the methodology is quite general and can be used for any dMRI model (H.S. Karimi et al., 2023).

1.3 Paper organization

This paper is arranged in the following manner: Section 2 is dedicated to discuss our method where subsection 2.1 describes the diffusion model and the transformation of the ODF into 2D space for efficient use with an NN formulation. In subsection 2.2, we explain our approach and discuss how we configure the NN for classification and likelihood-free posterior estimation. Section 3 evaluates performance and demonstrates the results on both synthetic and in-vivo data. Finally, section 4 provides a summary of this paper.

2.1 Preliminaries and problem statement

2.1.1 Diffusion model

To characterize the posterior distribution, we need access to an accurate propagator that models the fibers’ desired parameters (e.g., orientation). Several models have been proposed in the literature (Panagiotaki et al., 2012) with varied number of model parameters. In general, the signal at a voxel can be defined as S(b,u):×S2+ that measures diffusion along a set of distinct gradient directions u. Here, b is an acquisition-specific constant (known as b-value) and normally varies between 1000 and 3000s/mm2. Based on the diffusivity, the diffusion signal decays as a function of b-values where the signal decay can be approximately modeled by a mono-exponential (Gaussian) at low b-values. However, such a model cannot accurately predict signals at higher b-values, as the signal decay demonstrates a bi-exponential behavior (Assaf et al., 2004; Clark & Le Bihan, 2000; Maier et al., 2004; Mulkern et al., 1999; Novikov et al., 2018). Note that several other models, such as NODDI (H. Zhang et al., 2012) and DKI (Jensen et al., 2005), impose strong conditions within their models to allow for proper estimation of parameters. However, this limits their ability to generalize to all possible scenarios. For example, different values of intra-cellular and extra-cellular terms or the interchangeability of the tensors may result in similar signals, which are hard to interpret. On the other hand, estimation of the parameters of a more flexible biexponential model is considered challenging due to the possibility of a non-unique solution set. Thus, we choose this model to demonstrate the power of the proposed technique in characterizing the full posterior and how it can be used for choosing the right solution from different possibilities.

It should be noted that the dMRI signal contains multiple diffusing compartments (e.g., fast/slow diffusion fractions). Furthermore, most voxels contain a mixture of multiple fiber orientations crossing at different angles. Since our goal is estimating the uncertainty in the microstructural parameters in a fiber-specific manner, we assume a discrete number of fiber population in a voxel. Therefore, we use the following weighted Gaussian formulation proposed in Rathi et al. (2013) to model the dMRI signal:

(1)

where n indicates the number of fibers, and ωi is the coefficient of the fast diffusing component. Furthermore, Di and D¯i denote the fast and slow diffusion tensors sharing the same eigenvectors but different eigenvalues with Di=λ1imimiT+λ2i(pipiT+viviT) and D¯i=λ3imimiT+λ4i(pipiT+viviT). In this formulation, mi,pi,viS2 are three orthonormal bases of the ith tensor where mi represents the principal diffusion direction. Here, we assume that the tensors have cylindrical symmetry, that is, the tensors have a single dominant eigenvalue with the remaining two eigenvalues being equal. Accordingly, the model consists of the following parameters: xi=[mi,λ1i,λ2i,λ3i,λ4i]. Furthermore, since mi is a unit vector, it can be represented by two angles in spherical coordinates [θ,ϕ]. Due to antipodal symmetry of the ODF, the range of θ,ϕ can be restricted to the right hemisphere, that is, θ,ϕ[0,π]. In summary, the model parameters can be reduced to xi=[θi,ϕi,λ1i,λ2i,λ3i,λ4i]i{1,...,n}. Therefore, the complete parameter set for the multi-fiber case is α=[x1,x2,...,xn], where n is the number of crossing fibers and should be determined for each voxel.

In this work, we use HCP style acquisition parameters with 90 gradient directions at b-values of {1000,2000,3000}. The same set of parameters were used in our simulation experiments. Note that since our main focus in this paper is dedicated to white matter, our model does not include water exchange, which is an essential factor in modeling gray matter (Jelescu et al., 2022; Olesen et al., 2022).

2.1.2 ODF representation

Typically, dMRI studies (across centers) do not acquire the same number (or set) of gradient directions as it is dependent on the scan time budget and analysis methods used. Therefore, using a fixed gradient direction set will restrict the use of a trained NN to that particular set and hence cannot be generalized to the typical scenarios of neuroimaging studies. To address this issue and to ensure generalization of our framework for any number of gradient directions, we use the diffusion oriented distribution function (ODF) representation of the signal S. In this paper, we compute the ODF using spherical harmonics (SH) (Hess et al., 2006). Spherical harmonic transform can be interpreted as a generalized form of the Fourier expansion to the domain of the sphere (Groemer, 1996). Calculating accurate signal representation demands that the harmonic series order L be sufficiently large. On the other hand, the order L is limited by the number of measurements. For example, using spherical harmonics up to order L=8 requires at least 45 distinct diffusion measurements. However, we note that the connection between the accuracy of a SH fit and its relation to the number of measurements available for estimation is a research topic on its own and has been explored in the following works (Consagra et al., 2023; Descoteaux et al., 2007; Deriche et al., 2009). In this work, we assume that an appropriate SH fitting algorithm is available to estimate the ODF at each voxel. Notably, newer methods of ODF or fODF estimation can be easily incorporated into our work.

Once the ODF is computed, a hill-climbing algorithm is typically used to calculate the ODF peaks, which represent the principal diffusion directions of the underlying fibers. However, the number of fibers and the orientations are often inaccurately estimated due to several factors such as measurement noise, close crossing angles, and different levels of diffusivity (Ning, Laun, et al., 2015). Furthermore, estimation only based on raw ODFs is extremely unreliable in the case of fibers crossing at low to moderate angles. Therefore, it is very critical to develop a robust computational technique that can efficiently classify the voxels based on the number of fibers they contain and estimate the fiber orientations.

In diffusion MRI, the measurement noise associated with the diffusion signal is a potential challenge that can impair many of the existing reconstruction methods. Noise distribution depends on many factors, including number of receiver coils, spatial resolution, and gradient strength among others (Gudbjartsson & Patz, 1995; St-Jean et al., 2020). Throughout this paper, SH coefficients corresponding to synthetic data are computed using S+%, where % is zero-mean Gaussian noise with a certain standard deviation. Figure 1 shows a synthetic example of an ODF without and with noise (SNR = 3) which notably distorts the original ODF (top left) causing several false peaks (bottom left) and significant deviation from the ground truth. This demonstrates the need for a robust technique for accurate fiber orientation as well as parameter estimation from highly noisy dMRI data (Jones & Basser, 2004).

Fig. 1.

ODF representation (Right: 2D spherical coordinate ODF map, Left: three-dimensional ODF) of a voxel containing two fiber crossing, when the underlying signal is noiseless (top), and noisy (bottom) with SNR = 3.

Fig. 1.

ODF representation (Right: 2D spherical coordinate ODF map, Left: three-dimensional ODF) of a voxel containing two fiber crossing, when the underlying signal is noiseless (top), and noisy (bottom) with SNR = 3.

Close modal

By converting the 3D representation of the ODF in spherical coordinates, we can create a 2D map of the ODF. Right pictures in Figure 1 show the transformed ODF map in 2D in the spherical θ,ϕ coordinate system of the same underlying ODF on the left side of Figure 1. For the rest of this paper, we will use this 2-dimensional ODF map as input to the designed DNNs. This strategy has two main advantages: 1) The neural networks dealing with these 2-D maps can extract more informative features and 2) Our neural network formulation will be relatively independent of the number of gradient directions.

2.2 Approach

In addition to the parameter estimation, quantifying the uncertainty of the estimated parameters can give us valuable insights into the reliability of the estimation in different regions of the brain. To do so, we aim to quantify the full posterior distribution of the parameters, that is, the probability of the parameters α given the observed measurements y, that is, p(α|y). Furthermore, since the configuration of the parameter vector and the underlying diffusion model changes based on the number of fibers in a voxel, we dedicate subsection 2.2.2 on finding the number of possible fibers in a voxel. In subsection 2.2.4, we introduce some well-known measures that can be derived from the biexponential model.

Estimating the parameters of a model is typically done via optimizing a loss function (e.g., an L2 loss function) to estimate the maximum a-posteriori estimate. Another work such as Koay et al. (2008) derived an analytical cone of uncertainty while estimating the posterior of the DTI model parameters. On the other hand, the proposed work estimates the full posterior distribution of the model parameters (which could be multimodal and of high dimensionality) which implies an analytical and closed form joint distribution of all the model parameters. This extends the uncertainity quantification to arbitrary forward models of dMRI, without the need for complicated derivations of the cone of uncertainty as in Koay et al. (2008).

2.2.1 Posterior estimation

Suppose we can correctly characterize the posterior probability of the parameters. In that case, we will be able to observe the behavior of the parameters based on a given set of measurements for a particular voxel. To find an analytical form of the posterior distribution, we need to calculate the likelihood p(y|α) implied by the Bayes theorem (i.e., p(α|y)=p(y|α)p(α)p(y)). However, computing the likelihood is not feasible since it requires calculating closed form solution of many intractable integrals (Papamakarios et al., 2019). To tackle this issue, Approximate Bayesian Computation (ABC) techniques were proposed as likelihood-free statistical inference approaches (Sisson et al., 2018). However, these approaches are not typically applicable for high dimensional cases. In addition, the required summary statistics and distance functions are customized based on ad-hoc options (Greenberg et al., 2019). On the other hand, the methods based on likelihood-free inference (Greenberg et al., 2019) amortize the posterior without any additional inference steps. These approaches assume that the forward model that maps the parameters α to the output data y is known, and that a sufficiently large training dataset can be simulated. Then, the posterior is directly extracted by a density-estimation NN trained using the simulated dataset.

Likelihood-free inference approaches approximate the true posterior density using a family of densities qψ, where ψ denotes distribution parameters. Let p˜(α|y) indicate the estimated posterior, then,

(2)

An NN configuration must be designed to estimate the density parameters ψ correctly. In the literature, several families qψ have been proposed, including Mixture density networks (MDN) (Bishop, 1994), Masked Autoregressive Flow (MAF) (Papamakarios et al., 2017), Masked Autoencoder for Distribution Estimation (MADE) (Germain et al., 2015), and Neural Spline Flow (NSF) (Durkan et al., 2019). For our diffusion MRI model, we use the MDN as a representative of the true posterior. The MDN can be represented as a linear combination of m kernel functions Gi(α|y) as follows,

(3)

where wi(y) are known as mixing coefficients and control the contribution of the corresponding component Gi(α|y). Although numerous choices are available for the kernel functions, it has been shown that the kernels of Gaussian form result in a better approximation of the posterior as it can approximate any density function if the Gaussian parameters are correctly estimated (McLachlan & Basford, 1988). Therefore, we use the following form for the kernel function:

where μi is the vector containing the center of ith kernel (Gaussian component). Here, Σi represents the full covariance matrix of ith component. In Bishop (1994), a common variance σi is introduced to describe the distribution of a component. However, we consider a more general case where the elements of a component can adopt different distributions while the covariance between the elements can be extracted from Σi. It should be noted that the number of kernels must be adequate to fully characterize the actual posterior density. After determining m, we employ a deep Neural Network (DNN) structure to approximate the parameters of the proposed density. Based on the MDN model described by (3) and (4), the required parameters are: ψ={wi,μi,Σi}i{1,...,m}.

In the training stage, we first simulate a dataset containing N pairs of (αj,yj). Next, we train the DNN that approximates the posterior using the following loss function proposed in Greenberg et al. (2019):

(4)

where, q˜ψ(αj)=qψ(αj)p˜(α)p(α)1Z(ψ). Here, p(α) and p˜(α) refer to the prior probability of the parameters and its estimation. In (4), Z(ψ) is a normalization constant and it equals to Z(ψ)=αqψ(αj)p˜(α)p(α).

2.2.2 Voxel classification

The underlying diffusion model (1) (that we employed for simulating the training dataset) directly depends on the number of fibers in a given voxel (although methods exist to represent them in a continuous fashion). Although determining the number of fibers seems like a straightforward task, it has received significant attention from the research community as it fails in many cases due to measurement noise, close crossing angles, and different levels of diffusivity along different fiber bundles (Ning, Laun, et al., 2015). To overcome this issue, we design a DNN (Albawi et al., 2017) that classifies the voxels into three classes with 1, 2, or 3 fibers (from the 2D ODF map images). To this end, we implement a DNN, as shown in Figure 2, composed of a convolutional neural network (CNN) followed by four fully connected layers. In this paper, we assume a voxel has at most three fiber crossings, a reasonable assumption as more than three fiber crossings have not been validated from histology work. The input are 32×32×3 ODF maps, with the third dimension corresponding to b-values of 1000,2000,3000s/mm2. In addition to the fact that our experimental data has these three b-values (Scherrer & Warfield, 2010) show that multiple b-values are necessary for proper estimation of multi-fiber models. Next, a two-dimensional convolution layer extracts six feature maps using a convolution kernel size of 5×5. In order to not miss the information embedded near the image borders, we add a padding of size 4. Then, a max-pooling layer downsamples the resulting feature maps with kernel = 2 and stride = 2. A similar structure constitutes the two subsequent layers. The output of the last max-pooling layer is flattened and fed into a linear layer. The number of hidden features in the consecutive linear layers are set to be 1000, 500, and 84, respectively. The final section of the classifier structure consists of four fully connected layers that generate the labels, indicating the number of fibers. For the training stage, we employ a stochastic gradient descent optimization to minimize a cross-entropy loss function. Here, we choose the momentum to be 0.95, and the learning rate of 104.

Fig. 2.

Classifier network architecture for estimating the number of fibers in each voxel. It includes two CNN downsampling layers followed by four fully connected layers.

Fig. 2.

Classifier network architecture for estimating the number of fibers in each voxel. It includes two CNN downsampling layers followed by four fully connected layers.

Close modal

2.2.3 Configuration

We assume the parameters α follow a uniform prior distribution within a certain plausible range. That is, we only know the range of the parameters, but we do not assign any higher probability to a particular region between the minimum and maximum points. As described in section 2.1, direction of fibers can be fully expressed by θ,ϕ[0,π]. In addition, the fast eigenvalues {λ1,λ2} fall in the range of [1,3]μm2/ms and the slow eigenvalues {λ3,λ4} take a value in the range of [0.1,0.6]μm2/ms. We also assume that the fast and slow components have 0.7 and 0.3 as their weights (Rathi et al., 2013).

The number of Gaussian components in the MDN model plays a significant role in acquiring a meticulous posterior estimation. Ideally, the existence of at least n! components is necessary for describing the desired posterior in a voxel with n fibers. To clarify this fact, consider a voxel with two fibers crossing each other. Let the parameter vector be T1=[θ1,ϕ1,λ11,λ21,λ31,λ41,θ2,ϕ2,λ12,λ22,λ32,λ42] and s1 is the expected noiseless signal. Now, if we swap the parameters corresponding to each fiber, (i.e., T2=[θ2,ϕ2,λ12,λ22,λ32,λ42,θ1,ϕ1,λ11,λ21,λ31,λ41]), the expected signal will still be s1. This is due to the interchangeability property of the model (1). Accordingly, if the underlying voxel contains two fibers, one can expect two scenarios for the output vector. Similarly, six scenarios are possible for the case of three fiber crossings. In addition, extra numbers of components will be helpful to cope with the effect of measurement noise and outliers. As a rule of thumb, we consider five components for the one and two fiber voxels, while we consider 10 components for the voxels with three fiber crossings. Since we use a discrete representation for the number of fibers, the posterior distribution is conditioned on the number of fibers per voxel, which we estimate using our classification algorithm.

Similar to the classifier network, the DNN dedicated to posterior estimation consists of a convolutional network and a chain of seven fully connected linear layers. Determining the structure of the fully connected layers is very important and it influences the posterior estimator’s performance. Our brute force experiments to determine the best network architecture showed that a small number of layers (less than five layers) results in poor performance, while a large numbers of layers makes the network more complex and leads to over-fitting. We found that seven layers produced the most accurate results. Another factor affecting the performance is the number of hidden features in each layer. Here, we set the hidden features to be 1000 (after experimenting with different numbers). To avoid overfitting, we also embed dropout layers into the DNN.

We use 2 million simulated samples for training, and 10% of this data is used for validation. The training epochs continue until the validation loss decreases, and we stop the training when the validation loss starts to increase. In order to avoid being stuck in a local minimum, we consider 30 more epochs after the last increments. If we reach a new global minimum, we continue the training; otherwise, we select the computed parameters in the minimum validation loss stage. Here, we set the training batch size to be 100. Setting the learning rate to 5×105 was found to be most optimal. The entire code was implemented in python. Specifically for the training of the posterior, we used the python package provided by Tejero-Cantero et al. (2020).

2.2.4 dMRI measures

Several measures have been proposed in the literature to better understand the underlying tissue microstructure (Assaf et al., 2004; Jensen et al., 2005; Ning, Westin, et al., 2015; Ning et al., 2017). For the case of multi-fiber models (like the model used in this work), these measures can be calculated for each fiber separately. Let λi=[λ1i,λ2i,λ3i,λ4i] denote the vector of eigenvalues for fiber i, and λ=[λ1,...,λn] denote the vector of eigenvalues for a multi-fiber voxel. To evaluate the diffusivity of tensors, we compute Fractional anisotropy (FA) as defined in Basser and Pierpaoli (2011). Specifically, the FA of fast diffusing component corresponding to the ith fiber can be calculated using

(5)

Similar to FA, Tuch (2004) defines Generalized Fractional Anisotropy (GFA), a scalar measure for determining the level of isotropy in a multi-fiber voxel. GFA can be numerically computed as follows:

(6)

where std(S) and rms(S) are the underlying signal’s standard deviation and root mean square.

Another metric is Mean-Squared-Displacement (MSD), which expresses the average squared displacement during the diffusion experiment (Ning, Westin, et al., 2015). For the dMRI model (1), the MSD can be analytically computed using:

(7)

Return to origin probability (RTOP) (Assaf et al., 2004; Baxi et al., 2020; Özarslan et al., 2013) serves as another measure that gives us an insight into restricted diffusion in different regions of the brain. RTOP can vary depending on changes in intracellular and intra-axonal or restricted spaces of white and gray matter tissue (Ning, Westin, et al., 2015). For our model (1), the RTOP can be computed analytically using:

(8)

An important contribution of this work is that we not only estimate the “mean” but also the uncertainty measured using the standard deviation (or variance) of these measures. Although the output of the DNN provides the posterior distribution for the model parameters, we approximate the posterior distribution of the derived measures up to second-order statistics (i.e., variance) using the unscented transformation as described in the Appendix.

This section presents a comprehensive evaluation of our proposed method from different perspectives based on in-vivo and synthetic data. We examine our techniques using different synthetic datasets to validate our work and compare it with the ground truth data. To this end, we first generate the simulation data along 90 gradient directions at three different b-values {1000,2000,3000}s/mm2. We added Gaussian noise to the data in all our simulation experiments with a standard deviation of noise being 0.04 (which provides an average SNR of 3 in the white matter). Note that, MRI noise follows a Rician distribution, but for SNR>3 the noise can be approximated with a Gaussian distribution as shown in Gudbjartsson and Patz (1995). For the simulations, we used the acquisition parameters from the human connectome project (HCP) dataset. At the training stage, we use a machine equipped with two GPU cores (CUDA = 11.5 with 22 GB total memory), which results in a big improvement in reducing the training time. With this machine configuration, the training times are 3.8, 11.3, 7.6, and 4.1 hours for one, two, three fiber crossing samples and the classifier, respectively.

3.1 The classifier performance

Table 1 summarizes the classifier’s performance, where our method could estimate the number of fiber crossings at any given voxel with a low percentage error even when the measured signal is very noisy and the crossing angles are small. The test dataset (20,000 test samples) was generated to mimic realistic scenarios seen in the human brain. It should be noted that the test dataset follows uniform distributions for all the parameters, and they have not biased toward any particular configuration of the parameter distribution. We used FA>0.2 for the case of two or three fibers crossings in the white matter. For the case of one fiber, the tensor was allowed to be completely isotropic, which means FA[0,1]. The crossing angle for the two fiber samples was chosen randomly between 10 and 90 degrees (note that very small crossing angle between the fibers is allowed). For the case of three fiber crossings, the minimum angle between the fibers was set to 45 degrees (based on realistic values reported in the literature). For comparison with existing techniques, we calculate the number of peaks using the state-of-the-art method called Constrained Spherical Deconvolution (CSD) (Tournier et al., 2007) as implemented in the DIPY package (Garyfallidis et al., 2014). As seen in Table 1, our proposed classifier dramatically outperforms the CSD technique in terms of identification of the true number of fibers. The much higher percentage of error in the CSD method can be attributed to the high noise levels (SNR = 3) as well as small crossing angles used in our simulations. We also note that we classified a voxel as “mis-classified” if the exact number of fibers were not estimated by the method. Thus, spurious peaks (even if they are small) provided by CSD contributed to the high mis-classification percentage for this method.

Table 1.

Performance of the classifier for biexponential model.

Ground truth1 Fiber2 Fibers3 Fibers
Misclassification (SNR = 3) (Our Approach) 2.7% 2.3% 4.7% 
Misclassification (SNR = 3) (CSD) 45.8% 68.3% 91.6% 
Misclassification (noiseless) (Our Approach) 0.18% 0.11% 0.34% 
Misclassification (noiseless) (CSD) 26.6% 24.2% 50.2% 
Ground truth1 Fiber2 Fibers3 Fibers
Misclassification (SNR = 3) (Our Approach) 2.7% 2.3% 4.7% 
Misclassification (SNR = 3) (CSD) 45.8% 68.3% 91.6% 
Misclassification (noiseless) (Our Approach) 0.18% 0.11% 0.34% 
Misclassification (noiseless) (CSD) 26.6% 24.2% 50.2% 

To demonstrate generalizability of the classifier to data from other dMRI models, we also generated data from the standard model of diffusion (Jelescu et al., 2016) and tested the classifier performance without any retraining. Note that the classifier was trained using the biexponential model. Table 2 shows the classifier performance using this dataset. Clearly, the performance is quite similar to Table 1, albeit with very minor increase in the error for 1 and 2 fiber configurations.

Table 2.

Performance of the classifier for standard model.

Ground truth (Standard Model)1 Fiber2 Fibers3 Fibers
Misclassification (SNR = 3) (Our Approach) 4.2% 4.8% 5.1% 
Ground truth (Standard Model)1 Fiber2 Fibers3 Fibers
Misclassification (SNR = 3) (Our Approach) 4.2% 4.8% 5.1% 

We also tested our algorithm on in-vivo human data from the Human Connectome Project (HCP) (Van Essen et al., 2013) as shown in Figure 3. As expected, most of the white matter had two fibers, with certain regions in the centrum semiovale having 3-fiber crossings, which is known from anatomical validation studies. Also note that, the cerebrospinal fluid (CSF) and ventricles show the existence of only one “fiber” population. We also note that, the central portion of the corpus callosum shows multiple fibers using our method. This is because we do not model fiber dispersion explicitly but instead rely on very small crossing angles (10 deg) to implicitly model fiber dispersion.

Fig. 3.

Classification of voxels in three classes (1, 2, or 3 fibers) in in-vivo data from the HCP dataset. Each slice is 10 voxels away from the surrounding slices in a specific column.

Fig. 3.

Classification of voxels in three classes (1, 2, or 3 fibers) in in-vivo data from the HCP dataset. Each slice is 10 voxels away from the surrounding slices in a specific column.

Close modal

3.2 The posterior estimator performance

We train three different posterior networks based on the underlying number of fibers (one, two or three). In this subsection, we evaluate the performance of our posterior estimator on 20,000 test samples (noisy with SNR = 3). The output of each network will be ψ={wi,μi,Σi}. Figure 4 demonstrates the estimated posterior (red curve) for the fiber orientation corresponding to angle θ1. The true value (θ1=116o) is marked with a black line. As can be seen, the posterior has multiple modes indicating several plausible solutions. Due to the multi-modal nature of the posterior distribution, naive calculation of the mean of the distribution would give erroneous results. Further, in addition to the posterior estimation, we also need to find the final (mean) estimate of the model parameters (the fiber orientation in this case). Therefore, we choose the dominant Gaussian component (green curve) of the posterior as our “true component”. To select the most appropriate component, we first calculate δi=witrace(Σi0.5). Then, the component corresponding to the largest δi represents our final estimate or the most dominant component to use. This method to calculate the dominant component ensures that a component with very large variance representing a fit to the noise in the data is discarded. For the particular case at hand, we see that the peak of the dominant component is very close to the ground truth. Further note that, the variance of this parameter also provides the uncertainty in the estimation of this parameter. This unique information is not provided by any other estimation technique (unless one runs a monte-carlo simulation by initializing a non-linear estimator with a large number of starting points to obtain all possible values of the estimated parameter). On the other hand, the blue histogram in Figure 4 corresponds to the possible outcomes of the Levenberg–Marquardt (LM) nonlinear estimation algorithm that provides a solution to equation (1) for different starting points. In other words, every time we run the nonlinear approach, a different value (based on the frequency given by the histogram) in the region marked by the histogram will be given as the estimate of θ. This histogram plot implies a lack of robustness and repeatability due to noise in the data.

Fig. 4.

Estimation of θ. Black line indicate: true value, red curve: estimated posterior, green curve: selected component, blue histogram: possible solution given by a nonlinear estimator.

Fig. 4.

Estimation of θ. Black line indicate: true value, red curve: estimated posterior, green curve: selected component, blue histogram: possible solution given by a nonlinear estimator.

Close modal

Alternatively, one could use a brute force approach and use large number of initializations to arrive at a solution that gives the lowest error. However, this method is fraught with issues when the solution is highly multi-modal in nature, as explored in Jelescu et al. (2016). Further, the amount of computations required (up to several weeks for an HCP style acquisition) using such an approach would make it almost impossible to use it in more practical scenarios covering the entire brain. Additionally, we would like to point to the work in Sjölund et al. (2018), where the authors study the effect of noise on estimation of dMRI-derived parameters. We also note that the nonlinear approaches only provide a point estimate, and not uncertainty quantification.

As another example, Figure 5 demonstrates the estimated posterior for the eigenvalue λ11. In this example, the voxel contains two fibers x=[θ1=π/6,ϕ1=π/3,λ11=1.9,λ21=1.35,λ31=0.5,λ41=0.25,θ2=pi/5,ϕ2=7π/4,λ21=1.7,λ22=1.06,λ32=0.5,λ42=0.25]. In Figure 5, the red curve is the full posterior while the green curve represents the dominant component in the posterior. It is clear that the true value is very close to the mode of the selected component, while the possible values given by the LM method (histogram) give several solutions throughout the range of λ1.

Fig. 5.

Posterior estimation of the eigenvalue λ11. Black line indicates true value, red curve: estimated full posterior, green curve: selected component, blue histogram: possible solution range given by a nonlinear estimator.

Fig. 5.

Posterior estimation of the eigenvalue λ11. Black line indicates true value, red curve: estimated full posterior, green curve: selected component, blue histogram: possible solution range given by a nonlinear estimator.

Close modal

To evaluate the accuracy across all the 20,000 samples, Figures 6 and 7 depict the angular errors as a function of the ground truth FA and GFA respectively. In this plot, the error bars indicate 25 and 75 percentile, and the midpoint is the median of the errors. We compare our algorithm with two nonlinear approaches: 1) Levenberg-Marquardt (LM) and 2) Trust Region Reflective (TRF) (with bounds on the range of values the parameters can take). An important point to note for these nonlinear methods is that we fix the number of fibers a-priori (and hence the number of parameters to estimate is known) assuming this information is accurately known from other techniques (e.g., CSD or our classifier approach presented earlier). For all the FA values, our method outperforms both the LM and TRF nonlinear estimators in terms of the median. Note however, that both the non-linear estimators have a much larger variance indicating a large number of outlier estimations using these methods. On the other hand, the proposed method has very small error bars (and low median error) for all the simulated samples. As expected, the error has an inverse relationship with FA, that is, all methods perform better if the tensor becomes more anisotropic. It should be noted that the fiber orientation is estimated more precisely by all methods when the underlying voxel contains only one fiber configuration (see Fig. 8).

Fig. 6.

Angular error as a function of minimum FA (of the fast component) among the multiple fibers.

Fig. 6.

Angular error as a function of minimum FA (of the fast component) among the multiple fibers.

Close modal
Fig. 7.

Angular error as a function of GFA of the signal at b = 1000s/mm2.

Fig. 7.

Angular error as a function of GFA of the signal at b = 1000s/mm2.

Close modal
Fig. 8.

Angular error as a function of FA for the case of a single fiber.

Fig. 8.

Angular error as a function of FA for the case of a single fiber.

Close modal

For the multi-fiber voxels, the angle between the fibers (known as the crossing angle) may affect reconstruction accuracy. For example, Malcolm et al. (2010) show that some classical techniques (e.g., sharpened spherical harmonics) fail to resolve two fiber crossings when the crossing angle is below a certain value (e.g., 30 degrees). To show the robustness of our method in recovering small crossing angles, we demonstrate the angular error as a function of the crossing angle between two fibers in Figure 9. As can be seen, the proposed method accurately recovers fiber orientations even at small crossing angles (less than 30 degrees). This is also a significant advantage of our method, which can be very useful for tractography algorithms. An important factor to note here is that for the nonlinear methods, we fixed the number of estimated parameters as the number of fibers in the signal was assumed to be known. However, as shown earlier in this work (Table 1), standard techniques like CSD often fail to estimate the accurate number of fiber crossings at such low angles, which can significantly affect the results of the nonlinear techniques. Thus, the results for nonlinear techniques are true only if the estimate of number of fibers is correct in all test samples, which is unrealistic using existing methods.

Fig. 9.

Angular error as a function of the crossing angle between fibers.

Fig. 9.

Angular error as a function of the crossing angle between fibers.

Close modal

Figure 10 shows the relationship between angular error and the magnitude of the largest eigenvalue. While the proposed method is robust to changes in the eigenvalue, the nonlinear estimator fails in many more cases leading to larger number of voxels being estimated incorrectly. Figure 11 show average angular errors for different levels of measurement noise. The figure confirms the efficacy of our approach for different noise levels. In Figure 12, we show the average absolute error in the estimation of the largest eigenvalue as a function of the ground-truth FA of the fast component (top), and the largest eigenvalue λ1 (bottom). Clearly, our method outperforms the other two nonlinear techniques by a large margin. Similarly, Figure 13 shows the error in the other three eigenvalues. Figure 14 shows the error in the estimated FA and mean diffusivity (MD) respectively as a function of FA. In all the cases, our approach shows lower error compared to the nonlinear fitting method and results in more robust results in the presence of high noise.

Fig. 10.

Angular error as a function of the largest eigenvalue.

Fig. 10.

Angular error as a function of the largest eigenvalue.

Close modal
Fig. 11.

Average angular error for different measurement noise levels.

Fig. 11.

Average angular error for different measurement noise levels.

Close modal
Fig. 12.

Error in estimation of the largest eigenvalue as a function of the ground truth FA of the fast component (top) and the largest (major) eigenvalue (bottom).

Fig. 12.

Error in estimation of the largest eigenvalue as a function of the ground truth FA of the fast component (top) and the largest (major) eigenvalue (bottom).

Close modal
Fig. 13.

Error in estimation of the eigenvalues λ2,λ3 and λ4.

Fig. 13.

Error in estimation of the eigenvalues λ2,λ3 and λ4.

Close modal
Fig. 14.

Top: Error in estimated fractional anisotropy compared to ground truth FA. Bottom: Error in estimated mean diffusivity as a function of ground truth FA.

Fig. 14.

Top: Error in estimated fractional anisotropy compared to ground truth FA. Bottom: Error in estimated mean diffusivity as a function of ground truth FA.

Close modal

Estimation of three fiber crossings is always challenging. Figure 15 validates the efficacy of our approach of posterior estimation in a voxel containing three fiber crossings. The x-axis is the GFA of the voxel for the three fiber configuration. Although we assumed a fixed value for the weights of the fast and slow compartments at the training stage, the weights may vary in reality. To capture the effect of this difference in the model, Table 3 shows the angular error when the underlying weights of the fast and slow components are different from the weights we used in the simulated training dataset (0.7 and 0.3 respectively). Nevertheless, our method shows excellent robustness and generalization ability to such out-of-distribution test samples. As can be seen, while this difference does not affect the angular error, we see slightly higher error in the estimation of the eigenvalues (total error calculated as the sum of the errors in estimation of all the eigenvalues) which is still comparable to the errors obtained using other nonlinear methods.

Fig. 15.

Angular error for three fiber cases with respect to GFA.

Fig. 15.

Angular error for three fiber cases with respect to GFA.

Close modal
Table 3.

Performance with different model parameters.

Weights (fast - slow)Angular error (degree)Absolute total Eigenvalue error (μm2/ms)
0.5-0.5 3.96 0.22 
0.6-0.4 3.31 0.14 
0.7-0.3 2.93 0.07 
0.8-0.2 3.15 0.12 
0.9-0.1 3.91 0.19 
Weights (fast - slow)Angular error (degree)Absolute total Eigenvalue error (μm2/ms)
0.5-0.5 3.96 0.22 
0.6-0.4 3.31 0.14 
0.7-0.3 2.93 0.07 
0.8-0.2 3.15 0.12 
0.9-0.1 3.91 0.19 

We would like to note that the machine-learning algorithm was trained on the entire feasible range of values for each of the model parameters. For example, the range of values for the fast diffusivities ranges from [1,3]μm2/ms whereas the slow diffusivities range from [0.1,0.6]μm2/ms. Similarly, the orientations are spread across the sphere while the volume fractions for each fiber varies from 0.1 to 0.9. While these values are within the possible set of values observed in human data even in pathological cases, the test data set was not limited to this range. Yet, as shown in Figures 7, 8, 10, and 11, our method was able to provide low error rates for these settings. To further demonstrate our work’s generalization ability, we also used the proposed framework with the standard model of diffusion (instead of the biexponential model), details of which can be found in H.S. Karimi et al. (2023).

Figure 16 shows the normalized mean squared error (NMSE) in two slices in a human in-vivo dataset from the HCP database. We used all the 257 gradient directions at b-values of {1000,2000,3000}s/mm2 and estimated the model parameters voxel-wise across the whole brain. The NMSE was then calculated at each voxel using the estimated parameters to generate the signal using the forward model. A similar approach was used for the LM method (right column) as well, with the number of fibers assumed to be correctly known from our classifier. Note that, this model selection problem is already known to be error prone; however, we used the voxel classifier developed in this work for the nonlinear method as well to show comparative results. For the proposed method, the NMSE is well below 0.05 (or less than 5% error in fitting) whereas it is much higher for the LM nonlinear estimator.

Fig. 16.

NMSE in signal reconstruction in a subject from HCP database. Left: our approach, Right: LM nonlinear fitting. Clearly, our method shows low fitting error (less than 5%).

Fig. 16.

NMSE in signal reconstruction in a subject from HCP database. Left: our approach, Right: LM nonlinear fitting. Clearly, our method shows low fitting error (less than 5%).

Close modal

In Figures 17 and 18, we compute three dMRI measures, namely FA (of the fast diffusing component), MSD, and RTOP as well as their uncertainty in estimation (quantified using the standard deviation). We estimate the uncertainty in the derived measures from the uncertainty in the estimation of the eigenvalues using the unscented transform (Julier & Uhlmann, 1997) (see derivation details in Appendix). We see that the FA estimate is highly uncertain in the gray matter regions, whereas MSD is more uncertain in the CSF areas. Also note that, several boundary voxels (ventricles and surrounding tissue) show high uncertainty (variance) in all the measures indicating that those values cannot be trusted in statistical analyses involving group studies. To demonstrate how uncertainty can be used in neuroimaging studies, we first choose a region of interest (ROI) in the brain. Figure 19 shows two distinct ROIs enclosed by a red curve. Next, we calculate the mean of the dMRI measures via two methods: 1) standard arithmetic average where all data equally contribute to the final mean, and 2) variance-weighted average where the weights of each data point in calculating the final mean are computed by the inverse of their variance. We summarize these average values in Table 4 for FA, MSD, and RTOP. Clearly, there is a considerable difference between these values which highlights the importance of uncertainty quantification.

Fig. 17.

Axial slice shows the dMRI-derived measures (FA, MSD, RTOP) and the corresponding voxel-wise uncertainty in these measures quantified using standard deviation (std).

Fig. 17.

Axial slice shows the dMRI-derived measures (FA, MSD, RTOP) and the corresponding voxel-wise uncertainty in these measures quantified using standard deviation (std).

Close modal
Fig. 18.

Coronal slice shows the dMRI-derived measures (FA, MSD, RTOP) and the corresponding voxel-wise uncertainty in these measures quantified using standard deviation (std).

Fig. 18.

Coronal slice shows the dMRI-derived measures (FA, MSD, RTOP) and the corresponding voxel-wise uncertainty in these measures quantified using standard deviation (std).

Close modal
Fig. 19.

Two regions of interest (ROIs) for comparing arithmetic mean and variance-weighted mean.

Fig. 19.

Two regions of interest (ROIs) for comparing arithmetic mean and variance-weighted mean.

Close modal
Table 4.

Variance-weighted average vs. arithmetic average.

dMRI MeasureFAMSDRTOP
Arithmetic Average ROI-1 0.829 2.97 329 
Variance-weighted Average ROI-1 0.886 3.01 263 
Arithmetic Average ROI-2 0.702 3.06 435 
Variance-weighted Average ROI-2 0.916 3.01 262 
dMRI MeasureFAMSDRTOP
Arithmetic Average ROI-1 0.829 2.97 329 
Variance-weighted Average ROI-1 0.886 3.01 263 
Arithmetic Average ROI-2 0.702 3.06 435 
Variance-weighted Average ROI-2 0.916 3.01 262 

One of the other motivations of this work is to find a parameter estimation technique that is more time-efficient. Table 5 indicates the required computational times for executing the introduced methods for the whole brain (HCP dataset at 1.253mm2 spatial resolution). The results confirm a huge improvement (a factor of 400-fold reduction in computation time) using our method that avoids the long computational time requirements for the nonlinear techniques. For all the methods in Table 5, we use 20 CPU (2.2 GHz) cores. Training of the NN was however done on machine with 2 GPU cores with 22 GB memory.

Table 5.

The required time.

ApproachNonlinear LMNonlinear TRFOur Method
Consumed Time 967 hours 1105 hours 2.41 hours 
ApproachNonlinear LMNonlinear TRFOur Method
Consumed Time 967 hours 1105 hours 2.41 hours 

Accurate parameter estimation and uncertainty quantification in dMRI models is necessary for robust statistical analyses in neuroimaging studies. In this paper, for the first time, we estimate the full posterior distribution of the underlying parameters without any likelihood computation. Unlike most existing methods that give only point estimates, our approach can estimate the uncertainty of parameters and the derived measures. Furthermore, we developed a DNN technique to accurately classify the number of fibers in the brain. In terms of point estimation, our method is more robust against noise compared to the nonlinear approaches. Additionally, the proposed approach is extremely fast compared to standard nonlinear techniques, making it ideal for various large-scale applications.

The imaging data used in work are available from the Human Connectome Project (HCP) website (Van Essen et al., 2013). The codes for the presented techniques are available on the following repository: https://github.com/hazhars/posterior_dMRI.

Hazhar Sufi Karimi: Conceptualization, Methodology, Software, Visualization, and Writing Original Draft. Arghya Pal: Software. Lipeng Ning: Methodology, Writing Review & Editing. Yogesh Rathi: Conceptualization, Methodology, Writing Review & Editing, Funding acquisition, and Project administration.

The authors do not have any competing interests or conflicts of interest to declare.

This work was funded by NIH grants: R01MH119222, R01MH125860, and R01NS125781.

As this work does not employ novel data, there is no requirement for ethical approval. All the figures can be regenerated from open-access datasets that were cited.

Afzali
,
M.
,
Pieciak
,
T.
,
Newman
,
S.
,
Garyfallidis
,
E.
,
Özarslan
,
E.
,
Cheng
,
H.
, &
Jones
,
D. K.
(
2021
).
The sensitivity of diffusion MRI to microstructural properties and experimental factors
.
Journal of Neuroscience Methods
,
347
,
108951
. https://doi.org/10.1016/j.jneumeth.2020.108951
Aganj
,
I.
,
Lenglet
,
C.
,
Sapiro
,
G.
,
Yacoub
,
E.
,
Ugurbil
,
K.
, &
Harel
,
N.
(
2010
).
Reconstruction of the orientation distribution function in single-and multiple-shell q-ball imaging within constant solid angle
.
Magnetic Resonance in Medicine
,
64
(
2
),
554
566
. https://doi.org/10.1002/mrm.22365
Albawi
,
S.
,
Mohammed
,
T. A.
, &
Al-Zawi
,
S.
(
2017
).
Understanding of a convolutional neural network
. In
2017 International Conference on Engineering and Technology (ICET)
(pp.
1
6
).
IEEE
. https://doi.org/10.1109/icengtechnol.2017.8308186
Alexander
,
D.
,
Barker
,
G.
, &
Arridge
,
S.
(
2002
).
Detection and modeling of non-Gaussian apparent diffusion coefficient profiles in human brain data
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
48
(
2
),
331
340
. https://doi.org/10.1002/mrm.10209
Alexander
,
D. C.
(
2005
).
Multiple-fiber reconstruction algorithms for diffusion MRI
.
Annals of the New York Academy of Sciences
,
1064
(
1
),
113
133
. https://doi.org/10.1196/annals.1340.018
Ardizzone
,
L.
,
Kruse
,
J.
,
Wirkert
,
S.
,
Rahner
,
D.
,
Pellegrini
,
E. W.
,
Klessen
,
R. S.
,
Maier-Hein
,
L.
,
Rother
,
C.
, &
Köthe
,
U.
(
2018
).
Analyzing inverse problems with invertible neural networks
.
arXiv
, arXiv:1808.04730. https://doi.org/10.1007/s11548-019-01939-9
Assaf
,
Y.
,
Freidlin
,
R. Z.
,
Rohde
,
G. K.
, &
Basser
,
P. J.
(
2004
).
New modeling and experimental framework to characterize hindered and restricted water diffusion in brain white matter
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
52
(
5
),
965
978
. https://doi.org/10.1002/mrm.20274
Assemlal
,
H.-E.
,
Tschumperlé
,
D.
,
Brun
,
L.
, &
Siddiqi
,
K.
(
2011
).
Recent advances in diffusion MRI modeling: Angular and radial reconstruction
.
Medical Image Analysis
,
15
(
4
),
369
396
. https://doi.org/10.1016/j.media.2011.02.002
Basser
,
P. J.
,
Mattiello
,
J.
, &
LeBihan
,
D.
(
1994
).
MR diffusion tensor spectroscopy and imaging
.
Biophysical Journal
,
66
(
1
),
259
267
. https://doi.org/10.1016/s0006-3495(94)80775-1
Basser
,
P. J.
, &
Pierpaoli
,
C.
(
2011
).
Microstructural and physiological features of tissues elucidated by quantitative-diffusion-tensor MRI
.
Journal of Magnetic Resonance
,
213
(
2
),
560
570
. https://doi.org/10.1016/j.jmr.2011.09.022
Baxi
,
M.
,
Di Biase
,
M. A.
,
Lyall
,
A. E.
,
Cetin-Karayumak
,
S.
,
Seitz
,
J.
,
Ning
,
L.
,
Makris
,
N.
,
Rosene
,
D.
,
Kubicki
,
M.
, &
Rathi
,
Y.
(
2020
).
Quantifying genetic and environmental influence on gray matter microstructure using diffusion MRI
.
Cerebral Cortex
,
30
(
12
),
6191
6205
. https://doi.org/10.1093/cercor/bhaa174
Bishop
,
C. M.
(
1994
).
Mixture Density Networks. Technical Report NCRG/94/004.
Neural Computing Research Group, Aston University
,
Birmingham, UK
. https://publications.aston.ac.uk/373/1/NCRG_94_004.pdf.
Clark
,
C. A.
, &
Le Bihan
,
D.
(
2000
).
Water diffusion compartmentation and anisotropy at high b values in the human brain
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
44
(
6
),
852
859
. https://doi.org/10.1002/1522-2594(200012)44:6<852::aid-mrm5>3.0.co;2-a
Coelho
,
S.
,
Baete
,
S. H.
,
Lemberskiy
,
G.
,
Ades-Aron
,
B.
,
Barrol
,
G.
,
Veraart
,
J.
,
Novikov
,
D. S.
, &
Fieremans
,
E.
(
2022
).
Reproducibility of the standard model of diffusion in white matter on clinical MRI systems
.
NeuroImage
,
257
,
119290
. https://doi.org/10.1016/j.neuroimage.2022.119290
Consagra
,
W.
,
Ning
,
L.
, &
Rathi
,
Y.
(
2023
).
Neural orientation distribution fields for estimation and uncertainty quantification in diffusion MRI
.
arXiv
, 2307.08138. https://doi.org/10.48550/arXiv.2307.08138
Dell’Acqua
,
F.
,
Scifo
,
P.
,
Rizzo
,
G.
,
Catani
,
M.
,
Simmons
,
A.
,
Scotti
,
G.
, &
Fazio
,
F.
(
2010
).
A modified damped richardson–lucy algorithm to reduce isotropic background effects in spherical deconvolution
.
Neuroimage
,
49
(
2
),
1446
1458
. https://doi.org/10.1016/j.neuroimage.2009.09.033
Dell’Acqua
,
F.
, &
Tournier
,
J.-D.
(
2019
).
Modelling white matter with spherical deconvolution: How and why
?
NMR in Biomedicine
,
32
(
4
),
e3945
. https://doi.org/10.1002/nbm.3945
Deriche
,
R.
,
Calder
,
J.
, &
Descoteaux
,
M.
(
2009
).
Optimal real-time Q-ball imaging using regularized kalman filtering with incremental orientation sets
.
Medical Image Analysis
,
13
(
4
),
564
579
. https://doi.org/10.1016/j.media.2009.05.008
Descoteaux
,
M.
,
Angelino
,
E.
,
Fitzgibbons
,
S.
, &
Deriche
,
R.
(
2007
).
Regularized, fast, and robust analytical Q-ball imaging
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
58
(
3
),
497
510
. https://doi.org/10.1002/mrm.21277
Descoteaux
,
M.
,
Deriche
,
R.
,
Knosche
,
T. R.
, &
Anwander
,
A.
(
2008
).
Deterministic and probabilistic tractography based on complex fibre orientation distributions
.
IEEE Transactions on Medical Imaging
,
28
(
2
),
269
286
. https://doi.org/10.1109/tmi.2008.2004424
Durkan
,
C.
,
Bekasov
,
A.
,
Murray
,
I.
, &
Papamakarios
,
G.
(
2019
).
Neural spline flows
.
Advances in Neural Information Processing Systems
,
32
,
12
. https://doi.org/10.52591/lxai201912080
Garyfallidis
,
E.
,
Brett
,
M.
,
Amirbekian
,
B.
,
Rokem
,
A.
,
Van Der Walt
,
S.
,
Descoteaux
,
M.
,
Nimmo-Smith
,
I.
, &
Contributors
,
D.
(
2014
).
Dipy, a library for the analysis of diffusion MRI data
.
Frontiers in Neuroinformatics
,
8
,
8
. https://doi.org/10.3389/fninf.2014.00008
Germain
,
M.
,
Gregor
,
K.
,
Murray
,
I.
, &
Larochelle
,
H.
(
2015
).
Made: Masked autoencoder for distribution estimation
. In
International Conference on Machine Learning
(pp.
881
889
).
PMLR
. https://proceedings.mlr.press/v37/germain15.html
Gibbons
,
E. K.
,
Hodgson
,
K. K.
,
Chaudhari
,
A. S.
,
Richards
,
L. G.
,
Majersik
,
J. J.
,
Adluru
,
G.
, &
DiBella
,
E. V.
(
2019
).
Simultaneous NODDI and GFA parameter map generation from subsampled q-space imaging using deep learning
.
Magnetic Resonance in Medicine
,
81
(
4
),
2399
2411
. https://doi.org/10.1002/mrm.27568
Greenberg
,
D.
,
Nonnenmacher
,
M.
, &
Macke
,
J.
(
2019
).
Automatic posterior transformation for likelihood-free inference
. In
International Conference on Machine Learning
(pp.
2404
2414
).
PMLR
. https://proceedings.mlr.press/v97/greenberg19a/greenberg19a.pdf
Groemer
,
H.
(
1996
).
Geometric applications of Fourier series and spherical harmonics
(vol.
61
).
Cambridge University Press
. https://doi.org/10.1017/cbo9780511530005
Gudbjartsson
,
H.
, &
Patz
,
S.
(
1995
).
The Rician distribution of noisy MRI data
.
Magnetic Resonance in Medicine
,
34
(
6
),
910
914
. https://doi.org/10.1002/mrm.1910340618
Henriques
,
R. N.
,
Correia
,
M. M.
,
Nunes
,
R. G.
, &
Ferreira
,
H. A.
(
2015
).
Exploring the 3D geometry of the diffusion kurtosis tensor—Impact on the development of robust tractography procedures and novel biomarkers
.
NeuroImage
,
111
,
85
99
. https://doi.org/10.1016/j.neuroimage.2015.02.004
Hess
,
C. P.
,
Mukherjee
,
P.
,
Han
,
E. T.
,
Xu
,
D.
, &
Vigneron
,
D. B.
(
2006
).
Q-ball reconstruction of multimodal fiber orientations using the spherical harmonic basis
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
56
(
1
),
104
117
. https://doi.org/10.1002/mrm.20931
Hong
,
D.
,
Huang
,
C.
,
Yang
,
C.
,
Li
,
J.
,
Qian
,
Y.
, &
Cai
,
C.
(
2020
).
FFA-dMRI: A network based on feature fusion and attention mechanism for brain MRI denoising
.
Frontiers in Neuroscience
,
14
,
577937
. https://doi.org/10.3389/fnins.2020.577937
Jallais
,
M.
,
Rodrigues
,
P. L. C.
,
Gramfort
,
A.
, &
Wassermann
,
D.
(
2021
).
Inverting brain grey matter models with likelihood-free inference: A tool for trustable cytoarchitecture measurements
.
arXiv
, arXiv:2111.08693. https://doi.org/10.59275/j.melba.2022-a964
Jelescu
,
I. O.
, &
Budde
,
M. D.
(
2017
).
Design and validation of diffusion MRI models of white matter
.
Frontiers in Physics
,
5
,
61
. https://doi.org/10.3389/fphy.2017.00061
Jelescu
,
I. O.
,
de Skowronski
,
A.
,
Geffroy
,
F.
,
Palombo
,
M.
, &
Novikov
,
D. S.
(
2022
).
Neurite exchange imaging (NEXI): A minimal model of diffusion in gray matter with inter-compartment water exchange
.
NeuroImage
,
256
,
119277
. https://doi.org/10.1016/j.neuroimage.2022.119277
Jelescu
,
I. O.
,
Veraart
,
J.
,
Fieremans
,
E.
, &
Novikov
,
D. S.
(
2016
).
Degeneracy in model parameter estimation for multi-compartmental diffusion in neuronal tissue
.
NMR in Biomedicine
,
29
(
1
),
33
47
. https://doi.org/10.1002/nbm.3450
Jensen
,
J. H.
, &
Helpern
,
J. A.
(
2010
).
MRI quantification of non-Gaussian water diffusion by kurtosis analysis
.
NMR in Biomedicine
,
23
(
7
),
698
710
. https://doi.org/10.1002/nbm.1518
Jensen
,
J. H.
,
Helpern
,
J. A.
,
Ramani
,
A.
,
Lu
,
H.
, &
Kaczynski
,
K.
(
2005
).
Diffusional kurtosis imaging: The quantification of non-Gaussian water diffusion by means of magnetic resonance imaging
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
53
(
6
),
1432
1440
. https://doi.org/10.1002/mrm.20508
Jian
,
B.
, &
Vemuri
,
B. C.
(
2007
).
A unified computational framework for deconvolution to reconstruct multiple fibers from diffusion weighted MRI
.
IEEE Transactions on Medical Imaging
,
26
(
11
),
1464
1471
. https://doi.org/10.1109/tmi.2007.907552
Jones
,
D. K.
, &
Basser
,
P. J.
(
2004
).
“Squashing peanuts and smashing pumpkins”: How noise distorts diffusion-weighted MR data
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
52
(
5
),
979
993
. https://doi.org/10.1002/mrm.20283
Julier
,
S. J.
, &
Uhlmann
,
J. K.
(
1997
).
New extension of the kalman filter to nonlinear systems
. In
Signal processing, sensor fusion, and target recognition VI
(vol
3068
, pp.
182
193
).
SPIE
. https://doi.org/10.1117/12.280797
Kaden
,
E.
,
Knösche
,
T. R.
, &
Anwander
,
A.
(
2007
).
Parametric spherical deconvolution: Inferring anatomical connectivity using diffusion MR imaging
.
NeuroImage
,
37
(
2
),
474
488
. https://doi.org/10.1016/j.neuroimage.2007.05.012
Karimi
,
D.
,
Jaimes
,
C.
,
Machado-Rivas
,
F.
,
Vasung
,
L.
,
Khan
,
S.
,
Warfield
,
S. K.
, &
Gholipour
,
A.
(
2021
).
Deep learning-based parameter estimation in fetal diffusion-weighted MRI
.
NeuroImage
,
243
,
118482
. https://doi.org/10.1016/j.neuroimage.2021.118482
Karimi
,
H. S.
,
Ning
,
L.
, &
Rathi
,
Y.
(
2023
).
A deep-learning framework for estimating the posterior distribution of the standard model of diffusion MRI
. In
2023 IEEE 20th International Symposium on Biomedical Imaging (ISBI)
(pp.
1
4
).
IEEE
. https://doi.org/10.1109/isbi53787.2023.10230446
Koay
,
C. G.
, &
Basser
,
P. J.
(
2006
).
Analytically exact correction scheme for signal extraction from noisy magnitude mr signals
.
Journal of Magnetic Resonance
,
179
(
2
),
317
322
. https://doi.org/10.1016/j.jmr.2006.01.016
Koay
,
C. G.
,
Nevo
,
U.
,
Chang
,
L.-C.
,
Pierpaoli
,
C.
, &
Basser
,
P. J.
(
2008
).
The elliptical cone of uncertainty and its normalized measures in diffusion tensor imaging
.
IEEE Transactions on Medical Imaging
,
27
(
6
),
834
846
. https://doi.org/10.1109/tmi.2008.915663
Kreher
,
B.
,
Schneider
,
J.
,
Mader
,
I.
,
Martin
,
E.
,
Hennig
,
J.
, &
Il’Yasov
,
K.
(
2005
).
Multitensor approach for analysis and tracking of complex fiber configurations
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
54
(
5
),
1216
1225
. https://doi.org/10.1002/mrm.20670
Maier
,
S. E.
,
Vajapeyam
,
S.
,
Mamata
,
H.
,
Westin
,
C.-F.
,
Jolesz
,
F. A.
, &
Mulkern
,
R. V.
(
2004
).
Biexponential diffusion tensor analysis of human brain diffusion data
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
51
(
2
),
321
330
. https://doi.org/10.1002/mrm.10685
Maier-Hein
,
K. H.
,
Neher
,
P. F.
,
Houde
,
J.-C.
,
Côté
,
M.-A.
,
Garyfallidis
,
E.
,
Zhong
,
J.
,
Chamberland
,
M.
,
Yeh
,
F.-C.
,
Lin
,
Y.-C.
,
Ji
,
Q.
,
Reddick
,
W. E.
,
Glass
,
J. O.
,
Chen
,
D. Q.
,
Feng
,
Y.
,
Gao
,
C.
,
Wu
,
Y.
,
Ma
,
J.
,
He
,
R.
,
Li
,
Q.
,…
Descoteaux
,
M.
(
2017
).
The challenge of mapping the human connectome based on diffusion tractography
.
Nature Communications
,
8
(
1
),
1349
. https://doi.org/10.1038/s41467-017-01285-x
Malcolm
,
J. G.
,
Shenton
,
M. E.
, &
Rathi
,
Y.
(
2010
).
Filtered multitensor tractography
.
IEEE Transactions on Medical Imaging
,
29
(
9
),
1664
1675
. https://doi.org/10.1109/tmi.2010.2048121
McLachlan
,
G. J.
, &
Basford
,
K. E.
(
1988
).
Mixture models: Inference and applications to clustering
(vol.
38
).
M. Dekker
New York
. https://www.academia.edu/47662470/Mixture_Models_Inference_and_Applications_to_Clustering
Merlet
,
S. L.
, &
Deriche
,
R.
(
2013
).
Continuous diffusion signal, EAP and ODF estimation via compressive sensing in diffusion MRI
.
Medical Image Analysis
,
17
(
5
),
556
572
. https://doi.org/10.1016/j.media.2013.02.010
Mulkern
,
R. V.
,
Gudbjartsson
,
H.
,
Westin
,
C.-F.
,
Zengingonul
,
H. P.
,
Gartner
,
W.
,
Guttmann
,
C. R.
,
Robertson
,
R. L.
,
Kyriakos
,
W.
,
Schwartz
,
R.
,
Holtzman
,
D.
,
Jolesz
,
F. A.
, &
Maier
,
S. E.
(
1999
).
Multi-component apparent diffusion coefficients in human brain
.
NMR in Biomedicine: An International Journal Devoted to the Development and Application of Magnetic Resonance In Vivo
,
12
(
1
),
51
62
. https://doi.org/10.1002/(sici)1099-1492(199902)12:1<51::aid-nbm546>3.0.co;2-e
Ning
,
L.
,
Laun
,
F.
,
Gur
,
Y.
,
DiBella
,
E. V.
,
Deslauriers-Gauthier
,
S.
,
Megherbi
,
T.
,
Ghosh
,
A.
,
Zucchelli
,
M.
,
Menegaz
,
G.
,
Fick
,
R.
,
St-Jean
,
S.
,
Paquette
,
M.
,
Aranda
,
R.
,
Descoteaux
,
M.
,
Deriche
,
R.
,
O’Donnell
,
L.
, &
Rathi
,
Y.
(
2015
).
Sparse reconstruction challenge for diffusion MRI: Validation on a physical phantom to determine which acquisition scheme and analysis method to use
?
Medical Image Analysis
,
26
(
1
),
316
331
. https://doi.org/10.1016/j.media.2015.10.012
Ning
,
L.
,
Özarslan
,
E.
,
Westin
,
C.-F.
, &
Rathi
,
Y.
(
2017
).
Precise inference and characterization of structural organization (PICASO) of tissue from molecular diffusion
.
NeuroImage
,
146
,
452
473
. https://doi.org/10.1016/j.neuroimage.2016.09.057
Ning
,
L.
,
Westin
,
C.-F.
, &
Rathi
,
Y.
(
2015
).
Estimating diffusion propagator and its moments using directional radial basis functions
.
IEEE Transactions on Medical Imaging
,
34
(
10
),
2058
2078
. https://doi.org/10.1109/tmi.2015.2418674
Novikov
,
D. S.
,
Fieremans
,
E.
,
Jespersen
,
S. N.
, &
Kiselev
,
V. G.
(
2019
).
Quantifying brain microstructure with diffusion MRI: Theory and parameter estimation
.
NMR in Biomedicine
,
32
(
4
),
e3998
. https://doi.org/10.1002/nbm.3998
Novikov
,
D. S.
,
Kiselev
,
V. G.
, &
Jespersen
,
S. N.
(
2018
).
On modeling
.
Magnetic Resonance in Medicine
,
79
(
6
),
3172
3193
. https://doi.org/10.1002/mrm.27101
Olesen
,
J. L.
,
Østergaard
,
L.
,
Shemesh
,
N.
, &
Jespersen
,
S. N.
(
2022
).
Diffusion time dependence, power-law scaling, and exchange in gray matter
.
NeuroImage
,
251
,
118976
. https://doi.org/10.1016/j.neuroimage.2022.118976
Özarslan
,
E.
,
Koay
,
C. G.
,
Shepherd
,
T. M.
,
Komlosh
,
M. E.
,
İrfano ğlu
,
M. O.
,
Pierpaoli
,
C.
, &
Basser
,
P. J.
(
2013
).
Mean apparent propagator (MAP) MRI: A novel diffusion imaging method for mapping tissue microstructure
.
NeuroImage
,
78
,
16
32
. https://doi.org/10.1016/j.neuroimage.2013.04.016
Panagiotaki
,
E.
,
Schneider
,
T.
,
Siow
,
B.
,
Hall
,
M. G.
,
Lythgoe
,
M. F.
, &
Alexander
,
D. C.
(
2012
).
Compartment models of the diffusion mr signal in brain white matter: A taxonomy and comparison
.
NeuroImage
,
59
(
3
),
2241
2254
. https://doi.org/10.1016/j.neuroimage.2011.09.081
Papamakarios
,
G.
,
Pavlakou
,
T.
, &
Murray
,
I.
(
2017
).
Masked autoregressive flow for density estimation
.
Advances in Neural Information Processing Systems
,
30
. https://papers.nips.cc/paper_files/paper/2017/hash/6c1da886822c67822bcf3679d04369fa-Abstract.html
Papamakarios
,
G.
,
Sterratt
,
D.
, &
Murray
,
I.
(
2019
).
Sequential neural likelihood: Fast likelihood-free inference with autoregressive flows
. In
The 22nd International Conference on Artificial Intelligence and Statistics
(pp.
837
848
).
PMLR
. https://proceedings.mlr.press/v89/papamakarios19a/papamakarios19a.pdf
Peled
,
S.
,
Friman
,
O.
,
Jolesz
,
F.
, &
Westin
,
C.-F.
(
2006
).
Geometrically constrained two-tensor model for crossing tracts in dwi
.
Magnetic Resonance Imaging
,
24
(
9
),
1263
1270
. https://doi.org/10.1016/j.mri.2006.07.009
Rathi
,
Y.
,
Gagoski
,
B.
,
Setsompop
,
K.
,
Michailovich
,
O.
,
Grant
,
P. E.
, &
Westin
,
C.-F.
(
2013
).
Diffusion propagator estimation from sparse measurements in a tractography framework
. In
International Conference on Medical Image Computing and Computer-Assisted Intervention
(pp.
510
517
).
Springer
. https://doi.org/10.1007/978-3-642-40760-4_64
Rathi
,
Y.
,
Michailovich
,
O.
,
Shenton
,
M. E.
, &
Bouix
,
S.
(
2009
).
Directional functions for orientation distribution estimation
.
Medical Image Analysis
,
13
(
3
),
432
444
. https://doi.org/10.1016/j.media.2009.01.004
Reddy
,
C. P.
, &
Rathi
,
Y.
(
2016
).
Joint multi-fiber noddi parameter estimation and tractography using the unscented information filter
.
Frontiers in Neuroscience
,
10
,
166
. https://doi.org/10.3389/fnins.2016.00166
Reinhold
,
J. C.
,
He
,
Y.
,
Han
,
S.
,
Chen
,
Y.
,
Gao
,
D.
,
Lee
,
J.
,
Prince
,
J. L.
, &
Carass
,
A.
(
2020
).
Finding novelty with uncertainty
. In
Medical Imaging 2020: Image Processing
(vol.
11313
, pp.
82
87
).
SPIE
. https://doi.org/10.1117/12.2549341
Scherrer
,
B.
, &
Warfield
,
S. K.
(
2010
).
Why multiple b-values are required for multi-tensor models. evaluation with a constrained log-euclidean model
. In
2010 IEEE International Symposium on Biomedical Imaging: From Nano to Macro
(pp.
1389
1392
).
IEEE
. https://doi.org/10.1109/isbi.2010.5490257
Schilling
,
K. G.
,
Janve
,
V.
,
Gao
,
Y.
,
Stepniewska
,
I.
,
Landman
,
B. A.
, &
Anderson
,
A. W.
(
2018
).
Histological validation of diffusion MRI fiber orientation distributions and dispersion
.
NeuroImage
,
165
,
200
221
. https://doi.org/10.1016/j.neuroimage.2017.10.046
Sid
,
F. A.
,
Abed-Meraim
,
K.
,
Harba
,
R.
, &
Oulebsir-Boumghar
,
F.
(
2017
).
Analytical performance bounds for multi-tensor diffusion-MRI
.
Magnetic Resonance Imaging
,
36
,
146
158
. https://doi.org/10.1016/j.mri.2016.10.014
Sisson
,
S. A.
,
Fan
,
Y.
, &
Beaumont
,
M.
(
2018
).
Handbook of approximate Bayesian computation
.
CRC Press
. https://doi.org/10.1201/9781315117195
Sjölund
,
J.
,
Eklund
,
A.
,
Özarslan
,
E.
,
Herberthson
,
M.
,
Bånkestad
,
M.
, &
Knutsson
,
H.
(
2018
).
Bayesian uncertainty quantification in linear models for diffusion MRI
.
NeuroImage
,
175
,
272
285
. https://doi.org/10.1016/j.neuroimage.2018.03.059
St-Jean
,
S.
,
De Luca
,
A.
,
Tax
,
C. M.
,
Viergever
,
M. A.
, &
Leemans
,
A.
(
2020
).
Automated characterization of noise distributions in diffusion MRI data
.
Medical Image Analysis
,
65
,
101758
. https://doi.org/10.1016/j.media.2020.101758
Tanno
,
R.
,
Worrall
,
D. E.
,
Kaden
,
E.
,
Ghosh
,
A.
,
Grussu
,
F.
,
Bizzi
,
A.
,
Sotiropoulos
,
S. N.
,
Criminisi
,
A.
, &
Alexander
,
D. C.
(
2021
).
Uncertainty modelling in deep learning for safer neuroimage enhancement: Demonstration in diffusion MRI
.
NeuroImage
,
225
,
117366
. https://doi.org/10.1016/j.neuroimage.2020.117366
Tejero-Cantero
,
A.
,
Boelts
,
J.
,
Deistler
,
M.
,
Lueckmann
,
J.-M.
,
Durkan
,
C.
,
Gonçalves
,
P. J.
,
Greenberg
,
D. S.
, &
Macke
,
J. H.
(
2020
).
SBI: A toolkit for simulation-based inference
.
Journal of Open Source Software
,
5
(
52
),
2505
. https://doi.org/10.21105/joss.02505
Tian
,
Q.
,
Bilgic
,
B.
,
Fan
,
Q.
,
Liao
,
C.
,
Ngamsombat
,
C.
,
Hu
,
Y.
,
Witzel
,
T.
,
Setsompop
,
K.
,
Polimeni
,
J. R.
, &
Huang
,
S. Y.
(
2020
).
Deepdti: High-fidelity six-direction diffusion tensor imaging using deep learning
.
NeuroImage
,
219
,
117017
. https://doi.org/10.1016/j.neuroimage.2020.117017
Tournier
,
J.-D.
,
Calamante
,
F.
, &
Connelly
,
A.
(
2007
).
Robust determination of the fibre orientation distribution in diffusion MRI: Non-negativity constrained super-resolved spherical deconvolution
.
NeuroImage
,
35
(
4
),
1459
1472
. https://doi.org/10.1016/j.neuroimage.2007.02.016
Tournier
,
J.-D.
,
Smith
,
R.
,
Raffelt
,
D.
,
Tabbara
,
R.
,
Dhollander
,
T.
,
Pietsch
,
M.
,
Christiaens
,
D.
,
Jeurissen
,
B.
,
Yeh
,
C.-H.
, &
Connelly
,
A.
(
2019
).
Mrtrix3: A fast, flexible and open software framework for medical image processing and visualisation
.
NeuroImage
,
202
,
116137
. https://doi.org/10.1016/j.neuroimage.2019.116137
Tuch
,
D. S.
(
2004
).
Q-ball imaging
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
52
(
6
),
1358
1372
. https://doi.org/10.1002/mrm.20279
Tuch
,
D. S.
,
Reese
,
T. G.
,
Wiegell
,
M. R.
,
Makris
,
N.
,
Belliveau
,
J. W.
, &
Wedeen
,
V. J.
(
2002
).
High angular resolution diffusion imaging reveals intravoxel white matter fiber heterogeneity
.
Magnetic Resonance in Medicine: An Official Journal of the International Society for Magnetic Resonance in Medicine
,
48
(
4
),
577
582
. https://doi.org/10.1002/mrm.10268
Van Essen
,
D. C.
,
Smith
,
S. M.
,
Barch
,
D. M.
,
Behrens
,
T. E.
,
Yacoub
,
E.
, &
Ugurbil
,
K.
;
WU-Minn HCP Consortium
. (
2013
).
The WU-Minn human connectome project: An overview
.
NeuroImage
,
80
,
62
79
. https://doi.org/10.1016/j.neuroimage.2013.05.041
Zhang
,
F.
,
Daducci
,
A.
,
He
,
Y.
,
Schiavi
,
S.
,
Seguin
,
C.
,
Smith
,
R.
,
Yeh
,
C.-H.
,
Zhao
,
T.
, &
O’Donnell
,
L. J.
(
2022
).
Quantitative mapping of the brain’s structural connectivity using diffusion MRI tractography: A review
.
NeuroImage
,
249
,
118870
. https://doi.org/10.1016/j.neuroimage.2021.118870
Zhang
,
H.
,
Schneider
,
T.
,
Wheeler-Kingshott
,
C. A.
, &
Alexander
,
D. C.
(
2012
).
Noddi: Practical in vivo neurite orientation dispersion and density imaging of the human brain
.
NeuroImage
,
61
(
4
),
1000
1016
. https://doi.org/10.1016/j.neuroimage.2012.03.072

Originally proposed by Julier and Uhlmann (1997), the unscented transform aims at finding the first- and second-order statistics of a function undergoing a nonlinear transformation. In this paper, we use this transform to estimate the statistics of dMRI measures, which are nonlinear functions (denoted by f()) of eigenvalues. The estimated posterior distribution of the eigenvalues is a Gaussian distribution λ~N(μλ,Σλ). Our goal is to find the mean and covariance of the derived measures (μM,ΣM) using the unscented transform. To this end, we first choose a set of sigma points χk,k{0,...,2D+1}, where D is the dimension of λ. To compute the sigma points, we first find the scaled Cholesky decomposition L of the parameters covariance matrix, L=cholesky(DΣλ). Subsequently, the sigma points, and their associated convex weights are,

(9)
(10)

where L:j is the jth column of matrix L. In (10), κ serves as an adjustable scaling parameter. Then, we calculate the nonlinear transform of each sigma point:

(11)

where f() represents any of the derived measures given in equations (7) or (8). Finally, the transformed distribution can be calculated using the following equations:

(12)
(13)

Remark.The sigma points for the eigenvalues may contain some negative values especially if the elements in λ are close to zero. However, since a negative eigenvalue is meaningless in our case, we enforce a constraint to ensure that all the sigma points fall in the positive region. We apply this constraint by projecting the unconstrained sigma points into positive values using the following quadratic programming problem:

(14)

Afterwards, χ^k serve as new sigma points that do not contain any negative value.

This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International (CC BY 4.0) license, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. For a full description of the license, please visit https://creativecommons.org/licenses/by/4.0/legalcode.