Group-level comparison of brain connectivity networks

Pourmotahari, Fatemeh; Doosti, Hassan; Borumandnia, Nasrin; Tabatabaei, Seyyed Mohammad; Alavi Majd, Hamid

doi:10.1186/s12874-022-01712-8

Research
Open access
Published: 17 October 2022

Group-level comparison of brain connectivity networks

Fatemeh Pourmotahari¹,
Hassan Doosti²,
Nasrin Borumandnia³,
Seyyed Mohammad Tabatabaei⁴ &
…
Hamid Alavi Majd¹

BMC Medical Research Methodology volume 22, Article number: 273 (2022) Cite this article

2277 Accesses
1 Citations
6 Altmetric
Metrics details

Abstract

Background

Functional connectivity (FC) studies are often performed to discern different patterns of brain connectivity networks between healthy and patient groups. Since many neuropsychiatric disorders are related to the change in these patterns, accurate modelling of FC data can provide useful information about disease pathologies. However, analysing functional connectivity data faces several challenges, including the correlations of the connectivity edges associated with network topological characteristics, the large number of parameters in the covariance matrix, and taking into account the heterogeneity across subjects.

Methods

This study provides a new statistical approach to compare the FC networks between subgroups that consider the network topological structure of brain regions and subject heterogeneity.

Results

The power based on the heterogeneity structure of identity scaled in a sample size of 25 exhibited values greater than 0.90 without influencing the degree of correlation, heterogeneity, and the number of regions. This index had values above 0.80 in the small sample size and high correlation. In most scenarios, the type I error was close to 0.05. Moreover, the application of this model on real data related to autism was also investigated, which indicated no significant difference in FC networks between healthy and patient individuals.

Conclusions

The results from simulation data indicated that the proposed model has high power and near-nominal type I error rates in most scenarios.

Peer Review reports

Background

The non-invasive method of functional magnetic resonance imaging (fMRI) identifies the changes of functional connectivity (FC) patterns between healthy and patient individuals by measuring blood-oxygen-level-dependent (BOLD) [1]. Studies indicated that altered topological patterns of brain connectivity are related to many neurological disorders, including autism, Parkinson’s disease, and Alzheimer’s disease. Therefore, functional connectivity analysis is performed to investigate the connectivity patterns among the brain regions of patients to determine the biomarkers of neurodegenerative disorders [2,3,4].

Group comparison of brain connectivity patterns is usually performed using two graph theory-based techniques. In these methods, the network is represented by a graph so that the brain regions are defined as nodes, and the correlation between them is defined as the edge [5,6,7]. The first method is based on comparing the edges individually. In this approach, the correlation coefficient across time series of all-region pairs is calculated to estimate the edges, then the hypothesis of equal FC patterns between healthy and patient groups is assessed using a group-level statistic. Since there are a large number of pairwise edge comparisons, multiple testing procedures such as false discovery rate (FDR) are required to control false positive noise [8, 9]. In the second method, the comparison of brain connectivity is based on the network of edges. Some models investigate the changes in the FC patterns as a subnetwork of the edges, and others investigate them as an entire network of the edges. However, fitting a proper model to determine differentially expressed FC characteristics between subgroups faces some problems due to the high dimension of connectivity data and their complex correlation structure [10,11,12].

Generally, the fit of the FC model based on the edge requires estimating the covariance matrix among the edges. Since the structure of dependency between the edges is related to the network topological structure of brain regions, considering this feature can accurately estimate the model parameters. In conventional statistical models, the structure of parameters dependency is often determined by the distance between brain regions. For example, spatial dependencies are defined based on the distance of voxels that belong to the same region of interest (ROI) [13,14,15,16].

However, because the dependency among the edges is defined based on the network topological structure and is not necessarily limited to spatial adjacency, the results of these models may not be appropriate. On the other hand, because of the high-dimensional FC data, it is difficult to estimate a large number of parameters in the sample covariance matrix between edges, especially if the number of regions in the brain is large. In this regard, there are several optimization techniques for estimating parameters when the number of variables is more than the sample size [17,18,19,20,21]; however, these methods may overlook the network topological features to control false positive and false negative noises [22, 23].

Another feature of FC data is the heterogeneity, which indicates the existence of differences in the connectivity structure of brain regions across subjects within the same group. By considering the heterogeneity in the model, the power of tests can be increased to diagnose neurodegenerative disorders associated with brain connectivity correctly [24]. The heterogeneity across subjects can indicate the changes in the performance of cognitive and behavioural domains in healthy individuals, as well as the severity of symptoms and different responses to clinical interventions in patients [25,26,27,28]. In this regard, using the penalized model-based clustering, DiLernia et al. (2020) reported distinct FC patterns for each individual in the healthy and patient [29].

Many studies have been conducted on FC modelling, which has considered some of the FC data properties. For example, Fiecas et al. (2017) accounted the heterogeneity across subjects by introducing a variance component model. However, in addition to ignoring the spatial features of the brain network, their proposed model cannot estimate the parameters of the edges when the number of ROIs is large. Furthermore, given the large number of ROIs in many brain atlases, applying this model for more than 20 ROIs faces computational challenges [24]. On the other hand, Chen et al. (2020) proposed a nonparametric Bayesian model to estimate the massive parameters of the covariance matrix among edges, which takes the spatial network feature in terms of its topological structure. However, the effect of between-subject heterogeneity is not considered in this model, and equalization of the whole FC network is not examined [23]. Given these challenges, the present study proposes a more comprehensive model for examining differentially expressed FC characteristics between subgroups by considering the network topological structure and the heterogeneity across subjects.

Materials and methods

Estimating covariance matrix between edges

Connectivity data for each subject (n = 1, …, N) is formulated by a matrix ${\boldsymbol{M}}_{\mathrm{v}\times \mathrm{v}}^n=\left\{{M}_{i,j}^n\right\}$ so that the elements of ${M}_{i,j}^n$ represents the Fisher’s Z-transformed correlations between brain regions i and j. The entire brain connectivity network can be displayed by a graph Mⁿ = {V, E}, where V is the set of nodes and E = V(1 − V)/2 is the set of edges. In this setting, the degree is determined by the number of edges, and the correlation coefficient is determined by the edges. The corrections have been made. The matrix Mⁿ is transformed into a vector ${\boldsymbol{Y}}_{\mathbf{1}\times \boldsymbol{E}}^{\boldsymbol{n}}$. Suppose that the vector Y follows a normal multivariate distribution ${\boldsymbol{Y}}_{\mathbf{1}\times \boldsymbol{E}}^{\boldsymbol{n}}\sim MVN\left({\boldsymbol{X}}_{\boldsymbol{n}}^{\boldsymbol{T}}{\boldsymbol{\beta}}_{\boldsymbol{p}\times \boldsymbol{E}},{\mathbf{\sum}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)$, which ${\boldsymbol{X}}_{\boldsymbol{n}}^{\boldsymbol{T}}$ indicates the design matrix with p covariates, β_p × E is the effect of covariates on the vector Y, and ∑_E × E is the covariance matrix. The model has the following form:

$${\boldsymbol{Y}}_{\boldsymbol{N}\times \boldsymbol{E}}={\boldsymbol{X}}_{\boldsymbol{N}\times \boldsymbol{p}}^{\boldsymbol{T}}{\hat{\boldsymbol{\beta}}}_{\boldsymbol{p}\times \boldsymbol{E}}+{{\boldsymbol{R}}^{\boldsymbol{o}}}_{\boldsymbol{N}\times \boldsymbol{E}}$$

(1)

where R^o_N × E is the residual matrix. In the nonparametric Bayesian model, R^o_N × E is used as input data to estimate the covariance matrix by considering the characteristics of the network topological structure.

Let Rⁿ_1 × E ∣ Λ_E × E~MVN(0, Λ_E × E). The edges correlation matrix Λ_E × E = f(G, ρ) is a function of the unknown network structure G and the correlation factors ρ = (ρ₀, ρ₁, …, ρ_k). G is a probability measurement of the latent K networks that follows a Dirichlet process, with parameters G_o and α:

$$G\sim DP\left(\alpha, {G}_o\right)$$

(2)

The correlation matrix based on the network topological structure is given by the equation:

$$\Lambda_{e_{i,j}\times e_{i',j'}}=\left\{\begin{array}{llc} \rho_k && if \;\;\; \omega_i=\omega_j=\omega_i'=\omega_j'=C_k \cr \rho_0 && \ otherwise \end{array}\right.$$

(3)

${\varLambda}_{e_{i,j}\times {e}_{i^{\prime},{j}^{\prime}}}$ is indeed an element of the matrix Λ_E × E which is determined by the correlation between the edges e_{i, j} and ${e}_{i^{\prime },{j}^{\prime }}$. Let e_{i, j} denote connectivity between regions i and j, i ≠ j. Based on whether region i belongs to the cluster k or not, the term ω_i = C_k is used as an indicator variable. If two edges are in the same cluster, it can be supposed that:

$${e}_{i,j}\cong {e}_{i^{\prime },{j}^{\prime }}\in {C}_k\left({e}_{i,j}\in {C}_k,{e}_{i^{\prime },{j}^{\prime }}\in {C}_k\right)\,\,\,\ if\ and\ only\ if\,\,\,\ {\omega}_i={\omega}_j={\omega}_{i^{\prime }}={\omega}_{j^{\prime }}={C}_k$$

The nonparametric Bayesian model is used to detect neighbourhood networks based on the input data and correlation structure in Eq. (3).

A discrete distribution is employed to assign each region in the K networks with probability π = (π₁, …, π_k).

The Dirichlet process in Eq. (2) is therefore equivalent to the following equations:

$${\omega}_i={C}_k \mid \boldsymbol{\pi} \sim Discrete\left(\boldsymbol{\pi} \right),i=1,\dots, V$$

$$\boldsymbol{\pi} \mid \alpha \sim Dirichlet\left({}^{\alpha }\!\!\left/ \!{}_{K}\right.,\dots, {}^{\alpha }\!\!\left/ \!{}_{K}\right.\right),k\to \infty$$

The distributions of ρ_o and ρ_k are assumed to be normal with parameters μ_o ، μ_k ، ${\tau}_o^2$ and ${\tau}_k^2$:

$${\rho}_o \mid {\mu}_o,{\tau}_o^2\sim N\left(\ {\mu}_o,{\tau}_o^2\right)$$

$${\rho}_k \mid {\mu}_k,{\tau}_k^2\sim N\left({\mu}_k,{\tau}_k^2\right),k=1,\dots, K$$

The posterior probability of R_N × E is calculated by the prior distribution ρ, ω and function f:

$$p\left({\boldsymbol{R}}_{\boldsymbol{N}\times \boldsymbol{E}}|{\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}},G,\boldsymbol{\rho} \right)p(G)p\left(\boldsymbol{\rho} \right)\propto {\left|2\pi {\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}}\right|}^{-\frac{N}{2}}\exp \left(-\frac{1}{2}\sum_s{\boldsymbol{R}}_{\boldsymbol{n}}^{\boldsymbol{T}}{\left({\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-1}{\boldsymbol{R}}_{\boldsymbol{n}}\right)p(G)p\left(\boldsymbol{\rho} \right)=\exp \left\{-{}^{N}\!\!\left/ \!{}_{2}\right.\left(\mathit{\log}{\left|{\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}}\right|}^{-1}+ tr\left(\boldsymbol{H}{\left({\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-1}\right)\right)\right\}p(G)p\left(\boldsymbol{\rho} \right)$$

Where H^o = (R^o)^TR^o/(N − p) and H = (Diag(H^o))^−1/2H^o(Diag(H^o))^−1/2.

Then the sampling of the conditional posterior probabilities ω and ρ is obtained by the Markov chain Monte Carlo (MCMC) algorithm:

$$p\left({\omega}_i={C}_k|{\boldsymbol{\omega}}_{-\boldsymbol{i}},\boldsymbol{\rho}, {\boldsymbol{R}}_{\boldsymbol{N}\times \boldsymbol{E}}\right)$$

$$\propto \exp \left\{-N/2\right(\log \left(\mathit{\det}\left(f\left(\left({\boldsymbol{\omega}}_{-\mathbf{1}},{\omega}_i={C}_k\right),\boldsymbol{\rho} \Big)\right)\right)+ tr\left(\boldsymbol{H}f{\left(\left({\boldsymbol{\omega}}_{-\mathbf{1}},{\omega}_i={C}_k\right),\boldsymbol{\rho} \right)}^{-\mathbf{1}}\right)\right)\Big\}\times \frac{m_{- ik}}{\mathrm{v}-1+\alpha }$$

The number of nodes within the network k is showed by m_−ik = ∑_j ≠ iI(ω_j = C_k).

$$p\left({\omega}_i\ne {\omega}_j\ for\ all\ j\ne i|{\boldsymbol{\omega}}_{-\boldsymbol{i}},\boldsymbol{\rho}, {\boldsymbol{R}}_{\boldsymbol{N}\times \boldsymbol{E}}\right)$$

$$\propto \exp \left\{-N/2\right(\mathit{\log}\left(\mathit{\det}\left(f\left(\left({\boldsymbol{\omega}}_{-\mathbf{1}},{\omega}_i={C}_{k+1}\right),\boldsymbol{\rho} \right)\right)\right)$$

$$+ tr\left(\boldsymbol{H}f{\left(\left({\boldsymbol{\omega}}_{-\mathbf{1}},{\omega}_i={C}_{k+1}\right),\boldsymbol{\rho} \right)}^{-\mathbf{1}}\right)\left)\right\}\times \frac{\alpha }{\mathrm{v}-1+\alpha }$$

The Sherman–Morrison formula is used in the computation of (Λ_E × E)⁻¹ [30]:

$$f{\left(\boldsymbol{\omega}, \boldsymbol{\rho} \right)}^{-1}={\left({\boldsymbol{\varLambda}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-\mathbf{1}}={\left(A+\sqrt{\rho_o}{\mathbf{1}}_{E\times 1}{\left(\sqrt{\rho_o}{\mathbf{1}}_{E\times 1}\right)}^T\right)}^{-\mathbf{1}}$$

$$={A}^{-1}-\frac{\rho_o{A}^{-1}{\mathbf{1}}_{E\times E}{A}^{-1}}{1+{\rho}_o{\mathbf{1}}_{E\times 1}^T{A}^{-1}{\mathbf{1}}_{E\times 1}}$$

$$p\left({\rho}_o|\boldsymbol{\omega}, \boldsymbol{H},{\boldsymbol{\rho}}_{-\mathbf{0}}\right)$$

$$\propto \exp \left\{-N/2\left(\log \left(\det \left(f\left(\boldsymbol{\omega}, {\rho}_o,{\boldsymbol{\rho}}_{-\mathbf{0}}\right)\right)\right)+ tr\left(\boldsymbol{H}f{\left(\boldsymbol{\omega}, {\rho}_o,{\boldsymbol{\rho}}_{-\mathbf{0}}\right)}^{-1}\right)\right)\right\}\times p\left({\rho}_o|{\mu}_o,{\tau}_o^2\right)$$

$$p\left({\rho}_k|\boldsymbol{\omega}, \boldsymbol{H},{\boldsymbol{\rho}}_{-\boldsymbol{k}}\right)$$

$$\propto \exp \left\{-N/2\left(\log \left(\det \left(f\left(\boldsymbol{\omega}, {\rho}_k,{\boldsymbol{\rho}}_{-\boldsymbol{k}}\right)\right)\right)+ tr\left(\boldsymbol{H}f{\left(\boldsymbol{\omega}, {\rho}_o,{\boldsymbol{\rho}}_{-\boldsymbol{k}}\right)}^{-1}\right)\right)\right\}\times p\left({\rho}_k|{\mu}_k,{\tau}_k^2\right)$$

Finally, given the posterior distributions of ω and ρ, the covariance matrix between the edges is estimated. A detailed description of the theoretical methods of the Bayesian nonparametric model is available in the original paper [23].

Estimating between-subject variability

This section describes the estimation of the covariance matrix Ψ to control subject heterogeneity and the estimation of edges parameters β by considering the dependence structure between edges ∑_E × E. The terms ψ₁, …,ψ_N are considered to apply between-subject variability. It is assumed that ψ_N(1, …, N) is the scaled diagonal identity matrix with E × E dimensions. The Ψ covariance is a block-diagonal matrix with ψ₁, …,ψ_N matrices along its diagonal. Each entry of ψ_N indicates the variability that can be assigned to subject sampling.

Estimation of both β and Ψ parameters are obtained using the iterative method. In this process, an initial value of $\hat{\boldsymbol{\Psi}}$ is replaced in Eq. (4), and then the residuals R = Y − Xβ are calculated.

$$\hat{\boldsymbol{\beta}}={\left({\boldsymbol{X}}^{\prime }{\left({\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}+{\hat{\boldsymbol{\Psi}}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-1}\boldsymbol{X}\right)}^{-1}{\boldsymbol{X}}^{\prime }{\left({\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}+{\hat{\boldsymbol{\Psi}}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-1}\boldsymbol{Y}$$

(4)

The estimate $\hat{\boldsymbol{\varPsi}}$ to be updated using R. The process is repeated until convergence is achieved. To estimate the parameter Ψ, the outer product of the E × 1 residual vector is obtained for each subject, then averaged over these vectors to create a residual covariance matrix ${\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}$. For ψ_N, the structures of a scaled identity matrix and compound symmetry with the forms ψ_N = σ²I and ψ_N = σ² + b are assumed. Where σ² is set as the average of all diagonal elements of ${\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}-{\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}$, I is the identity matrix with dimensions of E × E and b is considered as the average of the off-diagonal elements of ${\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}-{\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}$.

Group-level statistic to detect differentially expressed FC patterns.

The basic hypotheses in this study are examined using the following test statistics.

Hypothesis 1: The entire functional connectivity networks are significantly different between case and control groups. The test statistic is:

$${\left(\boldsymbol{C}\left(\hat{{\boldsymbol{\beta}}_{\mathbf{1}}}-\hat{{\boldsymbol{\beta}}_{\mathbf{2}}}\right)\right)}^{\prime }{\left(\boldsymbol{C}\left(\hat{\operatorname{var}}\left(\hat{{\boldsymbol{\beta}}_{\mathbf{1}}}\ \right)+\hat{\operatorname{var}}\left(\hat{{\boldsymbol{\beta}}_{\mathbf{2}}}\right)\right){\boldsymbol{C}}^{\prime}\right)}^{-1}\left(\boldsymbol{C}\left(\hat{{\boldsymbol{\beta}}_{\mathbf{1}}}-\hat{{\boldsymbol{\beta}}_{\mathbf{2}}}\right)\right)$$

(5)

Where the contrast matrix of C is the E × E identity matrix. The parameters β₁ and β₂ are estimated according to Eq. (4). The covariance matrix for each group is estimated $\hat{\operatorname{var}}\left(\hat{\boldsymbol{\beta}}\right)={\left({\boldsymbol{X}}^{\prime }{\left({\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}+{\hat{\boldsymbol{\Psi}}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{-1}\boldsymbol{X}\right)}^{-1}.$

Hypothesis 2: The functional connectivity of the ROI pairwise is significantly different between the case and the control groups. The following test statistic is:

$${\left(\left(\hat{\beta_1}(e)-\hat{\beta_2}(e)\right)\right)}^{\prime }{\left(\left(\hat{\operatorname{var}}\left(\hat{\beta_1}(e)\ \right)+\hat{\operatorname{var}}\left(\hat{\beta_2}(e)\right)\right)\right)}^{-1}\left(\left(\hat{\beta_1}(e)-\hat{\beta_2}(e)\right)\right),e=1,..,E$$

(6)

A permutation test has been used to check the hypotheses while controlling type I error. In this regard, with each resampling of subjects between two groups, the interest parameters are estimated using the iterative algorithm of section. The values of the statistics for each permutation are then derived by replacing them with Eq. (5) and (6). In addition, the FDR method was used to adjust the p-value of the permutation test since there was the high number of multiple comparisons in the second hypothesis. In both the simulation and real data assessments, the nominal level of statistical significance was set at α = 0.05. Figure 1 shows a process of the proposed method to determine differentially expressed FC patterns between two groups.

Simulation settings

In this section, simulation data were applied to assess the performance of the proposed method in terms of statistical power and type I error. The number of nodes V= 20, 25, and 30 with two latent clusters were considered to simulate the dependence structure between the edges. The correlation of the edges inside the cluster was ρ₁ = ρ₂ equal to the values 0.3, 0.5, and 0.7 as well as the correlation of the edges outside the cluster was ρ₀ = 0. In the next step, the Uniform(−δ, δ) distribution was employed to create subject-specific effects. The values for each subject were separately generated from this distribution and were added to the diagonal entries of the edges covariance matrix. Given that the δ parameter controls the degree of heterogeneity between subjects, δ= 0.15 and 0.3 were considered as low and high heterogeneity, respectively.

In the following, connectivity data of each group is generated using a multivariate normal distribution according to the features of the covariance matrix. For example, Fisher’s Z-transformed correlation is considered a connectivity metric.

$$Y_{N\times E}\thicksim \left\{\begin{array}{llc} MVN\left(0,{\sum}_{E\times E}+\kern0.5em \Psi \kern0.5em \right)+ d \;\;\;\;\; \mbox{Control group} \\\cr MVN\left(0,{\sum}_{E\times E}+\kern0.5em \Psi \right) \;\;\;\;\;\;\;\;\;\;\;\;\; \mbox{Case group}\end{array}\right.$$

The d vector reflects the differentially expressed FC network between groups so that an E × 1 zero vector is considered, and then d = 0.8 is randomly replaced for about 5% of zero values.

The results of simulation data were based on 100 iterations for each setting to estimate power and type I errors under the null hypothesis. For all tests, the permutation number was 500. The model performance was evaluated based on the amount of correlation between edges, heterogeneity, and the dimensions of communication data in the sample size of 10 and 25. The proposed methods were implemented with R software version 4.0.5 and MATLAB R2021b. The codes of the method are available upon request.

Application to autism data

Subjects

This study examined resting-state fMRI data of 25 men with autism)7.29–15.66 years old) and 25 healthy men (7.26–14.66 years old) with a full intelligence quotient (FIQ) score above 87. The data were matched on age (p = 0.272) and FIQ score (p = 0.659).

Data acquisition

The resting-state fMRI data of subjects were obtained from the ABIDE database gathered by NYU Langone Medical Center [31]. fMRI images were collected using a 3 T Siemens Allegra scanner for 6 minutes. Participants were asked to relax and stare at the white cross displayed in the middle of a screen with a black background. The following is the scan acquiring procedure: TR = 2000 ms, TE = 15 ms, 33 slices with thickness = 4.0 mm, FOV = 240 mm, flip angle = 90.

Data processing

The pre-processed data were selected from the ABIDE dataset at http://fcon_1000.projects.nitrc.org/indi/abide/ [32]. Pre-processing of fMRI scans was performed using SPM8 software. The first ten values of each time series were eliminated due to the correction of the initial scan heterogeneity and the adaptability of individuals to the surrounding states; hence, a total number of 170 values per individual was considered. Normalization of scans was performed to the MNI (Montreal Neurological Institute) space with a resolution of 3 × 3 × 3 mm³ and correction of head movement according to Friston 24-parameter model [33] .According to the AAl (automated anatomical labelling) atlas, the pre-processed data were partitioned into 116 regions. [34]. The number of regions V = 30 affected by autism, including regions in the default mode network (DMN), were selected to compare data characteristics with simulation scenarios [35, 36].

Results

Simulation data

Table 1 displays the results of the simulation scenarios with ρ = 0.3. Based on the identity scaled structure, the power was above 0.90 in sample size N= 25, regardless of the heterogeneity amount and the number of regions. In the number of regions above 20 and the sample size of 10 and 25, the type I error rates formed on identities scaled and compound symmetry structures were close to 0.05. Based on the compound symmetry structure, the model had acceptable power values without affecting the degree of heterogeneity in the sample size N=25 and V > 20.

Table 1 Type I error and power of the model to check differentially expressed FC patterns between two groups in term of simulation setting: sample size N, structure of between-subject variability, number of regions V, degree of heterogeneity and dependence among the edges ρ = 0.3

Full size table

Table 2 shows the results of the simulation scenarios with ρ = 0.5. In a sample size N=25, the power based on both structures of between-subject variability was greater than 0.8, regardless of the number of regions and heterogeneity. The model demonstrated acceptable values of power in the identity scaled structure when N=10 and V = 30. The type I error was close to 0.05 in most scenarios for the region numbers of 20 and 25.

Table 2 Type I error and power of the model to check differentially expressed FC patterns between two groups in term of simulation setting: sample size N, structure of between-subject variability, number of regions V, degree of heterogeneity and dependence among the edges ρ = 0.5

Full size table

Table 3 shows the results of the simulation scenarios with ρ = 0.7. Based on the identity scaled structure, the power was above 0.80, regardless of the number of regions, heterogeneity, and sample size. However, according to the compound symmetry structure, the model had lower power in most scenarios. In more than half of the simulation items, the type I error was less than or equal to 0.05.

Table 3 Type I error and power of the model to check differentially expressed FC patterns between two groups in term of simulation setting: sample size N, structure of between-subject variability, number of regions V, degree of heterogeneity and dependence among the edges ρ = 0.7

Full size table

In summary, the power based on identity scaled structure in a sample size of 25 exhibited values greater than 0.90 without influencing the degree of correlation, heterogeneity, and the number of regions. This index had values above 0.80 in the small sample size and ρ = 0.7. In most scenarios, the type I error was near to the nominal level. With increasing the number of regions in the sample N = 25 and ρ= 0.3,0.5, the power based on compound symmetry structure was greater than 0.80. When there was a high correlation and a small sample size, however, the model yielded lower power. In most simulation items with high heterogeneity, the type I error based on compound symmetry structure was near to the nominal level.

Autism data

The proposed model in the current study was used to compare brain connectivity patterns between healthy individuals and patients with autism. Firstly, the estimation correlation coefficient and clustering of brain regions were calculated using the nonparametric Bayesian method to consider the network topological features. In this method, the brain network regions were divided into four clusters, K = 4, which the largest cluster consisted of 14 regions. Then the covariance matrix with dimensions of 435 × 435 was obtained using the dependency structure of clusters. Following that, the FC patterns between the patient and healthy groups were compared by considering the effect of heterogeneity. Based on compound symmetry and identity scaled structures, the test statistic values were 20.28(p = 0.517) and 21.68(p = 0.457), respectively. These results indicate no significant difference in entire FC networks between patients and healthy individuals. Additionally, the Pard algorithm proposed by Chen et al. (2015) and the aSPU method proposed by Pan et al. (2014) were used to examine the differences in FC patterns [10, 37]. These models also displayed no significant FC networks between the two groups.

The FC comparisons of the pair ROIs were performed using test statistics of Eq. (6). Figure 2 depicts the different edges between the healthy and patient groups in three forms: axial, coronal, and sagittal views. In this chart, the increase of functional relationship of the healthy individuals compared to autism patients is manifested with a green edge, and the decrease of functional relationship is manifested with a yellow edge. The results showed out of 435, 18 edges were significant using both structures of variability. However, due to the high number of multiple comparisons, the p-value was corrected using the FDR approach. Therefore, after p-values adjustment, there was no substantial difference in connectivity between the two groups. Table 4 provides more details of connectivity changes between brain regions.

Table 4 Differentially expressed edges by proposed method; the symbol shows a connectivity increase between brain regions in healthy individuals compared to the patients group. The symbol shows a connectivity decrease.

Discussion

Simulation data

To perform statistical inference on FC data, it seems necessary to consider the properties of heterogeneity between subjects and the dependency structure among edges. In previous studies, the spatial closeness between the regions defines the spatial structure [13,14,15,16, 38]. Accordingly, given that four regions define a pair of edges, it is difficult to consider the distance between the edges in statistical models. On the other hand, the dependency between the edges is not necessarily limited to the spatial adjacency of the four regions. It can be related to the network topological structure. Recently, Chen et al. (2020) proposed a nonparametric Bayesian model based on network topological features to model the dependency structure of edges. However, this model does not include the heterogeneity effect between subjects which leads to accurate estimation of the parameters. The statistical model proposed in the present study investigates the FC network equality between the patient and healthy groups by combining the nonparametric Bayesian model for estimating the dependency structure between the edges and the algorithm proposed by Fiecas et al. (2017) for considering the heterogeneity of subjects [23, 24].

Simulation data showed that the power improves by increasing the correlation between the edges, based on identity scaled matrix structure. Accordingly, the Bayesian model proposed by Chen et al. (2020) had high accuracy when the correlation was above 0.3 [23]. These results indicate that model performance increases when detecting the network topological structure at higher correlations.

In the compound symmetry structure, the power was affected by the number of regions, sample size, and the edges correlation. Based on compound symmetry and identity scaled structures, the type I error was near to the nominal level when data dimensions and heterogeneity were high. In this regard, Fiecas et al. (2017) presented a variance component model by applying heterogeneity between subjects and temporal dependency, whose performance was similar to the proposed model based on compound symmetry structure in terms of statistical power and type I error. However, in the proposed model, the power based on identity scaled structure yielded a higher value than the variance component model study without limiting the amount of heterogeneity and the number of regions in the sample size of 25 [24]. Accordingly, correct modelling of the dependency structure between the edges can improve the accuracy of estimating the model parameters and the power of the statistical test.

In FC studies, the scale of inference (i.e., at the edge, cluster or network scale) can significantly affect the results of statistical tests. It was recently shown that network-based analysis improves the power to capture average-size effects [39]. Consequently, it is important that the proposed model includes levels of inference from both the network and individual edges.

Another advantage of the proposed model compared to Fiecas et al.’s (2017) model is the improvement of computational power from 190 to 435 edges. However, since the interactions between the brain regions in high dimensions often show network topological features, applying more than 30 regions is one of the model’s limitations due to the high computational cost and time. Figure 3 shows the computational time of the permutation test in different dimensions and the sample size of 10 and 25 on a computer with an i8 CPU and 16G RAM. In order to avoid computing difficulties, structures with a small number of parameters, including the identity scaled and compound symmetry, were used in this study to estimate the between-subject covariance. Therefore, any covariates were not considered in the model. On the other hand, Pard and aSPU models are some of the existing methods for evaluating FC networks, with no limitations in the dimensions of connectivity data [10, 37]. Since there are different patterns of functional connectivity for each subject [29], the heterogeneity effect between subjects is not included in these models.

Autism data

Estimating the FC patterns can provide a better understanding of the pathologies related to neurodegenerative diseases such as autism, and thus is essential in clinical research. The proposed statistical model in the current study was applied to resting-state fMRI data obtained on 25 patients with autism and 25 healthy individuals. There was no substantial difference in entire FC networks between the patient and healthy groups, which was consistent with the findings of the models proposed by Pan et al. (2014) and Chen et al. (2015) [10, 37]. These results contradict the results of previous studies examining the functional connectivity patterns of autistic patients compared to healthy individuals, which could be due to a large number of regions and the high sample size investigated in these studies [35, 40, 41]. The proposed model is applicable to the analysis of whole-brain connectivity data in practice. However, when there were more than 30 regions, the model’s only shortcoming was the high computation time (Fig. 3). The model can be used with large sample sizes as well. Nevertheless, in order to compare real data characteristics with the simulated scenarios, a sample size of 25 was considered for each patient and healthy group.

Another hypothesis in the proposed statistical model is the FC comparisons of pair regions. In this regard, the permutation test showed the difference in the functional connectivity of several regions between patients and healthy groups regardless of p-value adjustment. Most of these changes were related to DMN network regions, including frontal superior medial, cingulum (anterior and posterior), praecuneus, and angular. For example, in the patient group, there was a connectivity decrease of the right posterior cingulate region with the left superior frontal (dorsolateral), left middle frontal regions, and the right anterior cingulate region with the left middle frontal (orbital part). On the other hand, there was an increase in the dependency amount of the right frontal superior medial with the left superior frontal gyrus (medial orbital), and the left angular with the left rectus. Accordingly, several studies have reported functional connectivity changes of DMN regions in autism patients. [42,43,44]. DMN is one of the most important brain networks whose function changes under the influence of neurological disorders such as autism. These functional changes have a substantial impact on cognitive functions. [45].

Conclusion

This study presents a new approach for determining differentially expressed FC patterns between two groups, in which heterogeneity between subjects and the structure of dependency among edges are simultaneously considered. Simulation data indicated the high power and near-nominal type I error rates in the proposed model. Additionally, the application of the model on real data related to autism was also evaluated, and there was no significant difference in the functional connectivity network between the patient and healthy groups.

Availability of data and materials

The corresponding author can provide the codes of the method upon reasonable request. The data presented in this study are available from the ABIDE database at http://preprocessed-connectomes-project.org/abide/.

Abbreviations

fMRI:: Functional magnetic resonance imaging
FC:: Functional connectivity
BOLD:: Blood-oxygen-level-dependent
FDR:: False discovery rate
ROI:: Region of interest
MNI:: Montreal Neurological Institute
AAl:: Automated anatomical labelling
MCMC:: Markov chain Monte Carlo
FIQ:: Full intelligence quotient
DMN:: Default mode network

References

Greicius M. Resting-state functional connectivity in neuropsychiatric disorders. Curr Opin Neurol. 2008;21(4):424–30.
Article PubMed Google Scholar
Weng S-J, Wiggins JL, Peltier SJ, Carrasco M, Risi S, Lord C, et al. Alterations of resting state functional connectivity in the default network in adolescents with autism spectrum disorders. Brain Res. 2010;1313:202–14.
Article CAS PubMed Google Scholar
Supekar K, Menon V, Rubin D, Musen M, Greicius MD. Network analysis of intrinsic functional brain connectivity in Alzheimer’s disease. PLoS Comput Biol. 2008;4(6):e1000100.
Article PubMed PubMed Central Google Scholar
Wu T, Wang L, Chen Y, Zhao C, Li K, Chan P. Changes of functional connectivity of the motor network in the resting state in Parkinson’s disease. Neurosci Lett. 2009;460(1):6–10.
Article CAS PubMed Google Scholar
He Y, Evans A. Graph theoretical modeling of brain connectivity. Curr Opin Neurol. 2010;23(4):341–50.
Article PubMed Google Scholar
Fornito A, Zalesky A, Breakspear M. Graph analysis of the human connectome: promise, progress, and pitfalls. Neuroimage. 2013;80:426–44.
Article PubMed Google Scholar
Simpson SL, Moussa MN, Laurienti PJ. An exponential random graph modeling approach to creating group-based representative whole-brain connectivity networks. Neuroimage. 2012;60(2):1117–26.
Article PubMed Google Scholar
Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nat Rev Neurosci. 2009;10(3):186–98.
Article CAS PubMed Google Scholar
Simpson SL, Lyday RG, Hayasaka S, Marsh AP, Laurienti PJ. A permutation testing framework to compare groups of brain networks. Front Comput Neurosci. 2013;7:171.
Article PubMed PubMed Central Google Scholar
Chen S, Kang J, Xing Y, Wang G. A parsimonious statistical method to detect groupwise differentially expressed functional connectivity networks. Hum Brain Mapp. 2015;36(12):5196–206.
Article PubMed PubMed Central Google Scholar
Kim J, Wozniak JR, Mueller BA, Pan W. Testing group differences in brain functional connectivity: using correlations or partial correlations? Brain Connect. 2015;5(4):214–31.
Article PubMed PubMed Central Google Scholar
Zalesky A, Fornito A, Bullmore ET. Network-based statistic: identifying differences in brain networks. Neuroimage. 2010;53(4):1197–207.
Article PubMed Google Scholar
Castruccio S, Ombao H, Genton MG. A scalable multi-resolution spatio-temporal model for brain activation and connectivity in fMRI data. Biometrics. 2018;74(3):823–33.
Article PubMed Google Scholar
Kang H, Ombao H, Fonnesbeck C, Ding Z, Morgan VL. A Bayesian double fusion model for resting-state brain connectivity using joint functional and structural data. Brain Connect. 2017;7(4):219–27.
Article PubMed PubMed Central Google Scholar
Wang R, Albert KM, Taylor WD, Boyd BD, Blaber J, Lyu I, et al. A bayesian approach to examining default mode network functional connectivity and cognitive performance in major depressive disorder. Psychiatry Res Neuroimaging. 2020;301:111102.
Article PubMed PubMed Central Google Scholar
Li F, Zhang T, Wang Q, Gonzalez MZ, Maresh EL, Coan JA. Spatial Bayesian variable selection and grouping for high-dimensional scalar-on-image regression. Ann Appl Stat. 2015;9(2):687–713.
Article CAS Google Scholar
Rothman AJ, Levina E, Zhu J. Generalized thresholding of large covariance matrices. J Am Stat Assoc. 2009;104(485):177–86.
Article CAS Google Scholar
Shen X, Pan W, Zhu Y. Likelihood-based selection and sharp parameter estimation. J Am Stat Assoc. 2012;107(497):223–32.
Article PubMed PubMed Central Google Scholar
Cui Y, Leng C, Sun D. Sparse estimation of high-dimensional correlation matrices. Comput Stat Data Anal. 2016;93:390–403.
Article Google Scholar
Bickel PJ, Levina E. Regularized estimation of large covariance matrices. Ann Stat. 2008;36(1):199–227.
Article Google Scholar
Ravikumar P, Wainwright MJ, Raskutti G, Yu B. High-dimensional covariance estimation by minimizing ℓ1-penalized log-determinant divergence. Electron J Stat. 2011;5:935–80.
Article Google Scholar
Chen S, Kang J, Xing Y, Zhao Y, Milton DK. Estimating large covariance matrix with network topology for high-dimensional biomedical data. Comput Stat Data Anal. 2018;127:82–95.
Article Google Scholar
Chen S, Xing Y, Kang J, Kochunov P, Hong LE. Bayesian modeling of dependence in brain connectivity data. Biostatistics. 2020;21(2):269–86.
Article CAS PubMed Google Scholar
Fiecas M, Cribben I, Bahktiari R, Cummine J. A variance components model for statistical inference on functional connectivity networks. Neuroimage. 2017;149:256–66.
Article PubMed Google Scholar
Mueller S, Wang D, Fox MD, Yeo BTT, Sepulcre J, Sabuncu MR, et al. Individual variability in functional connectivity architecture of the human brain. Neuron. 2013;77(3):586–95.
Article CAS PubMed PubMed Central Google Scholar
Wang D, Liu H. Functional connectivity architecture of the human brain: not all the same. Neurosci. 2014;20(5):432–8.
Google Scholar
Nawaz U, Lee I, Beermann A, Eack S, Keshavan M, Brady R Jr. Individual variation in functional brain network topography is linked to schizophrenia symptomatology. Schizophr Bull. 2021;47(1):180–8.
Article PubMed Google Scholar
Price RB, Lane S, Gates K, Kraynak TE, Horner MS, Thase ME, et al. Parsing heterogeneity in the brain connectivity of depressed and healthy adults during positive mood. Biol Psychiatry. 2017;81(4):347–57.
Article PubMed Google Scholar
DiLernia A, Quevedo K, Camchong J, Lim K, Pan W, Zhang L. Penalized model-based clustering of fMRI data. arXiv Prepr arXiv201006408. 2020.
Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical recipes with source code CD-ROM 3rd edition: the art of scientific computing: Cambridge University Press; 2007.
Google Scholar
Di Martino A, Yan C-G, Li Q, Denio E, Castellanos FX, Alaerts K, et al. The autism brain imaging data exchange: towards a large-scale evaluation of the intrinsic brain architecture in autism. Mol Psychiatry. 2014;19(6):659–67.
Article PubMed Google Scholar
Craddock C, Benhajali Y, Chu C, Chouinard F, Evans A, Jakab A, et al. The neuro bureau preprocessing initiative: open sharing of preprocessed neuroimaging data and derivatives. Front Neuroinform. 2013;7.
Friston KJ, Williams S, Howard R, Frackowiak RSJ, Turner R. Movement-related effects in fMRI time-series. Magn Reson Med. 1996;35(3):346–55.
Article CAS PubMed Google Scholar
Tzourio-Mazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, et al. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI single-subject brain. Neuroimage. 2002;15(1):273–89.
Article CAS PubMed Google Scholar
Chen S, Xing Y, Kang J. Latent and abnormal functional connectivity circuits in autism spectrum disorder. Front Neurosci. 2017;11:125.
Article PubMed PubMed Central Google Scholar
Xu S, Li M, Yang C, Fang X, Ye M, Wei L, et al. Altered functional connectivity in children with low-function autism spectrum disorders. Front Neurosci. 2019;13:806.
Article PubMed PubMed Central Google Scholar
Pan W, Kim J, Zhang Y, Shen X, Wei P. A powerful and adaptive association test for rare variants. Genetics. 2014;197(4):1081–95.
Article PubMed PubMed Central Google Scholar
Al-Momani M, Hussein AA, Ahmed SE. Penalty and related estimation strategies in the spatial error model. Statistica Neerlandica. 2017;71(1):4–30.
Article Google Scholar
Noble S, Mejia M, Zalesky A, Scheinost D. Leveling up: improving power in fMRI by moving beyond cluster-level inference. BioRxiv. 2021.
Chen H, Nomi JS, Uddin LQ, Duan X, Chen H. Intrinsic functional connectivity variance and state-specific under-connectivity in autism. Hum Brain Mapp. 2017;38(11):5740–55.
Article PubMed PubMed Central Google Scholar
Chen S, Kang J, Wang G. An empirical Bayes normalization method for connectivity metrics in resting state fMRI. Front Neurosci. 2015;9:316.
Article PubMed PubMed Central Google Scholar
Jung M, Kosaka H, Saito DN, Ishitobi M, Morita T, Inohara K, et al. Default mode network in young male adults with autism spectrum disorder: relationship with autism spectrum traits. Mol Autism. 2014;5(1):1–11.
Article CAS Google Scholar
Monk CS, Peltier SJ, Wiggins JL, Weng S-J, Carrasco M, Risi S, et al. Abnormalities of intrinsic functional connectivity in autism spectrum disorders. Neuroimage. 2009;47(2):764–72.
Article PubMed Google Scholar
Murdaugh DL, Shinkareva SV, Deshpande HR, Wang J, Pennick MR, Kana RK. Differential deactivation during mentalizing and classification of autism based on default mode network connectivity. PLoS One. 2012;7(11):e50064.
Article CAS PubMed PubMed Central Google Scholar
Padmanabhan A, Lynch CJ, Schaer M, Menon V. The default mode network in autism. Biol Psychiatry Cogn Neurosci Neuroimaging. 2017;2(6):476–86.
PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would thank the ABIDE team for sharing the fMRI data.

Funding

None.

Author information

Authors and Affiliations

Department of Biostatistics, Faculty of Paramedical Sciences, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Fatemeh Pourmotahari & Hamid Alavi Majd
Department of Mathematics and Statistics, Macquarie University, Macquarie, Australia
Hassan Doosti
Urology and Nephrology Research Centre, Shahid Beheshti University of Medical Sciences, Tehran, Iran
Nasrin Borumandnia
Department of Medical Informatics, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
Seyyed Mohammad Tabatabaei

Authors

Fatemeh Pourmotahari
View author publications
You can also search for this author in PubMed Google Scholar
Hassan Doosti
View author publications
You can also search for this author in PubMed Google Scholar
Nasrin Borumandnia
View author publications
You can also search for this author in PubMed Google Scholar
Seyyed Mohammad Tabatabaei
View author publications
You can also search for this author in PubMed Google Scholar
Hamid Alavi Majd
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

FP, HA developed the statistical method. FP and SMT wrote the simulation computer code and performed the analysis of autism data. FP and NB interpreted the results. FP wrote the draft of the manuscript. HA, HD and NB reviewed and edited the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Hamid Alavi Majd.

Ethics declarations

Ethics approval and consent to participate

Written informed consent was obtained for all participants, and experimental protocols were approved by the Institutional Review Board at the NYU Langone Medical Center. The autism dataset was de-identified in accordance with HIPAA (Health Insurance Portability and Accountability) guidelines, and no protected health information included. All methods were carried out in accordance with relevant guidelines and regulations. The Ethics Committee of Shahid Beheshti University of Medical Sciences accepted this research (IR.SBMU.RETECH.REC.1399.1140).

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Pourmotahari, F., Doosti, H., Borumandnia, N. et al. Group-level comparison of brain connectivity networks. BMC Med Res Methodol 22, 273 (2022). https://doi.org/10.1186/s12874-022-01712-8

Download citation

Received: 24 May 2022
Accepted: 16 August 2022
Published: 17 October 2022
DOI: https://doi.org/10.1186/s12874-022-01712-8

Group-level comparison of brain connectivity networks

Abstract

Background

Methods

Results

Conclusions

Background

Materials and methods

Estimating covariance matrix between edges

Estimating between-subject variability

Group-level statistic to detect differentially expressed FC patterns.

Simulation settings

Application to autism data

Subjects

Data acquisition

Data processing

Results

Simulation data

Autism data

Discussion

Simulation data

Autism data

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us