 Research
 Open Access
 Published:
Grouplevel comparison of brain connectivity networks
BMC Medical Research Methodology volume 22, Article number: 273 (2022)
Abstract
Background
Functional connectivity (FC) studies are often performed to discern different patterns of brain connectivity networks between healthy and patient groups. Since many neuropsychiatric disorders are related to the change in these patterns, accurate modelling of FC data can provide useful information about disease pathologies. However, analysing functional connectivity data faces several challenges, including the correlations of the connectivity edges associated with network topological characteristics, the large number of parameters in the covariance matrix, and taking into account the heterogeneity across subjects.
Methods
This study provides a new statistical approach to compare the FC networks between subgroups that consider the network topological structure of brain regions and subject heterogeneity.
Results
The power based on the heterogeneity structure of identity scaled in a sample size of 25 exhibited values greater than 0.90 without influencing the degree of correlation, heterogeneity, and the number of regions. This index had values above 0.80 in the small sample size and high correlation. In most scenarios, the type I error was close to 0.05. Moreover, the application of this model on real data related to autism was also investigated, which indicated no significant difference in FC networks between healthy and patient individuals.
Conclusions
The results from simulation data indicated that the proposed model has high power and nearnominal type I error rates in most scenarios.
Background
The noninvasive method of functional magnetic resonance imaging (fMRI) identifies the changes of functional connectivity (FC) patterns between healthy and patient individuals by measuring bloodoxygenleveldependent (BOLD) [1]. Studies indicated that altered topological patterns of brain connectivity are related to many neurological disorders, including autism, Parkinson’s disease, and Alzheimer’s disease. Therefore, functional connectivity analysis is performed to investigate the connectivity patterns among the brain regions of patients to determine the biomarkers of neurodegenerative disorders [2,3,4].
Group comparison of brain connectivity patterns is usually performed using two graph theorybased techniques. In these methods, the network is represented by a graph so that the brain regions are defined as nodes, and the correlation between them is defined as the edge [5,6,7]. The first method is based on comparing the edges individually. In this approach, the correlation coefficient across time series of allregion pairs is calculated to estimate the edges, then the hypothesis of equal FC patterns between healthy and patient groups is assessed using a grouplevel statistic. Since there are a large number of pairwise edge comparisons, multiple testing procedures such as false discovery rate (FDR) are required to control false positive noise [8, 9]. In the second method, the comparison of brain connectivity is based on the network of edges. Some models investigate the changes in the FC patterns as a subnetwork of the edges, and others investigate them as an entire network of the edges. However, fitting a proper model to determine differentially expressed FC characteristics between subgroups faces some problems due to the high dimension of connectivity data and their complex correlation structure [10,11,12].
Generally, the fit of the FC model based on the edge requires estimating the covariance matrix among the edges. Since the structure of dependency between the edges is related to the network topological structure of brain regions, considering this feature can accurately estimate the model parameters. In conventional statistical models, the structure of parameters dependency is often determined by the distance between brain regions. For example, spatial dependencies are defined based on the distance of voxels that belong to the same region of interest (ROI) [13,14,15,16].
However, because the dependency among the edges is defined based on the network topological structure and is not necessarily limited to spatial adjacency, the results of these models may not be appropriate. On the other hand, because of the highdimensional FC data, it is difficult to estimate a large number of parameters in the sample covariance matrix between edges, especially if the number of regions in the brain is large. In this regard, there are several optimization techniques for estimating parameters when the number of variables is more than the sample size [17,18,19,20,21]; however, these methods may overlook the network topological features to control false positive and false negative noises [22, 23].
Another feature of FC data is the heterogeneity, which indicates the existence of differences in the connectivity structure of brain regions across subjects within the same group. By considering the heterogeneity in the model, the power of tests can be increased to diagnose neurodegenerative disorders associated with brain connectivity correctly [24]. The heterogeneity across subjects can indicate the changes in the performance of cognitive and behavioural domains in healthy individuals, as well as the severity of symptoms and different responses to clinical interventions in patients [25,26,27,28]. In this regard, using the penalized modelbased clustering, DiLernia et al. (2020) reported distinct FC patterns for each individual in the healthy and patient [29].
Many studies have been conducted on FC modelling, which has considered some of the FC data properties. For example, Fiecas et al. (2017) accounted the heterogeneity across subjects by introducing a variance component model. However, in addition to ignoring the spatial features of the brain network, their proposed model cannot estimate the parameters of the edges when the number of ROIs is large. Furthermore, given the large number of ROIs in many brain atlases, applying this model for more than 20 ROIs faces computational challenges [24]. On the other hand, Chen et al. (2020) proposed a nonparametric Bayesian model to estimate the massive parameters of the covariance matrix among edges, which takes the spatial network feature in terms of its topological structure. However, the effect of betweensubject heterogeneity is not considered in this model, and equalization of the whole FC network is not examined [23]. Given these challenges, the present study proposes a more comprehensive model for examining differentially expressed FC characteristics between subgroups by considering the network topological structure and the heterogeneity across subjects.
Materials and methods
Estimating covariance matrix between edges
Connectivity data for each subject (n = 1, …, N) is formulated by a matrix \({\boldsymbol{M}}_{\mathrm{v}\times \mathrm{v}}^n=\left\{{M}_{i,j}^n\right\}\) so that the elements of \({M}_{i,j}^n\) represents the Fisher’s Ztransformed correlations between brain regions i and j. The entire brain connectivity network can be displayed by a graph M^{n} = {V, E}, where V is the set of nodes and E = V(1 − V)/2 is the set of edges. In this setting, the degree is determined by the number of edges, and the correlation coefficient is determined by the edges. The corrections have been made. The matrix M^{n} is transformed into a vector \({\boldsymbol{Y}}_{\mathbf{1}\times \boldsymbol{E}}^{\boldsymbol{n}}\). Suppose that the vector Y follows a normal multivariate distribution \({\boldsymbol{Y}}_{\mathbf{1}\times \boldsymbol{E}}^{\boldsymbol{n}}\sim MVN\left({\boldsymbol{X}}_{\boldsymbol{n}}^{\boldsymbol{T}}{\boldsymbol{\beta}}_{\boldsymbol{p}\times \boldsymbol{E}},{\mathbf{\sum}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)\), which \({\boldsymbol{X}}_{\boldsymbol{n}}^{\boldsymbol{T}}\) indicates the design matrix with p covariates, β_{p × E} is the effect of covariates on the vector Y, and ∑_{E × E} is the covariance matrix. The model has the following form:
where R^{o}_{N × E} is the residual matrix. In the nonparametric Bayesian model, R^{o}_{N × E} is used as input data to estimate the covariance matrix by considering the characteristics of the network topological structure.
Let R^{n}_{1 × E} ∣ Λ_{E × E}~MVN(0, Λ_{E × E}). The edges correlation matrix Λ_{E × E} = f(G, ρ) is a function of the unknown network structure G and the correlation factors ρ = (ρ_{0}, ρ_{1}, …, ρ_{k}). G is a probability measurement of the latent K networks that follows a Dirichlet process, with parameters G_{o} and α:
The correlation matrix based on the network topological structure is given by the equation:
\({\varLambda}_{e_{i,j}\times {e}_{i^{\prime},{j}^{\prime}}}\) is indeed an element of the matrix Λ_{E × E} which is determined by the correlation between the edges e_{i, j} and \({e}_{i^{\prime },{j}^{\prime }}\). Let e_{i, j} denote connectivity between regions i and j, i ≠ j. Based on whether region i belongs to the cluster k or not, the term ω_{i} = C_{k} is used as an indicator variable. If two edges are in the same cluster, it can be supposed that:
The nonparametric Bayesian model is used to detect neighbourhood networks based on the input data and correlation structure in Eq. (3).
A discrete distribution is employed to assign each region in the K networks with probability π = (π_{1}, …, π_{k}).
The Dirichlet process in Eq. (2) is therefore equivalent to the following equations:
The distributions of ρ_{o} and ρ_{k} are assumed to be normal with parameters μ_{o} ، μ_{k} ، \({\tau}_o^2\) and \({\tau}_k^2\):
The posterior probability of R_{N × E} is calculated by the prior distribution ρ, ω and function f:
Where H^{o} = (R^{o})^{T}R^{o}/(N − p) and H = (Diag(H^{o}))^{−1/2}H^{o}(Diag(H^{o}))^{−1/2}.
Then the sampling of the conditional posterior probabilities ω and ρ is obtained by the Markov chain Monte Carlo (MCMC) algorithm:
The number of nodes within the network k is showed by m_{−ik} = ∑_{j ≠ i}I(ω_{j} = C_{k}).
The Sherman–Morrison formula is used in the computation of (Λ_{E × E})^{−1} [30]:
Finally, given the posterior distributions of ω and ρ, the covariance matrix between the edges is estimated. A detailed description of the theoretical methods of the Bayesian nonparametric model is available in the original paper [23].
Estimating betweensubject variability
This section describes the estimation of the covariance matrix Ψ to control subject heterogeneity and the estimation of edges parameters β by considering the dependence structure between edges ∑_{E × E}. The terms ψ_{1}, …,ψ_{N} are considered to apply betweensubject variability. It is assumed that ψ_{N}(1, …, N) is the scaled diagonal identity matrix with E × E dimensions. The Ψ covariance is a blockdiagonal matrix with ψ_{1}, …,ψ_{N} matrices along its diagonal. Each entry of ψ_{N} indicates the variability that can be assigned to subject sampling.
Estimation of both β and Ψ parameters are obtained using the iterative method. In this process, an initial value of \(\hat{\boldsymbol{\Psi}}\) is replaced in Eq. (4), and then the residuals R = Y − Xβ are calculated.
The estimate \(\hat{\boldsymbol{\varPsi}}\) to be updated using R. The process is repeated until convergence is achieved. To estimate the parameter Ψ, the outer product of the E × 1 residual vector is obtained for each subject, then averaged over these vectors to create a residual covariance matrix \({\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}\). For ψ_{N}, the structures of a scaled identity matrix and compound symmetry with the forms ψ_{N} = σ^{2}I and ψ_{N} = σ^{2} + b are assumed. Where σ^{2} is set as the average of all diagonal elements of \({\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}{\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}\), I is the identity matrix with dimensions of E × E and b is considered as the average of the offdiagonal elements of \({\hat{\boldsymbol{\Omega}}}_{\boldsymbol{E}\times \boldsymbol{E}}{\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}\).
Grouplevel statistic to detect differentially expressed FC patterns.
The basic hypotheses in this study are examined using the following test statistics.
Hypothesis 1: The entire functional connectivity networks are significantly different between case and control groups. The test statistic is:
Where the contrast matrix of C is the E × E identity matrix. The parameters β_{1} and β_{2} are estimated according to Eq. (4). The covariance matrix for each group is estimated \(\hat{\operatorname{var}}\left(\hat{\boldsymbol{\beta}}\right)={\left({\boldsymbol{X}}^{\prime }{\left({\hat{\boldsymbol{\Sigma}}}_{\boldsymbol{E}\times \boldsymbol{E}}+{\hat{\boldsymbol{\Psi}}}_{\boldsymbol{E}\times \boldsymbol{E}}\right)}^{1}\boldsymbol{X}\right)}^{1}.\)
Hypothesis 2: The functional connectivity of the ROI pairwise is significantly different between the case and the control groups. The following test statistic is:
A permutation test has been used to check the hypotheses while controlling type I error. In this regard, with each resampling of subjects between two groups, the interest parameters are estimated using the iterative algorithm of section. The values of the statistics for each permutation are then derived by replacing them with Eq. (5) and (6). In addition, the FDR method was used to adjust the pvalue of the permutation test since there was the high number of multiple comparisons in the second hypothesis. In both the simulation and real data assessments, the nominal level of statistical significance was set at α = 0.05. Figure 1 shows a process of the proposed method to determine differentially expressed FC patterns between two groups.
Simulation settings
In this section, simulation data were applied to assess the performance of the proposed method in terms of statistical power and type I error. The number of nodes V= 20, 25, and 30 with two latent clusters were considered to simulate the dependence structure between the edges. The correlation of the edges inside the cluster was ρ_{1} = ρ_{2} equal to the values 0.3, 0.5, and 0.7 as well as the correlation of the edges outside the cluster was ρ_{0} = 0. In the next step, the Uniform(−δ, δ) distribution was employed to create subjectspecific effects. The values for each subject were separately generated from this distribution and were added to the diagonal entries of the edges covariance matrix. Given that the δ parameter controls the degree of heterogeneity between subjects, δ= 0.15 and 0.3 were considered as low and high heterogeneity, respectively.
In the following, connectivity data of each group is generated using a multivariate normal distribution according to the features of the covariance matrix. For example, Fisher’s Ztransformed correlation is considered a connectivity metric.
The d vector reflects the differentially expressed FC network between groups so that an E × 1 zero vector is considered, and then d = 0.8 is randomly replaced for about 5% of zero values.
The results of simulation data were based on 100 iterations for each setting to estimate power and type I errors under the null hypothesis. For all tests, the permutation number was 500. The model performance was evaluated based on the amount of correlation between edges, heterogeneity, and the dimensions of communication data in the sample size of 10 and 25. The proposed methods were implemented with R software version 4.0.5 and MATLAB R2021b. The codes of the method are available upon request.
Application to autism data
Subjects
This study examined restingstate fMRI data of 25 men with autism)7.29–15.66 years old) and 25 healthy men (7.26–14.66 years old) with a full intelligence quotient (FIQ) score above 87. The data were matched on age (p = 0.272) and FIQ score (p = 0.659).
Data acquisition
The restingstate fMRI data of subjects were obtained from the ABIDE database gathered by NYU Langone Medical Center [31]. fMRI images were collected using a 3 T Siemens Allegra scanner for 6 minutes. Participants were asked to relax and stare at the white cross displayed in the middle of a screen with a black background. The following is the scan acquiring procedure: TR = 2000 ms, TE = 15 ms, 33 slices with thickness = 4.0 mm, FOV = 240 mm, flip angle = 90.
Data processing
The preprocessed data were selected from the ABIDE dataset at http://fcon_1000.projects.nitrc.org/indi/abide/ [32]. Preprocessing of fMRI scans was performed using SPM8 software. The first ten values of each time series were eliminated due to the correction of the initial scan heterogeneity and the adaptability of individuals to the surrounding states; hence, a total number of 170 values per individual was considered. Normalization of scans was performed to the MNI (Montreal Neurological Institute) space with a resolution of 3 × 3 × 3 mm^{3} and correction of head movement according to Friston 24parameter model [33] .According to the AAl (automated anatomical labelling) atlas, the preprocessed data were partitioned into 116 regions. [34]. The number of regions V = 30 affected by autism, including regions in the default mode network (DMN), were selected to compare data characteristics with simulation scenarios [35, 36].
Results
Simulation data
Table 1 displays the results of the simulation scenarios with ρ = 0.3. Based on the identity scaled structure, the power was above 0.90 in sample size N= 25, regardless of the heterogeneity amount and the number of regions. In the number of regions above 20 and the sample size of 10 and 25, the type I error rates formed on identities scaled and compound symmetry structures were close to 0.05. Based on the compound symmetry structure, the model had acceptable power values without affecting the degree of heterogeneity in the sample size N=25 and V > 20.
Table 2 shows the results of the simulation scenarios with ρ = 0.5. In a sample size N=25, the power based on both structures of betweensubject variability was greater than 0.8, regardless of the number of regions and heterogeneity. The model demonstrated acceptable values of power in the identity scaled structure when N=10 and V = 30. The type I error was close to 0.05 in most scenarios for the region numbers of 20 and 25.
Table 3 shows the results of the simulation scenarios with ρ = 0.7. Based on the identity scaled structure, the power was above 0.80, regardless of the number of regions, heterogeneity, and sample size. However, according to the compound symmetry structure, the model had lower power in most scenarios. In more than half of the simulation items, the type I error was less than or equal to 0.05.
In summary, the power based on identity scaled structure in a sample size of 25 exhibited values greater than 0.90 without influencing the degree of correlation, heterogeneity, and the number of regions. This index had values above 0.80 in the small sample size and ρ = 0.7. In most scenarios, the type I error was near to the nominal level. With increasing the number of regions in the sample N = 25 and ρ= 0.3,0.5, the power based on compound symmetry structure was greater than 0.80. When there was a high correlation and a small sample size, however, the model yielded lower power. In most simulation items with high heterogeneity, the type I error based on compound symmetry structure was near to the nominal level.
Autism data
The proposed model in the current study was used to compare brain connectivity patterns between healthy individuals and patients with autism. Firstly, the estimation correlation coefficient and clustering of brain regions were calculated using the nonparametric Bayesian method to consider the network topological features. In this method, the brain network regions were divided into four clusters, K = 4, which the largest cluster consisted of 14 regions. Then the covariance matrix with dimensions of 435 × 435 was obtained using the dependency structure of clusters. Following that, the FC patterns between the patient and healthy groups were compared by considering the effect of heterogeneity. Based on compound symmetry and identity scaled structures, the test statistic values were 20.28(p = 0.517) and 21.68(p = 0.457), respectively. These results indicate no significant difference in entire FC networks between patients and healthy individuals. Additionally, the Pard algorithm proposed by Chen et al. (2015) and the aSPU method proposed by Pan et al. (2014) were used to examine the differences in FC patterns [10, 37]. These models also displayed no significant FC networks between the two groups.
The FC comparisons of the pair ROIs were performed using test statistics of Eq. (6). Figure 2 depicts the different edges between the healthy and patient groups in three forms: axial, coronal, and sagittal views. In this chart, the increase of functional relationship of the healthy individuals compared to autism patients is manifested with a green edge, and the decrease of functional relationship is manifested with a yellow edge. The results showed out of 435, 18 edges were significant using both structures of variability. However, due to the high number of multiple comparisons, the pvalue was corrected using the FDR approach. Therefore, after pvalues adjustment, there was no substantial difference in connectivity between the two groups. Table 4 provides more details of connectivity changes between brain regions.
Discussion
Simulation data
To perform statistical inference on FC data, it seems necessary to consider the properties of heterogeneity between subjects and the dependency structure among edges. In previous studies, the spatial closeness between the regions defines the spatial structure [13,14,15,16, 38]. Accordingly, given that four regions define a pair of edges, it is difficult to consider the distance between the edges in statistical models. On the other hand, the dependency between the edges is not necessarily limited to the spatial adjacency of the four regions. It can be related to the network topological structure. Recently, Chen et al. (2020) proposed a nonparametric Bayesian model based on network topological features to model the dependency structure of edges. However, this model does not include the heterogeneity effect between subjects which leads to accurate estimation of the parameters. The statistical model proposed in the present study investigates the FC network equality between the patient and healthy groups by combining the nonparametric Bayesian model for estimating the dependency structure between the edges and the algorithm proposed by Fiecas et al. (2017) for considering the heterogeneity of subjects [23, 24].
Simulation data showed that the power improves by increasing the correlation between the edges, based on identity scaled matrix structure. Accordingly, the Bayesian model proposed by Chen et al. (2020) had high accuracy when the correlation was above 0.3 [23]. These results indicate that model performance increases when detecting the network topological structure at higher correlations.
In the compound symmetry structure, the power was affected by the number of regions, sample size, and the edges correlation. Based on compound symmetry and identity scaled structures, the type I error was near to the nominal level when data dimensions and heterogeneity were high. In this regard, Fiecas et al. (2017) presented a variance component model by applying heterogeneity between subjects and temporal dependency, whose performance was similar to the proposed model based on compound symmetry structure in terms of statistical power and type I error. However, in the proposed model, the power based on identity scaled structure yielded a higher value than the variance component model study without limiting the amount of heterogeneity and the number of regions in the sample size of 25 [24]. Accordingly, correct modelling of the dependency structure between the edges can improve the accuracy of estimating the model parameters and the power of the statistical test.
In FC studies, the scale of inference (i.e., at the edge, cluster or network scale) can significantly affect the results of statistical tests. It was recently shown that networkbased analysis improves the power to capture averagesize effects [39]. Consequently, it is important that the proposed model includes levels of inference from both the network and individual edges.
Another advantage of the proposed model compared to Fiecas et al.’s (2017) model is the improvement of computational power from 190 to 435 edges. However, since the interactions between the brain regions in high dimensions often show network topological features, applying more than 30 regions is one of the model’s limitations due to the high computational cost and time. Figure 3 shows the computational time of the permutation test in different dimensions and the sample size of 10 and 25 on a computer with an i8 CPU and 16G RAM. In order to avoid computing difficulties, structures with a small number of parameters, including the identity scaled and compound symmetry, were used in this study to estimate the betweensubject covariance. Therefore, any covariates were not considered in the model. On the other hand, Pard and aSPU models are some of the existing methods for evaluating FC networks, with no limitations in the dimensions of connectivity data [10, 37]. Since there are different patterns of functional connectivity for each subject [29], the heterogeneity effect between subjects is not included in these models.
Autism data
Estimating the FC patterns can provide a better understanding of the pathologies related to neurodegenerative diseases such as autism, and thus is essential in clinical research. The proposed statistical model in the current study was applied to restingstate fMRI data obtained on 25 patients with autism and 25 healthy individuals. There was no substantial difference in entire FC networks between the patient and healthy groups, which was consistent with the findings of the models proposed by Pan et al. (2014) and Chen et al. (2015) [10, 37]. These results contradict the results of previous studies examining the functional connectivity patterns of autistic patients compared to healthy individuals, which could be due to a large number of regions and the high sample size investigated in these studies [35, 40, 41]. The proposed model is applicable to the analysis of wholebrain connectivity data in practice. However, when there were more than 30 regions, the model’s only shortcoming was the high computation time (Fig. 3). The model can be used with large sample sizes as well. Nevertheless, in order to compare real data characteristics with the simulated scenarios, a sample size of 25 was considered for each patient and healthy group.
Another hypothesis in the proposed statistical model is the FC comparisons of pair regions. In this regard, the permutation test showed the difference in the functional connectivity of several regions between patients and healthy groups regardless of pvalue adjustment. Most of these changes were related to DMN network regions, including frontal superior medial, cingulum (anterior and posterior), praecuneus, and angular. For example, in the patient group, there was a connectivity decrease of the right posterior cingulate region with the left superior frontal (dorsolateral), left middle frontal regions, and the right anterior cingulate region with the left middle frontal (orbital part). On the other hand, there was an increase in the dependency amount of the right frontal superior medial with the left superior frontal gyrus (medial orbital), and the left angular with the left rectus. Accordingly, several studies have reported functional connectivity changes of DMN regions in autism patients. [42,43,44]. DMN is one of the most important brain networks whose function changes under the influence of neurological disorders such as autism. These functional changes have a substantial impact on cognitive functions. [45].
Conclusion
This study presents a new approach for determining differentially expressed FC patterns between two groups, in which heterogeneity between subjects and the structure of dependency among edges are simultaneously considered. Simulation data indicated the high power and nearnominal type I error rates in the proposed model. Additionally, the application of the model on real data related to autism was also evaluated, and there was no significant difference in the functional connectivity network between the patient and healthy groups.
Availability of data and materials
The corresponding author can provide the codes of the method upon reasonable request. The data presented in this study are available from the ABIDE database at http://preprocessedconnectomesproject.org/abide/.
Abbreviations
 fMRI:

Functional magnetic resonance imaging
 FC:

Functional connectivity
 BOLD:

Bloodoxygenleveldependent
 FDR:

False discovery rate
 ROI:

Region of interest
 MNI:

Montreal Neurological Institute
 AAl:

Automated anatomical labelling
 MCMC:

Markov chain Monte Carlo
 FIQ:

Full intelligence quotient
 DMN:

Default mode network
References
Greicius M. Restingstate functional connectivity in neuropsychiatric disorders. Curr Opin Neurol. 2008;21(4):424–30.
Weng SJ, Wiggins JL, Peltier SJ, Carrasco M, Risi S, Lord C, et al. Alterations of resting state functional connectivity in the default network in adolescents with autism spectrum disorders. Brain Res. 2010;1313:202–14.
Supekar K, Menon V, Rubin D, Musen M, Greicius MD. Network analysis of intrinsic functional brain connectivity in Alzheimer’s disease. PLoS Comput Biol. 2008;4(6):e1000100.
Wu T, Wang L, Chen Y, Zhao C, Li K, Chan P. Changes of functional connectivity of the motor network in the resting state in Parkinson’s disease. Neurosci Lett. 2009;460(1):6–10.
He Y, Evans A. Graph theoretical modeling of brain connectivity. Curr Opin Neurol. 2010;23(4):341–50.
Fornito A, Zalesky A, Breakspear M. Graph analysis of the human connectome: promise, progress, and pitfalls. Neuroimage. 2013;80:426–44.
Simpson SL, Moussa MN, Laurienti PJ. An exponential random graph modeling approach to creating groupbased representative wholebrain connectivity networks. Neuroimage. 2012;60(2):1117–26.
Bullmore E, Sporns O. Complex brain networks: graph theoretical analysis of structural and functional systems. Nat Rev Neurosci. 2009;10(3):186–98.
Simpson SL, Lyday RG, Hayasaka S, Marsh AP, Laurienti PJ. A permutation testing framework to compare groups of brain networks. Front Comput Neurosci. 2013;7:171.
Chen S, Kang J, Xing Y, Wang G. A parsimonious statistical method to detect groupwise differentially expressed functional connectivity networks. Hum Brain Mapp. 2015;36(12):5196–206.
Kim J, Wozniak JR, Mueller BA, Pan W. Testing group differences in brain functional connectivity: using correlations or partial correlations? Brain Connect. 2015;5(4):214–31.
Zalesky A, Fornito A, Bullmore ET. Networkbased statistic: identifying differences in brain networks. Neuroimage. 2010;53(4):1197–207.
Castruccio S, Ombao H, Genton MG. A scalable multiresolution spatiotemporal model for brain activation and connectivity in fMRI data. Biometrics. 2018;74(3):823–33.
Kang H, Ombao H, Fonnesbeck C, Ding Z, Morgan VL. A Bayesian double fusion model for restingstate brain connectivity using joint functional and structural data. Brain Connect. 2017;7(4):219–27.
Wang R, Albert KM, Taylor WD, Boyd BD, Blaber J, Lyu I, et al. A bayesian approach to examining default mode network functional connectivity and cognitive performance in major depressive disorder. Psychiatry Res Neuroimaging. 2020;301:111102.
Li F, Zhang T, Wang Q, Gonzalez MZ, Maresh EL, Coan JA. Spatial Bayesian variable selection and grouping for highdimensional scalaronimage regression. Ann Appl Stat. 2015;9(2):687–713.
Rothman AJ, Levina E, Zhu J. Generalized thresholding of large covariance matrices. J Am Stat Assoc. 2009;104(485):177–86.
Shen X, Pan W, Zhu Y. Likelihoodbased selection and sharp parameter estimation. J Am Stat Assoc. 2012;107(497):223–32.
Cui Y, Leng C, Sun D. Sparse estimation of highdimensional correlation matrices. Comput Stat Data Anal. 2016;93:390–403.
Bickel PJ, Levina E. Regularized estimation of large covariance matrices. Ann Stat. 2008;36(1):199–227.
Ravikumar P, Wainwright MJ, Raskutti G, Yu B. Highdimensional covariance estimation by minimizing ℓ1penalized logdeterminant divergence. Electron J Stat. 2011;5:935–80.
Chen S, Kang J, Xing Y, Zhao Y, Milton DK. Estimating large covariance matrix with network topology for highdimensional biomedical data. Comput Stat Data Anal. 2018;127:82–95.
Chen S, Xing Y, Kang J, Kochunov P, Hong LE. Bayesian modeling of dependence in brain connectivity data. Biostatistics. 2020;21(2):269–86.
Fiecas M, Cribben I, Bahktiari R, Cummine J. A variance components model for statistical inference on functional connectivity networks. Neuroimage. 2017;149:256–66.
Mueller S, Wang D, Fox MD, Yeo BTT, Sepulcre J, Sabuncu MR, et al. Individual variability in functional connectivity architecture of the human brain. Neuron. 2013;77(3):586–95.
Wang D, Liu H. Functional connectivity architecture of the human brain: not all the same. Neurosci. 2014;20(5):432–8.
Nawaz U, Lee I, Beermann A, Eack S, Keshavan M, Brady R Jr. Individual variation in functional brain network topography is linked to schizophrenia symptomatology. Schizophr Bull. 2021;47(1):180–8.
Price RB, Lane S, Gates K, Kraynak TE, Horner MS, Thase ME, et al. Parsing heterogeneity in the brain connectivity of depressed and healthy adults during positive mood. Biol Psychiatry. 2017;81(4):347–57.
DiLernia A, Quevedo K, Camchong J, Lim K, Pan W, Zhang L. Penalized modelbased clustering of fMRI data. arXiv Prepr arXiv201006408. 2020.
Press WH, Teukolsky SA, Vetterling WT, Flannery BP. Numerical recipes with source code CDROM 3rd edition: the art of scientific computing: Cambridge University Press; 2007.
Di Martino A, Yan CG, Li Q, Denio E, Castellanos FX, Alaerts K, et al. The autism brain imaging data exchange: towards a largescale evaluation of the intrinsic brain architecture in autism. Mol Psychiatry. 2014;19(6):659–67.
Craddock C, Benhajali Y, Chu C, Chouinard F, Evans A, Jakab A, et al. The neuro bureau preprocessing initiative: open sharing of preprocessed neuroimaging data and derivatives. Front Neuroinform. 2013;7.
Friston KJ, Williams S, Howard R, Frackowiak RSJ, Turner R. Movementrelated effects in fMRI timeseries. Magn Reson Med. 1996;35(3):346–55.
TzourioMazoyer N, Landeau B, Papathanassiou D, Crivello F, Etard O, Delcroix N, et al. Automated anatomical labeling of activations in SPM using a macroscopic anatomical parcellation of the MNI MRI singlesubject brain. Neuroimage. 2002;15(1):273–89.
Chen S, Xing Y, Kang J. Latent and abnormal functional connectivity circuits in autism spectrum disorder. Front Neurosci. 2017;11:125.
Xu S, Li M, Yang C, Fang X, Ye M, Wei L, et al. Altered functional connectivity in children with lowfunction autism spectrum disorders. Front Neurosci. 2019;13:806.
Pan W, Kim J, Zhang Y, Shen X, Wei P. A powerful and adaptive association test for rare variants. Genetics. 2014;197(4):1081–95.
AlMomani M, Hussein AA, Ahmed SE. Penalty and related estimation strategies in the spatial error model. Statistica Neerlandica. 2017;71(1):4–30.
Noble S, Mejia M, Zalesky A, Scheinost D. Leveling up: improving power in fMRI by moving beyond clusterlevel inference. BioRxiv. 2021.
Chen H, Nomi JS, Uddin LQ, Duan X, Chen H. Intrinsic functional connectivity variance and statespecific underconnectivity in autism. Hum Brain Mapp. 2017;38(11):5740–55.
Chen S, Kang J, Wang G. An empirical Bayes normalization method for connectivity metrics in resting state fMRI. Front Neurosci. 2015;9:316.
Jung M, Kosaka H, Saito DN, Ishitobi M, Morita T, Inohara K, et al. Default mode network in young male adults with autism spectrum disorder: relationship with autism spectrum traits. Mol Autism. 2014;5(1):1–11.
Monk CS, Peltier SJ, Wiggins JL, Weng SJ, Carrasco M, Risi S, et al. Abnormalities of intrinsic functional connectivity in autism spectrum disorders. Neuroimage. 2009;47(2):764–72.
Murdaugh DL, Shinkareva SV, Deshpande HR, Wang J, Pennick MR, Kana RK. Differential deactivation during mentalizing and classification of autism based on default mode network connectivity. PLoS One. 2012;7(11):e50064.
Padmanabhan A, Lynch CJ, Schaer M, Menon V. The default mode network in autism. Biol Psychiatry Cogn Neurosci Neuroimaging. 2017;2(6):476–86.
Acknowledgements
We would thank the ABIDE team for sharing the fMRI data.
Funding
None.
Author information
Authors and Affiliations
Contributions
FP, HA developed the statistical method. FP and SMT wrote the simulation computer code and performed the analysis of autism data. FP and NB interpreted the results. FP wrote the draft of the manuscript. HA, HD and NB reviewed and edited the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
Written informed consent was obtained for all participants, and experimental protocols were approved by the Institutional Review Board at the NYU Langone Medical Center. The autism dataset was deidentified in accordance with HIPAA (Health Insurance Portability and Accountability) guidelines, and no protected health information included. All methods were carried out in accordance with relevant guidelines and regulations. The Ethics Committee of Shahid Beheshti University of Medical Sciences accepted this research (IR.SBMU.RETECH.REC.1399.1140).
Consent for publication
Not applicable.
Competing interests
The authors declare that they have no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Pourmotahari, F., Doosti, H., Borumandnia, N. et al. Grouplevel comparison of brain connectivity networks. BMC Med Res Methodol 22, 273 (2022). https://doi.org/10.1186/s12874022017128
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12874022017128
Keywords
 Connectivity
 Subject heterogeneity
 fMRI
 Statistical power
 Type I error rate