Overcoming the problems caused by collinearity in mixed-effects logistic model: determining the contribution of various types of violence on depression in pregnant women

Khalili, Sanaz; Faradmal, Javad; Mahjub, Hossein; Moeini, Babak; Ezzati-Rastegar, Khadijeh

doi:10.1186/s12874-021-01325-7

Research
Open access
Published: 28 July 2021

Overcoming the problems caused by collinearity in mixed-effects logistic model: determining the contribution of various types of violence on depression in pregnant women

Sanaz Khalili¹,
Javad Faradmal²,
Hossein Mahjub²,
Babak Moeini³ &
…
Khadijeh Ezzati-Rastegar⁴

BMC Medical Research Methodology volume 21, Article number: 154 (2021) Cite this article

3172 Accesses
2 Citations
Metrics details

Abstract

Background

Collinearity is a common and problematic phenomenon in studies on public health. It leads to inflation in variance of estimator and reduces test power. This phenomenon can occur in any model. In this study, a new ridge mixed-effects logistic model (RMELM) is proposed to overcome consequences of collinearity in correlated binary responses.

Methods

Parameters were estimated through penalized log-likelihood with combining expectation maximization (EM) algorithm, gradient ascent, and Fisher-scoring methods. A simulation study was performed to compare new model with mixed-effects logistic model(MELM). Mean square error, relative bias, empirical power, and variance of random effects were used to evaluate RMELM. Also, contribution of various types of violence, and intervention on depression among pregnant women experiencing intimate partner violence(IPV) were analyzed by new and previous models.

Results

Simulation study showed that mean square errors of fixed effects were decreased for RMELM than MELM and empirical power were increased. Inflation in variance of estimators due to collinearity was clearly shown in the MELM in data on IPV and RMELM adjusted the variances.

Conclusions

According to simulation results and analyzing IPV data, this new estimator is appropriate to deal with collinearity problems in the modelling of correlated binary responses.

Peer Review reports

Introduction

Intimate partner violence (IPV) against women is one of the major public health challenges in the world [1]. IPV is categorized into mental, physical, sexual, and financial types [2]. IPV can cause physical problems including bruising, fractures, trauma, and various sexually transmitted infections. It can also cause mental health problems in women, such as depression, anxiety, and even suicide [3]. There are many women who may experience depression during pregnancy and the risk of it increases under IPV [4, 5]. Some interventions may be useful to reduce odds of depression in pregnant women under IPV. For analyzing these longitudinal studies with binary responses, mixed-effects logistic model (MELM) is used as a common model. Usually, this method estimates the fixed parameters based on maximum likelihood and uses the adjusted Gauss-Hermite to approximate the integral related to random effects [6, 7]. Modeling of the correlated binary responses may suffer from some problems in modeling like collinearity [8].

Collinearity is referred to the linear relationship between predictor variables. The inherent relationship between variables in the real world, small sample size, design of model, and the trend of predictor variables can cause collinearity [9]. Indeed, the issue that makes collinearity an important problem in modeling is variance of estimators. When there is collinearity, determinant of X^TX becomes small, where X is design matrix, leading to an inflation in variance of estimators. Bias in decision on predictor variables and wide confidence interval length are other consequences of collinearity. In addition, collinearity makes the effects of predictor variables inseparable and it may be difficult to evaluate relative importance of each predictor variable [8, 10, 11].

There are some simple methods to deal with collinearity. Drawing back collinear variables, centering predictor variables, and using dimension reduction methods like principal component analysis are some of these solutions. But, it should be mentioned that despite simplicity, each of which has their own disadvantages [12, 13]. Ridge estimator is one of the methods shown a desirable effect against consequences of collinearity. In this method, the penalized log-likelihood is used with ridge penalty. Then, ridge estimator imposes some bias to estimator, by adding a constant value in the main diagonal of X^TX but decreases its variance. In fact, it is a tradeoff between bias and variance [11, 12, 14, 15].

Various studies have been conducted to compare performance of ridge, lasso, and Firth penalties. For instance, studies have shown that problems can arise if lasso penalty is applied instead of ridge penalty in the presence of collinearity. The first problem is variable selection. In the presence of collinearity between variables, lasso method randomly removes one variable from the model. The second problem is prediction accuracy of the model. Prediction accuracy of lasso method is less than ridge[16] and mean square error of ridge method is less than lasso method [17, 18]. Also, in the presence of separation, the use of Firth penalty compared to ridge leads to more accurate estimates [19].

MELM uses maximum likelihood estimator (MLE) for estimation of the fixed effects. So, inflation in variance of the estimator and lack of significance in important variables may occur. Due to the increase in the number of studies with the correlated binary responses, such as longitudinal and cluster studies, in this paper, a ridge estimator is proposed in the correlated binary responses based on Fahrmeir and Tutz method [20, 21]. Herein, the details on method and estimators are introduced in Method section. Analysis of IPV data and simulation study are presented in Numerical study section. Finally, discussion of findings and conclusions are provided in Discussion and conclusions section.

Method

Suppose y_ij determines the jth observation for ith individual, i=1,2,...,n, and j=1,2,...,n_i. MELM is defined as:

$$ log\left(\frac{{\pi}_{ij}}{1-{{\pi}_{ij}}}\right) = \mathbf{x}_{ij}^{T} \pmb{\beta} + {\mathbf{z}^{T}_{ij}} \mathbf{b}_{i}, $$

(1)

where, x_ij and z_ij are observation vector for fixed and random effect for ith individual in j observation, respectively. X and Z are the design matrix for the fixed and random effect. Vector of the fixed and random effect are denoted by β_p×1 and b_i. Penalized log-likelihood with Breslow and Clayton integral approximation for model 1 is in the form of 2.

$$ \begin{aligned} l^{\lambda}=& \sum_{i=1}^{n} \sum_{j=1}^{n_{i}}\left(y_{ij}\left(\mathbf{x}_{ij}^{T} \pmb{\beta} + {\mathbf{z}_{ij}}^{T} \mathbf{b}_{i}\right)\right)-log\left(1 + exp\left(\mathbf{x}_{ij}^{T} \pmb{\beta} + \mathbf{z}_{ij}^{T} \mathbf{b}_{i}\right)\right)\\ &- \lambda ~\pmb{\beta}^{T} \pmb{\beta} - \frac{1}{2} \mathbf{b}^{T} Q^{-1} \mathbf{b}, \end{aligned} $$

(2)

where Q is covariance matrix for distribution of random effects and b^T=(b₁,b₂,...,b_n). The ridge shrinkage parameter is shown as λ. The fixed and random effects vectors are considered in the form of one vector, and all subsequent calculations are performed based on this new vector. This vector is defined as δ^T=(β^T,b^T) where, $\hat {\pmb {\delta }}$ is a vector maximizing 2. Let A=[X,Z],U=diag(0,...,0,Q⁻¹,...,Q⁻¹) such that, U is a block-diagonal matrix with p zeros and n times the inverse of covariance matrix. Then, the Fisher information matrix is calculated, $F^{\lambda }(\hat {\pmb {\delta }})$, as ${{F}^{\lambda }}(\hat {\pmb {\delta }})=A^{T} \hat {\Psi } (\hat {\pmb {\delta }}) A +U + \lambda ^{(s-1)};$ where $ \hat {\Psi }(\hat {\pmb {\delta }}) = D (\hat {\pmb {\delta }}) ~ \nu ^{-1}(\hat {\pmb {\delta }})~ D^{T}(\hat {\pmb {\delta }}), D (\hat {\pmb {\delta }}) =\frac {\partial h(\pmb {\eta })}{\partial {\pmb {\eta }}}$ and $\nu (\hat {\pmb {\delta }})=cov(y|\pmb \delta)$. Here, $\eta _{ij}= \mathbf {x}_{ij}^{T} \pmb {\beta } + {\mathbf {z}^{T}_{ij}} \mathbf {b}_{i} $, and $\phantom {\dot {i}\!}{\pmb {\eta }}^{T}=(\eta _{11},..., \eta _{1n_{1}},..., \eta _{n1},..., \eta _{nn_{n}})$. The form of Fisher information matrix is as:

$${{{F}^{\lambda}}}(\hat{\pmb {\delta}})= \left[ \begin{array}{ccccc} F^{\lambda}_{{\pmb \beta \pmb\beta}} & F^{\lambda}_{{\pmb \beta 1}} & F^{\lambda}_{{\pmb \beta 2}} &...& F^{\lambda}_{{\pmb \beta n}} \\ F^{\lambda}_{1 {\pmb \beta }} & F^{\lambda}_{11}&&&0\\ F^{\lambda}_{2 {\pmb \beta }}&& F^{\lambda}_{22}\\ \vdots&&&\ddots&\\ F^{\lambda}_{n \pmb \beta}&0&&&F^{\lambda}_{nn} \end{array} \right] $$

For performing optimization, derivative is taken from the penalized log-likelihood and it is represented by s^λ(δ):

$$ \mathbf{s}^{\lambda}(\pmb \delta) =\left\{ \begin{array}{l} \frac{\partial l^{\lambda}}{\partial \pmb {\beta}}= \sum_{i=1}^{n} \sum_{j=1}^{n_{i}}(y_{ij} \mathbf{x}_{ij} - \pi_{ij} \mathbf{x}_{ij})- 2\lambda \pmb \beta \\ \\ \frac{\partial l^{\lambda}}{\partial \mathbf{b_{i}}}= \sum_{i=1}^{n} \sum_{j=1}^{n_{i}}(y_{ij} \mathbf{z}_{ij} - \pi_{ij} \mathbf{z}_{ij})- 2Q^{-1} \mathbf{b}_{i} \end{array}. \right. $$

(3)

Here, two optimization methods are combined to increase convergence speed. For estimating δ, the gradient ascent and Fisher-scoring methods are used:

$$\hat{\pmb \delta}^{s} = \left\{ \begin{array}{l} \hat{\pmb \delta}^{s - 1} +{\vartheta}^{s - 1} ~\mathbf{s}^{\lambda}\left(\hat{\pmb \delta}^{s - 1}\right)\\ \\ \hat{\pmb \delta}^{s - 1} + \left({F^{\lambda}}\left(\hat{\pmb \delta}^{s - 1}\right) \right)^{- 1} ~\mathbf{s}^{\lambda}\left(\hat{\pmb \delta}^{s - 1}\right) \end{array}. \right. $$

where 𝜗 is step size:

$$\vartheta=\frac{\left(\mathbf{s}^{\lambda}\left(\hat{\pmb \delta}\right)\right)^{T} ~~\mathbf{s}^{\lambda}(\hat{\pmb \delta}) } {\left(\mathbf{s}^{\lambda}\left(\hat{\pmb \delta}\right)\right)^{T} ~~F^{\lambda}(\hat{\pmb \delta}) ~~\mathbf{s}^{\lambda}(\hat{\pmb \delta}) }~. $$

To estimate the variance component, the EM algorithm is used. The estimation of variance is:

$$ \hat{Q}^{(s)} = \frac{1}{n}\sum_{i=1}^{n} \left(\hat{\mathbf{v}}_{ii}^{(s)} + \hat{\mathbf{b}}_{i}^{(s)} \left(\hat{\mathbf{b}}_{i}^{(s)} \right)^{T}\right), $$

(4)

where $\mathbf {v}_{ii}= {F^{\lambda }}_{ii}^{-1} +~ {F^{\lambda }}_{ii}^{-1} ~{F^{\lambda }}_{i \pmb \beta }\left ({F^{\lambda }}_{\pmb \beta \pmb \beta } - \sum _{i=1}^{n}{F^{\lambda }}_{\pmb \beta i} ~ { F^{\lambda }}^{-1}_{ii} {F^{\lambda }}_{i \pmb \beta }{\vphantom {{F^{\lambda }}_{ii}^{-1}}}\right)^{-1} {F^{\lambda }}_{i \pmb \beta } ~{F^{\lambda }}_{ii}^{-1}. $

Shrinkage parameter

The shrinkage parameter was obtained through $ \lambda =\prod \limits _{k=1}^{p} \left (\frac {1}{m_{k}}\right)^{\frac {1}{p}}$, where p is the number of predictor variables. Here, ${m_{k}}=\sqrt {\frac {{\hat {\sigma }}^{2}}{{{\hat {\alpha }_{k}}}^{2}}}$, and $\hat {\alpha }_{k}$ is the kth element of $\gamma \hat {\pmb \beta }$ and γ is eigenvector such that $X^{T} \hat {W}X= \gamma ^{T} \Lambda \gamma $ as Λ is a diagonal matrix with eigenvalues of $X^{T} \hat {W}X$ [22, 23]. A study showed that this method works well in reducing MSE [24]. Also, this method has the closed-form, so it saves computation time. Therefore, it was chosen as an estimator for the shrinkage parameter.

Hypothesis testing about regression coefficients

For testing regression coefficients obtained through maximum likelihood, it is possible to use square root of the main diagonal elements of Fisher information matrix as standard errors of regression coefficients. Then, test statistic is as follows:

$$ t=\frac{\hat{\beta}}{SE({\hat{\beta}})}. $$

This test statistic follows t-distribution. For the penalized maximum likelihood estimators, this test statistic has no longer t-distribution. Some studies have proposed a non-exact t-test for linear ridge regression and logistic ridge regression [25, 26]. For logistic ridge regression, it is as follows:

$$\begin{aligned} Var\left(\hat{\beta}\right)=&Var\left[\left(X^{T}WX+2\lambda I\right)^{-1}X^{T}W \xi\right]\\=&{\left(\frac{\partial^{2} l}{\partial \beta \partial \beta^{T}}\right)}^{-1}I\left(\pmb \beta\right){\left(\frac{\partial^{2} l}{\partial \beta \partial \beta^{T}}\right)}^{-1}\\=& \left(X^{T}WX+2\lambda I\right)^{-1}\left(X^{T}WX\right)\left(X^{T}WX+2\lambda I\right)^{-1} \end{aligned} $$

where $ W=diag\left [\hat {\pi }_{i}\left (1-\hat {\pi }_{i}\right)\right ]$ which W is an n×n matrix, and ξ is a vector where the ith element equals $\xi _{i}=logit\left [\hat {\pi }_{i}\right ]+ \frac {y_{i}-\hat {\pi }_{i}}{\hat {\pi }_{i}\left (1-\hat {\pi }_{i}\right)}$. Then, the test statistic is:

$$t^{\lambda}=\frac{\hat{\beta_{k}}}{SE\left(\hat{\beta_{k}}\right)}. $$

In this study, the last step of each iteration to estimate the fixed effects uses the Fisher-scoring, so the variance which used in non-exact t-test is:

$$\begin{aligned} Var\left(\hat{\beta}\right)=\left[E\left(\frac{\partial^{2} l^{\lambda}}{\partial \beta \partial \beta^{T}}\right)\right]^{-1}~I(\pmb \beta)~~ \left[E\left(\frac{\partial^{2} l^{\lambda}}{\partial \beta \partial \beta^{T}}\right)\right]^{-1}=\left[{I\left(\pmb \beta\right)}\right]^{-1} \end{aligned} $$

Numerical study

Intimate partner violence

In this study, 150 pregnant women referring to health centers in suburbs of Hamadan City (Hamadan Province, Iran) who were under IPV were selected. The study was approved by the ethics committee. This study was conducted in accordance with the Declaration of Helsinki. These women were assigned to control and intervention groups. For the intervention group, 5 public health education sessions were held by a clinical psychologist for 5 weeks. Identifying factors causing IPV and how to manage it, forming support groups of participants, being in contact with the consultant, providing management solutions for these people, increasing communication skills of participants, giving booklets containing conflict management techniques, gift cards, and providing a free counseling session for husbands of these women were a summary of the plans administered in the intervention group.

Before starting the study, a general mental health questionnaire (GHQ) was given to all the participants. At the end of the study, these people again completed this questionnaire. Finally, after data collection, it was attempted to determine effectiveness of the intervention and contribution of various types of violence in psychological aspects of these women. Depression is an important problem in these women. Here, depression was considered as the response variable. Women with depression received a value of 1 and the others received a value of 0. So, the main aim of analysis was assessing effectiveness of the intervention and the effect of types of violence on depression.

At first, types of violence were considered as a matrix, called as V. Then, correlation matrix of V was obtained, namely cor(V). As can be seen in cor(V), there are medium to high correlations between variables. As shown in the cor(V), there is a warning for the presence of collinearity between these predictors, because most of correlations are above 0.5 [8]. For achieving more assurance about the existence of collinearity, the condition index was computed. This value was equal to 9.8, indicating collinearity between these variables.

$$\begin{aligned} \quad\quad\quad\quad\text{{Financial}\ \ \ {Sexual}\ \ \ {Physical}\ \ \ {Psychological}} \end{aligned} $$

$$\begin{aligned} cor(V)= \left[\begin{array}{ccccccccccccccccccccccccccc} \hspace{6pt} 1 &&&&&&0.47 &&&&0.60 &&&&&&&&&&0.62 \\ &&&&&& 1&&&&0.73 &&&&&&&&&&0.51 \\ &&&&&&&& && 1&&&&&&&&&&0.64\\ &&&&&&&&& && &&&&&&&&&&& 1 \\ \end{array}\hspace{20pt} \right] \begin{array}{c} \text{Financial}\\ \text{Sexual}\\ \text{Physical}\\ \text{Psychological}\\ \end{array} \end{aligned} $$

For modeling, time, intervention, and types of violence were considered. So, the design matrix, X, defined as X=[Intervention,Time,V]. Condition number for this matrix was 14.9 which is shows collinearity is a concern. At first, MELM was fitted to these data regardless of collinearity. Then, our proposed model was fitted.

To conducting the global test for the null hypothesis that all of coefficients is simultaneously zero, the likelihood ratio test (LRT) was used. For this data in MELM, the LRT=293.91 and p−value=0.009. This test indicates that all of coefficients is not simultaneously zero. As shown in the first part of Table 1, due to collinearity between predictors, none of predictors is significant at 95% of significance level. Only, psychological violence had a significant effect on depression at 90% of significance level. As can be seen in Table 1, inflation in standard errors is quite obvious. The second part of Table 1 shows the results of our proposed model. As shown in Table 1, standard errors of RMELM are lower than those of the MELM. The standard errors became adjusted and all of variables became significant. The estimated variance of random effects was equal to 1.12 and 1.26 in MELM and RMELM, respectively.

Table 1 The impact of types of violence and intervention on depression in IPV women

Full size table

According to the results presented in Table 1, the odds of depression in the control group were 55% higher than the intervention group. Also, the odds of depression were decreased by increasing time so that, odds of depression at time 1 were 2.3 times compared to time 2. Among types of violence, financial violence increased the odds of depression more than other types so that, the odds of depression were increased by 2.37 times in women with the increase in financial violence. After that, sexual violence increased the odds of depression, so that the odds of depression were increased by 90% by increasing sexual violence. As physical violence was increased, the odds of depression were increased by 47%. Finally, as psychological violence was increased, the odds of depression were increased by 17%. All of these factors were significant (p−value<0.0001).

Simulation study

For assessing performance of the proposed RMELM, a simulation study was designed and conducted under different settings. Sample size, degree of collinearity between predictors, and correlation between responses were items which considered in the simulation. Here, $\eta _{ij}=\mathbf {x}_{ij}^{T} \pmb {\beta } + \mathbf {z}^{T}_{ij}b_{i}$ was generated with true values for β, where, β^T=(0.2,0.4,−0.3). Because, there must be collinearity between predictor variables, the correlation between these variables was considered as ρ=(0.7, 0.8, 0.9, 0.95). The predictor variables were generated through $x_{ijk}=(1-\rho)^{\frac {1}{2}} ~ a_{ijk} + \rho ^{1/2} ~ a_{ijk}$, where, i=1,2,...,n,j=1,2,k=1,2, and a_ijk were generated from standard normal distribution. For investigating the effect of correlation between responses, the intraclass correlation coefficient (ICC) was also considered as ICC=(0.2, 0.5, 0.8). RMELM and MELM were compared. MELM was obtained through glmer in lme4 package [27] in R. For assessing performance of these models, relative bias, mean square error (MSE), and empirical power for fixed effects, and variance of random effects were used.

Discussion and conclusions

In this study, RMELM was introduced for correlated binary responses, and this model was compared with MELM. Table 2 shows the comparative results of MELM and RMELM in terms of MSE and relative bias. For β₁, at n = 30 and ICC = 0.2, MSE for fixed effect estimator in MELM was increased by increasing correlation so that, this value was increased by 2.24 times at correlation level of 0.95 compared to 0.7. At n = 50 compared to smaller sample size, MSE of fixed effect estimator in MELM was relatively smaller and for n = 100, this value was also decreased. With the increase in ICC, MSE for fixed effect estimator in MELM was increased. The increase in MSE at ICC = 0.8 was quite clear compared to ICC = 0.2. The changes in MSE of fixed effect estimator in MELM for β₂ were similar to fixed effect estimator in MELM for β₁. MSE of fixed effect estimator in MELM was small for β₃ compared to β₁ and β₂. Median of relative bias of fixed effect estimator in MELM was equal to 7.5%.

Table 2 Comparison of MELM, and RMELM in terms of MSE (relative bias) for fixed effects

Full size table

Variation of MSE for fixed effect estimators in RMELM was quite different from MELM. At n = 30 and different ICCs, it cannot be said that MSE for fixed effect estimator in RMELM increases by increasing correlation, but these changes have a relatively constant trend. At n = 30 and ICC = 0.2, for estimating β₁, MSE of estimator in MELM was obtained as 11.5, 13.8, 21.9, and 19.4 relative to RMELM. This difference for MSE of the estimator in two models is multiplied as ICC is increased. As the sample size is increased, MSE for fixed effect estimator in RMELMis decreased. Changes in β₂ were similar to β₁. MSE of fixed effect estimator for β₃ in RMELM was less than that of β₁ and β₂. This value was smaller than that of MELM. Median of relative bias was 50% for fixed effect estimator in RMELM.

Table 3 shows empirical power for these estimators. The empirical power of MELM was very small for β₁, and β₂. It was increased about 0.2 by increasing sample size. The empirical power for MELM of β₃ was higher than the other two parameters and its maximum values were obtained in n = 100 and ICC = 0.2. For larger ICCs, this value was decreased. The empirical power of RMELM for β₁ and β₂ was greater than those for MELM. For instance, for β₁, the empirical power in n = 30 and ICC = 0.2 was 16, 19.25, 15, and 18.75 times for RMELM relative to MELM. Table 4 provides estimates regarding variance of random effects for MELM and RMELM. As ICC increased, variance in random effects was increased in both models. With the increase in sample size, the variance was decreased for MELM.

Table 3 Comparison of MELM, and RMELM in terms of empirical power

Full size table

Table 4 Comparison of MELM and RMELM in terms of variance component

Full size table

In this study, two methods were used to investigate the effect of different types of violence on depression in pregnant women under IPV. Due to collinearity between types of violence, for MELM, none of the predictor variables was significant at 95% of significance level and only one predictor variable was significant at 90% of significance level. Using the new method, the effect of all types of violence (financial, sexual, physical, and psychological) on depression was significant. These findings illustrate how collinearity influences the results of longitudinal studies with binary responses. The results obtained by the new estimator were consistent with the other previous studies in this area. It has been demonstrated that financial violence influenced depression in Brazilian pregnant women [28]. Physical violence has been also shown to affect the depressed married women in Korea [29]. Results of a study conducted in Tanzania revealed that emotional, physical, and sexual violence affected women’s depression [30].

The results of the simulation study showed that the new model has a lower MSE for fixed effects than the MELM. The new model also increased the empirical power well. Also, in numerical study, inflation in variance of fixed-effects in MELM was shown in the MELM, and a better estimation was made using RMELM.

Availability of data and materials

The written R codes for the current study are available from the corresponding author on reasonable request. Data sharing is not applicable to this article as the ethical concerns nature of IPV data.

Abbreviations

MSE:: Mean square error
MLE:: Maximum likelihood estimator
MELM:: Mixed effects logistic model
RMELM:: Ridge mixed effects logistic model
IPV:: Intimate partner violence
ICC:: Intraclass correlation
EM:: Expectation maximization
LRT:: Likelihood ratio test

References

Bessa MMM, Drezett J, Rolim M, de Abreu LC. Violence against women during pregnancy: sistematized revision. Reprodução Climatério. 2014; 29(2):71–9.
Article Google Scholar
Khayat S, Dolatian M, Navidian A, Mahmoodi Z, Kasaeian A. Association between Physical and sexual violence and mental health in suburban women of Zahedan: a cross-sectional study. J Clin Diagn Res. 2017; 11(12):IC01–5.
Google Scholar
Chisholm CA, Bullock L, Ferguson II JEJ. Intimate partner violence and pregnancy: epidemiology and impact. Am J Obstet Gynecol. 2017; 217(2):141–4.
Article Google Scholar
Martin SL, Li Y, Casanueva C, Harris-Britt A, Kupper LL, Cloutier S. Intimate partner violence and women’s depression before and during pregnancy. Violence Against Women. 2006; 12(3):221–39.
Article Google Scholar
Zlotnick C, Capezza NM, Parker D. An interpersonally based intervention for low-income pregnant women with intimate partner violence: a pilot study. Arch Womens Ment Health. 2011; 14(1):55–65.
Article Google Scholar
Bates D, Mächler M, Bolker B, Walker S. Fitting Linear Mixed-Effects Models Using lme4. J Stat Softw Artic. 2015; 67(1):1–48.
Google Scholar
Bates D. Computational methods for mixed models. 2011. https://cran.r-project.org/web/packages/lme4/vignettes/Theory.pdf. Accessed 7 July 2020.
Dormann CF, Elith J, Bacher S, Buchmann C, Carl G, Carré G, et al.Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography. 2013; 36(1):27–46.
Article Google Scholar
Gujarati DN, Porter DC. Basic Econometrics. New York: McGraw Hill Inc; 2009.
Google Scholar
Hannah J. A geometric approach to determinants. Am Math Mon. 1996; 103(5):401–9.
Article Google Scholar
Kutner MH, Nachtsheim CJ, Neter J, Li W, et al.Applied linear statistical models, vol 5. New York: McGraw-Hill Irwin; 2005.
Google Scholar
Hoerl AE, regression KennardRW. Ridge. Biased estimation for nonorthogonal problems. Technometrics. 1970; 12(1):55–67.
Article Google Scholar
Morzuch BJ, Ruark GA. Principal components regression to mitigate the effects of multicollinearity. For Sci. 1991; 37(1):191–9.
Google Scholar
Schaefer RL. Alternative estimators in logistic regression when the data are collinear. J Stat Comput Simul. 1986; 25(1-2):75–91.
Article Google Scholar
Melissa E, Ferguson J, Reilly MP, Foulkes AS. Ridge regression for longitudinal biomarker data. Int J Biostat. 2011; 7(1):1–11.
Google Scholar
Zou H, Hastie T. Regularization and variable selection via the elastic net. J R Stat Soc Ser B Stat Methodol. 2005; 67(2):301–20.
Article Google Scholar
Curtis SM, Ghosh SK. A Bayesian approach to multicollinearity and the simultaneous selection and clustering of predictors in linear regression. J Stat Theory Pract. 2011; 5(4):715–35.
Article Google Scholar
Maxwell O, Chukwudike CN, Chinedu OV, Valentine CO, Paul OC. Comparison of Different Parametric Methods in Handling Critical Multicollinearity: Monte Carlo Simulation Study. Asian J Probab Stat. 2019; 3(2):1–16.
Article Google Scholar
Rahman MS, Sultana M. Performance of Firth-and logF-type penalized methods in risk prediction for small or sparse binary data. BMC Med Res Methodol. 2017; 17(1):33.
Article Google Scholar
Fahrmeir L, Tutz G. Multivariate statistical modelling based on generalized linear models. New York: Springer Science & Business Media; 2013.
Google Scholar
Groll A, Tutz G. Variable selection for generalized linear mixed models by L1-penalized estimation. Stat Comput. 2014; 24(2):137–54.
Article Google Scholar
Kibria BG. Månsson K, Shukur G. Performance of some logistic ridge regression estimators. Comput Econ. 2012; 40(4):401–14.
Article Google Scholar
Kibria BG. Performance of some new ridge regression estimators. Commun Stat Simul Comput. 2003; 32(2):419–35.
Article Google Scholar
Månsson K, Shukur G, Golam Kibria B. A simulation study of some ridge regression estimators under different distributional assumptions. Commun Stat Simul Comput. 2010; 39(8):1639–70.
Article Google Scholar
Halawa A, El Bassiouni M. Tests of regression coefficients under ridge regression models. J Stat Comput Simul. 2000; 65(1-4):341–56.
Article Google Scholar
Cule E, Vineis P, De Iorio M. Significance testing in ridge regression for genetic data. BMC Bioinformatics. 2011; 12(1):372.
Article Google Scholar
Bates D, Sarkar D, Bates MD, Matrix L. The lme4 package. R Package Version. 2007; 2(1):74.
Google Scholar
Lovisi GM, Lopez JRR, Coutinho ESF, Patel V. Poverty, violence and depression during pregnancy: a survey of mothers attending a public hospital in Brazil. Psychol Med. 2005; 35(10):1485.
Article Google Scholar
Kim J, Lee J. Prospective study on the reciprocal relationship between intimate partner violence and depression among women in Korea. Soc Sci Med. 2013; 99:42–8.
Article Google Scholar
Rogathi JJ, Manongi R, Mushi D, Rasch V, Sigalla GN, Gammeltoft T, et al.Postpartum depression among women who have experienced intimate partner violence: A prospective cohort study at Moshi, Tanzania. J Affect Disord. 2017; 218:238–45.
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank the Vice Chancellor for Research and Technology of Hamadan University of Medical Sciences.

Funding

The study was a part of PhD thesis of Sanaz Khalili, funded by the Vice Chancellor for Research and Technology of Hamadan University of Medical Sciences [grant No.9804253413].

Author information

Authors and Affiliations

Department of Biostatistics School of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
Sanaz Khalili
Department of Biostatistics School of Public Health, Modeling of Noncommunicable Diseases Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
Javad Faradmal & Hossein Mahjub
Social Determinants of Health Research Center, Hamadan University of Medical Sciences, Hamadan, Iran
Babak Moeini
Health Education and Promotion, Department of Public Health, Hamadan University of Medical Sciences, Hamadan, Iran
Khadijeh Ezzati-Rastegar

Authors

Sanaz Khalili
View author publications
You can also search for this author in PubMed Google Scholar
Javad Faradmal
View author publications
You can also search for this author in PubMed Google Scholar
Hossein Mahjub
View author publications
You can also search for this author in PubMed Google Scholar
Babak Moeini
View author publications
You can also search for this author in PubMed Google Scholar
Khadijeh Ezzati-Rastegar
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

SKh and JF developed the statistical method. SK wrote R codes, and performed the analysis of IPV data and simulation study, interpreted the results, and wrote the draft of the manuscript. JF and HM reviewed and edited the manuscript. BM and KhE-R designed the numerical study and performed the intervention and data collection. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Javad Faradmal.

Ethics declarations

Ethics approval and consent to participate

The part of real data study, was granted ethical approval by the Medical Sciences and Research Ethics Committee of Hamadan University (1396.478). A written informed consent form was obtained from all participants.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Khalili, S., Faradmal, J., Mahjub, H. et al. Overcoming the problems caused by collinearity in mixed-effects logistic model: determining the contribution of various types of violence on depression in pregnant women. BMC Med Res Methodol 21, 154 (2021). https://doi.org/10.1186/s12874-021-01325-7

Download citation

Received: 24 January 2021
Accepted: 21 May 2021
Published: 28 July 2021
DOI: https://doi.org/10.1186/s12874-021-01325-7

Overcoming the problems caused by collinearity in mixed-effects logistic model: determining the contribution of various types of violence on depression in pregnant women

Abstract

Background

Methods

Results

Conclusions

Introduction

Method

Shrinkage parameter

Hypothesis testing about regression coefficients

Numerical study

Intimate partner violence

Simulation study

Discussion and conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Keywords

BMC Medical Research Methodology

Contact us

Overcoming the problems caused by collinearity in mixed-effects logistic model: determining the contribution of various types of violence on depression in pregnant women

Abstract

Background

Methods

Results

Conclusions

Introduction

Method

Shrinkage parameter

Hypothesis testing about regression coefficients

Numerical study

Intimate partner violence

Simulation study

Discussion and conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Medical Research Methodology

Contact us