Skip Navigation
Skip to contents

J Korean Acad Nurs : Journal of Korean Academy of Nursing

OPEN ACCESS

Articles

Page Path
HOME > J Korean Acad Nurs > Volume 43(2); 2013 > Article
Original Article
An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain
Hyeoun-Ae Park
Journal of Korean Academy of Nursing 2013;43(2):154-164.
DOI: https://doi.org/10.4040/jkan.2013.43.2.154
Published online: April 30, 2013

College of Nursing and System Biomedical Informatics National Core Research Center, Seoul National University, Seoul, Korea.

Address reprint requests to: Park, Hyeoun-Ae. College of Nursing, Seoul National University, 103 Daehak-ro, Jongno-gu, Seoul 110-799, Korea. Tel: +82-10-3786-3284, Fax: +82-2-765-4103, hapark@snu.ac.kr
• Received: March 19, 2013   • Accepted: April 2, 2013

© 2013 Korean Society of Nursing Science

  • 1,321 Views
  • 41 Download
  • 166 Crossref
  • 237 Scopus
prev next
  • Purpose
    The purpose of this article is twofold: 1) introducing logistic regression (LR), a multivariable method for modeling the relationship between multiple independent variables and a categorical dependent variable, and 2) examining use and reporting of LR in the nursing literature.
  • Methods
    Text books on LR and research articles employing LR as main statistical analysis were reviewed. Twenty-three articles published between 2010 and 2011 in the Journal of Korean Academy of Nursing were analyzed for proper use and reporting of LR models.
  • Results
    Logistic regression from basic concepts such as odds, odds ratio, logit transformation and logistic curve, assumption, fitting, reporting and interpreting to cautions were presented. Substantial shortcomings were found in both use of LR and reporting of results. For many studies, sample size was not sufficiently large to call into question the accuracy of the regression model. Additionally, only one study reported validation analysis.
  • Conclusion
    Nursing researchers need to pay greater attention to guidelines concerning the use and reporting of LR models.
Multivariable methods of statistical analysis commonly appear in general health science literature (Bagley, White, & Golomb, 2001). The terms "multivariate analysis" and "multivariable analysis" are often used interchangeably in the literature. In the strict sense, multivariate analysis refers to simultaneously predicting multiple outcomes and multivariable analysis uses multiple variables to predict a single outcome (Katz, 1999).
The multivariable methods explore a relation between two or more predictor (independent) variables and one outcome (dependent) variable. The model describing the relationship expresses the predicted value of the outcome variable as a sum of products, each product formed by multiplying the value and coefficient of the independent variable. The coefficients are obtained as the best mathematical fit for the specified model. A coefficient indicates the impact of each independent variable on the outcome variable adjusting for all other independent variables. The model serves two purposes: (1) it can predict the value of the dependent variable for new values of the independent variables, and (2) it can help describe the relative contribution of each independent variable to the dependent variable, controlling for the influences of the other independent variables. The four main multivariable methods used in health science are linear regression, logistic regression, discriminant analysis, and proportional hazard regression.
The four multivariable methods have many mathematical similarities but differ in the expression and format of the outcome variable. In linear regression, the outcome variable is a continuous quantity, such as blood pressure. In logistic regression, the outcome variable is usually a binary event, such as alive versus dead, or case versus control. In discriminant analysis, the outcome variable is a category or group to which a subject belongs. For only two categories, discriminant analysis produces results similar to logistic regression. In proportional hazards regression, the outcome variable is the duration of time to the occurrence of a binary "failure" event (for example, death) during a follow-up period of observation.
The logistic regression is the most popular multivariable method used in health science (Tetrault, Sauler, Wells, & Concato, 2008). In this article logistic regression (LR) will be presented from basic concepts to interpretation. In addition, the use of LR in nursing literature will be examined by comparing the actual use of LR with published criteria for use and reporting.
Logistic regression sometimes called the logistic model or logit model, analyzes the relationship between multiple independent variables and a categorical dependent variable, and estimates the probability of occurrence of an event by fitting data to a logistic curve. There are two models of logistic regression, binary logistic regression and multinomial logistic regression. Binary logistic regression is typically used when the dependent variable is dichotomous and the independent variables are either continuous or categorical. When the dependent variable is not dichotomous and is comprised of more than two categories, a multinomial logistic regression can be employed.
As an illustrative example, consider how coronary heart disease (CHD) can be predicted by the level of serum cholesterol. The probability of CHD increases with the serum cholesterol level. However, the relationship between CHD and serum cholesterol is nonlinear and the probability of CHD changes very little at the low or high extremes of serum cholesterol. This pattern is typical because probabilities cannot lie outside the range from 0 to 1. The relationship can be described as an 'S'-shaped curve. The logistic model is popular because the logistic function, on which the logistic regression model is based, provides estimates in the range 0 to 1 and an appealing S-shaped description of the combined effect of several risk factors on the risk for an event (Kleinbaum & Klein, 2010).
1. Odds
Odds of an event are the ratio of the probability that an event will occur to the probability that it will not occur. If the probability of an event occurring is p, the probability of the event not occurring is (1-p). Then the corresponding odds is a value given by
odds ofEvent=p1-pjkan-43-154-e001.jpg
Since logistic regression calculates the probability of an event occurring over the probability of an event not occurring, the impact of independent variables is usually explained in terms of odds. With logistic regression the mean of the response variable p in terms of an explanatory variable x is modeled relating p and x through the equation p=α+βx.
Unfortunately, this is not a good model because extreme values of x will give values of α+βx that does not fall between 0 and 1. The logistic regression solution to this problem is to transform the odds using the natural logarithm (Peng, Lee & Ingersoll, 2002). With logistic regression we model the natural log odds as a linear function of the explanatory variable:
(1)
logity=lnodds=lnp1-p=a+βχjkan-43-154-e002.jpg
where p is the probability of interested outcome and x is the explanatory variable. The parameters of the logistic regression are α and β. This is the simple logistic model.
Taking the antilog of equation (1) on both sides, one can derive an equation for the prediction of the probability of the occurrence of interested outcome as
p=PY=interested outcome/X=χ, a specific vlaue=ea+βχ1+ea+βχ=11+e-a+βχjkan-43-154-e003.jpg
Extending the logic of the simple logistic regression to multiple predictors, one may construct a complex logistic regression as
logity=lnp1-p=a+β1χ1++βkχkjkan-43-154-e004.jpg
Therefore,
p=PY=interested outcome/X1=χ1,Xk=χk=ea+β1χ1++βkχk1+ea+β1χ1++βkχk=11+e-a+β1χ1++βkχkjkan-43-154-e005.jpg
2. Odds ratio
The odds ratio (OR) is a comparative measure of two odds relative to different events. For two events A and B, the corresponding odds of A occurring relative to B occurring is
odds ratioA vs.B=oddsAoddsB=pA1-pApB1-pBjkan-43-154-e006.jpg
An OR is a measure of association between an exposure and an outcome. The OR represents the odds that an outcome (e.g. disease or disorder) will occur given a particular exposure (e.g. health behavior, medical history), compared to the odds of the outcome occurring in the absence of that exposure.
When a logistic regression is calculated, the regression coefficient (b1) is the estimated increase in the logged odds of the outcome per unit increase in the value of the independent variable. In other words, the exponential function of the regression coefficient (eb1) is the OR associated with a one-unit increase in the independent variable. The OR can also be used to determine whether a particular exposure is a risk factor for a particular outcome, and to compare the magnitude of various risk factors for that outcome. OR=1 indicates exposure does not affect odds of outcome. OR>1 indicates exposure associated with higher odds of outcome. OR<1 indicates exposure associated with lower odds of outcome. For example, the variable smoking is coded as 0 (=no smoking) and 1 (=smoking), and the odds ratio for this variable is 3.2. Then, the odds for a positive outcome in smoking cases are 3.2 times higher than in non-smoking cases.
Logistic regression is one way to generalize the OR beyond two binary variables (Peng & So, 2002). Suppose we have a binary response variable Y and a binary predictor variable X, and in addition we have other predictor variables Z1, …, Zk that may or may not be binary. If we use multiple logistic regression to regress Y on X, Z1, …, Zk, then the estimated coefficient β̂x for X is related to a conditional OR. Specifically, at the population level
eβ̂x=PY=1X=0,Z1,,Zk/PY=0X=0,Z1,,ZkPY=1X=0,Z1,,Zk/PY=0X=0,Z1,,Zkjkan-43-154-e007.jpg
so eβ̂x is an estimate of this conditional odds ratio. The interpretation of eβ̂x is as an estimate of the OR between Y and X when the values of Z1, …, Zk are held fixed.
3. The logistic curve
Logistic regression is a method for fitting a regression curve, y =ƒ(x), when y consists of binary coded (0, 1- -failure, success) data. When the response is a binary (dichotomous) variable and x is numerical, logistic regression fits a logistic curve to the relationship between x and y. Logistic curve is an S-shaped or sigmoid curve, often used to model population growth (Eberhardt & Breiwick, 2012). A logistic curve starts with slow, linear growth, followed by exponential growth, which then slows again to a stable rate.
A simple logistic function is defined by the formula
y=ex1+ex=11+e-xjkan-43-154-e008.jpg
which is graphed in Figure 1.
To provide flexibility, the logistic function can be extended to the form
y=ea+βχ1+ea+βχ=11+e-a+βχjkan-43-154-e009.jpg
where α and β determine the logistic intercept and slope.
Logistic regression fits α and β, the regression coefficients. Figure 1 shows logistic function when α and β are 0 and 1, respectively. The logistic or logit function is used to transform an 'S'-shaped curve into an approximately straight line and to change the range of the proportion from 0 - 1 to -∞ - +∞ as
logity=lnodds=lnp1-p=α+βχjkan-43-154-e010.jpg
where p is the probability of interested outcome, α is the intercept parameter, β is a regression coefficient, and χ is a predictor.
Logistic regression does not require many of the principle assumptions of linear regression models that are based on ordinary least squares method-particularly regarding linearity of relationship between the dependent and independent variables, normality of the error distribution, homoscedasticity of the errors, and measurement level of the independent variables. Logistic regression can handle non-linear relationships between the dependent and independent variables, because it applies a non-linear log transformation of the linear regression. The error terms (the residuals) do not need to be multivariate normally distributed-although multivariate normality yields a more stable solution. The variance of errors can be heteroscedastic for each level of the independent variables. Logistic regression can handle not only continuous data but also discrete data as independent variables.
However, some other assumptions still apply (Bewick, Cheek, & Ball, 2005; Peng & So, 2002): First, logistic regression requires the dependent variable to be discrete mostly dichotomous. Second, since logistic regression estimates the probability of the event occurring (P(Y=1)), it is necessary to code the dependent variable accordingly. That is the desired outcome should be coded to be 1. Third, the model should be fitted correctly. It should not be over fitted with the meaningless variables included. Also it should not be under fitted with meaningful variable not included. Fourth, logistic regression requires each observation to be independent. Also the model should have little or no multicollinearity. That is, independent variables are not linear functions of each other. Fifth, whilst logistic regression does not require a linear relationship between the dependent and independent variables, it requires that the independent variables are linearly related to the log odds of an event. Lastly, logistic regression requires large sample sizes because maximum likelihood estimates are less powerful than ordinary least squares used for estimating the unknown parameters in a linear regression model.
Logistic regression model corresponds to data from either a cross-sectional, prospective, or retrospective case-control study (Hsieh, Bloch & Larsen, 1998). In the cross-sectional studies a random sample is taken from a population, and outcome and explanatory variables are collected simultaneously. The fitted probabilities from a logistic regression model are then estimates of proportions of an outcome in the underlying population.
In the prospective studies, a set of subjects are selected and the explanatory variables are observed. Subjects are then followed over some standard period (e.g. a month or a year) or episode (hospital stay) to determine the response outcome. In this case, the fitted probabilities are estimates of the probability of the response outcomes occurring.
In the retrospective case-control studies, separate samples of case and control groups are first assembled and potential explanatory variables are collected later often through their recollections. In this case the fitted probabilities do not have a direct interpretation since they are determined by the relative sample sizes for case and control groups. However, odds ratios can be estimated based on logistic regression.
Sample size calculation for logistic regression is a complicated problem, because there are so many factors involved in determining sample size such as statistical power, number of parameters to estimate, percentage of 1's, effect size, and standard error. There are many researchers suggesting different methods to calculate the required sample size for logistic regression (Hosmer & Lemeshow, 2000; Hsieh et al., 1998).
Hsieh et al. (1998) proposed a sample size formula for a simple logistic regression with a continuous variable with a normal distribution:
n=Z1-α2+Z1-β2P11-P1β*2jkan-43-154-e011.jpg
where n is the required total sample size, β* is the effect size to be tested the null hypothesis H0 : β1= 0 against the alternative H1 : β1*, where β* ≠ 0, P1 is the overall event rate at the mean of X, and Zu is the upper uth percentiles of the standard normal distribution.
When the covariate is a binary variable, the sample size formula for the total sample size required for comparing two independent event rates has the following form
n=Z1-α/2P1-PB+Z1-βP11-P1+P21-P21-BB2P1-P221-Bjkan-43-154-e012.jpg
where P= (1-B)P1+BP2 is the overall event rate; B is the proportion of the sample with X=1; P1 and P2 are the event rates at X= 0 and X=1, respectively.
For multiple logistic regression, Peduzzi , Concato, Kemper, Holford, & Feinstein (1996) suggested a very simple guideline for a minimum number of cases for logistic regression study. Let p be the smallest of the proportions of negative or positive cases in the population and k the number of independent variables, then the minimum number of cases to include is:
N=10 k / p
For example, if there are 5 explanatory variables to include in the model and the proportion of positive cases in the population is 0.25 (25%). The minimum number of cases required is
N=10 × 5 / 0.25=200
Long (1997) suggested that if the resulting number is less than 100 it should be increased to 100.
Although logistic regression model, logit (y)=α+βχ looks similar to a simple linear regression model, the underlying distribution is binomial and the parameters, α and β cannot be estimated in the same way as for simple linear regression. Instead, the parameters are usually estimated using the method of maximum likelihood of observing the sample values (Menard, 2001). Maximum likelihood will provide values of α and β which maximize the probability of obtaining the data set. It requires iterative computing with computer software.
The likelihood function is used to estimate the probability of observing the data, given the unknown parameters (α and β). A "likelihood" is a probability that the observed values of the dependent variable may be predicted from the observed values of the independent variables. The likelihood varies from 0 to 1 like any other probabilities. Practically, it is easier to work with the logarithm of the likelihood function. This function is known as the log-likelihood. Log-likelihood will be used for inference testing when comparing several models. The log likelihood varies from 0 to -∞ (it is negative because the natural log of any number less than 1 is negative).
In logistic regression, we observe binary outcome and predictors, and we wish to draw inferences about the probability of an event in the population. Suppose in a population from which we are sampling, each individual has the same probability p, that an event occurs. For each individual in our sample of size n, Yi =1 indicates that an event occurs for the ith subject, otherwise, Yi =0. The observed data are Y1, …, Yn and X1, …, Xn
The joint probability of the data (the likelihood) is given by
L=i=1npy/xYi1-py/x1-Yi=py/xi=1nYi1-py/xn-i=1nYijkan-43-154-e013.jpg
Natural logarithm of the likelihood is
l=logL=i=1nYilogpy/x+n-i=1nYilog1-py/xjkan-43-154-e014.jpg
In which
py/x=ea+βχ1+ea+βχjkan-43-154-e015.jpg
Estimating the parameters α and β is done using the first derivatives of log-likelihood, and solving them for α and β. For this, iterative computing is used. An arbitrary value for the coefficients (usually 0) is first chosen. Then log-likelihood is computed and variation of coefficients values observed. Reiteration is then performed until maximization of l (equivalent to maximizing L). The results are the maximum likelihood estimates of α and β.
There are several parts involved in the evaluation of the logistic regression model. First, the overall model (relationship between all of the independent variables and dependent variable) needs to be assessed. Second, the importance of each of the independent variables needs to be assessed. Third, predictive accuracy or discriminating ability of the model needs to be evaluated. Finally, the model needs to be validated.
1. Overall model evaluation

1) The likelihood ratio test

Overall fit of a model shows how strong a relationship between all of the independent variables, taken together, and dependent variable is. It can be assessed by comparing the fit of the two models with and without the independent variables. A logistic regression model with the k independent variables (the given model) is said to provide a better fit to the data if it demonstrates an improvement over the model with no independent variables (the null model). The overall fit of the model with k coefficients can be examined via a likelihood ratio test which tests the null hypothesis
H0 : β12 =… =βk = 0.
To do this, the deviance with just the intercept (-2 log likelihood of the null model) is compared to the deviance when the k independent variables have been added (-2 log likelihood of the given model). Likelihood of the null model is the likelihood of obtaining the observation if the independent variables had no effect on the outcome. Likelihood of the given model is the likelihood of obtaining the observations with all independent variables incorporated in the model.
The difference of these two yields a goodness of fit index G, χ2 statistic with k degrees of freedom (Bewick, Cheek, & Ball, 2005). This is a measure of how well all of the independent variables affect the outcome or dependent variable.
G=χ2=(-2 log likelihood of null model)-(-2 log likelihood of given model) An equivalent formula sometimes presented in the literature is
=-2loglikelihood of the null modellikelihood of the given modeljkan-43-154-e016.jpg
where the ratio of the maximum likelihood is calculated before taking the natural logarithm and multiplying by -2. The term 'likelihood ratio test' is used to describe this test. If the p-value for the overall model fit statistic is less than the conventional 0.05, then reject H0 with the conclusion that there is evidence that at least one of the independent variables contributes to the prediction of the outcome.

2) Chi-Square Goodness of Fit Tests

With logistic regression, instead of R2 as the statistics for overall fit of the linear regression model, deviance between observed values from the expected values is used. In linear regression, residuals can be defined as yi-i, where yi is the observed dependent variable for the ith subject, and i the corresponding prediction from the model. The same concept applies to logistic regression, where yi is equal to either 1 or 0, and the corresponding prediction from the model is as
i=expα+β1xi1++βkxik1+expα+β1xi1++βkxikjkan-43-154-e017.jpg
Chi-square test can be based on the residuals, yi-i (Peng & So, 2002). A standardized residual can be defined as
ri=yi-ii1-ijkan-43-154-e018.jpg
where the standard deviation of the residuals is i(1-i). One can then form a χ2 statistic as
χ2=i=1nri2jkan-43-154-e019.jpg
This statistic follows a χ2 distribution with n-(k+1) degrees of freedom, so that p-values can be calculated.

3) Hosmer - Lemeshow test

The Hosmer-Lemeshow test is to examine whether the observed proportions of events are similar to the predicted probabilities of occurrence in subgroups of the model population. The Hosmer-Lemeshow test is performed by dividing the predicted probabilities into deciles (10 groups based on percentile ranks) and then computing a Pearson Chi-square that compares the predicted to the observed frequencies in a 2-by-10 table.
The value of the test statistics is
H=g=110Og-Eg2Egjkan-43-154-e020.jpg
where Og and Eg denote the observed events, and expected events for the gth risk decile group. The test statistic asymptotically follows a χ2 distribution with 8 (number of groups -2) degrees of freedom. Small values (with large p-value closer to 1) indicate a good fit to the data, therefore, good overall model fit. Large values (with p<.05) indicate a poor fit to the data. Hosmer and Lemeshow do not recommend the use of this test when there is a small n less than 400 (Hosmer & Lemeshow, 2000).
2. Statistical significance of individual regression coefficients
If the overall model works well, the next question is how important each of the independent variables is. The logistic regression coefficient for the ith independent variable shows the change in the predicted log odds of having an outcome for one unit change in the ith independent variable, all other things being equal. That is, if the ith independent variable is changed 1 unit while all of the other predictors are held constant, log odds of outcome is expected to change bi units. There are a couple of different tests designed to assess the significance of an independent variable in logistic regression, the likelihood ratio test and the Wald statistic (Menard, 2001).

1) Likelihood ratio test

The likelihood-ratio test used to assess overall model fit can also be used to assess the contribution of individual predictors to a given model. The likelihood ratio test for a particular parameter compares the likelihood of obtaining the data when the parameter is zero (L0) with the likelihood (L1) of obtaining the data evaluated at the MLE of the parameter. The test statistic is calculated as follows:
G=-2lnL0L1=-2lnL0-lnL1jkan-43-154-e021.jpg
This statistics is compared with a χ2 distribution with 1 degree of freedom. To assess the contribution of individual predictors one can enter the predictors hierarchically, then compare each new model with the previous model to determine the contribution of each predictor.

2) Wald statistic

The Wald statistic can be used to assess the contribution of individual predictors or the significance of individual coefficients in a given model (Bewick et al., 2005). The Wald statistic is the ratio of the square of the regression coefficient to the square of the standard error of the coefficient. The Wald statistic is asymptotically distributed as a Chi-square distribution.
Wj=βj2SEβj2jkan-43-154-e022.jpg
Each Wald statistic is compared with a Chi-square with 1 degree of freedom. Wald statistics are easy to calculate but their reliability is questionable.

3) Odds ratios with 95% CI

Odds ratio with 95% confidence interval (CI) can be used to assess the contribution of individual predictors (Katz, 1999). It is important to note however, that unlike the p value, the 95% CI does not report a measure's statistical significance. It is used as a proxy for the presence of statistical significance if it does not overlap the null value (e.g. OR=1).
The 95% CI is used to estimate the precision of the OR. A large CI indicates a low level of precision of the OR, whereas a small CI indicates a higher precision of the OR. An approximate confidence interval for the population log odds ratio is
95% CI for the In (OR)=In (OR)±1.96×{SE In (OR)}
where ln(OR) is the sample log odds ratio, and SE ln(OR) is the standard error of the log odds ratio(Morris & Gardner, 1988). Taking the antilog, we get the 95% confidence interval for the odds ratio:
95% CI for OR=eIn (OR)=In (OR)±1.96×{SE In (OR)}
3. Predictive Accuracy and Discrimination

1) Classification table

The classification table is a method to evaluate the predictive accuracy of the logistic regression model (Peng & So, 2002). In this table the observed values for the dependent outcome and the predicted values (at a user defined cut-off value) are cross-classified. For example, if a cutoff value is 0.5, all predicted values above 0.5 can be classified as predicting an event, and all below 0.5 as not predicting the event. Then a two-by-two table of data can be constructed with dichotomous observed outcomes, and dichotomous predicted outcomes.
The table has following form.
If the logistic regression model has a good fit, we expect to see many counts in the a and d cells, and few in the b and c cells. In an analogy with medical diagnostic testing, we can consider sensitivity=a/(a+b) and specificity =d/(c+d). Higher sensitivity and specificity indicate a better fit of the model.

2) Discrimination with ROC curves

Extending the above two-by-two table idea, rather than selecting a single cutoff, the full range of cutoff values from 0 to 1 can be examined. For each possible cutoff value, a two-by-two table can be formed. Plotting the pairs of sensitivity and one minus specificity on a scatter plot provides an ROC (Receiver Operating Characteristic) curve. The area under this curve (AUC) provides an overall measure of fit of the model (Bewick, Cheek, & Ball, 2004). The AUC varies from 0.5 (no predictive ability) to 1.0 (perfect predictive ability). Larger AUC indicates better predictability of the model. Points above the diagonal dividing the ROC space represent good classification results (better than random), while points below represent the poor results (worse than random).
4. Validation of the logistic regression
Logistic regression models are frequently used to predict a dependent variable from a set of independent variables. An important question is whether results of the logistic regression analysis on the sample can be extended to the population the sample has been chosen from. This question is referred as model validation. In practice, a model cab be validated by deriving a model and estimating its coefficients in one data set, and then using this model to predict the outcome variable from the second data set, then check the residuals, and so on.
When a model is validated using the data on which the model was developed, it is likely to be over-estimated. Thus, the validity of model should be assessed by carrying out tests of goodness of fit and discrimination on a different data set (Giancristofaro & Salmaso, 2003). If the model is developed with a sub sample of observations and validated with the remaining sample, it is called internal validation. The most widely used methods for obtaining a good internal validation are data-splitting, repeated data-splitting, jackknife technique and bootstrapping (Harrell, Lee, & Mark, 1996).
If the validity is tested with a new independent data set from the same population or from a similar population, it is called external validation. Obtaining a new data set allows us to check the model in a different context. If the first model fits the second data set, there is some assurance of generalizability of the model. However, if the model does not fit the second data, the lack of fit can be either due to the different contexts of the two data sets, or true lack of fit of the first model.
In presenting the logistic regression results, following four types of information should be included: a) an overall evaluation of the logistic model; b) statistical tests of individual predictors; c) goodness-of-fit statistics; and d) an assessment of the predicted probabilities. Table 2, 3, and 4 are examples to illustrate the presentation of these four types of information.
Table 2 presents the statistical significance of individual regression coefficients (βs) tested using the Wald Chi-square statistic. According to Table 2, Cholesterol was a significant predictor for event (p<.05). The slope coefficient 1.48 represents the change in the log odds for a one unit increase in cholesterol. The test of the intercept (p<.05) was significant suggesting that the intercept should be included in the model. Odd ratio 4.04 indicates that the odds for a event increase 4.04 times when the value of the cholesterol is increased by 1 unit.
Table 3 presents three inferential statistical tests for overall model evaluation: the likelihood ratio, score, and Wald tests. All three tests yield similar conclusions for the given data set, namely that given logistic model with independent variables was more effective than the null model. Table 3 also presents an inferential goodness-of-fit test, the Hosmer-Lemeshow test. Hosmer-Lemeshow test statistics 7.76 was insignificant (p>.05), suggesting that the model was fit to the data well.
Table 4 presents the degree to which predicted probabilities agree with actual outcomes in a classification table. The overall correct prediction, 66.84% shows an improvement over the chance level which is 50%. With the classification table, sensitivity, specificity, false positive and false negative can be measured. Sensitivity measures the proportion of correctly classified events, whereas specificity measures the proportion of correctly classified nonevents. The false positive measures the proportion of observations misclassified as events over all of those classified as events. The false negative therefore measures the proportion of observations misclassified as nonevents over all of those classified as nonevents.
In logistic regression no assumptions are made about the distributions of the independent variables. However, the independent variables should not be highly correlated with one another because this could cause problems with estimation.
Studies with small to moderate samples size employing logistic regression overestimate the effect measure. Thus, large sample sizes are required for logistic regression to provide sufficient numbers in both categories of the outcome variable. Also, the more independent variables were included, the larger the sample size is required. With small sample sizes, the Hosmer-Lemeshow test has low power and is unlikely to detect subtle deviations from the logistic model. Hosmer and Lemeshow recommend sample sizes greater than 400 (Hosmer & Lemeshow, 2000).
Original research articles containing explicit mention of LR were searched in the Journal of Korean Academy of Nursing published between 2010 and 2012. In total 23 articles were identified. There are 11 articles performed logistic regression with secondary data from large surveys, six articles from the cross-sectional survey, four articles from the prospective studies and two from the retrospective case-control studies. To evaluate the research reports, a list of criteria from Bagley et al.'s (2001) study was used. None of the studies reported any interaction and only one study reported a validation of the model with threefold cross validation technique, so those columns have been omitted.
Table 5 shows the adherence to the guidelines for using and reporting LR for each of the 23 articles. Although it is important the logistic regression model includes all relevant variables, it is also important the model not start with more variables than are justified for the given number of observations (Peduzzi et al., 1996). Peduzzi et al. suggested that the number of the less common of the two possible outcomes divided by the number of independent variables should be at least 10 as a useful rule of thumb. For analysis of 23 articles, the number of events-per-variable ranged from 0.7 to 16409. Sixteen of 23 of the analyses had an events-per-variable ratio above 10. Sample size calculation was mentioned in 11 articles. Five out of 11 used G*Power to calculate the sample size.
For best results from the use of logistic regression, any given change in a continuous independent variable should have the same effect on the log-odds of a positive outcome. However, in all the studies with continuous or ordinal independent variables, none tested the conformity with the linear gradient for continuous variables. As noted before, no interactions were reported in any of 23 studies. However, it is unclear whether interactions were considered but not found to be significant or whether interactions were not considered. If two highly correlated variables are included in the model, then their estimated contributions may be imprecise. Thus collinearity should be tested. However, only 5 of the 23 studies mentioned collinearity with the details of testing. Again it is not clear whether collinearities was considered but not found to be significant or whether collinearities were not considered. As noted before, only one study reported the model validation.
All of the studies reported measures of statistical significance, typically confidence intervals and P-values for each of the independent variables. Sometimes, these statistics were reported only for those variables found to be significant. The statistical significance for the entire model was reported for 13 of the 23 analyses. Goodness of fit measures describing how well the entire model matches the observed values were reported in 9 articles.
Nearly all the articles explained how variables were selected for inclusion in the model. Most of the articles selected variables based on the literature review. However, 15 of 23 reported performing the statistical tests (such as bivariate analyses) before considering the variable for the models. None of the articles provided complete details on the coding for all the variables. However, it was possible to infer the coding from the textual description in all cases. Eight studies explicitly stated the fitting procedure. In one study the variables included in the model was determined in hierarchically grouped subsets.
Logistic regression is a type of multivariable analyses used with increasing frequency in the nursing domain because of its ability to model the relationship between dichotomous dependent variable and one or more independent variables. In this paper, logistic regression from basic concepts to interpretation of analysis results was introduced. In addition, twenty-three articles published between 2010 and 2011 in the Journal of Korean Academy of Nursing were examined to see if logistic regressions were properly used and reported.
It was found that there were substantial shortcomings in the use and reporting of logistical regression results. Most notably, one-thirds of the studies had not enough sample size with events-per-variables ratios below 10, suggesting that the regression results may not be reliable. Only one study reported internal validation analysis. Validation is a crucial step to test the regression model captured essential relationships in the domain of study. Another problem is that only five studies reported tests for collinearity. If there is high correlation between two independent variables, variance of the coefficients of these variables will be increased, with a consequent loss of statistical significance. In addition, none reported tests for any conformity with the linear gradient for continuous independent variables, and tests for interactions. These problems represent failures to perform important aspect of the analysis or failures to report details of the analyses. It is hard to distinguish these two possibilities by reviewing published articles.
Thus, proper use and report of this powerful and sophisticated modeling technique requires considerable care both in the specification of the model and in the estimation and interpretation of the model's coefficients.
  • 1. Bagley SC, White H, Golomb BA. Logistic regression in the medical literature: Standards for use and reporting, with particular attention to one medical domain. J Clin Epidemiol. 2001;54(10):979–985.PubMed
  • 2. Bewick V, Cheek L, Ball J. Statistics review 13: Receiver operating characteristic curves. Crit Care. 2004;8(6):508–512. http://dx.doi.org/10.1186/cc3000.PubMedPMC
  • 3. Bewick V, Cheek L, Ball J. Statistics review 14: Logistic regression. Crit Care. 2005;9(1):112–118. http://dx.doi.org/10.1186/cc3045.PubMedPMC
  • 4. Eberhardt LL, Breiwick JM. Models for population growth curves. ISRN Ecol. 2012;2012:815016. http://dx.doi.org/doi:10.5402/2012/815016.
  • 5. Giancristofaro RA, Salmaso L. Model performance analysis and model validation in logistic regression. Statistica. 2003;63(2):375–396.
  • 6. Harrell FE Jr, Lee KL, Mark DB. Multivariable prognostic models: issues in developing models, evaluating assumptions and adequacy, and measuring and reducing errors. Stat Med. 1996;15:361–387.ArticlePubMed
  • 7. Hosmer DW, Lemeshow S. Applied logistic regression. 2000;2nd ed. New York, NY, John Wiley & Sons Inc.
  • 8. Hsieh FY, Bloch DA, Larsen MD. A simple method of sample size calculation for linear and logistic regression. Stat Med. 1998;17(14):1623–1634.ArticlePubMed
  • 9. Katz MH. Multivariable analysis: A practical guide for clinicians. 1999;Cambridge, Cambridge University Press.
  • 10. Kleinbaum DG, Klein M. Logistic regression(statistics for biology and health). 2010;3rd ed. New York, NY, Springer-Verlag New York Inc.
  • 11. Long JS. Regression models for categorical and limited dependent vriables. 1997;Thousand Oaks, CA, Sage Publications.
  • 12. Menard SW. Applied logistic regression analysis (quantitative applications in the social sciences). 2001;2nd ed. Thousand Oaks, CA, Sage Publications.
  • 13. Morris JA, Gardner MJ. Calculating confidence intervals for relative risks (odds ratios) and standardised ratios and rates. Br Med J (Clin Res Ed). 1988;296(6632):1313–1316.PubMedPMC
  • 14. Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–1379.ArticlePubMed
  • 15. Peng CJ, Lee KL, Ingersoll GM. An introduction to logistic regression analysis and reporting. J Educ Res. 2002;96(1):3–14.Article
  • 16. Peng CJ, So TH. Logistic regression analysis and reporting: A primer. Underst Stat. 2002;1(1):31–70.Article
  • 17. Tetrault JM, Sauler M, Wells CK, Concato J. Reporting of multivariable methods in the medical literature. J Investig Med. 2008;56(7):954–957. http://dx.doi.org/10.231/JIM.0b013e31818914ff.ArticlePubMedPDF
Figure 1
Graph of logistic curve where α=0 and β=1.
jkan-43-154-g001.jpg
Table 1
Sample Classification Table
jkan-43-154-i001.jpg

a, b, c, and d are number of observations in the corresponding cells.

Table 2
Example Output from Logistic Regression: Statistical Tests of Individual Predictors
jkan-43-154-i002.jpg

CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.

Table 3
Example Output from Logistic Regression: Overall Model Evaluation and Goodness-of-Fit Statistics
jkan-43-154-i003.jpg
Table 4
Example Output from Logistic Regression: A Classification Table
jkan-43-154-i004.jpg

Sensitivity=3/(3+57)=5.00%; Specificity=124/(6+124)=95.48%;

False positive=6/(6+124)=4.62%; False negative=57/(3+57)=95.00%.

Table 5
Adherence to Guidelines for Using and Reporting Logistic Regression
jkan-43-154-i005.jpg

NR=Not reported; VIF=Variance inflation factor; AUC=Area under curve; ROC=Receiver operationg characteristic; AIC=Akaike's information criterion; LL=Log-likelihood CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.

Figure & Data

REFERENCES

    Citations

    Citations to this article as recorded by  
    • The development of classification-based machine-learning models for the toxicity assessment of chemicals associated with plastic packaging
      Md Mobarak Hossain, Kunal Roy
      Journal of Hazardous Materials.2025; 484: 136702.     CrossRef
    • Development of a predictive model for severe peripartum hemorrhage in placenta accreta spectrum cases under neuraxial anesthesia: a multicenter retrospective analysis
      Yanan Li, Liang Li, Xiao Song, Fanqing Meng, Meiling Zhang, Yarong Li, Ran Chu
      Therapeutic Advances in Reproductive Health.2025;[Epub]     CrossRef
    • Innovative Credit Risk Assessment: Leveraging Social Media Data for Inclusive Credit Scoring in Indonesia’s Fintech Sector
      Andry Alamsyah, Aufa Azhari Hafidh, Annisa Dwiyanti Mulya
      Journal of Risk and Financial Management.2025; 18(2): 74.     CrossRef
    • Machine Learning as a Tool for Assessment and Management of Fraud Risk in Banking Transactions
      Antonio Dichev, Silvia Zarkova, Petko Angelov
      Journal of Risk and Financial Management.2025; 18(3): 130.     CrossRef
    • MLP-SVM: a hybrid approach for improving the performance of the classification model for health-related documents from social media using multi-layer perceptron and support vector machine
      Etana Fikadu, Teklu Urgessa, Mrinal Das
      Discover Applied Sciences.2025;[Epub]     CrossRef
    • Armed conflict and household food insecurity: impacts and coping strategies in the conflict-affected rural settings of Tigray, Ethiopia
      Hafte Gebreselassie Gebrihet, Yibrah Hagos Gebresilassie
      Cogent Social Sciences.2025;[Epub]     CrossRef
    • Employing Binary Logistic Regression in Modeling the Effectiveness of Agricultural Extension in Clove Farming: Facts and Findings from Sidrap Regency, Indonesia
      Hasim Hasim, Muslim Salam, Andi Amran Sulaiman, Muhammad Hatta Jamil, Hari Iswoyo, Pipi Diansari, Ariady Arsal, Andi Nixia Tenriawaru, Akhsan Akhsan, Ahmad Imam Muslim
      Sustainability.2025; 17(6): 2786.     CrossRef
    • Sociodemographic Characteristics of Internationally Educated Nurses Associated With Successful Outcomes in Canada: Quantitative Analysis
      Nasrin Alostaz, Jiajie Mo, Margaret Walton‐Roberts, Ruth Chen, Maria Pratt, Olive Wahoush
      Journal of Advanced Nursing.2024;[Epub]     CrossRef
    • Longitudinal Risk Analysis of Second Primary Cancer after Curative Treatment in Patients with Rectal Cancer
      Jiun-Yi Hsia, Chi-Chang Chang, Chung-Feng Liu, Chia-Lin Chou, Ching-Chieh Yang
      Diagnostics.2024; 14(13): 1461.     CrossRef
    • MODELLING THE FACTORS AFFECTING CRASH OCCURRENCE AND FREQUENCY RESULTED FROM MOBILE PHONE USE WHILE DRIVING: EVIDENCE FROM AL-NAJAF, IRAQ
      Firas Asad, Shaimaa Hadi
      Kufa Journal of Engineering.2024; 15(2): 74.     CrossRef
    • Factors That Affect Rural Financial Inclusion: A Case of Gambella Town, Gambell Peoples National Regional State, Ethiopia
      Chuol Jock Ruey
      KnE Social Sciences.2024;[Epub]     CrossRef
    • Learning strategies, self-efficacy beliefs and academic achievement of first-year preservice teachers: a person-centred approach
      Henna Vilppu, Eero Laakkonen, Anu Laine, Marko Lähteenmäki, Riitta-Leena Metsäpelto, Mirjamaija Mikkilä-Erdmann, Anu Warinowski
      European Journal of Psychology of Education.2024; 39(2): 1161.     CrossRef
    • Landscape dynamics and its related factors in the Citarum River Basin: a comparison of three algorithms with multivariate analysis
      Moh. Dede, Sunardi Sunardi, Kuok-Choy Lam, Susanti Withaningsih, Hendarmawan Hendarmawan, Teguh Husodo
      Geocarto International.2024;[Epub]     CrossRef
    • Analysis in Terms of Environmental Awareness of Farmers’ Decisions and Attitudes: Reducing Pesticide Use and Risks
      Ismail Bulent Gurbuz
      Sustainability.2024; 16(11): 4323.     CrossRef
    • Assessment of pedestrians’ red light violation behavior at signalized crosswalks in Kathmandu, Nepal
      Deepak Raj Shah, Rojee Pradhananga
      Transportation Research Interdisciplinary Perspectives.2024; 24: 101035.     CrossRef
    • Financial Stress, Family, Marital, and Life Satisfaction of Turkish Families During Covid-19: What Did We Learn?
      Selda Coşkuner Aktaş, Birgül Çiçek
      Journal of Family Issues.2024; 45(4): 1044.     CrossRef
    • Logistic regression and other statistical tools in diagnostic biomarker studies
      Dina Mohamed Ahmed Samir Elkahwagy, Caroline Joseph Kiriacos, Manar Mansour
      Clinical and Translational Oncology.2024; 26(9): 2172.     CrossRef
    • Machine learning applications in vadose zone hydrology: A review
      Xiang Li, John L. Nieber, Vipin Kumar
      Vadose Zone Journal.2024;[Epub]     CrossRef
    • Improving dengue fever predictions in Taiwan based on feature selection and random forests
      Chao-Yang Kuo, Wei-Wen Yang, Emily Chia-Yu Su
      BMC Infectious Diseases.2024;[Epub]     CrossRef
    • A logit model approach to the examination of parents’ willingness to contribute towards the cost of senior high school education: The case of parents in Tamale, Ghana
      William Kofi Nkegbe
      Social Sciences & Humanities Open.2024; 9: 100832.     CrossRef
    • PET radiomics-based lymphovascular invasion prediction in lung cancer using multiple segmentation and multi-machine learning algorithms
      Seyyed Ali Hosseini, Ghasem Hajianfar, Pardis Ghaffarian, Milad Seyfi, Elahe Hosseini, Atlas Haddadi Aval, Stijn Servaes, Mauro Hanaoka, Pedro Rosa-Neto, Sanjeev Chawla, Habib Zaidi, Mohammad Reza Ay
      Physical and Engineering Sciences in Medicine.2024; 47(4): 1613.     CrossRef
    • Boosting Institutional Identity on X Using NLP and Sentiment Analysis: King Faisal University as a Case Study
      Khalied M. Albarrak, Shaymaa E. Sorour
      Mathematics.2024; 12(12): 1806.     CrossRef
    • Faktor-Faktor yang Mempengaruhi Hasil Belajar Statistika Mahasiswa melalui Pemodelan Regresi Logistik Biner
      Filda Febrinita, Wahid Ibnu Zaman, Wahyu Dwi Puspitasari
      Kognitif: Jurnal Riset HOTS Pendidikan Matematika.2024; 4(1): 523.     CrossRef
    • What made you do moonlighting? A tale from the Indonesian labor force survey
      Padang Wicaksono, Dewi Kartika Sari, Reinaldy Sutanto
      Cogent Social Sciences.2024;[Epub]     CrossRef
    • CREDIT RISK ANALYSIS: AN ASSESSMENT OF THE PERFORMANCE OF SIX MACHINE LEARNING TECHNIQUES IN CREDIT SCORING MODELLING
      Abba Bello Muhammad, Ishaq O. Olawoyin, Abubakar Yahaya, S. U. Gulumbe, Abdullahi. A. Muhammad, Iliyasu Abubakar Salisu
      FUDMA JOURNAL OF SCIENCES.2024; 8(6): 163.     CrossRef
    • Reliable water quality prediction and parametric analysis using explainable AI models
      M. K. Nallakaruppan, E. Gangadevi, M. Lawanya Shri, Balamurugan Balusamy, Sweta Bhattacharya, Shitharth Selvarajan
      Scientific Reports.2024;[Epub]     CrossRef
    • Modeling the drivers of large herbivore distribution in human‐dominated southern African savannas
      Dionísio V. Roque, Thomas Göttert, Ulrich Zeller, Valério A. Macandza
      Ecosphere.2024;[Epub]     CrossRef
    • Antecedents of Savings Behaviour Among Rural Households: A Holistic Approach
      Mohd Abass Bhat, Geleta Demera Gomero, Shagufta Tariq Khan
      FIIB Business Review.2024; 13(1): 56.     CrossRef
    • The causal-effect model of input factor allocation on maize production: Using binary logistic regression in search for ways to be more productive
      Muslim Salam, Rusli M. Rukka, Muh An-Nashrullah K. Samma, A. Nixia Tenriawaru, Rahmadanih, Ahmad Imam Muslim, Hamed Noralla Bakheet Ali, Muhammad Ridwan
      Journal of Agriculture and Food Research.2024; 16: 101094.     CrossRef
    • Sensing Biomechanical Alterations in Red Blood Cells of Type 1 Diabetes Patients: Potential Markers for Microvascular Complications
      Riccardo Di Santo, Benedetta Niccolini, Alessandro Rizzi, Laura Bertini, Denise Pires Marafon, Maria Vaccaro, Federica Cristallo, Enrico Rosa, Linda Tartaglione, Laura Leo, Marco De Spirito, Gabriele Ciasca, Dario Pitocco
      Biosensors.2024; 14(12): 587.     CrossRef
    • Evaluating Rural Land Administration Institutional Efficiency from Modern Land Administration Systems Perspective: In Case of Awi Zone, North-Western Ethiopia
      Dereje Shiferaw, Mihret Yeneneh, Teha Benti
      Journal of Public Policy and Administration.2024; 8(3): 88.     CrossRef
    • Predicting increments in heavy metal contamination in farmland soil
      Jieh-Haur Chen, Meng-Fen Yeh, Jui-Pin Wang, Hsi-Hsien Wei
      Environment, Development and Sustainability.2024;[Epub]     CrossRef
    • Investigation of Risk Factors Associated with the African Swine Fever Outbreaks in the Nizhny Novgorod Region of Russia, 2011–2022
      Olga I. Zakharova, Andrey A. Blokhin, Ivan V. Yashin, Olga A. Burova, Denis V. Kolbasov, Fedor I. Korennoy, Jordi Casal
      Transboundary and Emerging Diseases.2023; 2023: 1.     CrossRef
    • Determinants of bamboo processors' utilization level: The case of Bahir Dar city and Injibara town, Ethiopia
      Tsadiku Setegne Dessie, Ahmed Mohammed Yimer, Mohammed Yimam Ali
      Advances in Bamboo Science.2023; 5: 100043.     CrossRef
    • Comprehensive breathing variability indices enhance the prediction of extubation failure in patients on mechanical ventilation
      Qing Pan, Haoyuan Zhang, Mengting Jiang, Gangmin Ning, Luping Fang, Huiqing Ge
      Computers in Biology and Medicine.2023; 153: 106459.     CrossRef
    • A Survey on Machine Learning in Hardware Security
      Troya Çağıl Köylü, Cezar Rodolfo Wedig Reinbrecht, Anteneh Gebregiorgis, Said Hamdioui, Mottaqiallah Taouil
      ACM Journal on Emerging Technologies in Computing Systems.2023; 19(2): 1.     CrossRef
    • Machine learning facilitated structural activity relationship approach for the discovery of novel inhibitors targeting EGFR
      Rekha Choudhary, Vinayak Walhekar, Amol Muthal, Dilip Kumar, Chandrakant Bagul, Ravindra Kulkarni
      Journal of Biomolecular Structure and Dynamics.2023; 41(22): 12445.     CrossRef
    • What Factors Influence Consumers to Buy Green Products? An Analysis through the Motivation–Opportunity–Ability Framework and Consumer Awareness
      Gizem Yener, Arzu Secer, Pascal L. Ghazalian
      Sustainability.2023; 15(18): 13872.     CrossRef
    • Relationships between Health Education, Health Behaviors, and Health Status among Migrants in China: A Cross-Sectional Study Based on the China Migrant Dynamic Survey
      Minji Kim, Hai Gu
      Healthcare.2023; 11(12): 1768.     CrossRef
    • Multi-View Learning to Unravel the Different Levels Underlying Hepatitis B Vaccine Response
      Fabio Affaticati, Esther Bartholomeus, Kerry Mullan, Pierre Van Damme, Philippe Beutels, Benson Ogunjimi, Kris Laukens, Pieter Meysman
      Vaccines.2023; 11(7): 1236.     CrossRef
    • Stacked Generalization Architecture for Predicting Publisher Behaviour from Highly Imbalanced User-Click Data Set for Click Fraud Detection
      Deepti Sisodia, Dilip Singh Sisodia
      New Generation Computing.2023; 41(3): 581.     CrossRef
    • Textual Feature Extraction Using Ant Colony Optimization for Hate Speech Classification
      Shilpa Gite, Shruti Patil, Deepak Dharrao, Madhuri Yadav, Sneha Basak, Arundarasi Rajendran, Ketan Kotecha
      Big Data and Cognitive Computing.2023; 7(1): 45.     CrossRef
    • Prevalence and Risk Factors for Poor Mental Health and Suicidal Ideation in the Nigerian Construction Industry
      Janet M. Nwaogu, Albert P. C. Chan, Riza Yosia Sunindijo, Amos Darko, Jackie Y. Yang, Dauda Salihu
      Journal of Construction Engineering and Management.2023;[Epub]     CrossRef
    • A comprehensive analysis for classification and regression of surface points based on geodesics and machine learning algorithms
      Vahide Bulut
      Engineering Computations.2023; 40(9/10): 2270.     CrossRef
    • Factors Influencing the Success of High-flow Nasal Cannula Treatment Applied to Respiratory Distress Patients with Hematologic Neoplasms
      Eun Hee Kim, Dong Yeon Kim, Yang Sook Yoo
      Asian Oncology Nursing.2023; 23(4): 185.     CrossRef
    • Data Balancing Techniques for Predicting Student Dropout Using Machine Learning
      Neema Mduma
      Data.2023; 8(3): 49.     CrossRef
    • Prevalence and severity of temporomandibular joint disorder in partially versus completely edentulous patients: A systematic review
      Pragati Rawat, Deepesh Saxena, Pratiksha A. Srivastava, Abhinav Sharma, Arka Swarnakar, Aditya Sharma
      The Journal of Indian Prosthodontic Society.2023; 23(3): 218.     CrossRef
    • An Explainable Hybrid Intelligent System for Prediction of Cardiovascular Disease
      Annwesha Banerjee Majumder, Somsubhra Gupta, Dharmpal Singh, Sourav Majumder
      Journal of Mines, Metals and Fuels.2023; : 687.     CrossRef
    • Asset Management of Wastewater Interceptors Adjacent to Bodies of Water
      Mohammad Damen Bani Fawwaz, Mohammad Najafi, Vinayak Kaushal
      Water.2023; 15(23): 4176.     CrossRef
    • Comparison of Machine Learning Strategies in Hazardous Asteroids Prediction
      Yao Wang
      Highlights in Science, Engineering and Technology.2023; 39: 201.     CrossRef
    • Healthcare predictive analytics using machine learning and deep learning techniques: a survey
      Mohammed Badawy, Nagy Ramadan, Hesham Ahmed Hefny
      Journal of Electrical Systems and Information Technology.2023;[Epub]     CrossRef
    • Analyzing physical and socio-economic factors for property crime incident in Addis Ababa, Ethiopia
      Yeshimar Yigzaw, Asnake Mekuriaw, Tadesse Amsalu
      Heliyon.2023; 9(2): e13282.     CrossRef
    • Mapping and Prediction of Urban Growth using Remote Sensing, Geographic Information System, and Statistical Techniques for Tiruppur Region, Tamil Nadu, India
      K. Elangovan, G. Krishnaraaju
      Journal of the Indian Society of Remote Sensing.2023; 51(8): 1657.     CrossRef
    • Development of a predictive model for workers' involvement in workplace accidents in an underground coal mine
      Kumar Arra, Yuga Raju Gunda, Suprakash Gupta
      Sādhanā.2023;[Epub]     CrossRef
    • A review on the significance of body temperature interpretation for early infectious disease diagnosis
      Nurul Izzati Darul Zaman, Yuan Wen Hau, Ming Chern Leong, Rania Hussien Ahmed Al-ashwal
      Artificial Intelligence Review.2023; 56(12): 15449.     CrossRef
    • Unraveling Structural Alerts in Marketed Drugs for Improving Adverse Outcome Pathway Framework of Drug-Induced QT Prolongation
      Wulin Long, Shihai Li, Yujie He, Jinzhu Lin, Menglong Li, Zhining Wen
      International Journal of Molecular Sciences.2023; 24(7): 6771.     CrossRef
    • The Influence of Street Morphology on Thermal Environment Based on ENVI-met Simulation: A Case Study of Hangzhou Core Area, China
      Jin Bao, Lihua Xu, Yijun Shi, Qiwei Ma, Zhangwei Lu
      ISPRS International Journal of Geo-Information.2023; 12(8): 303.     CrossRef
    • An Asymmetric Ensemble Method for Determining the Importance of Individual Factors of a Univariate Problem
      Jelena Mišić, Aleksandar Kemiveš, Milan Ranđelović, Dragan Ranđelović
      Symmetry.2023; 15(11): 2050.     CrossRef
    • Classification of WatSan Technologies Using Machine Learning Techniques
      Hala Al Nuaimi, Mohamed Abdelmagid, Ali Bouabid, Constantinos V. Chrysikopoulos, Maher Maalouf
      Water.2023; 15(15): 2829.     CrossRef
    • A Study of R-R Interval Transition Matrix Features for Machine Learning Algorithms in AFib Detection
      Sahil Patel, Maximilian Wang, Justin Guo, Georgia Smith, Cuixian Chen
      Sensors.2023; 23(7): 3700.     CrossRef
    • Determinants of viral load suppression among orphaned and vulnerable children living with HIV on ART in Tanzania
      Amal Ally, Amon Exavery, John Charles, Levina Kikoyo, Remmy Mseya, Asheri Barankena, Rose Fovo, Akwila Temu
      Frontiers in Public Health.2023;[Epub]     CrossRef
    • An Economic Analysis of the Factors Affecting the Organization and Cultivation of the Wheat Crop in Iraq Using Logistic Regression
      Noor Amer Ali, Ali Darub Kassar Al-Hiyali, Muhammed Khalid Muhammed
      IOP Conference Series: Earth and Environmental Science.2023; 1158(9): 092001.     CrossRef
    • An interpretable prediction model of illegal running into the opposite lane on curve sections of two-lane rural roads from drivers’ visual perceptions
      Li He, Bo Yu, Yuren Chen, Shan Bao, Kun Gao, You Kong
      Accident Analysis & Prevention.2023; 186: 107066.     CrossRef
    • Explainable Artificial Intelligence of Multi-Level Stacking Ensemble for Detection of Alzheimer’s Disease Based on Particle Swarm Optimization and the Sub-Scores of Cognitive Biomarkers
      Abdulaziz AlMohimeed, Redhwan M. A. Saad, Sherif Mostafa, Nora Mahmoud El-Rashidy, Sarah Farrag, Abdelkareem Gaballah, Mohamed Abd Elaziz, Shaker El-Sappagh, Hager Saleh
      IEEE Access.2023; 11: 123173.     CrossRef
    • Prediction of pine mistletoe infection using remote sensing imaging: A comparison of the artificial neural network model and logistic regression model
      Ayhan Usta, Murat Yilmaz
      Forest Pathology.2023;[Epub]     CrossRef
    • Hydrogeochemical characterization of the groundwater of Lahore region using supervised machine learning technique
      Sadia Ismail, M. Farooq Ahmed
      Environmental Monitoring and Assessment.2023;[Epub]     CrossRef
    • TÜRK SİGORTACILIK SEKTÖRÜNDE UFRS 17’YE GEÇİŞ SÜRECİ VE SİGORTA ŞİRKETLERİNE OLASI ETKİLERİ
      Gülcan ÇAĞIL, Cüneyt TUNÇ
      Finansal Araştırmalar ve Çalışmalar Dergisi.2022; : 55.     CrossRef
    • The effect of residential house rent on Urban households poverty status in Ethiopia: evidence from Wolkite town
      Endalkachew Kabtamu Mekonen
      Cogent Economics & Finance.2022;[Epub]     CrossRef
    • Local perception of watershed degradation in the upper Gibe basin, southwest Ethiopia: implications to sustainable watershed management strategies
      Fekadu Mengistu, Engdawork Assefa
      International Journal of River Basin Management.2022; 20(2): 235.     CrossRef
    • Pliable lasso for the multinomial logistic regression
      Theophilus Quachie Asenso, Hai Zhang, Yong Liang
      Communications in Statistics - Theory and Methods.2022; 51(11): 3596.     CrossRef
    • Consumer choice for purchasing imported apparel goods and its effect on perceived saving in Debre Markos district, Amhara Ethiopia: A logistic regression analysis
      Fasika Chekol, Yigardush Alimaw, Nitsuh Mengist, Ashebir Tsegaye
      Cogent Social Sciences.2022;[Epub]     CrossRef
    • A comparison of neural and non-neural machine learning models for food safety risk prediction with European Union RASFF data
      Alberto Nogales, Rodrigo Díaz-Morón, Álvaro J. García-Tejedor
      Food Control.2022; 134: 108697.     CrossRef
    • Assessing Landslide Susceptibility by Coupling Spatial Data Analysis and Logistic Model
      Antonio Ganga, Mario Elia, Ersilia D’Ambrosio, Simona Tripaldi, Gian Franco Capra, Francesco Gentile, Giovanni Sanesi
      Sustainability.2022; 14(14): 8426.     CrossRef
    • The Impact of Coping Strategies and Individual Resilience on Anxiety and Depression among Construction Supervisors
      Janet M. Nwaogu, Albert P. C. Chan
      Buildings.2022; 12(12): 2148.     CrossRef
    • Differences in levels of E. coli contamination of point of use drinking water in Bangladesh
      Md. Masud Hasan, Zahirul Hoque, Enamul Kabir, Shahadut Hossain, Mentore Vaccari
      PLOS ONE.2022; 17(5): e0267386.     CrossRef
    • Analysis of the tendency of transition between segments of green consumer behavior with a Markov chain approach
      Aries Susanty, Pradita Yusi Akshinta, M. Mujiya Ulkhaq, Nia Budi Puspitasari
      Journal of Modelling in Management.2022; 17(4): 1177.     CrossRef
    • DravidianCodeMix: sentiment analysis and offensive language identification dataset for Dravidian languages in code-mixed text
      Bharathi Raja Chakravarthi, Ruba Priyadharshini, Vigneshwaran Muralidaran, Navya Jose, Shardul Suryawanshi, Elizabeth Sherly, John P. McCrae
      Language Resources and Evaluation.2022; 56(3): 765.     CrossRef
    • Staff resilience and coping behavior as protective factors for mental health among construction tradesmen
      Janet Mayowa Nwaogu, Albert P.C. Chan, Mershack Opoku Tetteh
      Journal of Engineering, Design and Technology .2022; 20(3): 671.     CrossRef
    • Investigating Which Services are Effective on Recommendation of the Airline Companies
      Seden DOĞAN, Emrah ÖZKUL, Gamze KAYA
      Advances in Hospitality and Tourism Research (AHTR).2022; 10(1): 109.     CrossRef
    • Nature, patterns, and determinants of seasonal outmigration in the north-eastern part of Ethiopia
      Girma Senbetie Asefawu, Kumela Gudeta Nedessa
      Norsk Geografisk Tidsskrift - Norwegian Journal of Geography.2022; 76(3): 177.     CrossRef
    • Civil servants’ integrity in public sector: the case of Nepal
      Rajan Khanal, Anil Kumar Gupta, Prakash C. Bhattarai
      Heliyon.2022; 8(12): e12632.     CrossRef
    • Development of a prediction models for chemotherapy-induced adverse drug reactions: A retrospective observational study using electronic health records
      Jeongah On, Hyeoun-Ae Park, Sooyoung Yoo
      European Journal of Oncology Nursing.2022; 56: 102066.     CrossRef
    • Structure-Activity Relationship (SAR) Model for Predicting Teratogenic Risk of Antiseizure Medications in Pregnancy by Using Support Vector Machine
      Liyuan Kang, Yifei Duan, Cheng Chen, Shihai Li, Menglong Li, Lei Chen, Zhining Wen
      Frontiers in Pharmacology.2022;[Epub]     CrossRef
    • Preferences for service bundles in the energy sector – a survey of German private households
      André Hackbarth, Timo Tremml, Sabine Löbbe
      International Journal of Energy Sector Management.2022; 16(6): 1214.     CrossRef
    • Classification of imbalanced medical data: An empirical study of machine learning approaches
      Shikha Mundra, Shounak Vijay, Ankit Mundra, Punit Gupta, Mayank Kumar Goyal, Mandeep Kaur, Supriya Khaitan, Abha Kiran Rajpoot, Valentina Emilia Balas
      Journal of Intelligent & Fuzzy Systems.2022; 43(2): 1933.     CrossRef
    • Development and validation of a predictive scoring system for in-hospital mortality in COVID-19 Egyptian patients: a retrospective study
      Mohamed AbdelSalam Elgohary, Asmaa Ali, Thanaa A. El-Masry, Hani Faidah, Farkad Bantun, Ahmad M. Elkholy, Jaklin S. Fahim, Nabila N. Elgamal, Mohamed Emam Mohamed, Mohamed G. Seadawy, Amro M. Helal, Michel De Waard, Hesham M. Shishtawy, Maisra M. El-Bouse
      Scientific Reports.2022;[Epub]     CrossRef
    • Intelligent modeling of unconfined compressive strength (UCS) of hybrid cement-modified unsaturated soil with nanostructured quarry fines inclusion
      Kennedy C. Onyelowe, Fazal E. Jalal, Mudassir Iqbal, Zia Ur Rehman, Kizito Ibe
      Innovative Infrastructure Solutions.2022;[Epub]     CrossRef
    • Neutron/gamma pulse shape discrimination using short-time frequency transform
      Assem Abdelhakim, Ehab Elshazly
      Analog Integrated Circuits and Signal Processing.2022; 111(3): 387.     CrossRef
    • Paradox of Success-Mediated Conflicts: Analysing Attitudes of Local Communities Towards Successfully Reintroduced Tigers in India
      Manjari Malviya, Sankar Kalyanasundaram, Ramesh Krishnamurthy
      Frontiers in Conservation Science.2022;[Epub]     CrossRef
    • Simulation of Acute Pulmonary Hypertension in Beagle Dogs
      Miao Wang, YaTing Hu, BinBin Guo, Hong Tang
      International Heart Journal.2022; 63(3): 612.     CrossRef
    • An Analytical Predictive Models and Secure Web-Based Personalized Diabetes Monitoring System
      Radwa Marzouk, Ala Saleh Alluhaidan, Sahar A. El Rahman
      IEEE Access.2022; 10: 105657.     CrossRef
    • Practical Machine Learning Model to Predict the Recovery of Motor Function in Patients with Stroke
      Jeoung Kun Kim, Zhihan Lv, Donghwi Park, Min Cheol Chang
      European Neurology.2022; 85(4): 273.     CrossRef
    • Analyzing Various Factors Affecting Farmers’ Willingness to Adopt Soil Erosion Control Measures in the Sebeya Catchment, Rwanda
      Félicien Majoro, Umaru Garba Wali
      Sustainability.2022; 14(19): 12895.     CrossRef
    • Risk Factors Associated with Tuberculosis Among Men; A Study of South Africa
      Muziwandile Nhlakanipho Mlondo, Sileshi Fanta Melesse, Henry G Mwambi
      The Open Public Health Journal.2022;[Epub]     CrossRef
    • Artificial intelligence methods for improving the inventive design process, application in lattice structure case study
      Masih Hanifi, Hicham Chibane, Remy Houssin, Denis Cavallucci, Naser Ghannad
      Artificial Intelligence for Engineering Design, Analysis and Manufacturing.2022;[Epub]     CrossRef
    • TheImpacts of Apple-based Agroforestry Practices on the Livelihoods of Smallholder Farmers in Southern Ethiopia
      Tesfaye Gebeyehu Admasu, Amene Afework Jenberu
      Trees, Forests and People.2022; 7: 100205.     CrossRef
    • Relation of Dietary n-3 and n-6 Fatty Acid Intakes to Metabolic Syndrome in Middle-Aged People Depending on the Level of HbA1c: A Review of National Health and Nutrition Survey Data from 2014 to 2016
      Seo-Woo Park, Do-Yeong Kim, Gyeong-Tae Bak, Dae-Sung Hyun, Sung-Kyung Kim
      Medicina.2022; 58(8): 1017.     CrossRef
    • Prevalence and factors associated with pre‐frailty and frailty among Korean older adults with heart failure
      Youn‐Jung Son, Sang‐Wook Kim, Wang‐Soo Lee, Seung Yong Shin, Hoyoun Won, Jun Hwan Cho, Hyue Mee Kim, Joonhwa Hong, JiYeon Choi
      Journal of Advanced Nursing.2022; 78(10): 3235.     CrossRef
    • An Ensemble Heart Disease Prediction Model Bagged with Logistic Regression, Naïve Bayes and K Nearest Neighbour.
      Annwesha Banerjee Majumder, Somsubhra Gupta, Dharmpal Singh
      Journal of Physics: Conference Series.2022; 2286(1): 012017.     CrossRef
    • Motives for the introduction of agricultural innovations in Serbia with particular accent on beekeepers: The application of logistic regression
      Lidija Madžar
      Ekonomika poljoprivrede.2022; 69(1): 27.     CrossRef
    • Predicting Ischemic Stroke in Patients with Atrial Fibrillation Using Machine Learning
      Seonwoo Jung, Min-Keun Song, Eunjoo Lee, Sejin Bae, Yeon-Yong Kim, Doheon Lee, Myoung Jin Lee, Sunyong Yoo
      Frontiers in Bioscience-Landmark.2022;[Epub]     CrossRef
    • Quantitative Structure–Activity Relationship (QSAR) Model for the Severity Prediction of Drug-Induced Rhabdomyolysis by Using Random Forest
      Yifan Zhou, Shihai Li, Yiru Zhao, Mingkun Guo, Yuan Liu, Menglong Li, Zhining Wen
      Chemical Research in Toxicology.2021; 34(2): 514.     CrossRef
    • Knowledge extraction from pointer movements and its application to detect uncertainty
      Catia Cepeda, Maria Camila Dias, Dina Rindlisbacher, Hugo Gamboa, Marcus Cheetham
      Heliyon.2021; 7(1): e05873.     CrossRef
    • Predicting the Utilization of Mental Health Treatment with Various Machine Learning Algorithms
      Meera Sharma, Sonok Mahapatra, Adeethyia Shankar, Xiaodi Wang
      WSEAS TRANSACTIONS ON COMPUTERS.2021; 19: 285.     CrossRef
    • Talking about Sustainability: How the Media Construct the Public’s Understanding of Sustainable Food in Romania
      Valentina Marinescu, Bianca Fox, Darie Cristea, Daniela Roventa-Frumusani, Ramona Marinache, Silvia Branea
      Sustainability.2021; 13(9): 4609.     CrossRef
    • TÜRK SİGORTACILIK SEKTÖRÜNDE UFRS 17’YE GEÇİŞ SÜRECİ VE SİGORTA ŞİRKETLERİNE OLASI ETKİLERİ
      Cüneyt TUNÇ
      Finansal Araştırmalar ve Çalışmalar Dergisi.2021;[Epub]     CrossRef
    • Analysis of Unmet Healthcare Needs in Ireland: A Data Mining Approach
      Dr. Anatoli Nachev
      International Journal of Engineering and Advanced Technology.2021; 10(3): 81.     CrossRef
    • Performance analysis of Word Embeddings for Cyberbullying Detection
      Subbaraju Pericherla, E Ilavarasan
      IOP Conference Series: Materials Science and Engineering.2021; 1085(1): 012008.     CrossRef
    • The usage of logistic regression and artificial neural networks for evaluation and predicting property-liability insurers' solvency in Egypt
      Eid Elghaly Hassan, Diping Zhang
      Data Science in Finance and Economics.2021; 1(3): 215.     CrossRef
    • Predicting the Safety Climate in Construction Sites of Saudi Arabia: A Bootstrapped Multiple Ordinal Logistic Regression Modeling Approach
      Anas A. Makki, Ibrahim Mosly
      Applied Sciences.2021; 11(4): 1474.     CrossRef
    • Community participation in watershed management: analysis of the status and factors affecting community engagement in the upper Gibe basin, South West Ethiopia
      Fekadu Mengistu, Engdawork Assefa
      Journal of Environmental Planning and Management.2021; 64(2): 252.     CrossRef
    • AGRICULTURAL INNOVATIONS, EXTENSION, FINANCE AND RURAL LOANS IN THE REPUBLIC OF SERBIA: THE CASE OF LOGISTIC REGRESSION
      Lidija Madžar
      Annals of the Polish Association of Agricultural and Agribusiness Economists.2021; XXIII(4): 129.     CrossRef
    • An Approach Using Fuzzy Sets and Boosting Techniques to Predict Liver Disease
      Pushpendra Kumar, Ramjeevan Singh Thakur
      Computers, Materials & Continua.2021; 68(3): 3513.     CrossRef
    • Does engagement with frontline health workers improve maternal and child healthcare utilisation and outcomes in India?
      Anu Rammohan, Srinivas Goli, Shashi Kala Saroj, C. P. Abdul Jaleel
      Human Resources for Health.2021;[Epub]     CrossRef
    • Assessing risk factors associated with urban transit bus involved accident severity: a case study of a Middle East country
      Mehrdad Nasri, Kayvan Aghabayk
      International Journal of Crashworthiness.2021; 26(4): 413.     CrossRef
    • Detecting Information on the Spread of Dengue on Twitter Using Artificial Neural Networks
      Samina Amin, M. Irfan Uddin, M. Ali Zeb, Ala Abdulsalam Alarood, Marwan Mahmoud, Monagi H. Alkinani
      Computers, Materials & Continua.2021; 67(1): 1317.     CrossRef
    • Automatic Evaluation of Heart Condition According to the Sounds Emitted and Implementing Six Classification Methods
      Manuel A. Soto-Murillo, Jorge I. Galván-Tejada, Carlos E. Galván-Tejada, Jose M. Celaya-Padilla, Huizilopoztli Luna-García, Rafael Magallanes-Quintanar, Tania A. Gutiérrez-García, Hamurabi Gamboa-Rosales
      Healthcare.2021; 9(3): 317.     CrossRef
    • Visual light perceptions caused by medical linear accelerator: Findings of machine-learning algorithms in a prospective questionnaire-based case–control study
      Chao-Yang Kuo, Cheng-Chun Lee, Yuh-Lin Lee, Shueh-Chun Liou, Jia-Cheng Lee, Emily Chia-Yu Su, Yi-Wei Chen, Rongxiao Zhang
      PLOS ONE.2021; 16(2): e0247597.     CrossRef
    • Health-seeking behavior and waste management practices among women in major urban markets in Owerri, Nigeria
      Cyprian Ezedike, Eudora Ohazurike, Faisal C Emetumah, Okechukwu O Ajaegbu
      AIMS Public Health.2020; 7(1): 169.     CrossRef
    • A cross-sectional study to determine factors affecting dental and medical students’ preference for virtual learning during the COVID-19 outbreak
      Nosayba Al-Azzam, Lina Elsalem, Farai Gombedza
      Heliyon.2020; 6(12): e05704.     CrossRef
    • Machine Learning Algorithms in Cardiology Domain: A Systematic Review
      Aleksei Dudchenko, Matthias Ganzinger, Georgy Kopanitsa
      The Open Bioinformatics Journal.2020; 13(1): 25.     CrossRef
    • Effect of Succession Planning on Leadership Elevation
      Ammad Zafar, Ghazal Khawaja Hummayun Akhtar
      Pakistan Journal of Applied Social Sciences.2020; 11(2): 21.     CrossRef
    • Postpartum depression and its correlates in middle‐class women in Hunan, China
      Ziyao Guan, Wenhui Bai, Chunxiang Qin, Mei Sun, Xiaoling Bai, Siyuan Tang
      Asian Social Work and Policy Review.2020; 14(2): 107.     CrossRef
    • A Regression Model for Predicting the Likelihood of Reporting a Crime Based on the Victim’s Demographic Variables and Their Perceptions Towards the Police
      Malebogo Pulenyane, Tlhalitshi Volition Montshiwa
      Statistics, Politics and Policy.2020; 11(2): 167.     CrossRef
    • Gender differences in the relationships between sleep disturbances and academic performance among nursing students: A cross-sectional study
      Ollyvia Freeska Dwi Marta, Shu-Yu Kuo, Jacqueline Bloomfield, Hsin-Chien Lee, Faqih Ruhyanudin, Mia Yuangi Poynor, Ageng Brahmadhi, Indah Dwi Pratiwi, Nur Aini, Erma Wahyu Mashfufa, Faizul Hasan, Hsiao-Yean Chiu
      Nurse Education Today.2020; 85: 104270.     CrossRef
    • Prediction of physical violence in schizophrenia with machine learning algorithms
      Kevin Z. Wang, Ali Bani-Fatemi, Christopher Adanty, Ricardo Harripaul, John Griffiths, Nathan Kolla, Philip Gerretsen, Ariel Graff, Vincenzo De Luca
      Psychiatry Research.2020; 289: 112960.     CrossRef
    • Assessment of a Deep Learning Algorithm for the Detection of Rib Fractures on Whole-Body Trauma Computed Tomography
      Thomas Weikert, Luca Andre Noordtzij, Jens Bremerich, Bram Stieltjes, Victor Parmar, Joshy Cyriac, Gregor Sommer, Alexander Walter Sauter
      Korean Journal of Radiology.2020; 21(7): 891.     CrossRef
    • Application of logistic regression analysis in prediction of groundwater vulnerability in gold mining environment: a case of Ilesa gold mining area, southwestern, Nigeria
      K. A. N. Adiat, B. E. Akeredolu, A. A. Akinlalu, G. M. Olayanju
      Environmental Monitoring and Assessment.2020;[Epub]     CrossRef
    • Eco-Kansei design for retailing packaging: a current research progress
      A Aprilia, T Djatna, N S Indrasti, Sugiarto
      IOP Conference Series: Earth and Environmental Science.2020; 472(1): 012046.     CrossRef
    • Knowledge mining on the association between psychological capital and educational qualifications among hospitality employees
      Donald Douglas Atsa’am, Ersin Kuset Bodur
      Current Issues in Tourism.2020; 23(9): 1073.     CrossRef
    • Predicting Dysfunctional Human–Dog Dyads
      Rute Canejo-Teixeira, Pedro Armelim Almiro, Luís V. Baptista, Maria Manuela Grave Rodeia Espada Niza
      Anthrozoös.2020; 33(6): 743.     CrossRef
    • Entrepreneurship factor’s affecting the youth decision to continue their family farm
      S F Ayu, M Nauly
      Journal of Physics: Conference Series.2020; 1542(1): 012052.     CrossRef
    • Farmers’ Attitudes Towards the Use of Biomass as Renewable Energy—A Case Study from Southeastern Europe
      Prespa Ymeri, Csaba Gyuricza, Csaba Fogarassy
      Sustainability.2020; 12(10): 4009.     CrossRef
    • AI Approaches towards Prechtl’s Assessment of General Movements: A Systematic Literature Review
      Muhammad Tausif Irshad, Muhammad Adeel Nisar, Philip Gouverneur, Marion Rapp, Marcin Grzegorzek
      Sensors.2020; 20(18): 5321.     CrossRef
    • Logistic Regression and Least Absolute Shrinkage and Selection Operator
      Hyunyong Lee, Hun-Sung Kim
      Cardiovascular Prevention and Pharmacotherapy.2020; 2(4): 142.     CrossRef
    • A modular framework for estimating annual averaged power output generation of wind turbines
      Benjamin Wacker, Johann V. Seebaß, Jan Chr. Schlüter
      Energy Conversion and Management.2020; 221: 113149.     CrossRef
    • Interest to continue farming among agricultural students who is a child of a farmer
      S F Ayu, M Nauly
      IOP Conference Series: Earth and Environmental Science.2020; 454(1): 012027.     CrossRef
    • Assessment of Teachers’ Knowledge about Tuberculosis at Primary School in Balad City
      Wafaa S. Hasanain
      Journal of Physics: Conference Series.2020; 1591(1): 012001.     CrossRef
    • A Comparative Assessment of Credit Risk Model Based on Machine Learning ——a case study of bank loan data
      Yuelin Wang, Yihan Zhang, Yan Lu, Xinran Yu
      Procedia Computer Science.2020; 174: 141.     CrossRef
    • Machine Learning Algorithms in Cardiology Domain: A Systematic Review (Preprint)
      Georgy Kopanitsa, Aleksei Dudchenko, Matthias Ganzinger
      JMIR Medical Informatics.2019;[Epub]     CrossRef
    • Determinants of child labour practices in Ghana
      Lucy Twumwaah Afriyie, Bashiru I. I. Saeed, Abukari Alhassan
      Journal of Public Health.2019; 27(2): 211.     CrossRef
    • Mapping fecal pollution in rural groundwater supplies by means of artificial intelligence classifiers
      S. Díaz-Alcaide, P. Martínez-Santos
      Journal of Hydrology.2019; 577: 124006.     CrossRef
    • A Hybrid Computational Intelligence Approach to Groundwater Spring Potential Mapping
      Dieu Tien Bui, Ataollah Shirzadi, Kamran Chapi, Himan Shahabi, Biswajeet Pradhan, Binh Pham, Vijay Singh, Wei Chen, Khabat Khosravi, Baharin Bin Ahmad, Saro Lee
      Water.2019; 11(10): 2013.     CrossRef
    • Determinants of environmental conservation in Lake Tana Biosphere Reserve, Ethiopia
      Zemenu Bires, Sahil Raj
      Heliyon.2019; 5(7): e01997.     CrossRef
    • Drivers of Cooperation Activity in Kosovo’s Agriculture
      Shyhrete Muriqi, Maria Fekete-Farkas, Zsolt Baranyai
      Agriculture.2019; 9(5): 96.     CrossRef
    • Determinants of paying national health insurance premium with mobile phone in Ghana: a cross-sectional prospective study
      Joseph Marfo Boaheng, Eugenia Amporfu, Daniel Ansong, Anthony Kofi Osei-Fosu
      International Journal for Equity in Health.2019;[Epub]     CrossRef
    • Determining Factors Affecting Consumer’s Decision to Purchase Organic Chicken Meat
      F Kaygisiz, BA Bolat, D Bulut
      Brazilian Journal of Poultry Science.2019;[Epub]     CrossRef
    • Performance and cost-effectiveness of change burst metrics in predicting software faults
      Malanga Kennedy Ndenga, Ivaylo Ganchev, Jean Mehat, Franklin Wabwoba, Herman Akdag
      Knowledge and Information Systems.2019; 60(1): 275.     CrossRef
    • Detection of Timestamps Tampering in NTFS using Machine Learning
      Alji Mohamed, Chougdali Khalid
      Procedia Computer Science.2019; 160: 778.     CrossRef
    • Responding to the greatest challenges? Value creation in ecological startups
      Andreas Kuckertz, Elisabeth S.C. Berger, Anja Gaudig
      Journal of Cleaner Production.2019; 230: 1138.     CrossRef
    • Sewer Pipes Condition Prediction Models: A State-of-the-Art Review
      Mohammadreza Malek Mohammadi, Mohammad Najafi, Vinayak Kaushal, Ramtin Serajiantehrani, Nazanin Salehabadi, Taha Ashoori
      Infrastructures.2019; 4(4): 64.     CrossRef
    • TLUSBoost algorithm: a boosting solution for class imbalance problem
      Sujit Kumar, Saroj Kr. Biswas, Debashree Devi
      Soft Computing.2019; 23(21): 10755.     CrossRef
    • Analysis between GPA and TOEFL Score For Postgraduate Student of Sports Education using Bivariate Logistic Regression
      A Sofro, A Oktaviarina, E Mintarto, O Wiriawan
      Journal of Physics: Conference Series.2019; 1417(1): 012022.     CrossRef
    • Suicidal ideation in systemic lupus erythematosus: NR2A gene polymorphism, clinical and psychosocial factors
      R I Buji, N A Abdul Murad, L F Chan, T Maniam, M S Mohd Shahrir, M Rozita, A S Shamsul, R Mohamad Hussain, N Abdullah, R Jamal, N R Nik Jaafar
      Lupus.2018; 27(5): 744.     CrossRef
    • The use of geographic information system and 1860s cadastral data to model agricultural suitability before heavy mechanization. A case study from Malta
      Gianmarco Alberti, Reuben Grima, Nicholas C. Vella, Hugo Rebelo
      PLOS ONE.2018; 13(2): e0192039.     CrossRef
    • Drivers of eco-innovation in the manufacturing sector of Nigeria
      Maruf Sanni
      Technological Forecasting and Social Change.2018; 131: 303.     CrossRef
    • Spatial filtering pipeline evaluation of cortically coupled computer vision system for rapid serial visual presentation
      Zhengwei Wang, Graham Healy, Alan F. Smeaton, Tomas E. Ward
      Brain-Computer Interfaces.2018; 5(4): 132.     CrossRef
    • Logistic Regression and Growth Charts to Determine Children Nutritional and Stunting Status: A Review
      Margaretha Ohyver, Jurike V. Moniaga, Karina Restisa Yunidwi, Muhamad Irfan Setiawan
      Procedia Computer Science.2017; 116: 232.     CrossRef
    • Predictors of Unintentionally Severe Harm During Nonsuicidal Self‐Injury
      Trevor J. Buser, Juleen K. Buser, Corrine C. Rutt
      Journal of Counseling & Development.2017; 95(1): 14.     CrossRef
    • Women’s opinion on the justification of physical spousal violence: A quantitative approach to model the most vulnerable households in Bangladesh
      Raaj Kishore Biswas, Nusma Rahman, Enamul Kabir, Farabi Raihan, Fakir M Amirul Islam
      PLOS ONE.2017; 12(11): e0187884.     CrossRef
    • Well-testing model identification using time-series shapelets
      Rouhollah Ahmadi, Babak Aminshahidy, Jamal Shahrabi
      Journal of Petroleum Science and Engineering.2017; 149: 292.     CrossRef
    • Proposing stochastic probability-based math model and algorithms utilizing social networking and academic data for good fit students prediction
      Muhammad Fahim Uddin, Jeongkyu Lee
      Social Network Analysis and Mining.2017;[Epub]     CrossRef
    • Infrared assessment of knee instability in ACL deficient patients
      Aleksandar Matić, Suzana Petrović Savić, Branko Ristić, Vladan B. Stevanović, Goran Devedžić
      International Orthopaedics.2016; 40(2): 385.     CrossRef
    • Binary Logistic Model for Estimation of Mode Shift into Delhi Metro
      Vineet Chauhan, Hemant K. Suman, Nomesh B. Bolia
      The Open Transportation Journal.2016; 10(1): 124.     CrossRef
    • Analysis of the Factors Influencing the Use of Public Buses in Delhi
      Hemant K. Suman, Nomesh B. Bolia, Geetam Tiwari
      Journal of Urban Planning and Development.2016;[Epub]     CrossRef
    • Are predictors of future suicide attempts and the transition from suicidal ideation to suicide attempts shared or distinct: A 12-month prospective study among patients with depressive disorders
      Lai Fong Chan, Azhar Shah Shamsul, Thambu Maniam
      Psychiatry Research.2014; 220(3): 867.     CrossRef

    • Cite
      CITE
      export Copy Download
      Close
      Download Citation
      Download a citation file in RIS format that can be imported by all major citation management software, including EndNote, ProCite, RefWorks, and Reference Manager.

      Format:
      • RIS — For EndNote, ProCite, RefWorks, and most other reference management software
      • BibTeX — For JabRef, BibDesk, and other BibTeX-specific software
      Include:
      • Citation for the content below
      An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain
      J Korean Acad Nurs. 2013;43(2):154-164.   Published online April 30, 2013
      Close
    • XML DownloadXML Download
    Figure
    • 0
    We recommend
    An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain
    Image
    Figure 1 Graph of logistic curve where α=0 and β=1.
    An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain

    Sample Classification Table

    a, b, c, and d are number of observations in the corresponding cells.

    Example Output from Logistic Regression: Statistical Tests of Individual Predictors

    CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.

    Example Output from Logistic Regression: Overall Model Evaluation and Goodness-of-Fit Statistics

    Example Output from Logistic Regression: A Classification Table

    Sensitivity=3/(3+57)=5.00%; Specificity=124/(6+124)=95.48%;

    False positive=6/(6+124)=4.62%; False negative=57/(3+57)=95.00%.

    Adherence to Guidelines for Using and Reporting Logistic Regression

    NR=Not reported; VIF=Variance inflation factor; AUC=Area under curve; ROC=Receiver operationg characteristic; AIC=Akaike's information criterion; LL=Log-likelihood CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.

    Table 1 Sample Classification Table

    a, b, c, and d are number of observations in the corresponding cells.

    Table 2 Example Output from Logistic Regression: Statistical Tests of Individual Predictors

    CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.

    Table 3 Example Output from Logistic Regression: Overall Model Evaluation and Goodness-of-Fit Statistics

    Table 4 Example Output from Logistic Regression: A Classification Table

    Sensitivity=3/(3+57)=5.00%; Specificity=124/(6+124)=95.48%;

    False positive=6/(6+124)=4.62%; False negative=57/(3+57)=95.00%.

    Table 5 Adherence to Guidelines for Using and Reporting Logistic Regression

    NR=Not reported; VIF=Variance inflation factor; AUC=Area under curve; ROC=Receiver operationg characteristic; AIC=Akaike's information criterion; LL=Log-likelihood CI=Confidence interval; df=Degrees of freedom; OR=Odds ratio; SE=Standard error.


    J Korean Acad Nurs : Journal of Korean Academy of Nursing
    Close layer
    TOP