Welcome to the session on 'Multivariate Logistic Regression (Model Building)'.
Just like when you’re building a model using linear regression, one independent variable might not be enough to capture all the uncertainties of the target variable in logistic regression as well. So in order to make good and accurate predictions, you need multiple variables and that is what we’ll study in this session.
Before starting with multivariate logistic regression, the first question that arises is, “Do you need any extensions while moving from univariate to multivariate logistic regression?” Recall the equation used in the case of univariate logistic regression was:
The above equation has only one feature variable , for which the coefficient is . Now, if you have multiple features, say n, you can simply extend this equation with ‘n’ feature variables and ‘n’ corresponding coefficients such that the equation now becomes:
Recall this extension is similar to what you did while moving from simple to multiple linear regression.
In this session, you will learn how to:
We will use the ‘Telecom Churn’ dataset in this session to build a model using multivariate logistic regression. This will involve all the familiar steps such as:
Apart from the familiar old steps, you’ll also be introduced to something known as a confusion matrix and you’ll also learn how the accuracy is measured for a logistic regression model.
There are no prerequisites for this session other than the knowledge of the previous session and the previous module.