Survival Analysis for Telecom Churn using R

maddy picture maddy · Nov 22, 2014 · Viewed 9k times · Source

I am working on Telecom Churn problem and here is my dataset.

http://www.sgi.com/tech/mlc/db/churn.data

Names - http://www.sgi.com/tech/mlc/db/churn.names

I'm new to survival analysis.Given the training data,my idea to build a survival model to estimate the survival time along with predicting churn/non churn on test data based on the independent factors.Could anyone help me with the code or pointers on how to go about this problem.

To be precise,say my train data has got

customer call usage details,plan details,tenure of his account etc and whether did he churn or not.

Using general classification models,I can predict churn or not on test data.Now using Survival analysis,I want to predict the tenure of the survival in test data.

Thanks, Maddy

Answer

John Chrysostom picture John Chrysostom · Mar 18, 2015

If you're still interested (or for the benefit of those coming later), I've written a few guides specifically for conducting survival analysis on customer churn data using R. They cover a bunch of different analytical techniques, all with sample data and R code.

Basic survival analysis: http://daynebatten.com/2015/02/customer-churn-survival-analysis/

Basic cox regression: http://daynebatten.com/2015/02/customer-churn-cox-regression/

Time-dependent covariates in cox regression: http://daynebatten.com/2015/12/survival-analysis-customer-churn-time-varying-covariates/

Time-dependent coefficients in cox regression: http://daynebatten.com/2016/01/customer-churn-time-dependent-coefficients/

Restricted mean survival time (quantify the impact of churn in dollar terms): http://daynebatten.com/2015/03/customer-churn-restricted-mean-survival-time/

Pseudo-observations (quantify dollar gain/loss associated with the churn effects of variables): http://daynebatten.com/2015/03/customer-churn-pseudo-observations/

Please forgive the goofy images.