The Social Effect: Predicting Telecom Customer Churn with Call DataMichael E. Driscoll, Ph.D.Principal, FounderFebruary 16, 2010
Social Network Analysis with Telecom DataThe following slides describe an initial project analyzing a N. American telecom’s call data on a dedicated analytics platform:We describe the analysis of a slice of a telecom’s call history data from several million customers in the several major North American markets.We demonstrate the performance gain achieved by having a dedicated analytics platform (computation of  millions of relationships from tens of billions of events, spanning tens of TB of data, in less than one hour)We show  that social network influence is a powerful predictor of customer churn:  subscribers who experience a Telecom cancellation in their frequent calling network are 2x more likely to cancel themselves.We highlight one outbreak of cancellations in a metropolitan call network from May-June 2009.
Challenge:  Customer ChurnAcquisitionAttrition
Key Data:  Call Detail RecordsA slice of several billion call detail records (CDRs) from several million subscribers drawn from three major North American markets, for May-August 2009.
Call Quality AnalysisNo Relationship Between Dropped Calls and Customer ChurnNo significant correlation found between:  inferred dropped calls  (defined as consecutive calls to same number with < 20 s gap)library(ggplot2)qplot(Status, DroppedCalls,   data=CallHistory,geom="boxplot“)Box plot to shows log-normalized distributions of dropped call frequencies (drops per 100 calls) for 10k customers placed, faceted by active and cancelled subscribers.
What about social networks?
Social Network AnalysisNetwork is Generated from Call History DataCall history logs were pulled from the Greenplum warehouse.  These were parsed and outgoing numbers were associated with subscription ids.  The result is a row of data for every caller-callee connection meeting a low threshold (> 1 call and > 60 s talk-time per month).  The majority are between Telecom customers and other carriers (or land-lines).
Our Analytics WorkflowThree steps:  1.  Pull from DB, 2.  Analyze in R,  3.  Visualize in R + Graphviz
Our Tool:  The R Programming LanguageDownload R at http://www.r-project.org/
Getting Call Data Into R for Analysis	- from Files> Calls <- read.csv(“CallHistory.csv”,header=TRUE)	  from Databases> con <- dbConnect(driver,user,password,host,dbname)> Calls <- dbSendQuery(con, “SELECT * FROM call_history”)	  from the Web> con <- url('http://Telco.com/dump/CallHistory.csv')> Calls <- read.csv(con, header=TRUE)	   from previous R objects> load(‘CallHistory.RData’)
Social Network AnalysisMillions of edges analyzed in minutesFull analysis of a first-order outgoing call network for our slice (~ millions of customers, three months of call history) took less than one hour.This could be further improved with further parallelization of R code (currently SQL queries run parallel on Greenplum, R is run on master node).
Results:  People Have Small Call Networks (Three)The median size of a caller’s network is three,  while the mean size is five.
Results: Canceling Customers are 7x More Likely to be LinkedTypes of Callers (Nodes)active (A)cancelled (C)Types of Connections (Edges)A-AA-C or C-AC-CC-C edges are 7x more likely in call networks than what is expected by chance
Results:  A Customer With a Canceller in Their Network Churns at Twice the RateTypes of Connections (Edges)May C-AJune C-CIn essence, we are asking whether being connected to another canceller has any effect on one’s rate of cancellation.  It turns out that it does.   And if we only look at voluntary port-outs, we see that customers churn at 3x the rate.
From Data to Insights to ActionsIf we had known two customers’ calling networks…Could we have prevented four more from leaving?
The Emerging Analytics StackActionsApps (Email, Ad Campaigns)Analytics(R, SPSS, SAS, SAP)InsightsBig Data(HDFS or Parallel RDBMS) Data
ReferencesEnhancing Customer Knowledge at Optus, Teradata Case-Study (September 2009).IBM’s Analytics Tapped to Predict, Prevent Churn.  Telephony Online (April 2009).  The Elements of Statistical Learning, Hastie, Tibshirani, Friedman.  Springer. (February 2009).Study Shows Obesity Can Be Contagious, Gina Kolata, The New York Times (July 25, 2007)  [great example of homophily]ContactMichael E. Driscoll, Ph.D.med@dataspora.comFollow @datasporaon Twitter

More Related Content

PDF
Tesseract overview EY
PPTX
Social Network Analysis Helps Telecommunication Firms
PPT
Data Driven Health Care Enterprise
PPT
AI Science
PDF
25 Digital Transformation Case Studies In Retail
PDF
Social network analysis part ii
PPTX
Metaverse opportunities for the communications industry
PDF
5G Multi-Access Edge Compute
Tesseract overview EY
Social Network Analysis Helps Telecommunication Firms
Data Driven Health Care Enterprise
AI Science
25 Digital Transformation Case Studies In Retail
Social network analysis part ii
Metaverse opportunities for the communications industry
5G Multi-Access Edge Compute

What's hot (20)

PDF
Use of Analytics by Netflix - Case Study
PDF
Cavalry Ventures | Deep Dive: Generative AI
PDF
The Digital Culture Challenge: Closing the Employee-Leadership Gap
PDF
Artificial Intelligence in Telecom – Industry Adoption Analysis
PPTX
Photocert- InsurTech Innovation Award 2022
PPTX
Technology Vision 2022: Communications Industry | Accenture
PPTX
MG5705 AI in Marketing to share.pptx
PPTX
LLM presentation final
PDF
Marketing reimagined within the fourth industrial revolution
PDF
Media-Morphosis Transforming Media and Entertainment
PDF
Xây dựng kho dữ liệu trên nền tảng điện toán đám mây cho bệnh viện
 
PDF
GenAI in Research with Responsible AI
PDF
Value Untangled Slideshare
PDF
Accenture Tech Vision 2020 - Overview
PDF
Modern Recommendation for Advanced Practitioners
PPTX
Die fünf Quellen der Motivation bei der Nutzung von Enterprise Wikis - WI2013
PDF
From Customer Insights to Action
PDF
Black Box Global Corporate Presentation - Jul'23
PPTX
Technology Vision 2022: Communications Industry | Accenture
PDF
Tendencias, visión y futuro de los Clasificados Online.
Use of Analytics by Netflix - Case Study
Cavalry Ventures | Deep Dive: Generative AI
The Digital Culture Challenge: Closing the Employee-Leadership Gap
Artificial Intelligence in Telecom – Industry Adoption Analysis
Photocert- InsurTech Innovation Award 2022
Technology Vision 2022: Communications Industry | Accenture
MG5705 AI in Marketing to share.pptx
LLM presentation final
Marketing reimagined within the fourth industrial revolution
Media-Morphosis Transforming Media and Entertainment
Xây dựng kho dữ liệu trên nền tảng điện toán đám mây cho bệnh viện
 
GenAI in Research with Responsible AI
Value Untangled Slideshare
Accenture Tech Vision 2020 - Overview
Modern Recommendation for Advanced Practitioners
Die fünf Quellen der Motivation bei der Nutzung von Enterprise Wikis - WI2013
From Customer Insights to Action
Black Box Global Corporate Presentation - Jul'23
Technology Vision 2022: Communications Industry | Accenture
Tendencias, visión y futuro de los Clasificados Online.
Ad

Viewers also liked (18)

PDF
Applying sonamine social network analysis to telecommunications marketing
PDF
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
PPT
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
PPT
Idiro Analytics - What is Rotational Churn and how can we tackle it?
PDF
Deriving economic value for CSPs with Big Data [read-only]
PPSX
Telco Churn Roi V3
PPT
Idiro Analytics - Social Network Analysis for Online Gaming
PPTX
Churn modelling
PDF
Big Telco - Yousun Jeong
PPT
Idiro Analytics - Analytics & Big Data
PPTX
Churn Analysis in Telecom Industry
PPTX
Predicting churn in telco industry: machine learning approach - Marko Mitić
PPTX
Decide on technology stack & data architecture
PDF
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
PDF
Social network analysis & Big Data - Telecommunications and more
PDF
How to use your CRM for upselling and cross-selling
PPT
Big Data: Social Network Analysis
PDF
Big Data Analytics : A Social Network Approach
Applying sonamine social network analysis to telecommunications marketing
Subscriber Churn Prediction Model using Social Network Analysis In Telecommun...
Idiro Analytics - Identifying Families using Social Network Analysis and Big ...
Idiro Analytics - What is Rotational Churn and how can we tackle it?
Deriving economic value for CSPs with Big Data [read-only]
Telco Churn Roi V3
Idiro Analytics - Social Network Analysis for Online Gaming
Churn modelling
Big Telco - Yousun Jeong
Idiro Analytics - Analytics & Big Data
Churn Analysis in Telecom Industry
Predicting churn in telco industry: machine learning approach - Marko Mitić
Decide on technology stack & data architecture
Leveraging Kafka for Big Data in Real Time Bidding, Analytics, ML & Campaign ...
Social network analysis & Big Data - Telecommunications and more
How to use your CRM for upselling and cross-selling
Big Data: Social Network Analysis
Big Data Analytics : A Social Network Approach
Ad

Similar to Social Network Analysis for Telecoms (20)

PPT
Netsci10 report
PPTX
Churn Presentation22May2016
PPTX
Churn Management
PDF
Social Network Analysis White Paper
PDF
OpenText Big Data Analytics for Telecommunications - Solution Overview
PPSX
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
PPTX
Solving churn challenge in Big Data environment - Jelena Pekez
PDF
Telecom analytics brochure
PPTX
Telcom churn .pptx
PDF
Churn Prediction in Practice
PDF
Roundtable: Best Practices in First Call Resolution (FCR)
PPSX
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
PDF
Insights to Deliver a Connected Digital Customer Experience
PDF
Social can learn from tradicional customer service
PPTX
Telecom Churn Prediction Presentation
PDF
Network Quality and Customer Experience
PPTX
AI Applications in telecommunication industry
PPTX
CRM Analytics
PDF
Churn in the Telecommunications Industry
PPTX
Social_Analytics_and_SNA_Presentation.pptx
Netsci10 report
Churn Presentation22May2016
Churn Management
Social Network Analysis White Paper
OpenText Big Data Analytics for Telecommunications - Solution Overview
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
Solving churn challenge in Big Data environment - Jelena Pekez
Telecom analytics brochure
Telcom churn .pptx
Churn Prediction in Practice
Roundtable: Best Practices in First Call Resolution (FCR)
T-Mobile: Kiss Churn Goodbye with Data-Driven Campaign Management
Insights to Deliver a Connected Digital Customer Experience
Social can learn from tradicional customer service
Telecom Churn Prediction Presentation
Network Quality and Customer Experience
AI Applications in telecommunication industry
CRM Analytics
Churn in the Telecommunications Industry
Social_Analytics_and_SNA_Presentation.pptx

Recently uploaded (20)

PPTX
Chapter 2 strategic Presentation (6).pptx
PDF
IFRS Green Book_Part B for professional pdf
PDF
The Future of Marketing: AI, Funnels & MBA Careers | My Annual IIM Lucknow Talk
PPTX
Oracle Cloud Infrastructure Overview July 2020 v2_EN20200717.pptx
PDF
France's Top 5 Promising EdTech Companies to Watch in 2025.pdf
PDF
Value-based IP Management at Siemens: A Cross-Divisional Analysis
PDF
Pink Cute Simple Group Project Presentation.pdf
PDF
The Influence of Historical Figures on Legal Communication (www.kiu.ac.ug)
PPTX
Market and Demand Analysis.pptx for Management students
PDF
The Impact of Historical Events on Legal Communication Styles (www.kiu.ac.ug)
PPTX
Understanding Procurement Strategies.pptx Your score increases as you pick a ...
PPTX
Supply Chain under WAR (Managing Supply Chain Amid Political Conflict).pptx
PDF
HQ #118 / 'Building Resilience While Climbing the Event Mountain
PPTX
Leadership and leader jobs and ch - 2.pptx
PPTX
PwC consulting Powerpoint Graphics 2014 templates
PDF
Chembond Chemicals Limited Presentation 2025
PPTX
IndustrialAIGuerillaInnovatorsARCPodcastEp3.pptx
PDF
The Dynamic CLOs Shaping the Future of the Legal Industry in 2025.pdf
PDF
QT INTRODUCTION chapters that help to study
PPT
BCG内部幻灯片撰写. slide template BCG.slide template
Chapter 2 strategic Presentation (6).pptx
IFRS Green Book_Part B for professional pdf
The Future of Marketing: AI, Funnels & MBA Careers | My Annual IIM Lucknow Talk
Oracle Cloud Infrastructure Overview July 2020 v2_EN20200717.pptx
France's Top 5 Promising EdTech Companies to Watch in 2025.pdf
Value-based IP Management at Siemens: A Cross-Divisional Analysis
Pink Cute Simple Group Project Presentation.pdf
The Influence of Historical Figures on Legal Communication (www.kiu.ac.ug)
Market and Demand Analysis.pptx for Management students
The Impact of Historical Events on Legal Communication Styles (www.kiu.ac.ug)
Understanding Procurement Strategies.pptx Your score increases as you pick a ...
Supply Chain under WAR (Managing Supply Chain Amid Political Conflict).pptx
HQ #118 / 'Building Resilience While Climbing the Event Mountain
Leadership and leader jobs and ch - 2.pptx
PwC consulting Powerpoint Graphics 2014 templates
Chembond Chemicals Limited Presentation 2025
IndustrialAIGuerillaInnovatorsARCPodcastEp3.pptx
The Dynamic CLOs Shaping the Future of the Legal Industry in 2025.pdf
QT INTRODUCTION chapters that help to study
BCG内部幻灯片撰写. slide template BCG.slide template

Social Network Analysis for Telecoms

  • 1. The Social Effect: Predicting Telecom Customer Churn with Call DataMichael E. Driscoll, Ph.D.Principal, FounderFebruary 16, 2010
  • 2. Social Network Analysis with Telecom DataThe following slides describe an initial project analyzing a N. American telecom’s call data on a dedicated analytics platform:We describe the analysis of a slice of a telecom’s call history data from several million customers in the several major North American markets.We demonstrate the performance gain achieved by having a dedicated analytics platform (computation of millions of relationships from tens of billions of events, spanning tens of TB of data, in less than one hour)We show that social network influence is a powerful predictor of customer churn: subscribers who experience a Telecom cancellation in their frequent calling network are 2x more likely to cancel themselves.We highlight one outbreak of cancellations in a metropolitan call network from May-June 2009.
  • 3. Challenge: Customer ChurnAcquisitionAttrition
  • 4. Key Data: Call Detail RecordsA slice of several billion call detail records (CDRs) from several million subscribers drawn from three major North American markets, for May-August 2009.
  • 5. Call Quality AnalysisNo Relationship Between Dropped Calls and Customer ChurnNo significant correlation found between: inferred dropped calls (defined as consecutive calls to same number with < 20 s gap)library(ggplot2)qplot(Status, DroppedCalls, data=CallHistory,geom="boxplot“)Box plot to shows log-normalized distributions of dropped call frequencies (drops per 100 calls) for 10k customers placed, faceted by active and cancelled subscribers.
  • 6. What about social networks?
  • 7. Social Network AnalysisNetwork is Generated from Call History DataCall history logs were pulled from the Greenplum warehouse. These were parsed and outgoing numbers were associated with subscription ids. The result is a row of data for every caller-callee connection meeting a low threshold (> 1 call and > 60 s talk-time per month). The majority are between Telecom customers and other carriers (or land-lines).
  • 8. Our Analytics WorkflowThree steps: 1. Pull from DB, 2. Analyze in R, 3. Visualize in R + Graphviz
  • 9. Our Tool: The R Programming LanguageDownload R at http://www.r-project.org/
  • 10. Getting Call Data Into R for Analysis - from Files> Calls <- read.csv(“CallHistory.csv”,header=TRUE) from Databases> con <- dbConnect(driver,user,password,host,dbname)> Calls <- dbSendQuery(con, “SELECT * FROM call_history”) from the Web> con <- url('http://Telco.com/dump/CallHistory.csv')> Calls <- read.csv(con, header=TRUE) from previous R objects> load(‘CallHistory.RData’)
  • 11. Social Network AnalysisMillions of edges analyzed in minutesFull analysis of a first-order outgoing call network for our slice (~ millions of customers, three months of call history) took less than one hour.This could be further improved with further parallelization of R code (currently SQL queries run parallel on Greenplum, R is run on master node).
  • 12. Results: People Have Small Call Networks (Three)The median size of a caller’s network is three, while the mean size is five.
  • 13. Results: Canceling Customers are 7x More Likely to be LinkedTypes of Callers (Nodes)active (A)cancelled (C)Types of Connections (Edges)A-AA-C or C-AC-CC-C edges are 7x more likely in call networks than what is expected by chance
  • 14. Results: A Customer With a Canceller in Their Network Churns at Twice the RateTypes of Connections (Edges)May C-AJune C-CIn essence, we are asking whether being connected to another canceller has any effect on one’s rate of cancellation. It turns out that it does. And if we only look at voluntary port-outs, we see that customers churn at 3x the rate.
  • 15. From Data to Insights to ActionsIf we had known two customers’ calling networks…Could we have prevented four more from leaving?
  • 16. The Emerging Analytics StackActionsApps (Email, Ad Campaigns)Analytics(R, SPSS, SAS, SAP)InsightsBig Data(HDFS or Parallel RDBMS) Data
  • 17. ReferencesEnhancing Customer Knowledge at Optus, Teradata Case-Study (September 2009).IBM’s Analytics Tapped to Predict, Prevent Churn. Telephony Online (April 2009). The Elements of Statistical Learning, Hastie, Tibshirani, Friedman. Springer. (February 2009).Study Shows Obesity Can Be Contagious, Gina Kolata, The New York Times (July 25, 2007) [great example of homophily]ContactMichael E. Driscoll, [email protected] @datasporaon Twitter

Editor's Notes

  • #4: Most telcos lose 1-2% of their customers every month.It’s 7x more expensive to acquire a customer, than to retain.
  • #14: Birds of a feather flock together; cancellers clump together, so do active users. Like vinegar and water, we see enrichment for “like-like” edges in our network, and dilution of “dissimilar” edges (the A-C or C-A). Upshot: people cancellationQuestion: is this all an artifact of family plans – where a bunch of subscribers quits together? In part yes, but the trends hold up even when we do a temporal analysis.
  • #15: Key take-home point here is that this analysis , looking at the May to June transition, removes
  • #17: The stack is loosely coupled: right tool for the right job. The need for a dedicated analytics RDBMS