SlideShare a Scribd company logo
Increasing NewYork
student attendance with
Kinvolved and Data
Science
Richard Sheng
@rcsheng
Data Science and Strategic Analytics
TE Connectivity
NYC Data Science Academy
 Data Science & Strategic Analytics
 Investment Banking Associate
 NYU MBA
 Principal Consultant, SAP Data Science
 Application Engineer
Disclaimer: My views are my own
 Kinvolved was Co-founded by a former educator (Teach For
America, NYC, 2008) and a parent advocate. Miriam and
Alex began this journey while graduate students at the
Robert F. Wagner School of Public Service at NYU in 2012.
They completed an accelerator in August 2013, and are
currently based at the Blue Ridge Foundation in Brooklyn,
NY.
 Stakeholders:
ď‚§ Kinvolved
ď‚§ School Principals
ď‚§ External funders
 Goals:
 Help drive adoption of Kinvolved’s product to improve
attendance rates, an early predictor of drop-outs
 Impact1:
ď‚§ Estimated lost lifetime revenue for male dropouts
between the ages of 25 and 34 is approximately $944
billion dollars, and costs associated with poor health and
criminal activity have been estimated at $24 billion
1. Source: http://www.attendanceworks.org/wordpress/wp-content/uploads/2010/04/Schoeneberger_2011.pdf
Data Science Academy Student Demo day--Richard Sheng, kinvolved school attendance
Data Science Academy Student Demo day--Richard Sheng, kinvolved school attendance
 read.delim("attendance-2009-
2014.csv",as.is=TRUE,header=TRUE,stringsAsFactors=FALSE,fill=TRUE,f
ileEncoding="UTF-16LE")
 Date conversion
 dist_attnd09to14 <- subset(attnd09to14,District==School)
 dist_attnd09to14 <- subset(dist_attnd09to14,City!=District)
 districts2 <- districts[grep("^DISTRICT",districts)]
 ds <- dist_attnd09to14[dist_attnd09to14$District %in% districts2,]
 school.years <- c("09-10","10-11","11-12","12-13","13-14")
 coln <- c(1:2,which(colnames(ds) %in% school.years))
 df <- ds[coln]
 newyork_ds <- paste("new york school district",1:32)
 ds_code <- geocode(newyork_ds)
 df3 <- df[order(df$District),c("District","13-14")]
 data <- cbind(df3,newyork_ds,ds_code)
 colnames(data)[2] <- "attnd“
 ds_map <- ggmap(get_googlemap(center = 'new york',
zoom=11,maptype='terrain'),extent='device') +
geom_point(data=data,aes(x=lon,y=lat,colour=attnd,size=1/attnd))+
scale_colour_gradientn(colours=c("red", "blue")) +
scale_size_area() +
labs(title = "NewYork SchoolAttendance - '13 to '14 n" )
 print(ds_map)
Kinvolved found that majority of absenteeism of students related to Asthma issues
 Level 4: Exceeding the
proficiency standard
 Level 3: Meeting the
proficiency standard
 Level 2: Meeting the
basic standard
 Level 1: Scoring below
the learning standard
 % Proficiency = % Level
3 & 4 / all students
Looks fairly similar in problematic areas
Just looking at Districts, 67% of exam results variance can be attributed to attendance
Q&A
Richard Sheng
@rcsheng
rcsheng@gmail.com
Data Science and Strategic Analytics
TE Connectivity
NYC Data Science Academy

More Related Content

PPTX
Data Science Academy Student Demo day--Chang Wang, dogs breeds in nyc
Vivian S. Zhang
 
PPTX
Data Science Academy Student Demo day--Divyanka Sharma, Businesses in nyc
Vivian S. Zhang
 
DOC
CV Henny Schouten
Henny Schouten
 
DOCX
science resume
Andrew Winston
 
PPTX
Data Science Academy Student Demo day--Shelby Ahern, An Exploration of Non-Mi...
Vivian S. Zhang
 
PPTX
Data Science Academy Student Demo day--Peggy sobolewski,analyzing transporati...
Vivian S. Zhang
 
PDF
Nycdsa ml conference slides march 2015
Vivian S. Zhang
 
PPT
Future of education
Vignesh Kumar
 
Data Science Academy Student Demo day--Chang Wang, dogs breeds in nyc
Vivian S. Zhang
 
Data Science Academy Student Demo day--Divyanka Sharma, Businesses in nyc
Vivian S. Zhang
 
CV Henny Schouten
Henny Schouten
 
science resume
Andrew Winston
 
Data Science Academy Student Demo day--Shelby Ahern, An Exploration of Non-Mi...
Vivian S. Zhang
 
Data Science Academy Student Demo day--Peggy sobolewski,analyzing transporati...
Vivian S. Zhang
 
Nycdsa ml conference slides march 2015
Vivian S. Zhang
 
Future of education
Vignesh Kumar
 

Viewers also liked (15)

PDF
Resume(Data Science)
Priyanka Ahire
 
PDF
San Francisco Crime Prediction Report
Rohit Dandona
 
PDF
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
Vivian S. Zhang
 
PPTX
Data Science Academy Student Demo day--Moyi Dang, Visualizing global public c...
Vivian S. Zhang
 
PDF
Natural Language Processing(SupStat Inc)
Vivian S. Zhang
 
PDF
Using Machine Learning to aid Journalism at the New York Times
Vivian S. Zhang
 
PPTX
Streaming Python on Hadoop
Vivian S. Zhang
 
PDF
Hack session for NYTimes Dialect Map Visualization( developed by R Shiny)
Vivian S. Zhang
 
PDF
Spatial query tutorial for nyc subway income level along subway
Vivian S. Zhang
 
PDF
Nyc open-data-2015-andvanced-sklearn-expanded
Vivian S. Zhang
 
PDF
Wenzhe Xu (Evelyn) Resume for Data Science
Wenzhe(Evelyn) Xu
 
PDF
Data mining with caret package
Vivian S. Zhang
 
PDF
Bayesian models in r
Vivian S. Zhang
 
PDF
Xgboost
Vivian S. Zhang
 
PDF
Full resume dr_russell_john_childs_2016
Russell Childs
 
Resume(Data Science)
Priyanka Ahire
 
San Francisco Crime Prediction Report
Rohit Dandona
 
THE HACK ON JERSEY CITY CONDO PRICES explore trends in public data
Vivian S. Zhang
 
Data Science Academy Student Demo day--Moyi Dang, Visualizing global public c...
Vivian S. Zhang
 
Natural Language Processing(SupStat Inc)
Vivian S. Zhang
 
Using Machine Learning to aid Journalism at the New York Times
Vivian S. Zhang
 
Streaming Python on Hadoop
Vivian S. Zhang
 
Hack session for NYTimes Dialect Map Visualization( developed by R Shiny)
Vivian S. Zhang
 
Spatial query tutorial for nyc subway income level along subway
Vivian S. Zhang
 
Nyc open-data-2015-andvanced-sklearn-expanded
Vivian S. Zhang
 
Wenzhe Xu (Evelyn) Resume for Data Science
Wenzhe(Evelyn) Xu
 
Data mining with caret package
Vivian S. Zhang
 
Bayesian models in r
Vivian S. Zhang
 
Xgboost
Vivian S. Zhang
 
Full resume dr_russell_john_childs_2016
Russell Childs
 
Ad

Similar to Data Science Academy Student Demo day--Richard Sheng, kinvolved school attendance (19)

PPTX
Learning Analytics: Realizing the Big Data Promise in the CSU
John Whitmer, Ed.D.
 
PPTX
Harnessing Decentralized Data to Improve Advising and Student Success - NASPA...
Naviance
 
PDF
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
PPTX
Program eval webinar final v2
Nptimes
 
PPTX
Kinvolved - Connecting families, schools, and communities through real-time d...
Kinvolved
 
PPTX
Data Con LA 2022 Keynotes
Data Con LA
 
PPT
Key assessement project 2
cynnmoore
 
PPTX
WSWHE Data Conference 2012
JohnSipple
 
PPTX
Week One - Why Data?
Rich Parker
 
PDF
Which New Yorkers Are Left Behind By Benefit Programs?
Geoffery Mullings
 
PPT
Leveraging Your Data
Matthew Matz
 
PPTX
Precon presentation 2015
Matthew D. Pistilli
 
PPTX
statistical-analysis-for-administra.pptx
MohamedChakroun16
 
DOCX
Question 9 Researcher Quin is interested in patterns of receiving on-t.docx
HarryOinDickensn
 
DOC
Glenn Langan Resume
Glenn Langan
 
PPTX
EDM2014 Better Data Beats Big Data
Michael Yudelson
 
PDF
Measurement Memo Re: Measuring the Impact of Student Diversity Program
andrejohnson034
 
PDF
K12 Tableau User Group meeting: September 2024
dogden2
 
PDF
K12 Tableau User Group meeting: January 22, 2025
dogden2
 
Learning Analytics: Realizing the Big Data Promise in the CSU
John Whitmer, Ed.D.
 
Harnessing Decentralized Data to Improve Advising and Student Success - NASPA...
Naviance
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Program eval webinar final v2
Nptimes
 
Kinvolved - Connecting families, schools, and communities through real-time d...
Kinvolved
 
Data Con LA 2022 Keynotes
Data Con LA
 
Key assessement project 2
cynnmoore
 
WSWHE Data Conference 2012
JohnSipple
 
Week One - Why Data?
Rich Parker
 
Which New Yorkers Are Left Behind By Benefit Programs?
Geoffery Mullings
 
Leveraging Your Data
Matthew Matz
 
Precon presentation 2015
Matthew D. Pistilli
 
statistical-analysis-for-administra.pptx
MohamedChakroun16
 
Question 9 Researcher Quin is interested in patterns of receiving on-t.docx
HarryOinDickensn
 
Glenn Langan Resume
Glenn Langan
 
EDM2014 Better Data Beats Big Data
Michael Yudelson
 
Measurement Memo Re: Measuring the Impact of Student Diversity Program
andrejohnson034
 
K12 Tableau User Group meeting: September 2024
dogden2
 
K12 Tableau User Group meeting: January 22, 2025
dogden2
 
Ad

More from Vivian S. Zhang (15)

PDF
Why NYC DSA.pdf
Vivian S. Zhang
 
PPTX
Career services workshop- Roger Ren
Vivian S. Zhang
 
PDF
Nycdsa wordpress guide book
Vivian S. Zhang
 
PDF
We're so skewed_presentation
Vivian S. Zhang
 
PDF
Wikipedia: Tuned Predictions on Big Data
Vivian S. Zhang
 
PDF
A Hybrid Recommender with Yelp Challenge Data
Vivian S. Zhang
 
PDF
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Vivian S. Zhang
 
PDF
Xgboost
Vivian S. Zhang
 
PDF
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Vivian S. Zhang
 
PDF
Max Kuhn's talk on R machine learning
Vivian S. Zhang
 
PDF
Winning data science competitions, presented by Owen Zhang
Vivian S. Zhang
 
PDF
Introducing natural language processing(NLP) with r
Vivian S. Zhang
 
PPTX
Data Science Academy Student Demo day--Michael blecher,the importance of clea...
Vivian S. Zhang
 
PPTX
R003 laila restaurant sanitation report(NYC Data Science Academy, Data Scienc...
Vivian S. Zhang
 
PPTX
R003 jiten south park episode popularity analysis(NYC Data Science Academy, D...
Vivian S. Zhang
 
Why NYC DSA.pdf
Vivian S. Zhang
 
Career services workshop- Roger Ren
Vivian S. Zhang
 
Nycdsa wordpress guide book
Vivian S. Zhang
 
We're so skewed_presentation
Vivian S. Zhang
 
Wikipedia: Tuned Predictions on Big Data
Vivian S. Zhang
 
A Hybrid Recommender with Yelp Challenge Data
Vivian S. Zhang
 
Kaggle Top1% Solution: Predicting Housing Prices in Moscow
Vivian S. Zhang
 
Xgboost
Vivian S. Zhang
 
Kaggle Winning Solution Xgboost algorithm -- Let us learn from its author
Vivian S. Zhang
 
Max Kuhn's talk on R machine learning
Vivian S. Zhang
 
Winning data science competitions, presented by Owen Zhang
Vivian S. Zhang
 
Introducing natural language processing(NLP) with r
Vivian S. Zhang
 
Data Science Academy Student Demo day--Michael blecher,the importance of clea...
Vivian S. Zhang
 
R003 laila restaurant sanitation report(NYC Data Science Academy, Data Scienc...
Vivian S. Zhang
 
R003 jiten south park episode popularity analysis(NYC Data Science Academy, D...
Vivian S. Zhang
 

Recently uploaded (20)

PPTX
Tunnel Ventilation System in Kanpur Metro
220105053
 
PPTX
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
PDF
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
PDF
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
PDF
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
PPT
Understanding the Key Components and Parts of a Drone System.ppt
Siva Reddy
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
All chapters of Strength of materials.ppt
girmabiniyam1234
 
PPTX
Inventory management chapter in automation and robotics.
atisht0104
 
PDF
STUDY OF NOVEL CHANNEL MATERIALS USING III-V COMPOUNDS WITH VARIOUS GATE DIEL...
ijoejnl
 
PDF
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
PDF
Zero carbon Building Design Guidelines V4
BassemOsman1
 
PDF
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
PDF
Zero Carbon Building Performance standard
BassemOsman1
 
PDF
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
PPTX
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
PPTX
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
PPTX
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
PPTX
quantum computing transition from classical mechanics.pptx
gvlbcy
 
PPTX
Online Cab Booking and Management System.pptx
diptipaneri80
 
Tunnel Ventilation System in Kanpur Metro
220105053
 
Victory Precisions_Supplier Profile.pptx
victoryprecisions199
 
AI-Driven IoT-Enabled UAV Inspection Framework for Predictive Maintenance and...
ijcncjournal019
 
top-5-use-cases-for-splunk-security-analytics.pdf
yaghutialireza
 
Packaging Tips for Stainless Steel Tubes and Pipes
heavymetalsandtubes
 
Understanding the Key Components and Parts of a Drone System.ppt
Siva Reddy
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
All chapters of Strength of materials.ppt
girmabiniyam1234
 
Inventory management chapter in automation and robotics.
atisht0104
 
STUDY OF NOVEL CHANNEL MATERIALS USING III-V COMPOUNDS WITH VARIOUS GATE DIEL...
ijoejnl
 
67243-Cooling and Heating & Calculation.pdf
DHAKA POLYTECHNIC
 
Zero carbon Building Design Guidelines V4
BassemOsman1
 
Biodegradable Plastics: Innovations and Market Potential (www.kiu.ac.ug)
publication11
 
Zero Carbon Building Performance standard
BassemOsman1
 
CAD-CAM U-1 Combined Notes_57761226_2025_04_22_14_40.pdf
shailendrapratap2002
 
MT Chapter 1.pptx- Magnetic particle testing
ABCAnyBodyCanRelax
 
FUNDAMENTALS OF ELECTRIC VEHICLES UNIT-1
MikkiliSuresh
 
Civil Engineering Practices_BY Sh.JP Mishra 23.09.pptx
bineetmishra1990
 
quantum computing transition from classical mechanics.pptx
gvlbcy
 
Online Cab Booking and Management System.pptx
diptipaneri80
 

Data Science Academy Student Demo day--Richard Sheng, kinvolved school attendance

  • 1. Increasing NewYork student attendance with Kinvolved and Data Science Richard Sheng @rcsheng Data Science and Strategic Analytics TE Connectivity NYC Data Science Academy
  • 2.  Data Science & Strategic Analytics  Investment Banking Associate  NYU MBA  Principal Consultant, SAP Data Science  Application Engineer Disclaimer: My views are my own
  • 3.  Kinvolved was Co-founded by a former educator (Teach For America, NYC, 2008) and a parent advocate. Miriam and Alex began this journey while graduate students at the Robert F. Wagner School of Public Service at NYU in 2012. They completed an accelerator in August 2013, and are currently based at the Blue Ridge Foundation in Brooklyn, NY.
  • 4.  Stakeholders: ď‚§ Kinvolved ď‚§ School Principals ď‚§ External funders  Goals: ď‚§ Help drive adoption of Kinvolved’s product to improve attendance rates, an early predictor of drop-outs  Impact1: ď‚§ Estimated lost lifetime revenue for male dropouts between the ages of 25 and 34 is approximately $944 billion dollars, and costs associated with poor health and criminal activity have been estimated at $24 billion 1. Source: http://www.attendanceworks.org/wordpress/wp-content/uploads/2010/04/Schoeneberger_2011.pdf
  • 7.  read.delim("attendance-2009- 2014.csv",as.is=TRUE,header=TRUE,stringsAsFactors=FALSE,fill=TRUE,f ileEncoding="UTF-16LE")  Date conversion  dist_attnd09to14 <- subset(attnd09to14,District==School)  dist_attnd09to14 <- subset(dist_attnd09to14,City!=District)  districts2 <- districts[grep("^DISTRICT",districts)]  ds <- dist_attnd09to14[dist_attnd09to14$District %in% districts2,]  school.years <- c("09-10","10-11","11-12","12-13","13-14")  coln <- c(1:2,which(colnames(ds) %in% school.years))  df <- ds[coln]
  • 8.  newyork_ds <- paste("new york school district",1:32)  ds_code <- geocode(newyork_ds)  df3 <- df[order(df$District),c("District","13-14")]  data <- cbind(df3,newyork_ds,ds_code)  colnames(data)[2] <- "attnd“  ds_map <- ggmap(get_googlemap(center = 'new york', zoom=11,maptype='terrain'),extent='device') + geom_point(data=data,aes(x=lon,y=lat,colour=attnd,size=1/attnd))+ scale_colour_gradientn(colours=c("red", "blue")) + scale_size_area() + labs(title = "NewYork SchoolAttendance - '13 to '14 n" )  print(ds_map)
  • 9. Kinvolved found that majority of absenteeism of students related to Asthma issues
  • 10.  Level 4: Exceeding the proficiency standard  Level 3: Meeting the proficiency standard  Level 2: Meeting the basic standard  Level 1: Scoring below the learning standard  % Proficiency = % Level 3 & 4 / all students
  • 11. Looks fairly similar in problematic areas
  • 12. Just looking at Districts, 67% of exam results variance can be attributed to attendance
  • 13. Q&A Richard Sheng @rcsheng [email protected] Data Science and Strategic Analytics TE Connectivity NYC Data Science Academy