Establishing an Ethics
Framework for Predictive
Analytics in Higher Education
Cyber Summit 2016, Banff
Stephen Childs, Institutional Analyst
October 27, 2016
Disclamer
 The content of this presentation represents my views only.
and not that of my employer, the University of Calgary.
 I am not qualified to accurately describe University of Calgary
policy in the areas discussed in this talk.
 Please contact the University if you have policy questions.
Data Abundance in Higher Education
3
Big Data, Big Problems
 Advancing technology
—Better data collection
—Handle more data
—Apply algorithms to data
 We know more about our students
 Can make predictions about their behavior
 Very few guidelines about this practice
Solutions
 Develop an ethics framework around student data.
 Build on existing guidelines.
 Build on the norms of service to students
 Do this now while these practices are new.
Outline
 Introduction
 Students and Student Data
 Predictive Analytics
 Existing Frameworks
 Next Steps
About Me
About My Office
 Office of Institutional Analysis
 https://oia.ucalgary.ca/
What OIA Does
About the University
Students
Student Data
 Application
 Student Information System
 LMS
 Unicard
 Surveys
 Residence
 Facilities
 Awarding Degrees
 Grades
 USRI
 IT usage
 Others…
Student Data
 Students can opt out of some data collection, but not all
 Student give us their data because they trust us
 We need to deserve that trust!
—Respect student privacy
—Transparency about how data is used
—Accountability
—Consultation
—Consider the Consequences
Privacy
Access to Data
Transparency and Accountability
 Internalize norms is not enough!
 How Universities use data should be known
—We aren’t corporations with competitive secrets
—We need to set up ways to report and share
 We need to be able describe what happened!
 Log events
 Version control your software
 Develop reporting methods
Consultation
Consider the Consequences
 Moving from institutional decision
making to acting on individual data
 Lathe of Heaven – a mad social
scientist
Predictive Analytics
Best Practices using Predictive Analytics
 Have to carefully present information to students
—Present a positive outlook
—Don’t personalize it – talk about a group of similar
students.
 The factors in the model may be less deterministic than
unobserved factors.
 Difference between causality and correlation.
 Beware the self-fulfilling prophecy
Cathy O’Neil
 @mathbabe, mathbabe.org
 Mathematician, former hedge-fund
quant
Weapons of Math Destruction
 Three factors make a model a WMD:
—Is the participant aware of the model? Is the model
opaque or invisible?
—Does the model work against the participant’s interest? Is
it unfair? Does it create feedback loops?
—Can the model scale?
Student Data Principles
 http://studentdataprinciples.org/
 Purpose and use of student data
 Timely access to data
 Data should not replace professional judgement.
 Data governance, security, breach notification
Student Data Pledge
 http://www.edtechmagazine.com/k12/article/2015/03/prote
ct-personal-student-information-pair-organizations-
recommends-commitment
 Don’t sell student data, use data to target ads, or profile
students for non-educational purposes
 Don’t collect more information or retain information longer
than necessary.
 Do disclose how, what and why
uCalgary Data Rules
 Freedom of Information and Privacy Act (1999)
—Students must be able to correct own info
—University must provide own info upon confirmation of ID
 Categories of Data Confidentiality
 Research Ethics Boards
—Data collection for University operations does not
generally fall under REB jurisdiction.
Financial Modeler’s Manifesto
 https://www.wilmott.com/financial-modelers-manifesto/
 Emanuel Derman and Paul Wilmott – January 7, 2009
 The Modelers’ Hippocratic Oath
— I will remember that I didn’t make the world, and it doesn’t satisfy my
equations.
— Though I will use models boldly to estimate value, I will not be overly
impressed by mathematics.
— I will never sacrifice reality for elegance without explaining why I have
done so.
— Nor will I give the people who use my model false comfort about its
accuracy. Instead, I will make explicit its assumptions and oversights.
— I understand that my work may have enormous effects on society and
the economy, many of them beyond my comprehension.
Responsible Use of Student Data in Higher Education
 http://gsd.su.domains/
 Opportunity to understand student learning and enhance
educational attainment.
 New questions about the ethical collection, use, and sharing
of information.
 Commitments to honor the integrity, discretion, and humanity
of students.
 Improve practice in light of accumulating information and
knowledge.
Maciej Cegłowski
 https://pinboard.in and @pinboard
 http://idlewords.com/talks/
 Two talks on Data in particular:
—http://idlewords.com/talks/deep_fried_data.htm
—http://idlewords.com/talks/haunted_by_data.htm
Basic Framework
 Safeguard Student Privacy
—Vendors; Monetizing Data
 Strong internal norms around data
 Consider and Measure Outcomes
 Work with Data Owners and Stewards
 Responsibility to Educate
 Consult with Students and Stakeholders
 Data should have a clear purpose
Next Steps
 Write down your norms/expectations for working with
Student data
 Set up a discussion with your co-workers about it.
 Seek out others who perform a similar role and discuss it.
 Discuss with the Student Data Steward at your institution.
 Send me your comments!
Continue the Conversation
 Follow me on twitter: @sechilds
 Stephen.Childs@ucalgary.ca or sechilds@gmail.com
 https://oia.ucalgary.ca/Contact
 https://www.meetup.com/PyData-Calgary/

More Related Content

PPTX
Managing confidential data
PDF
Privacy tool osha comments
PPTX
Assessing Digital Output in New Ways
PDF
Big Data for Library Services (2017)
PPTX
Confidential data management_key_concepts
PPTX
Introduction to Big Data Analytics
PPTX
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
PDF
Data and Knowledge as Commodities
Managing confidential data
Privacy tool osha comments
Assessing Digital Output in New Ways
Big Data for Library Services (2017)
Confidential data management_key_concepts
Introduction to Big Data Analytics
BROWN BAG TALK WITH MICAH ALTMAN, SOURCES OF BIG DATA FOR SOCIAL SCIENCES
Data and Knowledge as Commodities

What's hot (20)

PDF
Data science and ethics in fundraising
PPTX
Data science.chapter-1,2,3
PPTX
Text analytics
PPTX
Open Data Bay Area: Interesting Problems in Academic Data
PDF
Making data sharing count
PDF
Slides | Research data literacy and the library
PPTX
Data Discovery and Visualization
PPT
The Semantic Web: It's for Real
PPTX
Behind the scenes of data science
PPTX
Managing Your Research Data
PPT
Linking Data to Publications through Citation and Virtual Archives
PPTX
Research Ethics and Use of Restricted Access Data
PPTX
Fsci 2018 wednesday1_august_am6
PPTX
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
PPTX
Noshir Contractor's view on the future of Linked Data
PPT
Introduction to Data Management and Sharing
PPSX
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
PPTX
Data Mining and Data Warehouse
PDF
Knowledge Representation on the Web
PPTX
Advancing Foundation and Practice of Software Analytics
Data science and ethics in fundraising
Data science.chapter-1,2,3
Text analytics
Open Data Bay Area: Interesting Problems in Academic Data
Making data sharing count
Slides | Research data literacy and the library
Data Discovery and Visualization
The Semantic Web: It's for Real
Behind the scenes of data science
Managing Your Research Data
Linking Data to Publications through Citation and Virtual Archives
Research Ethics and Use of Restricted Access Data
Fsci 2018 wednesday1_august_am6
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
Noshir Contractor's view on the future of Linked Data
Introduction to Data Management and Sharing
USING BIGDATA WITH ACADEMIC LIBRARY SERVICES: A VIEW
Data Mining and Data Warehouse
Knowledge Representation on the Web
Advancing Foundation and Practice of Software Analytics
Ad

Similar to Cyber Summit 2016: Establishing an Ethics Framework for Predictive Analytics in Higher Education (20)

PDF
Learning Analytics in action: ethics and privacy issues in the classroom
PPTX
Mary Loftus #ILTAEdTech - Ways of Seeing Learning - 2017 v0.6
PPTX
Ways of seeing learning - 2017v1.0 - NUI Galway University of Limerick postgr...
PPTX
Probabilistic Graphical Models as Predictive Feedback for Students
PPTX
Aim analytics panel (2017 Fall): Privacy & ethics
PPTX
Balancing cost, quality and access with care in online education: the role of...
PDF
Griffiths lace workshop-eden-2016
PDF
Data informed decision making - Yaz El Hakim
PPTX
Creating an action plan for learning analytics
PDF
Learning Analytics
PPTX
Analysing analytics, what is learning analytics?
PPTX
Addressing the wicked problem of learning data privacy though principle and p...
PPTX
Using student data to inform support, pedagogy & curricula: ethical issues & ...
PPTX
Mapping the ethical implications of using student data – A South African cont...
PPTX
An invitation to a conversation: Towards a South African ethical use of stude...
PPTX
Building the learning analytics curriculum: Should we teach (a code of) ethics?
PDF
Lynema Data Collection and Privacy: Library and Learning Management Systems
PPTX
Lak2018: Scaling Nationally: Seven Lesson Learned
PPTX
Using Assessment data
PPTX
Collecting, measuring, analysing and using student data in open distance/dist...
Learning Analytics in action: ethics and privacy issues in the classroom
Mary Loftus #ILTAEdTech - Ways of Seeing Learning - 2017 v0.6
Ways of seeing learning - 2017v1.0 - NUI Galway University of Limerick postgr...
Probabilistic Graphical Models as Predictive Feedback for Students
Aim analytics panel (2017 Fall): Privacy & ethics
Balancing cost, quality and access with care in online education: the role of...
Griffiths lace workshop-eden-2016
Data informed decision making - Yaz El Hakim
Creating an action plan for learning analytics
Learning Analytics
Analysing analytics, what is learning analytics?
Addressing the wicked problem of learning data privacy though principle and p...
Using student data to inform support, pedagogy & curricula: ethical issues & ...
Mapping the ethical implications of using student data – A South African cont...
An invitation to a conversation: Towards a South African ethical use of stude...
Building the learning analytics curriculum: Should we teach (a code of) ethics?
Lynema Data Collection and Privacy: Library and Learning Management Systems
Lak2018: Scaling Nationally: Seven Lesson Learned
Using Assessment data
Collecting, measuring, analysing and using student data in open distance/dist...
Ad

More from Cybera Inc. (20)

PDF
Cyber Summit 2016: Technology, Education, and Democracy
PPTX
Cyber Summit 2016: Understanding Users' (In)Secure Behaviour
PPT
Cyber Summit 2016: Insider Threat Indicators: Human Behaviour
PPTX
Cyber Summit 2016: Research Data and the Canadian Innovation Challenge
PPTX
Cyber Summit 2016: Knowing More and Understanding Less in the Age of Big Data
PDF
Cyber Summit 2016: Privacy Issues in Big Data Sharing and Reuse
PDF
Cyber Summit 2016: The Data Tsunami vs The Network: How More Data Changes Eve...
PDF
Cyber Summit 2016: Issues and Challenges Facing Municipalities In Securing Data
PPT
Cyber Summit 2016: Using Law Responsibly: What Happens When Law Meets Technol...
PPT
Privacy, Security & Access to Data
PDF
Do Universities Dream of Big Data
PDF
Predicting the Future With Microsoft Bing
PPTX
Analytics 101: How to not fail at analytics
PPTX
Are MOOC's past their peak?
PPTX
Opening the doors of the laboratory
PPTX
Open City - Edmonton
PPTX
Unlocking the power of healthcare data
PPT
Checking in on Healthcare Data Analytics
PPTX
Open access and open data: international trends and strategic context
PPTX
I didn't know i was a geomatics company
Cyber Summit 2016: Technology, Education, and Democracy
Cyber Summit 2016: Understanding Users' (In)Secure Behaviour
Cyber Summit 2016: Insider Threat Indicators: Human Behaviour
Cyber Summit 2016: Research Data and the Canadian Innovation Challenge
Cyber Summit 2016: Knowing More and Understanding Less in the Age of Big Data
Cyber Summit 2016: Privacy Issues in Big Data Sharing and Reuse
Cyber Summit 2016: The Data Tsunami vs The Network: How More Data Changes Eve...
Cyber Summit 2016: Issues and Challenges Facing Municipalities In Securing Data
Cyber Summit 2016: Using Law Responsibly: What Happens When Law Meets Technol...
Privacy, Security & Access to Data
Do Universities Dream of Big Data
Predicting the Future With Microsoft Bing
Analytics 101: How to not fail at analytics
Are MOOC's past their peak?
Opening the doors of the laboratory
Open City - Edmonton
Unlocking the power of healthcare data
Checking in on Healthcare Data Analytics
Open access and open data: international trends and strategic context
I didn't know i was a geomatics company

Recently uploaded (20)

PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
Chapter 5: Probability Theory and Statistics
PPT
What is a Computer? Input Devices /output devices
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
NewMind AI Weekly Chronicles – August ’25 Week III
PDF
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
Configure Apache Mutual Authentication
PPTX
The various Industrial Revolutions .pptx
PPT
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
PDF
Architecture types and enterprise applications.pdf
PPT
Geologic Time for studying geology for geologist
PDF
Developing a website for English-speaking practice to English as a foreign la...
PPTX
Final SEM Unit 1 for mit wpu at pune .pptx
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PDF
STKI Israel Market Study 2025 version august
PDF
A comparative study of natural language inference in Swahili using monolingua...
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
Chapter 5: Probability Theory and Statistics
What is a Computer? Input Devices /output devices
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
sbt 2.0: go big (Scala Days 2025 edition)
1 - Historical Antecedents, Social Consideration.pdf
NewMind AI Weekly Chronicles – August ’25 Week III
Two-dimensional Klein-Gordon and Sine-Gordon numerical solutions based on dee...
Hindi spoken digit analysis for native and non-native speakers
Configure Apache Mutual Authentication
The various Industrial Revolutions .pptx
Galois Field Theory of Risk: A Perspective, Protocol, and Mathematical Backgr...
Architecture types and enterprise applications.pdf
Geologic Time for studying geology for geologist
Developing a website for English-speaking practice to English as a foreign la...
Final SEM Unit 1 for mit wpu at pune .pptx
A contest of sentiment analysis: k-nearest neighbor versus neural network
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
STKI Israel Market Study 2025 version august
A comparative study of natural language inference in Swahili using monolingua...

Cyber Summit 2016: Establishing an Ethics Framework for Predictive Analytics in Higher Education

  • 1. Establishing an Ethics Framework for Predictive Analytics in Higher Education Cyber Summit 2016, Banff Stephen Childs, Institutional Analyst October 27, 2016
  • 2. Disclamer  The content of this presentation represents my views only. and not that of my employer, the University of Calgary.  I am not qualified to accurately describe University of Calgary policy in the areas discussed in this talk.  Please contact the University if you have policy questions.
  • 3. Data Abundance in Higher Education 3
  • 4. Big Data, Big Problems  Advancing technology —Better data collection —Handle more data —Apply algorithms to data  We know more about our students  Can make predictions about their behavior  Very few guidelines about this practice
  • 5. Solutions  Develop an ethics framework around student data.  Build on existing guidelines.  Build on the norms of service to students  Do this now while these practices are new.
  • 6. Outline  Introduction  Students and Student Data  Predictive Analytics  Existing Frameworks  Next Steps
  • 8. About My Office  Office of Institutional Analysis  https://oia.ucalgary.ca/
  • 12. Student Data  Application  Student Information System  LMS  Unicard  Surveys  Residence  Facilities  Awarding Degrees  Grades  USRI  IT usage  Others…
  • 13. Student Data  Students can opt out of some data collection, but not all  Student give us their data because they trust us  We need to deserve that trust! —Respect student privacy —Transparency about how data is used —Accountability —Consultation —Consider the Consequences
  • 16. Transparency and Accountability  Internalize norms is not enough!  How Universities use data should be known —We aren’t corporations with competitive secrets —We need to set up ways to report and share  We need to be able describe what happened!  Log events  Version control your software  Develop reporting methods
  • 18. Consider the Consequences  Moving from institutional decision making to acting on individual data  Lathe of Heaven – a mad social scientist
  • 20. Best Practices using Predictive Analytics  Have to carefully present information to students —Present a positive outlook —Don’t personalize it – talk about a group of similar students.  The factors in the model may be less deterministic than unobserved factors.  Difference between causality and correlation.  Beware the self-fulfilling prophecy
  • 21. Cathy O’Neil  @mathbabe, mathbabe.org  Mathematician, former hedge-fund quant
  • 22. Weapons of Math Destruction  Three factors make a model a WMD: —Is the participant aware of the model? Is the model opaque or invisible? —Does the model work against the participant’s interest? Is it unfair? Does it create feedback loops? —Can the model scale?
  • 23. Student Data Principles  http://studentdataprinciples.org/  Purpose and use of student data  Timely access to data  Data should not replace professional judgement.  Data governance, security, breach notification
  • 24. Student Data Pledge  http://www.edtechmagazine.com/k12/article/2015/03/prote ct-personal-student-information-pair-organizations- recommends-commitment  Don’t sell student data, use data to target ads, or profile students for non-educational purposes  Don’t collect more information or retain information longer than necessary.  Do disclose how, what and why
  • 25. uCalgary Data Rules  Freedom of Information and Privacy Act (1999) —Students must be able to correct own info —University must provide own info upon confirmation of ID  Categories of Data Confidentiality  Research Ethics Boards —Data collection for University operations does not generally fall under REB jurisdiction.
  • 26. Financial Modeler’s Manifesto  https://www.wilmott.com/financial-modelers-manifesto/  Emanuel Derman and Paul Wilmott – January 7, 2009  The Modelers’ Hippocratic Oath — I will remember that I didn’t make the world, and it doesn’t satisfy my equations. — Though I will use models boldly to estimate value, I will not be overly impressed by mathematics. — I will never sacrifice reality for elegance without explaining why I have done so. — Nor will I give the people who use my model false comfort about its accuracy. Instead, I will make explicit its assumptions and oversights. — I understand that my work may have enormous effects on society and the economy, many of them beyond my comprehension.
  • 27. Responsible Use of Student Data in Higher Education  http://gsd.su.domains/  Opportunity to understand student learning and enhance educational attainment.  New questions about the ethical collection, use, and sharing of information.  Commitments to honor the integrity, discretion, and humanity of students.  Improve practice in light of accumulating information and knowledge.
  • 28. Maciej Cegłowski  https://pinboard.in and @pinboard  http://idlewords.com/talks/  Two talks on Data in particular: —http://idlewords.com/talks/deep_fried_data.htm —http://idlewords.com/talks/haunted_by_data.htm
  • 29. Basic Framework  Safeguard Student Privacy —Vendors; Monetizing Data  Strong internal norms around data  Consider and Measure Outcomes  Work with Data Owners and Stewards  Responsibility to Educate  Consult with Students and Stakeholders  Data should have a clear purpose
  • 30. Next Steps  Write down your norms/expectations for working with Student data  Set up a discussion with your co-workers about it.  Seek out others who perform a similar role and discuss it.  Discuss with the Student Data Steward at your institution.  Send me your comments!
  • 31. Continue the Conversation  Follow me on twitter: @sechilds  [email protected] or [email protected]  https://oia.ucalgary.ca/Contact  https://www.meetup.com/PyData-Calgary/

Editor's Notes

  • #4: Higher Education Institutions have a lot of data. We have a lot of data relating to our students. Asymmetric power and information relationship
  • #5: Research Ethics Boards – don’t typically cover this kind of work. How do we know if our models are BAD and if they are harming students?
  • #6: Talk Objectives: Understand why responsible use of student data matters Give you resources and language to talk about it Start a conversation about student data
  • #8: Analyst and Researcher - MA Economics from WLU Handle Student Data and Build models of student behavior Parts of my job are becoming easier Better data science tools – mention PyData Calgary Not much guidance
  • #9: Office of Institutional Analysis Reports to Vice-Provost (Planning & Resource Allocation) …Who reports to the Provost & VP (Academic) …Who reports to the President
  • #10: We don’t own the institution’s data – but we become experts in it. (Particularly student data) Create reports for: SLT, Internal stakeholders, Government, Public Fact Book, USRI (instruction ratings), University Rankings, U-15 Institutional research is great – work with data, wide variety of tasks
  • #11: 50th Anniversary Year Eyes High Energizing Eyes High Better serve students and community
  • #12: Central focus of the University Attend for various reasons: job, skills, education -- University is one of the best paths to the life they want. Student pay the University to act as a gatekeeper Demographic changes, Different support needs Asymmetrical Power and Information
  • #13: Enter the system as soon as they apply
  • #15: Privacy is just the starting point Employees should only have access to data they need Security – need to mention @SwiftOnSecurity De-Identify Data where necessary
  • #16: Access to data his a hard problem To restrictive and nothing gets done To open and problems result Employees need to internalize norms about data Highly specific to institutional culture Need employees to use their own judgment
  • #17: Version control - Software Carpentry
  • #18: Are we hearing the students about how we use their data? Do they care about this? How can we consult with students and get a useful result?
  • #19: Worth reading - but don't watch the TV movie.
  • #20: Technology used in the business world Statistics, Econometrics, Machine Learning Universities have experts on all three. Expertise starting to move into admin Bad use of aggregate statistics can lead to bad policy. Bad use of individual data can lead to bad individual outcomes Individual outcomes can be systematically bad https://ischool.syr.edu/infospace/2013/11/13/using-predictive-analytics-to-understand-your-business/ https://ischool.syr.edu/infospace/wp-content/files/2013/12/crystal-ball-e1385997891512.jpg
  • #23: Talk about examples of WMD at this point: prison recidivism models used in sentencing,