SlideShare a Scribd company logo
9
Most read
An introduction
to time series
data with R and
Malawian
agricultural
data
COLLEEN M. FARRELLY,
MACHINE LEARNING LEAD
Who Am I?
Consulting machine learning lead, Mpuza
Industry researcher in topological data analysis, natural
language processing, and time series analytics
Co-author of The Shape of Data (No Starch Press)
Predictions
through Time
Sales volumes over quarters
Pricing of goods or food over weeks
Daily stock market trends
Yearly disease burden
Time
Dependency
Caveat…
Future system behavior depends on
current and past system states…
◦ Not independent data points
◦ Limits usage of machine learning
◦ Limits accuracy of far-off predictions
Analyzing Time
Series Data:
ARIMA
Moving averages
◦ Many types of models based on
averages over time
◦ Many that add in an autoregressive
piece to account for correlations
across time periods
◦ Prediction based on autoregression
and moving average
Analyzing
Time Series
Data: SSA
Another approach is to decompose
the time series (singular spectrum
analysis):
◦ Embed the time series
◦ Perform spectral decomposition
(singular value decomposition…)
◦ Group eigentriples and average
across the matrix diagonal
◦ Linear prediction of future time
periods
More
Advanced
Methods
Time-lag components
in machine learning
models (deep learning,
KNN, random forest…)
Partial differential
equation models (SIR
models for epidemics…)
Forecasting future
values
Detecting changes in
system behavior
Example Dataset: ARIMA and SSA
1961-2013 agricultural
land usage in Malawi
Cleaned up from World
Bank’s data on climate
change indicators by
country (Humanitarian
Data Exchange)
An introduction to time series data with R.pptx

More Related Content

Similar to An introduction to time series data with R.pptx (20)

PDF
Mastering Time Series Forecasting - Guide to Techniques, Applications, and Fu...
Data & Analytics Magazin
 
PPTX
Module 3 - Time Series.pptx
nikshaikh786
 
PPT
Enterprise_Planning_TimeSeries_And_Components
nanfei
 
PDF
Time series and forecasting from wikipedia
Monica Barros
 
PDF
Forecasting time series powerful and simple
Ivo Andreev
 
PDF
Time Series Analysis: Theory and Practice
Tetiana Ivanova
 
PPTX
Time series
amiyadash
 
PPTX
time_series and the forecastring age of RNNS.pptx
shahzebTariq11
 
PPTX
Presentation On Time Series Analysis in Mechine Learning
mahfuzur32785
 
PPTX
Long Memory presentation to SURF
Richard Hunt
 
PDF
Ac26185187
IJERA Editor
 
PDF
Time series forecasting with ARIMA
Yury Kashnitsky
 
PPTX
Unit-5 Time series data Analysis.pptx
Sheba41
 
PPTX
Time series analysis
Utkarsh Sharma
 
PDF
A Course in Time Series Analysis 1st Edition Pena D.
rewabhm44
 
PPTX
Lesson 5 arima
ankit_ppt
 
PDF
Demand time series analysis and forecasting
M Baddar
 
PDF
RDataMining slides-time-series-analysis
Yanchang Zhao
 
PPTX
Data Science and analytics, computer Science
MurugeswariC1
 
PDF
A Course in Time Series Analysis 1st Edition Pena D.
studyfortiev
 
Mastering Time Series Forecasting - Guide to Techniques, Applications, and Fu...
Data & Analytics Magazin
 
Module 3 - Time Series.pptx
nikshaikh786
 
Enterprise_Planning_TimeSeries_And_Components
nanfei
 
Time series and forecasting from wikipedia
Monica Barros
 
Forecasting time series powerful and simple
Ivo Andreev
 
Time Series Analysis: Theory and Practice
Tetiana Ivanova
 
Time series
amiyadash
 
time_series and the forecastring age of RNNS.pptx
shahzebTariq11
 
Presentation On Time Series Analysis in Mechine Learning
mahfuzur32785
 
Long Memory presentation to SURF
Richard Hunt
 
Ac26185187
IJERA Editor
 
Time series forecasting with ARIMA
Yury Kashnitsky
 
Unit-5 Time series data Analysis.pptx
Sheba41
 
Time series analysis
Utkarsh Sharma
 
A Course in Time Series Analysis 1st Edition Pena D.
rewabhm44
 
Lesson 5 arima
ankit_ppt
 
Demand time series analysis and forecasting
M Baddar
 
RDataMining slides-time-series-analysis
Yanchang Zhao
 
Data Science and analytics, computer Science
MurugeswariC1
 
A Course in Time Series Analysis 1st Edition Pena D.
studyfortiev
 

More from Colleen Farrelly (20)

PPTX
Generative AI for Social Good at Open Data Science East 2024
Colleen Farrelly
 
PPTX
Hands-On Network Science, PyData Global 2023
Colleen Farrelly
 
PPTX
Modeling Climate Change.pptx
Colleen Farrelly
 
PPTX
Natural Language Processing for Beginners.pptx
Colleen Farrelly
 
PPTX
The Shape of Data--ODSC.pptx
Colleen Farrelly
 
PPTX
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
PPTX
Emerging Technologies for Public Health in Remote Locations.pptx
Colleen Farrelly
 
PPTX
Applications of Forman-Ricci Curvature.pptx
Colleen Farrelly
 
PPTX
Geometry for Social Good.pptx
Colleen Farrelly
 
PPTX
Topology for Time Series.pptx
Colleen Farrelly
 
PPTX
An introduction to quantum machine learning.pptx
Colleen Farrelly
 
PPTX
NLP: Challenges and Opportunities in Underserved Areas
Colleen Farrelly
 
PPTX
Geometry, Data, and One Path Into Data Science.pptx
Colleen Farrelly
 
PPTX
Topological Data Analysis.pptx
Colleen Farrelly
 
PPTX
Transforming Text Data to Matrix Data via Embeddings.pptx
Colleen Farrelly
 
PPTX
Natural Language Processing in the Wild.pptx
Colleen Farrelly
 
PPTX
SAS Global 2021 Introduction to Natural Language Processing
Colleen Farrelly
 
PPTX
2021 American Mathematical Society Data Science Talk
Colleen Farrelly
 
PPTX
WIDS 2021--An Introduction to Network Science
Colleen Farrelly
 
PPTX
Technical aspects of writing poetry II--sounds
Colleen Farrelly
 
Generative AI for Social Good at Open Data Science East 2024
Colleen Farrelly
 
Hands-On Network Science, PyData Global 2023
Colleen Farrelly
 
Modeling Climate Change.pptx
Colleen Farrelly
 
Natural Language Processing for Beginners.pptx
Colleen Farrelly
 
The Shape of Data--ODSC.pptx
Colleen Farrelly
 
Generative AI, WiDS 2023.pptx
Colleen Farrelly
 
Emerging Technologies for Public Health in Remote Locations.pptx
Colleen Farrelly
 
Applications of Forman-Ricci Curvature.pptx
Colleen Farrelly
 
Geometry for Social Good.pptx
Colleen Farrelly
 
Topology for Time Series.pptx
Colleen Farrelly
 
An introduction to quantum machine learning.pptx
Colleen Farrelly
 
NLP: Challenges and Opportunities in Underserved Areas
Colleen Farrelly
 
Geometry, Data, and One Path Into Data Science.pptx
Colleen Farrelly
 
Topological Data Analysis.pptx
Colleen Farrelly
 
Transforming Text Data to Matrix Data via Embeddings.pptx
Colleen Farrelly
 
Natural Language Processing in the Wild.pptx
Colleen Farrelly
 
SAS Global 2021 Introduction to Natural Language Processing
Colleen Farrelly
 
2021 American Mathematical Society Data Science Talk
Colleen Farrelly
 
WIDS 2021--An Introduction to Network Science
Colleen Farrelly
 
Technical aspects of writing poetry II--sounds
Colleen Farrelly
 
Ad

Recently uploaded (20)

DOC
MATRIX_AMAN IRAWAN_20227479046.docbbbnnb
vanitafiani1
 
PDF
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PPTX
Introduction to Artificial Intelligence.pptx
StarToon1
 
PDF
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
PPT
1 DATALINK CONTROL and it's applications
karunanidhilithesh
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
T2_01 Apuntes La Materia.pdfxxxxxxxxxxxxxxxxxxxxxxxxxxxxxskksk
mathiasdasilvabarcia
 
PPT
Lecture 2-1.ppt at a higher learning institution such as the university of Za...
rachealhantukumane52
 
PDF
Incident Response and Digital Forensics Certificate
VICTOR MAESTRE RAMIREZ
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
PPTX
Usage of Power BI for Pharmaceutical Data analysis.pptx
Anisha Herala
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPTX
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
PDF
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
PPTX
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
MATRIX_AMAN IRAWAN_20227479046.docbbbnnb
vanitafiani1
 
MusicVideoProjectRubric Animation production music video.pdf
ALBERTIANCASUGA
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
Introduction to Artificial Intelligence.pptx
StarToon1
 
How to Avoid 7 Costly Mainframe Migration Mistakes
JP Infra Pvt Ltd
 
1 DATALINK CONTROL and it's applications
karunanidhilithesh
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
T2_01 Apuntes La Materia.pdfxxxxxxxxxxxxxxxxxxxxxxxxxxxxxskksk
mathiasdasilvabarcia
 
Lecture 2-1.ppt at a higher learning institution such as the university of Za...
rachealhantukumane52
 
Incident Response and Digital Forensics Certificate
VICTOR MAESTRE RAMIREZ
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
List of all the AI prompt cheat codes.pdf
Avijit Kumar Roy
 
Usage of Power BI for Pharmaceutical Data analysis.pptx
Anisha Herala
 
Climate Action.pptx action plan for climate
justfortalabat
 
Resmed Rady Landis May 4th - analytics.pptx
Adrian Limanto
 
apidays Helsinki & North 2025 - APIs in the healthcare sector: hospitals inte...
apidays
 
DATA-COLLECTION METHODS, TYPES AND SOURCES
biggdaad011
 
Ad

An introduction to time series data with R.pptx

  • 1. An introduction to time series data with R and Malawian agricultural data COLLEEN M. FARRELLY, MACHINE LEARNING LEAD
  • 2. Who Am I? Consulting machine learning lead, Mpuza Industry researcher in topological data analysis, natural language processing, and time series analytics Co-author of The Shape of Data (No Starch Press)
  • 3. Predictions through Time Sales volumes over quarters Pricing of goods or food over weeks Daily stock market trends Yearly disease burden
  • 4. Time Dependency Caveat… Future system behavior depends on current and past system states… ◦ Not independent data points ◦ Limits usage of machine learning ◦ Limits accuracy of far-off predictions
  • 5. Analyzing Time Series Data: ARIMA Moving averages ◦ Many types of models based on averages over time ◦ Many that add in an autoregressive piece to account for correlations across time periods ◦ Prediction based on autoregression and moving average
  • 6. Analyzing Time Series Data: SSA Another approach is to decompose the time series (singular spectrum analysis): ◦ Embed the time series ◦ Perform spectral decomposition (singular value decomposition…) ◦ Group eigentriples and average across the matrix diagonal ◦ Linear prediction of future time periods
  • 7. More Advanced Methods Time-lag components in machine learning models (deep learning, KNN, random forest…) Partial differential equation models (SIR models for epidemics…) Forecasting future values Detecting changes in system behavior
  • 8. Example Dataset: ARIMA and SSA 1961-2013 agricultural land usage in Malawi Cleaned up from World Bank’s data on climate change indicators by country (Humanitarian Data Exchange)