SlideShare a Scribd company logo
#DataTalkWhat is a Data Scientist?
LIVE TWEETCHAT
FEATURING:
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Join our #DataTalk on Thursdays at 5 p.m. ET
This week, we tweeted with Dr. Michael Wu, the Chief Scientist at
Lithium, where he applies data-driven methodologies to investigate
the complex dynamics of the social web.
Check out all tweets from this Twitter chat:
ex.pn/scientist
What type of work does
a data scientist do?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
“A data scientist’s work includes everything
from data infrastructure (capture, store,
process) to data service (retrieval).
#DataTalk
ex.pn/datatalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
“A data scientist converts data
into business intelligence.
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
“A data scientist’s work includes: decision
science, business intelligence, customer
analytics, marketing analytics, fraud,
security, etc.
#DataTalk
ex.pn/datatalk
What are attributes of a
good data scientist?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
To be a data scientist, you need the
technical expertise in computer science,
statistics, and knowledge/experience
with large data sets.
#DataTalk
ex.pn/datatalk
“
Data scientists should have good
intuition, strong coding capability, solid
training in statistics & machine learning.
#DataTalk
ex.pn/datatalk
“
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Educational background for data
scientists can be computational genomics,
astrophysicists, fluid dynamics, chemistry,
biophysics (like me) ...
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
To be a good data scientist, you need
more than tech expertise. You must be
a good communicator to explain
complex data/analysis.
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Good data scientists also need to be
passionate about data. I’d highly value
curiosity, creativity and perseverance
when hiring one.
#DataTalk
ex.pn/datatalk
“
What kinds of companies have
(or need) data scientists?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Any company that already is invested
in modern big data infrastructure
will need data scientists to crunch the data.
“
ex.pn/datatalk
#DataTalk
All companies need to have
data scientists to stay competitive.“
ex.pn/datatalk
#DataTalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
All businesses use data, and data will
grow so big that our brain and databases
eventually can’t handle … they all need
data scientists eventually.
“
ex.pn/datatalk
#DataTalk
What types of teams do data
scientists work with?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
It all depends how the data organization
is structured within the enterprise:
independent team, hub & spoke,
or silo in dept.
ex.pn/datatalk
“
#DataTalk
Data scientists can work in R&D,
product development and support
business operations.
ex.pn/datatalk
“
#DataTalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
SMBs use hub and spoke data teams
where they report to different departments,
but collaborate and work together,
so data expertise is shared.
ex.pn/datatalk
“
#DataTalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Large companies typically have
entire teams of data scientists
within each department and they
usually don’t collaborate.
ex.pn/datatalk
“
#DataTalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Personally, I work internally with
engineering, product, marketing,
best practice, service, consulting,
strategy, even sales and human resources.
ex.pn/datatalk
“
#DataTalk
I process data, build models, engage
with clients, and facilitate collaboration
among Experian Data Labs.
ex.pn/datatalk
“
#DataTalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
What are some big challenges
that data scientists face?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
One of the biggest challenges for
data scientists is communication.
Many data scientists speak tech & stats,
but they don’t speak business.
“
ex.pn/datatalk
#DataTalk
Challenges for data scientists:
data governance and what
data can be used for
“
ex.pn/datatalk
#DataTalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Other challenges for data scientists:
Data access, data integration, and motivation.“
ex.pn/datatalk
#DataTalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
If a company is starting a data science
initiative, their data scientist may not
have access to all data due to security
and compliance.
“
ex.pn/datatalk
#DataTalk
Is there an art and science to
working with big data?
Absolutely! Good intuition and
domain knowledge are the keys for
successful big data projects.
“
#DataTalk
ex.pn/datatalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
There’s definitely science to working
with big data… there are rigorous
stats and implementation details you
learn from statistics and computer science.
“
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
There’s also art in working with big data,
and this only comes with years
of experience on working with big data.
“
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Picking a good problem is sort of an art,
choosing right features from an infinite
number of features, too (feature engineering).
“
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Exploratory data analysis (EDA):
getting a feel or a hunch for how the
data behaves is definitely an art.
“
#DataTalk
ex.pn/datatalk
How can data scientists make
a big impact in their business?
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
First, data scientists need to learn
about the business, so they have the
context to interpret the data and
result of models/analyses.
“
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Second, they need to pick a good
problem: the most impactful
problem that can be addressed with
data they have access to.
“
#DataTalk
ex.pn/datatalk
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Third, they must communicate
effectively and make businesses
understand the analysis and business
implication of the insight they found.
“
#DataTalk
ex.pn/datatalk
To be impactful, data scientists need
to keep an open mind and concentrate
efforts on most impactful problems.
“
#DataTalk
ex.pn/datatalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
A good data scientist must be
a good communicator, storyteller,
teacher, etc. who can simplify
complex data science for business.
“
#DataTalk
ex.pn/datatalk
What are some big data trends?
Big data will be embraced by more
and more businesses. More decisions
will be driven by data and analytics.
#DataTalk
ex.pn/datatalk
“
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
In past 5 years, most of the big data
tech operate at the infrastructure layer.
Now more people are focused on the
algorithm layer.
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
If there’s a big data trend, it’s the
shift from infrastructure to analytics/
algorithms on people’s big data asset.
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
It used to be that data scientists can do
everything about any data, now there is data
engineering, algorithm, decision science.
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Now there’s experts in natural language
processing, image analysis, video/audio
processing, streaming data, etc.
#DataTalk
ex.pn/datatalk
“
Why should companies invest
more in data science?
Businesses invested in big data wisely
will have a huge competitive advantage
over their peers.
“
ex.pn/datatalk
#DataTalk
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Companies should invest more in data
science. They will need to eventually anyway.“
ex.pn/datatalk
#DataTalk
Any final tips for those who
want to work in data science?
Tips for new data scientists:
Keep an open mind, think outside the
box, and work hard. The future is bright.
#DataTalk
ex.pn/datatalk
“
Shanji Xiong
Global Chief Scientist, Experian
@ShanjiXiong
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Final Tips: Learn the tech and stats,
learn the business context, learn to
communicate the tech/stats to business
to bridge the gap
#DataTalk
ex.pn/datatalk
“
Dr. Michael Wu
Chief Scientist, Lithium
@mich8elwu
Be patient, follow your passion
(which should be data), and pick a good
problem to solve.
#DataTalk
ex.pn/datatalk
“
Join our #DataTalk on Twitter
on Thursdays at 5 p.m. ET.
experian.com/datatalk

More Related Content

What's hot (20)

PDF
What data scientists really do, according to 50 data scientists
Hugo Bowne-Anderson
 
PDF
Data science vs. Data scientist by Jothi Periasamy
Peter Kua
 
PPTX
Idiots guide to setting up a data science team
Ashish Bansal
 
PDF
Building Data Science Teams
EMC
 
PDF
data scientist the sexiest job of the 21st century
Frank Kienle
 
PDF
2017 06-14-getting started with data science
Thinkful
 
PPTX
DISUMMIT - Rishi Nalin Kumar from Datakind
DigitYser
 
PPTX
Lessons Learned The Hard Way: 32+ Data Science Interviews
Gregory Kamradt
 
PDF
Is Data Scientist still the sexiest job of 21st century? Find Out!
Edureka!
 
PPTX
DISUMMIT Keynote presentation from Kirk Borne - From Sensors to Sense-Making
DigitYser
 
PDF
Big Data Maturity Model and Governance
IMC Institute
 
PDF
Data Science towards the Digital Enterprise
Jake Bouma
 
PDF
Thinkful DC - Intro to Data Science
TJ Stalcup
 
PDF
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Edureka!
 
PPTX
Introduction to Data Science
Laguna State Polytechnic University
 
PDF
How to build a data science team 20115.03.13v6
Zhihao Lin
 
PPTX
5 ways to get more from data science
Tyrone Systems
 
PDF
Big data
Claire Choong
 
PPTX
Project management for Big Data projects
Sandeep Kumar, PMPÂŽ
 
PPTX
introduction to data science
bhavesh lande
 
What data scientists really do, according to 50 data scientists
Hugo Bowne-Anderson
 
Data science vs. Data scientist by Jothi Periasamy
Peter Kua
 
Idiots guide to setting up a data science team
Ashish Bansal
 
Building Data Science Teams
EMC
 
data scientist the sexiest job of the 21st century
Frank Kienle
 
2017 06-14-getting started with data science
Thinkful
 
DISUMMIT - Rishi Nalin Kumar from Datakind
DigitYser
 
Lessons Learned The Hard Way: 32+ Data Science Interviews
Gregory Kamradt
 
Is Data Scientist still the sexiest job of 21st century? Find Out!
Edureka!
 
DISUMMIT Keynote presentation from Kirk Borne - From Sensors to Sense-Making
DigitYser
 
Big Data Maturity Model and Governance
IMC Institute
 
Data Science towards the Digital Enterprise
Jake Bouma
 
Thinkful DC - Intro to Data Science
TJ Stalcup
 
Data Science Applications | Data Science For Beginners | Data Science Trainin...
Edureka!
 
Introduction to Data Science
Laguna State Polytechnic University
 
How to build a data science team 20115.03.13v6
Zhihao Lin
 
5 ways to get more from data science
Tyrone Systems
 
Big data
Claire Choong
 
Project management for Big Data projects
Sandeep Kumar, PMPÂŽ
 
introduction to data science
bhavesh lande
 

Viewers also liked (20)

PDF
Be a Data Scientist in 8 steps!
PromptCloud
 
PPTX
5 Job Skills Every Data Scientist Must Possess
Multisoft Virtual Academy
 
PPTX
Data Scientist: The Sexiest Job in the 21st Century
Lyn Fenex
 
PDF
Learn Data Science
Ryan
 
PDF
Experian State of the Automotive Finance Market
Experian_US
 
PDF
The path to be a data scientist
Poo Kuan Hoong
 
PDF
Data Science Thailand Meetup#11
Data Science Thailand
 
PPTX
The Future of Information - Experian Knows Big Data Analytics
Experian Global Decision Analytics
 
PDF
Predictive Data Analytics to Help Your Customers
Experian_US
 
PPTX
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Data Science London
 
PPTX
Introduction to Data Engineering
Vivek Aanand Ganesan
 
PPT
How to conduct a social network analysis: A tool for empowering teams and wor...
Jeromy Anglim
 
PPTX
GĂśteborg university(condensed)
Zenodia Charpy
 
PDF
How to become a data scientist in 6 months
Tetiana Ivanova
 
PDF
2 Aplicaciones Concretas para tu Data Driven Cross Channel Marketing - Miguel...
Miguel Poyatos
 
KEY
Intro to Data Science for Enterprise Big Data
Paco Nathan
 
PPTX
10 R Packages to Win Kaggle Competitions
DataRobot
 
PDF
Big Data: The Force That’s Good for Consumers and Society
Experian_US
 
PDF
Myths and Mathemagical Superpowers of Data Scientists
David Pittman
 
PDF
How to Become a Data Scientist
ryanorban
 
Be a Data Scientist in 8 steps!
PromptCloud
 
5 Job Skills Every Data Scientist Must Possess
Multisoft Virtual Academy
 
Data Scientist: The Sexiest Job in the 21st Century
Lyn Fenex
 
Learn Data Science
Ryan
 
Experian State of the Automotive Finance Market
Experian_US
 
The path to be a data scientist
Poo Kuan Hoong
 
Data Science Thailand Meetup#11
Data Science Thailand
 
The Future of Information - Experian Knows Big Data Analytics
Experian Global Decision Analytics
 
Predictive Data Analytics to Help Your Customers
Experian_US
 
Standardizing +113 million Merchant Names in Financial Services with Greenplu...
Data Science London
 
Introduction to Data Engineering
Vivek Aanand Ganesan
 
How to conduct a social network analysis: A tool for empowering teams and wor...
Jeromy Anglim
 
GĂśteborg university(condensed)
Zenodia Charpy
 
How to become a data scientist in 6 months
Tetiana Ivanova
 
2 Aplicaciones Concretas para tu Data Driven Cross Channel Marketing - Miguel...
Miguel Poyatos
 
Intro to Data Science for Enterprise Big Data
Paco Nathan
 
10 R Packages to Win Kaggle Competitions
DataRobot
 
Big Data: The Force That’s Good for Consumers and Society
Experian_US
 
Myths and Mathemagical Superpowers of Data Scientists
David Pittman
 
How to Become a Data Scientist
ryanorban
 
Ad

Similar to What is a Data Scientist (20)

PPTX
How to start thinking like a data scientist
Debashish Jana
 
PDF
What's the profile of a data scientist?
BICC Thomas More
 
PDF
Ultimate Data Science Cheat Sheet For Success
Julie Bowie
 
PPTX
Week1day2 (1)
Shaon Datta
 
PDF
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
PDF
Achieving Business Success with Data.pdf
Data Science Council of America
 
PPT
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Shiv Shakti Ghosh
 
PDF
Data science tutorial
Aakashdata
 
PPTX
Data science
DeekshaSrivas
 
PPTX
Data Scientist
Prince Barai
 
PDF
1030 track1 bennett
Rising Media, Inc.
 
PDF
Who is a data scientist
prateek kumar
 
PPTX
Data science a glance
Adekunle Babatunde Anthony
 
PPTX
Data science
CHARANJEET SINGH AHLUWALIA
 
PPTX
Introduction to Big Data and Data Science
Feyzi R. Bagirov
 
PDF
What Managers Need to Know about Data Science
Annie Flippo
 
PDF
Introduction-to-Data-Science.pdf
mallikarjuntalakal
 
PDF
Introduction-to-Data-Science.pdf
ikenossama03
 
PPTX
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
PDF
So you want to be a Data Scientist?
Mohd Izhar Firdaus Ismail
 
How to start thinking like a data scientist
Debashish Jana
 
What's the profile of a data scientist?
BICC Thomas More
 
Ultimate Data Science Cheat Sheet For Success
Julie Bowie
 
Week1day2 (1)
Shaon Datta
 
Lean Analytics: How to get more out of your data science team
Digital Transformation EXPO Event Series
 
Achieving Business Success with Data.pdf
Data Science Council of America
 
Colloquium(7)_DataScience:ShivShaktiGhosh&MohitGarg
Shiv Shakti Ghosh
 
Data science tutorial
Aakashdata
 
Data science
DeekshaSrivas
 
Data Scientist
Prince Barai
 
1030 track1 bennett
Rising Media, Inc.
 
Who is a data scientist
prateek kumar
 
Data science a glance
Adekunle Babatunde Anthony
 
Introduction to Big Data and Data Science
Feyzi R. Bagirov
 
What Managers Need to Know about Data Science
Annie Flippo
 
Introduction-to-Data-Science.pdf
mallikarjuntalakal
 
Introduction-to-Data-Science.pdf
ikenossama03
 
intro to data science Clustering and visualization of data science subfields ...
jybufgofasfbkpoovh
 
So you want to be a Data Scientist?
Mohd Izhar Firdaus Ismail
 
Ad

More from Experian_US (20)

PPTX
2016 Holiday Spending Survey
Experian_US
 
PPTX
Experian financial blogger partners survey results
Experian_US
 
PDF
Credit & Money Tips for Military Families
Experian_US
 
PDF
Pay Off Debt: How to Pay Down Debt Faster
Experian_US
 
PDF
Ways to Improve Your Credit Scores
Experian_US
 
PDF
The Benefits of Community Banking
Experian_US
 
PDF
Ways to Teach Your Kids About Money
Experian_US
 
PPTX
Ecs college graduate survey report final
Experian_US
 
PDF
Investing 101: How to Prepare for Retirement
Experian_US
 
PDF
How to Tackle Student Loan Debt
Experian_US
 
PPTX
Experian Consumer Newlywed Survey
Experian_US
 
PPTX
Experian Consumer Homebuying Survey 2016
Experian_US
 
PDF
Women and Money: Building Wealth and Banishing Fear
Experian_US
 
PPTX
Experian Millennial Credit & Finance Survey Report Part II
Experian_US
 
PDF
Women, the Workplace and Money: How to Take Action Today and Plan for Tomorro...
Experian_US
 
PPTX
Experian Consumer Tax Survey
Experian_US
 
PDF
Ways to Give Thanks and Pay it Forward
Experian_US
 
PDF
Ways to Control Emotional Spending
Experian_US
 
PDF
Healthy and Frugal Holiday Eats
Experian_US
 
PDF
How to Save on Holiday Travel
Experian_US
 
2016 Holiday Spending Survey
Experian_US
 
Experian financial blogger partners survey results
Experian_US
 
Credit & Money Tips for Military Families
Experian_US
 
Pay Off Debt: How to Pay Down Debt Faster
Experian_US
 
Ways to Improve Your Credit Scores
Experian_US
 
The Benefits of Community Banking
Experian_US
 
Ways to Teach Your Kids About Money
Experian_US
 
Ecs college graduate survey report final
Experian_US
 
Investing 101: How to Prepare for Retirement
Experian_US
 
How to Tackle Student Loan Debt
Experian_US
 
Experian Consumer Newlywed Survey
Experian_US
 
Experian Consumer Homebuying Survey 2016
Experian_US
 
Women and Money: Building Wealth and Banishing Fear
Experian_US
 
Experian Millennial Credit & Finance Survey Report Part II
Experian_US
 
Women, the Workplace and Money: How to Take Action Today and Plan for Tomorro...
Experian_US
 
Experian Consumer Tax Survey
Experian_US
 
Ways to Give Thanks and Pay it Forward
Experian_US
 
Ways to Control Emotional Spending
Experian_US
 
Healthy and Frugal Holiday Eats
Experian_US
 
How to Save on Holiday Travel
Experian_US
 

Recently uploaded (20)

PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
PDF
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
PDF
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
PDF
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
PPTX
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
PDF
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
PPTX
How to Add Columns and Rows in an R Data Frame
subhashenia
 
PDF
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
PPTX
BinarySearchTree in datastructures in detail
kichokuttu
 
PPTX
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
PDF
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
PDF
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
PDF
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPTX
Powerful Uses of Data Analytics You Should Know
subhashenia
 
PDF
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
PPTX
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
Research Methodology Overview Introduction
ayeshagul29594
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
1750162332_Snapshot-of-Indias-oil-Gas-data-May-2025.pdf
sandeep718278
 
Driving Employee Engagement in a Hybrid World.pdf
Mia scott
 
JavaScript - Good or Bad? Tips for Google Tag Manager
📊 Markus Baersch
 
apidays Singapore 2025 - How APIs can make - or break - trust in your AI by S...
apidays
 
05_Jelle Baats_Tekst.pptx_AI_Barometer_Release_Event
FinTech Belgium
 
apidays Singapore 2025 - Streaming Lakehouse with Kafka, Flink and Iceberg by...
apidays
 
How to Add Columns and Rows in an R Data Frame
subhashenia
 
apidays Singapore 2025 - Building a Federated Future, Alex Szomora (GSMA)
apidays
 
BinarySearchTree in datastructures in detail
kichokuttu
 
03_Ariane BERCKMOES_Ethias.pptx_AIBarometer_release_event
FinTech Belgium
 
Technical-Report-GPS_GIS_RS-for-MSF-finalv2.pdf
KPycho
 
Using AI/ML for Space Biology Research
VICTOR MAESTRE RAMIREZ
 
NIS2 Compliance for MSPs: Roadmap, Benefits & Cybersecurity Trends (2025 Guide)
GRC Kompas
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
Powerful Uses of Data Analytics You Should Know
subhashenia
 
Business implication of Artificial Intelligence.pdf
VishalChugh12
 
04_Tamás Marton_Intuitech .pptx_AI_Barometer_2025
FinTech Belgium
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
Research Methodology Overview Introduction
ayeshagul29594
 

What is a Data Scientist

  • 1. #DataTalkWhat is a Data Scientist? LIVE TWEETCHAT FEATURING: Dr. Michael Wu Chief Scientist, Lithium @mich8elwu
  • 2. Join our #DataTalk on Thursdays at 5 p.m. ET This week, we tweeted with Dr. Michael Wu, the Chief Scientist at Lithium, where he applies data-driven methodologies to investigate the complex dynamics of the social web. Check out all tweets from this Twitter chat: ex.pn/scientist
  • 3. What type of work does a data scientist do?
  • 4. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu “A data scientist’s work includes everything from data infrastructure (capture, store, process) to data service (retrieval). #DataTalk ex.pn/datatalk
  • 5. Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong “A data scientist converts data into business intelligence. #DataTalk ex.pn/datatalk
  • 6. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu “A data scientist’s work includes: decision science, business intelligence, customer analytics, marketing analytics, fraud, security, etc. #DataTalk ex.pn/datatalk
  • 7. What are attributes of a good data scientist?
  • 8. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu To be a data scientist, you need the technical expertise in computer science, statistics, and knowledge/experience with large data sets. #DataTalk ex.pn/datatalk “
  • 9. Data scientists should have good intuition, strong coding capability, solid training in statistics & machine learning. #DataTalk ex.pn/datatalk “ Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 10. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Educational background for data scientists can be computational genomics, astrophysicists, fluid dynamics, chemistry, biophysics (like me) ... #DataTalk ex.pn/datatalk “
  • 11. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu To be a good data scientist, you need more than tech expertise. You must be a good communicator to explain complex data/analysis. #DataTalk ex.pn/datatalk “
  • 12. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Good data scientists also need to be passionate about data. I’d highly value curiosity, creativity and perseverance when hiring one. #DataTalk ex.pn/datatalk “
  • 13. What kinds of companies have (or need) data scientists?
  • 14. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Any company that already is invested in modern big data infrastructure will need data scientists to crunch the data. “ ex.pn/datatalk #DataTalk
  • 15. All companies need to have data scientists to stay competitive.“ ex.pn/datatalk #DataTalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 16. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu All businesses use data, and data will grow so big that our brain and databases eventually can’t handle … they all need data scientists eventually. “ ex.pn/datatalk #DataTalk
  • 17. What types of teams do data scientists work with?
  • 18. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu It all depends how the data organization is structured within the enterprise: independent team, hub & spoke, or silo in dept. ex.pn/datatalk “ #DataTalk
  • 19. Data scientists can work in R&D, product development and support business operations. ex.pn/datatalk “ #DataTalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 20. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu SMBs use hub and spoke data teams where they report to different departments, but collaborate and work together, so data expertise is shared. ex.pn/datatalk “ #DataTalk
  • 21. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Large companies typically have entire teams of data scientists within each department and they usually don’t collaborate. ex.pn/datatalk “ #DataTalk
  • 22. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Personally, I work internally with engineering, product, marketing, best practice, service, consulting, strategy, even sales and human resources. ex.pn/datatalk “ #DataTalk
  • 23. I process data, build models, engage with clients, and facilitate collaboration among Experian Data Labs. ex.pn/datatalk “ #DataTalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 24. What are some big challenges that data scientists face?
  • 25. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu One of the biggest challenges for data scientists is communication. Many data scientists speak tech & stats, but they don’t speak business. “ ex.pn/datatalk #DataTalk
  • 26. Challenges for data scientists: data governance and what data can be used for “ ex.pn/datatalk #DataTalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 27. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Other challenges for data scientists: Data access, data integration, and motivation.“ ex.pn/datatalk #DataTalk
  • 28. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu If a company is starting a data science initiative, their data scientist may not have access to all data due to security and compliance. “ ex.pn/datatalk #DataTalk
  • 29. Is there an art and science to working with big data?
  • 30. Absolutely! Good intuition and domain knowledge are the keys for successful big data projects. “ #DataTalk ex.pn/datatalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 31. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu There’s definitely science to working with big data… there are rigorous stats and implementation details you learn from statistics and computer science. “ #DataTalk ex.pn/datatalk
  • 32. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu There’s also art in working with big data, and this only comes with years of experience on working with big data. “ #DataTalk ex.pn/datatalk
  • 33. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Picking a good problem is sort of an art, choosing right features from an infinite number of features, too (feature engineering). “ #DataTalk ex.pn/datatalk
  • 34. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Exploratory data analysis (EDA): getting a feel or a hunch for how the data behaves is definitely an art. “ #DataTalk ex.pn/datatalk
  • 35. How can data scientists make a big impact in their business?
  • 36. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu First, data scientists need to learn about the business, so they have the context to interpret the data and result of models/analyses. “ #DataTalk ex.pn/datatalk
  • 37. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Second, they need to pick a good problem: the most impactful problem that can be addressed with data they have access to. “ #DataTalk ex.pn/datatalk
  • 38. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Third, they must communicate effectively and make businesses understand the analysis and business implication of the insight they found. “ #DataTalk ex.pn/datatalk
  • 39. To be impactful, data scientists need to keep an open mind and concentrate efforts on most impactful problems. “ #DataTalk ex.pn/datatalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 40. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu A good data scientist must be a good communicator, storyteller, teacher, etc. who can simplify complex data science for business. “ #DataTalk ex.pn/datatalk
  • 41. What are some big data trends?
  • 42. Big data will be embraced by more and more businesses. More decisions will be driven by data and analytics. #DataTalk ex.pn/datatalk “ Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 43. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu In past 5 years, most of the big data tech operate at the infrastructure layer. Now more people are focused on the algorithm layer. #DataTalk ex.pn/datatalk “
  • 44. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu If there’s a big data trend, it’s the shift from infrastructure to analytics/ algorithms on people’s big data asset. #DataTalk ex.pn/datatalk “
  • 45. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu It used to be that data scientists can do everything about any data, now there is data engineering, algorithm, decision science. #DataTalk ex.pn/datatalk “
  • 46. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Now there’s experts in natural language processing, image analysis, video/audio processing, streaming data, etc. #DataTalk ex.pn/datatalk “
  • 47. Why should companies invest more in data science?
  • 48. Businesses invested in big data wisely will have a huge competitive advantage over their peers. “ ex.pn/datatalk #DataTalk Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 49. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Companies should invest more in data science. They will need to eventually anyway.“ ex.pn/datatalk #DataTalk
  • 50. Any final tips for those who want to work in data science?
  • 51. Tips for new data scientists: Keep an open mind, think outside the box, and work hard. The future is bright. #DataTalk ex.pn/datatalk “ Shanji Xiong Global Chief Scientist, Experian @ShanjiXiong
  • 52. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Final Tips: Learn the tech and stats, learn the business context, learn to communicate the tech/stats to business to bridge the gap #DataTalk ex.pn/datatalk “
  • 53. Dr. Michael Wu Chief Scientist, Lithium @mich8elwu Be patient, follow your passion (which should be data), and pick a good problem to solve. #DataTalk ex.pn/datatalk “
  • 54. Join our #DataTalk on Twitter on Thursdays at 5 p.m. ET. experian.com/datatalk