SlideShare a Scribd company logo
Data Science Company 
DataScience for e-commerce 
Infofarm - Seminar 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
25/11/2014
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Agenda 
• About us 
• What is Data Science? 
e-commerce vs Data Science vs BigData 
• Example Data Science applications in e-commerce 
some inspiration to see your opportunities… 
• Applying Data Science 
how to get started with all this?
About us 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Speakers 
• Niels Trescinski 
e-commerce Consultant 
– Fenego (Intershop) 
– Elision (Hybris) 
• Günther Van Roey 
Technical (IT) Consultant 
– InfoFarm (BigData & Data Science) 
– XT-i (software development and integration) 
– PHPro (website development)
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
InfoFarm - Team 
• Mixed skills team 
– 2 Data Scientist 
• Mathematics 
• Statistics 
– 4 BigData Consultants 
– 1 Infra specialist 
– n Cronos colleagues 
with various background 
• Certifications 
– CCDH - Cloudera Certified Hadoop Developer 
– CCAD - Cloudera Certified Hadoop Administrator 
– OCJP – Oracle Certified Java Programmer
InfoFarm + Fenego & Elision – e-commerce! 
Highly focused on 
e-commerce 
Business Knowledge 
Highly focused on 
Data Science and 
Big Data 
Technical Knowledge 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Introduction: what is Data Science? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
What is data science? 
• Data Scientist: “A person who is better at statistics than 
any software engineer and better at software 
engineering than any statistician” 
- Josh Wills 
• “Getting meaning from data” 
Finding patterns (data mining) 
• Complementing business 
knowledge with figures
Data Science & Big Data 
• Relevance for e-commerce - use data to: 
– Increment conversion 
– Increment operational efficiency 
– Understand your customers’ needs 
– Make better offers 
– Make better recommendations 
– … 
• Many successful online business thank their position to 
smart data usage: 
– Google was the first search engine that didn’t index by keyword 
– Amazon is the e-commerce leader thanks to BigData 
– NetFlix is a world leader in personalized recommendations 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Data Science & Big Data 
• Most of us don’t run a business like the ones referred to in 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
stereotypical Big Data cases 
• Big Data does not necessarily means or requires much data 
• Data Science is very affordable to companies of all sizes 
• Typical Data Science projects are 10’s of man-days of work
Data Science & Big Data 
• Non-structured data: weblogs, social media content, … 
• Secondary use of data sources is the key 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
– eg: Weblogs 
• Are there to log webserver activity 
• But can also tell you how people find, compare and choose products! 
– eg: ERP / Cash register software 
• Prints bills 
• But can also tell you what products are typically bought together in a shop 
• Many data is present, valuable information is hidden in it!
Topics not covered in this seminar 
• Very interesting topics that we will gladly 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
elaborate upon another time: 
– Statistical Tools (R, SPSS, …) 
– Mathematical models 
– Machine Learning Techniques (Clustering, Classification, …) 
– BigData Tools & Platforms (Hadoop, Spark, …) 
– Data processing tools (Pig, Hive, …)
Example Data Science applications: 
#1: Recommendations 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Recommendations – Why? How? 
– Why? 
• Attempt to cross-sell or up-sell 
• Provide customers with alternatives that might please them even more 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
– Traditional approach 
• No recommendations at all 
• Products in the same category 
• Manually managed cross-selling opportunities per product 
– Why are these approaches fundamentally flawed? 
• They all start from the seller perspective, not the customer! 
• “We know what you should be buying” 
• Manual recommendations are too costly and time-consuming to 
maintain – even impossible with large catalogs
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Recommendations 
– Online vs Offline 
• Main focus on online, but why? 
• Who knows best what products to recommend? 
• Learn from your data, don’t take decisions based on a feeling. 
– Time based recommendations 
• Recommend or cross sell different products depending on 
– season? 
– holiday? 
– weather? 
– Customer based recommendations 
• Learn from your customers and their past. 
• Android vs iOS smartphones.
Showing (too) 
similar products? 
No color alternatives? 
No glossy/matte 
alternatives? 
No product 
Recommendations – Traditional approach 
recommendations 
at all 
(Link to category 
without match with 
specific product) 
Which roller would be 
appropriate? 
No primer + paint 
combo? 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Recommendations – what does Amazon do? 
Cross-selling 
as realized with other 
(similar?) customers 
Starts from customer 
point of view! 
Recommendations 
based on perceived 
customer journeys 
Re-use the product 
comparisons that 
previous customers 
did! 
DATA 
DRIVEN! 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Recommendations – Other ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Data Science ideas 
– “x % of the people who looked at this item eventually bought product X or Y” 
– Get cross-selling information from ERP in the physical shops and let this feed the 
webshop recommendations! 
– Similar product in different price ranges 
(“best-buy alternative”, “deluxe alternative”) 
– ... 
• This is very achievable for a webshop of any size 
– Just generate ideas, and test to see what actually increases sales! 
• Secondary use of various kinds of non-structured data = BigData ! 
– Weblogs of e-commerce site (use to deduct customer journeys) 
– ERP info with bills and/or invoices (use to deduct cross-selling in physical shops) 
– Product information (product categorization, …)
Example Data Science applications: 
#2: Physical stores vs webshop 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Impact physical store on online? 
– Are online sales higher when physical store is nearby? 
– Where to open a new store? 
– How to approach your customers to motivate 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - Why bother? 
• Determine strength of online brand vs physical brand 
– Is online sales driven by brand awareness? 
– Or is there quite a balance between the two? 
– Omni-channel strategy? 
• Know what would be the impact of opening/closing a 
physical shop, also on the online business 
– Support management decisions with facts & figures 
• Depends heavily on sector/product/case/… 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - example 
• Analysis for a retailer: Physical shops vs online sales 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Impact physical shops - example 
• Impact of opening a physical shop on local online sales 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
(brand awareness?)
Impact physical shops – now what? 
• Use this correlation information: 
– As extra input for determining new shop locations 
– Publish folders focusing on online in non-covered areas 
– Use popup-stores to get brand awareness and drive online sales 
– Discounts per region 
– Google Adwords campaigns focusing on regions with limited 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
brand presence 
– Customer segmentation based on this information
Example Data Science applications: 
#3: Dynamic Pricing 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Dynamic prices 
– End of life products? 
– Relevancy of products. 
– (Local) competition. 
– Customer!
Dynamic Prices – some ideas 
• Auto-combination special offers based on cross-selling 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
info 
• Monitor stock & manage promotions accordingly 
– Example: stock of calendars in December 
(value decreases over time…) 
– Example: Customer history: needs incentive to buy? 
Why not give a small 
discount if bought 
together? 
Testing will show if 
and for which 
products and 
customers this 
increases revenue!
Dynamic Prices – some ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Pricing vs competition 
scraping competition websites 
• Analysis of tenders vs deals 
– What type of deals do we typically win, and which not? 
= Data mining on CRM data! 
– How can we optimize our chances to make a deal? 
Which tenders should we invest in? What offer should we make? 
• Remark: in B2C scenarios, can be difficult / unwanted to 
use dynamic prices. Mind the legal impact!
Example Data Science applications: 
#4: Personalized offerings 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Personalized offering 
– Loyal (online) customer vs new customers. 
– Browsing habits and patterns. 
– Spending patterns. 
– Personalized discounts and/or content?
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Personalized offerings 
• Customer should be central in the webshop 
– Provide a truly personalized shopping experience 
– Like high-end physical shops with personal approach to VIP 
customers 
• Gather data about your customer 
– Surfing history – what products where looked at? How long? … 
– What products were bought? When? 
– Brand preference? 
– Product-segment preference? (budget, high-end, best-buy?) 
– Abandoned shopping carts 
• Take action based on information mined from this data 
– Triggered e-mails, personal recommendations, …
Personalized offerings – some ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Imply social media 
– Are there any connections of our customer that wrote product 
recommendations that might convince him to buy? 
– Do we know the shopping behaviour of some of the customers’ 
connections? Are they in line with his/hers? Can we use this to 
make better recommendations? 
• Anticipate customer behaviour 
– Use all customer contact moments 
eg: if customer calls customer service, they should know what 
products the customer was looking at during his last visit to the 
webshop 
– Prediction model (surfing behaviour vs % deal making) 
eg: 
Low chance? Go to checkout immediately. 
High chance? Offer extra cross-selling opportunities
Example Data Science applications: 
#5: Gather external data 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Gather external data, zoom & magnify 
– Explore search trends within Google. 
– Detect what is hot on social media. 
– Magnify to the results and set clear goals/actions. 
– Take action! 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be
Gather and use external data 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• example: how to sell a Smartwatch? 
– It’s a new product, how to market it effectively? 
– eg: SEO in line with trending topics on twitter, facebook posts, … 
– eg: SEO in line with used search terms 
• Added value: combining external data sources with own data 
• Some ideas 
– Find and follow your contacts on LinkedIn 
previous/future employers of your contacts may be great prospects for 
your B2B business! 
– Use weather info to adapt the featured product offering 
Data Science exercise: do we find any correlation between the weather 
and the product sales figures?
Example Data Science applications: 
#6: Anticipatory shipping 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Anticipatory shipping 
– Patent pending by Amazon. 
– Ships an order before it is placed. 
– Order history, search, wish list and click behaviour!
Anticipatory shipping 
• High-tech? Actually not complex at all … 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Steps: 
– Gather many info on past orders 
(customer info, country, product info, price, product group, 
product combinations, time of day, season, …) 
– Build a prediction model predicting “cancelled or not” based on 
all this information 
– Assess the quality of the model by training it with 90% of your 
historical orders and testing it with 10% of your historical orders 
– Pass each new order’s info and predict the likelihood of this 
order getting cancelled (0 .. 100%) and act accordingly
Example Data Science applications: 
#7: Customer Service optimizations 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Customer service 
– Losing sales/conversion/money by poor customer service. 
– Optimize information for all communication channels. 
– Which issues are your customers concerned with? 
– Allocate resources better!
Customer Service – Some Ideas 
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
• Text mining 
– Mood analysis: detect negative messages on social media, forum, … 
Put TODO on action list of customer care to contact with certain priority 
– Auto-classification of e-mails, letters, messages: 
Is this e-mail a question or a complaint? 
Is it about the quality of the product or financial (wrong invoice, …)? 
Automatic routing of messages to the right person! (operational optimization) 
• Social media 
– Social media status of customer (scoring based on profile) 
What’s would be the impact of this customer being unhappy about our service? 
• Omnichannel insights 
– What did this customer buy of look at? 
– How did he rate the last bought products? 
– Which contacts (mail, phone, …) did we have and what seems to be the most 
effective deal trigger?
Applying Data Science 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye
Veldkant 33A, Kontich ● info@infofarm.be ● www.infofarm.be 
Applying Data Science 
• Data Science does not replace business knowledge 
– Need to find balance between the two 
– Confirm or deny assumed business knowledge 
– Detect changing trends early (customer behaviour, …) 
• Not a development cycle, rather exploratory process: 
– Formulate hypotheses 
– Data mining and modeling 
– A/B testing (test new idea on x % of your customers/products/…) 
– Conclusions: did the test group show better conversion? 
– Rollout or cancel and start over! 
• Potential issues 
– Privacy law and other legal restrictions 
– Feedback loops, information leakage, wrong assumptions 
eg: trying to gather customer preferences when an order could as well have 
been a gift to someone else (perfume, …)
Questions? 
Veldkant 33A, Kontich ● info@infofarmDa.btae S●ciwewncwe. inCfoomfaprman.bye

More Related Content

What's hot (20)

PPTX
Data science
SwapnilDahake2
 
PPTX
NOSQL Databases types and Uses
Suvradeep Rudra
 
PPTX
Content Management System - CMS
Jaime Delgado
 
PDF
NOSQL- Presentation on NoSQL
Ramakant Soni
 
PPTX
PPT on Data Science Using Python
NishantKumar1179
 
PDF
Data Visualization With Tableau | Edureka
Edureka!
 
PPTX
Introduction to Hadoop
Dr. C.V. Suresh Babu
 
PPTX
Big Data PPT by Rohit Dubey
Rohit Dubey
 
PPTX
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 
PDF
Sequence Modelling with Deep Learning
Natasha Latysheva
 
PDF
Big Data
Seminar Links
 
PDF
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Edureka!
 
PDF
Hadoop Overview & Architecture
EMC
 
PPTX
Introduction to Big Data
Srinath Perera
 
PPSX
A Seminar on NoSQL Databases.
Navdeep Charan
 
PDF
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
PDF
WEB I - 01 - Introduction to Web Development
Randy Connolly
 
PDF
Vector database
Guy Korland
 
PPTX
Database management system of facebook
Saidur Rahman
 
PDF
CS6007 information retrieval - 5 units notes
Anandh Arumugakan
 
Data science
SwapnilDahake2
 
NOSQL Databases types and Uses
Suvradeep Rudra
 
Content Management System - CMS
Jaime Delgado
 
NOSQL- Presentation on NoSQL
Ramakant Soni
 
PPT on Data Science Using Python
NishantKumar1179
 
Data Visualization With Tableau | Edureka
Edureka!
 
Introduction to Hadoop
Dr. C.V. Suresh Babu
 
Big Data PPT by Rohit Dubey
Rohit Dubey
 
What Is Data Science? | Introduction to Data Science | Data Science For Begin...
Simplilearn
 
Sequence Modelling with Deep Learning
Natasha Latysheva
 
Big Data
Seminar Links
 
Data Science Tutorial | Introduction To Data Science | Data Science Training ...
Edureka!
 
Hadoop Overview & Architecture
EMC
 
Introduction to Big Data
Srinath Perera
 
A Seminar on NoSQL Databases.
Navdeep Charan
 
GENERATIVE AI, THE FUTURE OF PRODUCTIVITY
Andre Muscat
 
WEB I - 01 - Introduction to Web Development
Randy Connolly
 
Vector database
Guy Korland
 
Database management system of facebook
Saidur Rahman
 
CS6007 information retrieval - 5 units notes
Anandh Arumugakan
 

Viewers also liked (20)

PPTX
Data mining with Google analytics
Greg Bray
 
PDF
Data Science and Machine Learning for eCommerce and Retail
Andrei Lopatenko
 
PDF
Applying machine learning to product categorization
Sushant Shankar
 
PDF
Magento scalability from the trenches (Meet Magento Sweden 2016)
Divante
 
PDF
How Data Science can increase Ecommerce profits
Romexsoft
 
PDF
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
Divante
 
PDF
Omnichannel Customer Experience
Divante
 
PDF
Machine Learning in Ecommerce
David Jones
 
DOCX
Boosting conversion rates on ecommerce using deep learning algorithms
Armando Vieira
 
PDF
E-commerce product classification with deep learning
Christopher Bonnett Ph.D
 
PPTX
Digital analytics with R - Sydney Users of R Forum - May 2015
Johann de Boer
 
PDF
Marketing Analytics with R Lifting Campaign Success Rates
Revolution Analytics
 
PPTX
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
PPTX
Google Analytics Data Mining with R
Tatvic Analytics
 
PDF
Big Data in Retail - Examples in Action
David Pittman
 
PPTX
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
InfoFarm
 
PPTX
Data Driven Decisions seminar
InfoFarm
 
PPTX
Boosting big data with apache spark
InfoFarm
 
PPTX
Big Data with Apache Hadoop
InfoFarm
 
PPTX
SF Data Science: Developing Data Products
Peter Skomoroch
 
Data mining with Google analytics
Greg Bray
 
Data Science and Machine Learning for eCommerce and Retail
Andrei Lopatenko
 
Applying machine learning to product categorization
Sushant Shankar
 
Magento scalability from the trenches (Meet Magento Sweden 2016)
Divante
 
How Data Science can increase Ecommerce profits
Romexsoft
 
Surprising failure factors when implementing eCommerce and Omnichannel eBusiness
Divante
 
Omnichannel Customer Experience
Divante
 
Machine Learning in Ecommerce
David Jones
 
Boosting conversion rates on ecommerce using deep learning algorithms
Armando Vieira
 
E-commerce product classification with deep learning
Christopher Bonnett Ph.D
 
Digital analytics with R - Sydney Users of R Forum - May 2015
Johann de Boer
 
Marketing Analytics with R Lifting Campaign Success Rates
Revolution Analytics
 
Learn to Build an App to Find Similar Images using Deep Learning- Piotr Teterwak
PyData
 
Google Analytics Data Mining with R
Tatvic Analytics
 
Big Data in Retail - Examples in Action
David Pittman
 
Retail Detail OmniChannel Congress 2015 - Data Science for e-commerce
InfoFarm
 
Data Driven Decisions seminar
InfoFarm
 
Boosting big data with apache spark
InfoFarm
 
Big Data with Apache Hadoop
InfoFarm
 
SF Data Science: Developing Data Products
Peter Skomoroch
 
Ad

Similar to Data Science for e-commerce (20)

PPTX
Harvesting business Value with Data Science
InfoFarm
 
PPTX
Introduction to (Big) Data Science
InfoFarm
 
PPTX
User-Centered Information Architecture for e-Commerce
World IA Day Copenhagen
 
PDF
User-Centered Information Architecture for e-commerce
Søren Engelbrecht
 
PDF
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
Connotate
 
PPTX
Schneider 1. E-Business Across World & Time.pptx
MuhtasimRosul
 
PPTX
Air Force Research Laboratory Small Business Hub Collider - Small Business Ma...
Billy Grill
 
PPTX
6 steps to create a data-driven content strategy
Smart Insights
 
PDF
Data Science for Digital Commerce
Manish Gupta, Ph.D.
 
PPTX
Content Marketing For Building And Construction Products
Leigh Simpson
 
PDF
Qhuba office depot schelle - seminar omni channel marketing 2014-11-25 brief
Faces of Content
 
PDF
Online Industry - New Era
Dr. Ranbijay Kumar
 
PDF
Big Data in e-Commerce
Divante
 
PDF
Business Intelligence Projects Portfolio
Alina Krekhovets
 
PPT
New Marketing for the New Economy - Kotler
Filipe Mello
 
PPT
Motarme NUI Galway Technology Marketing Seminar Dec 2012
Motarme Marketing Technology
 
PDF
Machine Learning for retail and ecommerce
Andrei Lopatenko
 
PPT
Research retail software products
Suresh Koujalagi
 
PPTX
AI in Retail
Subrat Panda, PhD
 
PDF
IDM Assignment revision certificate Nov '11
Steve Kemish
 
Harvesting business Value with Data Science
InfoFarm
 
Introduction to (Big) Data Science
InfoFarm
 
User-Centered Information Architecture for e-Commerce
World IA Day Copenhagen
 
User-Centered Information Architecture for e-commerce
Søren Engelbrecht
 
Know Your Market - Know Your Customer: What Web Data Reveals if You Know Wher...
Connotate
 
Schneider 1. E-Business Across World & Time.pptx
MuhtasimRosul
 
Air Force Research Laboratory Small Business Hub Collider - Small Business Ma...
Billy Grill
 
6 steps to create a data-driven content strategy
Smart Insights
 
Data Science for Digital Commerce
Manish Gupta, Ph.D.
 
Content Marketing For Building And Construction Products
Leigh Simpson
 
Qhuba office depot schelle - seminar omni channel marketing 2014-11-25 brief
Faces of Content
 
Online Industry - New Era
Dr. Ranbijay Kumar
 
Big Data in e-Commerce
Divante
 
Business Intelligence Projects Portfolio
Alina Krekhovets
 
New Marketing for the New Economy - Kotler
Filipe Mello
 
Motarme NUI Galway Technology Marketing Seminar Dec 2012
Motarme Marketing Technology
 
Machine Learning for retail and ecommerce
Andrei Lopatenko
 
Research retail software products
Suresh Koujalagi
 
AI in Retail
Subrat Panda, PhD
 
IDM Assignment revision certificate Nov '11
Steve Kemish
 
Ad

Recently uploaded (20)

PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PDF
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PPTX
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
PPTX
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
PPTX
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
PDF
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
PPTX
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
PPT
deep dive data management sharepoint apps.ppt
novaprofk
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
PPTX
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
PPTX
Dr djdjjdsjsjsjsjsjsjjsjdjdjdjdjjd1.pptx
Nandy31
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
Climate Action.pptx action plan for climate
justfortalabat
 
Advanced_NLP_with_Transformers_PPT_final 50.pptx
Shiwani Gupta
 
Numbers of a nation: how we estimate population statistics | Accessible slides
Office for National Statistics
 
GenAI-Introduction-to-Copilot-for-Bing-March-2025-FOR-HUB.pptx
cleydsonborges1
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
apidays Munich 2025 - Building an AWS Serverless Application with Terraform, ...
apidays
 
R Cookbook - Processing and Manipulating Geological spatial data with R.pdf
OtnielSimopiaref2
 
apidays Singapore 2025 - Designing for Change, Julie Schiller (Google)
apidays
 
deep dive data management sharepoint apps.ppt
novaprofk
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
WEF_Future_of_Global_Fintech_Second_Edition_2025.pdf
AproximacionAlFuturo
 
apidays Helsinki & North 2025 - API access control strategies beyond JWT bear...
apidays
 
Dr djdjjdsjsjsjsjsjsjjsjdjdjdjdjjd1.pptx
Nandy31
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 

Data Science for e-commerce

  • 1. Data Science Company DataScience for e-commerce Infofarm - Seminar Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be 25/11/2014
  • 2. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Agenda • About us • What is Data Science? e-commerce vs Data Science vs BigData • Example Data Science applications in e-commerce some inspiration to see your opportunities… • Applying Data Science how to get started with all this?
  • 3. About us Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 4. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Speakers • Niels Trescinski e-commerce Consultant – Fenego (Intershop) – Elision (Hybris) • Günther Van Roey Technical (IT) Consultant – InfoFarm (BigData & Data Science) – XT-i (software development and integration) – PHPro (website development)
  • 5. Veldkant 33A, Kontich [email protected] ● www.infofarm.be InfoFarm - Team • Mixed skills team – 2 Data Scientist • Mathematics • Statistics – 4 BigData Consultants – 1 Infra specialist – n Cronos colleagues with various background • Certifications – CCDH - Cloudera Certified Hadoop Developer – CCAD - Cloudera Certified Hadoop Administrator – OCJP – Oracle Certified Java Programmer
  • 6. InfoFarm + Fenego & Elision – e-commerce! Highly focused on e-commerce Business Knowledge Highly focused on Data Science and Big Data Technical Knowledge Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 7. Introduction: what is Data Science? Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 8. Veldkant 33A, Kontich [email protected] ● www.infofarm.be What is data science? • Data Scientist: “A person who is better at statistics than any software engineer and better at software engineering than any statistician” - Josh Wills • “Getting meaning from data” Finding patterns (data mining) • Complementing business knowledge with figures
  • 9. Data Science & Big Data • Relevance for e-commerce - use data to: – Increment conversion – Increment operational efficiency – Understand your customers’ needs – Make better offers – Make better recommendations – … • Many successful online business thank their position to smart data usage: – Google was the first search engine that didn’t index by keyword – Amazon is the e-commerce leader thanks to BigData – NetFlix is a world leader in personalized recommendations Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 10. Data Science & Big Data • Most of us don’t run a business like the ones referred to in Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be stereotypical Big Data cases • Big Data does not necessarily means or requires much data • Data Science is very affordable to companies of all sizes • Typical Data Science projects are 10’s of man-days of work
  • 11. Data Science & Big Data • Non-structured data: weblogs, social media content, … • Secondary use of data sources is the key Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be – eg: Weblogs • Are there to log webserver activity • But can also tell you how people find, compare and choose products! – eg: ERP / Cash register software • Prints bills • But can also tell you what products are typically bought together in a shop • Many data is present, valuable information is hidden in it!
  • 12. Topics not covered in this seminar • Very interesting topics that we will gladly Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be elaborate upon another time: – Statistical Tools (R, SPSS, …) – Mathematical models – Machine Learning Techniques (Clustering, Classification, …) – BigData Tools & Platforms (Hadoop, Spark, …) – Data processing tools (Pig, Hive, …)
  • 13. Example Data Science applications: #1: Recommendations Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 14. Recommendations – Why? How? – Why? • Attempt to cross-sell or up-sell • Provide customers with alternatives that might please them even more Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be – Traditional approach • No recommendations at all • Products in the same category • Manually managed cross-selling opportunities per product – Why are these approaches fundamentally flawed? • They all start from the seller perspective, not the customer! • “We know what you should be buying” • Manual recommendations are too costly and time-consuming to maintain – even impossible with large catalogs
  • 15. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Recommendations – Online vs Offline • Main focus on online, but why? • Who knows best what products to recommend? • Learn from your data, don’t take decisions based on a feeling. – Time based recommendations • Recommend or cross sell different products depending on – season? – holiday? – weather? – Customer based recommendations • Learn from your customers and their past. • Android vs iOS smartphones.
  • 16. Showing (too) similar products? No color alternatives? No glossy/matte alternatives? No product Recommendations – Traditional approach recommendations at all (Link to category without match with specific product) Which roller would be appropriate? No primer + paint combo? Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 17. Recommendations – what does Amazon do? Cross-selling as realized with other (similar?) customers Starts from customer point of view! Recommendations based on perceived customer journeys Re-use the product comparisons that previous customers did! DATA DRIVEN! Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 18. Recommendations – Other ideas Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • Data Science ideas – “x % of the people who looked at this item eventually bought product X or Y” – Get cross-selling information from ERP in the physical shops and let this feed the webshop recommendations! – Similar product in different price ranges (“best-buy alternative”, “deluxe alternative”) – ... • This is very achievable for a webshop of any size – Just generate ideas, and test to see what actually increases sales! • Secondary use of various kinds of non-structured data = BigData ! – Weblogs of e-commerce site (use to deduct customer journeys) – ERP info with bills and/or invoices (use to deduct cross-selling in physical shops) – Product information (product categorization, …)
  • 19. Example Data Science applications: #2: Physical stores vs webshop Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 20. Impact physical store on online? – Are online sales higher when physical store is nearby? – Where to open a new store? – How to approach your customers to motivate Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 21. Impact physical shops - Why bother? • Determine strength of online brand vs physical brand – Is online sales driven by brand awareness? – Or is there quite a balance between the two? – Omni-channel strategy? • Know what would be the impact of opening/closing a physical shop, also on the online business – Support management decisions with facts & figures • Depends heavily on sector/product/case/… Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 22. Impact physical shops - example • Analysis for a retailer: Physical shops vs online sales Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 23. Impact physical shops - example • Impact of opening a physical shop on local online sales Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be (brand awareness?)
  • 24. Impact physical shops – now what? • Use this correlation information: – As extra input for determining new shop locations – Publish folders focusing on online in non-covered areas – Use popup-stores to get brand awareness and drive online sales – Discounts per region – Google Adwords campaigns focusing on regions with limited Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be brand presence – Customer segmentation based on this information
  • 25. Example Data Science applications: #3: Dynamic Pricing Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 26. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Dynamic prices – End of life products? – Relevancy of products. – (Local) competition. – Customer!
  • 27. Dynamic Prices – some ideas • Auto-combination special offers based on cross-selling Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be info • Monitor stock & manage promotions accordingly – Example: stock of calendars in December (value decreases over time…) – Example: Customer history: needs incentive to buy? Why not give a small discount if bought together? Testing will show if and for which products and customers this increases revenue!
  • 28. Dynamic Prices – some ideas Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • Pricing vs competition scraping competition websites • Analysis of tenders vs deals – What type of deals do we typically win, and which not? = Data mining on CRM data! – How can we optimize our chances to make a deal? Which tenders should we invest in? What offer should we make? • Remark: in B2C scenarios, can be difficult / unwanted to use dynamic prices. Mind the legal impact!
  • 29. Example Data Science applications: #4: Personalized offerings Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 30. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Personalized offering – Loyal (online) customer vs new customers. – Browsing habits and patterns. – Spending patterns. – Personalized discounts and/or content?
  • 31. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Personalized offerings • Customer should be central in the webshop – Provide a truly personalized shopping experience – Like high-end physical shops with personal approach to VIP customers • Gather data about your customer – Surfing history – what products where looked at? How long? … – What products were bought? When? – Brand preference? – Product-segment preference? (budget, high-end, best-buy?) – Abandoned shopping carts • Take action based on information mined from this data – Triggered e-mails, personal recommendations, …
  • 32. Personalized offerings – some ideas Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • Imply social media – Are there any connections of our customer that wrote product recommendations that might convince him to buy? – Do we know the shopping behaviour of some of the customers’ connections? Are they in line with his/hers? Can we use this to make better recommendations? • Anticipate customer behaviour – Use all customer contact moments eg: if customer calls customer service, they should know what products the customer was looking at during his last visit to the webshop – Prediction model (surfing behaviour vs % deal making) eg: Low chance? Go to checkout immediately. High chance? Offer extra cross-selling opportunities
  • 33. Example Data Science applications: #5: Gather external data Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 34. Gather external data, zoom & magnify – Explore search trends within Google. – Detect what is hot on social media. – Magnify to the results and set clear goals/actions. – Take action! Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be
  • 35. Gather and use external data Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • example: how to sell a Smartwatch? – It’s a new product, how to market it effectively? – eg: SEO in line with trending topics on twitter, facebook posts, … – eg: SEO in line with used search terms • Added value: combining external data sources with own data • Some ideas – Find and follow your contacts on LinkedIn previous/future employers of your contacts may be great prospects for your B2B business! – Use weather info to adapt the featured product offering Data Science exercise: do we find any correlation between the weather and the product sales figures?
  • 36. Example Data Science applications: #6: Anticipatory shipping Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 37. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Anticipatory shipping – Patent pending by Amazon. – Ships an order before it is placed. – Order history, search, wish list and click behaviour!
  • 38. Anticipatory shipping • High-tech? Actually not complex at all … Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • Steps: – Gather many info on past orders (customer info, country, product info, price, product group, product combinations, time of day, season, …) – Build a prediction model predicting “cancelled or not” based on all this information – Assess the quality of the model by training it with 90% of your historical orders and testing it with 10% of your historical orders – Pass each new order’s info and predict the likelihood of this order getting cancelled (0 .. 100%) and act accordingly
  • 39. Example Data Science applications: #7: Customer Service optimizations Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 40. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Customer service – Losing sales/conversion/money by poor customer service. – Optimize information for all communication channels. – Which issues are your customers concerned with? – Allocate resources better!
  • 41. Customer Service – Some Ideas Veldkant 33A, Kontich ● [email protected] ● www.infofarm.be • Text mining – Mood analysis: detect negative messages on social media, forum, … Put TODO on action list of customer care to contact with certain priority – Auto-classification of e-mails, letters, messages: Is this e-mail a question or a complaint? Is it about the quality of the product or financial (wrong invoice, …)? Automatic routing of messages to the right person! (operational optimization) • Social media – Social media status of customer (scoring based on profile) What’s would be the impact of this customer being unhappy about our service? • Omnichannel insights – What did this customer buy of look at? – How did he rate the last bought products? – Which contacts (mail, phone, …) did we have and what seems to be the most effective deal trigger?
  • 42. Applying Data Science Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye
  • 43. Veldkant 33A, Kontich [email protected] ● www.infofarm.be Applying Data Science • Data Science does not replace business knowledge – Need to find balance between the two – Confirm or deny assumed business knowledge – Detect changing trends early (customer behaviour, …) • Not a development cycle, rather exploratory process: – Formulate hypotheses – Data mining and modeling – A/B testing (test new idea on x % of your customers/products/…) – Conclusions: did the test group show better conversion? – Rollout or cancel and start over! • Potential issues – Privacy law and other legal restrictions – Feedback loops, information leakage, wrong assumptions eg: trying to gather customer preferences when an order could as well have been a gift to someone else (perfume, …)
  • 44. Questions? Veldkant 33A, Kontich ● [email protected] S●ciwewncwe. inCfoomfaprman.bye