2. About me
• Education
• NCU (MIS)、NCCU (CS)
• Work Experience
• Telecom big data Innovation
• AI projects
• Retail marketing technology
• User Group
• TW Spark User Group
• TW Hadoop User Group
• Taiwan Data Engineer Association Director
• Research
• Big Data/ ML/ AIOT/ AI Columnist
2
6. 視覺化資料分析
欄位名稱 說明
Sales(銷量) Unit sales (in thousands) at each location
CompPrice(競爭者價格) Price charged by competitor at each location
Income(收入等級) Community income level (in thousands of dollars)
Advertising(廣告預算) Local advertising budget for company at each location (in thousands of dollars)
Population(人口) Population size in region (in thousands)
Price(價格) Price company charges for car seats at each site
ShelveLoc
A factor with levels Bad, Good and Medium indicating the quality of the shelving
location for the car seats at each site
Age(年齡) Average age of the local population
Education(教育程度) Education level at each location
Urban
A factor with levels No and Yes to indicate whether the store is in an urban or
rural location
US A factor with levels No and Yes to indicate whether the store is in the US or not
6
14. 補充練習需要的套件
• 套件說明
• pip install pyod
• pip install yellowbrick
• pip install lightgbm
• pip install prince
14
(Python library for detecting anomalous/outlying objects.)
(Yellowbrick is a suite of visual analysis and diagnostic tools designed to
facilitate machine learning with scikit-learn.)
(Compiled library that is included in the wheel file supports both GPU and CPU versions out of the box.)
https://pypi.org/project/lightgbm/
(Prince is a library for doing factor analysis. This includes a variety of methods including principal
component analysis (PCA) and correspondence analysis (CA) .)
15. 補充練習需要的套件
• 套件說明
• pip install sentence-transformers
• pip install plotly
• pip install shap
• 一鍵安裝
15
(This framework provides an easy method to compute dense vector representations
for sentences, paragraphs, and images.)
https://pypi.org/project/sentence-transformers
(To explain the output of any machine learning model.)
(An open-source, interactive data visualization library for Python.)
請注意,使用 plotly 的 3D 顯示,需要在 jupyterlab 3.x 的版本