SlideShare a Scribd company logo
10
Most read
14
Most read
23
Most read
Introduction to
Clinical SAS Programming
Introduction to clinical sas programming
Source: http://www.cdisc.org/standards-and-implementations
CDISC – The Clinical Data Interchange
Consortium, non-profitable organization, that
defines the world-wide standards for
representing clinical data, required by the
regulatory authorities as part of submissions
Source: www.bioforum.co.il
Source: http://www.cros.it/statistical-analysis.html
Career in Clinical Data Management
• Data Entry Operator
• Data Validation Executive
• QA Executive
• Data Manager
• QA Manager
• Statistical Programmer
• Statistician
• Data Reviewer
• Data Base Designer
• Medical Writer
• Head –Data Management
What is the job of Clinical SAS Programmer?
• Manage hundreds of datasets
• Thousands of data points
• Understand all interdependencies of the data points
• Deliver accurate, timely, and reproducible analyses that determine
the approval of novel therapies
• Conclusion
• Clinical SAS Programming is a difficult mental activity and requires
uninterrupted concentration.
Source: “Motivating Clinical SAS Programmers” from Daniel Boisvert
Attributes which makes Programmers Competent
• SAS technical skills
• Clinical trial understanding
• Basic understanding of statistics in clinical trial
• Industry data standards and guidelines
• Analyst mindset (planning, execution, problem solving and decision
making skill)
• Soft skill (communication skill and basic etiquette)
• Industry collaboration (conferences, microblogging sites etc.)
Source: “Competent statistical programmer: Need of business process outsourcing industry” from Imran Khan
Source: http://www.lexjansen.com/nesug/nesug96/NESUG96061.pdf
Documents Data Submission
Protocol, SAP
Case Report Form (CRF)
Table Shells
SAS Dataset format or
others (xpt, xls, xlsx)
Tables, Figures, Listings (TFLs)
Define.xml, Review Guide
Clinical Reporting Summary
• Input: reading in the source data
• Analysis: determining the analysis result
• Output: presenting the analysis result
Source: http://www.phusewiki.org/wiki/index.php?title=Clinical_Reporting_Summary
SAS
• SAS (Statistical Analysis Software)
was founded in 1976 by James
Goodnight and several colleagues
from North Carolina State University
• Originally designed to mine
agricultural research, SAS's software
was quickly adopted by corporate,
government, and academic
customers.
• $3 billion business-analytics
juggernaut, with 12,000 employees
and an unbroken 35-year track
record of revenue growth
Source: Forbes, “Roundup Of Analytics, Big Data & Business Intelligence Forecasts And Market Estimates, 2015”
Source: http://cos.name/2010/12/think-sas-2/
What is SAS?
It’s not only a statistical software,
but also a Business Analytics and
Business Intelligence Software
• 1.传统SAS(编程驱动)
• 1-1 基础模块(Base SAS):包括类似于PL/L的第四代编程语言data steps、SQL、ODS、XML Engine、Macro以及大量的内置函数(支
持Perl 正则表达式)和过程步(procedures)等。对一个SAS程序员来说,这个BASE模块几乎就是全部(说学SAS,就是先学这个,这不
需要统计背景或者计算机背景)。又,狭义的SAS Base仅指data steps。
• 1-2 数据存取模块(SAS/ACCESS):支持大量的PC文件(除文本文件外,还包括Excel、SPSS的.sav、Stata的.dta等等)以及所有主流
的关系数据库及ERP系统(Oracle、SAP、SQL Server、DB2、MySQL等等)
• 1-3 作图模块(SAS/GRAPH):SAS作图功能强大,只是模样够土。新版SAS 9.2在这块有不少让人兴奋的改进,比如支持ODS、TrueColor、
ActiveX、SVG(Scalable Vector Graphics)等,另外还加了一个graph editor,新潮不少。
• 1-4 统计分析模块(SAS/STAT ) :包括回归模型、方差分析模型、混合模型、贝叶斯分析、分类数据分析、多元分析(主成分和因子
分析等)、判别分析、聚类分析、生存分析、非参数分析等,多是我不熟悉的领域,只管罗列。有个Stat Studio支持R。
• 1-5 时间序列与计量经济学模块(SAS/ETS ):同上,X11、X12、ARIMA、PANEL、AUTOREG、……。又,做资产组合等投资分析也是
在这个模块。
• 1-6 矩阵运算模块(SAS/IML) 1-7 运筹学模块(SAS/OR) 1-8 地理信息系统模块(SAS/GIS) …… 以上部分是大部分SAS高校用户能接触到的东
西,主要用命令行实现功能。下面则是有GUI的客户端工具,一般都用它们作计算引擎。
• 2. 客户端工具
• 2-1 元数据管理(SAS Management Console,SMC):元数据(metadata)就是关于数据的数据。最简单的,一个数据集的变量属性就是
metadata。SMC是SAS产品元数据管理的统一中心。
• 2-2 ETL工具(SAS Data Integration Studio):ETL就是数据抽取(Extract)、转换(Transform)和加载(Load)。完成企业数据处理的工具还有
SAS OLAP Cube Studio、SAS Information Map Studio等。
• 2-3 数据挖掘模块(SAS Enterprise Miner,EM) :SAS的重磅产品之一。5以上是Java客户端版本,用户体验增进不少。
• 2-4 综合分析工具包(SAS Enterprise Guide,EG): 有完善的GUI界面,完成SAS从数据整合、分析到报表的一系列功能。EG与JMP,是
SAS公司的两个明星产品。
• 3. 其他
• 3-1 统计探索软件包JMP:这是一个独立于Base SAS的软件,由SAS公司的二把手John Sall主管,界面很炫,功能很强,让
业务人员爱不释手那种。JMP9支持R。
Source: http://cos.name/2010/12/think-sas-2/
Why use SAS?
• SAS is very efficient with data manipulation if you know what you're
doing. It's been designed to work with sequential tapes so it is built with the
assumption that data access is expensive. Makes wonders when you work
truly massive datasets.
• SAS is good at opening up gigantic data sets even on computer which
do not have a lot of computing power. Essentially data sets that would
crash most programs on a given computer in a heart beat can load in SAS.
• SAS as a company is smart and designs its products at corporate cost
centers. This includes doing things like company wide installations and
setting up its platform in a way that makes it easy for corporate it
departments to setup a company wide SAS infrastructure.
Source: How to start using SAS from SARBAJIT MUKHERJEE
From 2016 TIOBE
How to learn SAS?
• Learn SAS ® in 50 minutes
http://support.sas.com/resources/papers/proceedings11/054-2011.pdf
• Book: Learning SAS by Example: A Programmer's Guide
• For more recourses, check:
https://web.stanford.edu/group/ssds/cgi-
bin/drupal/files/Guides/Resources_for_Learning_SAS.pdf
Source: How to start using SAS from SARBAJIT MUKHERJEE
SAS Data Step
• The DATA step is a separate language for performing programming tasks such as data
manipulation, i.e. cleaning and editing, and data restructuring.
• The Implied Loop of the DATA Step (ILDS), internal looping. One important
consequence of the ILDS is that it may make sense to place code before the line that
reads data.
• Program Data Vector (PDV)
• it is a logical area in memory where SAS builds a data set, one observation at a time
• contains current values for all variables
• maintains two automatic variables, _N_ and _ERROR_
Source: http://www2.sas.com/proceedings/sugi31/246-31.pdf
Source: http://www.mwsug.org/proceedings/2013/BB/MWSUG-2013-BB03.pdf
SAS dataset
format
Data Step logic
SAS Code
SAS will read in three lines
automatically by it’s ILDS
Raw data
Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf
The macro facility is a tool for
text substitution, which reduces
the amount of text entered for
common tasks.
Two components of Macro facility:
1.the macro processor, which is the
portion of the system that does the
work and
2.macro language, the syntax used
to communicate with the macro
processor
Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf
When the word scanner detects a
macro trigger, ampersand (&) or
percent (%), it sends information
and temporarily turns processing
over to the macro processor.
Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf
Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf

More Related Content

What's hot (20)

PPTX
Sas demo
rvmfinishingschool
 
PDF
Clinical sas training overview
Great Online Training
 
PPTX
SAS Clinical Online Training
Manga SubbuNaidu
 
PDF
A Roadmap for SAS Programmers to Clinical Statistical Programming
Mohammad Majharul Alam
 
PDF
CDISC-CDASH
Gowthami6789
 
PPTX
Clinical Data Management: Best Practices and Key Considerations
ClinosolIndia
 
PDF
Discrepany Management_Katalyst HLS
Katalyst HLS
 
PDF
INTERPRETING CDISC ADaM IG THROUGH USERS INTERPRETATION
Angelo Tinazzi
 
PPT
SDTM - Adverse Events Vs. Clinical Events
Vijayaraghava Karpurapu
 
PDF
Case Report Form (CRF) Design Tips
Perficient
 
PPT
SAS Proc SQL
guest2160992
 
PPTX
CRF / e-CRF Completion Guidline (CCG).pptx
Siro Clinical Research Institute
 
PPTX
SDTM (Study Data Tabulation Model)
SWAROOP KUMAR K
 
PPTX
Clinical data-management-overview
Acri India
 
PPT
SAS Functions
guest2160992
 
PPT
CDISC SDTM Domain Presentation
Ankur Sharma
 
PPTX
Clinical Data Management
DABBETA DIVYA
 
PPTX
How to create edit checks in medidata rave painlessly
Weihong Yang
 
PPTX
Clinical Data Management
Shray Jali
 
Clinical sas training overview
Great Online Training
 
SAS Clinical Online Training
Manga SubbuNaidu
 
A Roadmap for SAS Programmers to Clinical Statistical Programming
Mohammad Majharul Alam
 
CDISC-CDASH
Gowthami6789
 
Clinical Data Management: Best Practices and Key Considerations
ClinosolIndia
 
Discrepany Management_Katalyst HLS
Katalyst HLS
 
INTERPRETING CDISC ADaM IG THROUGH USERS INTERPRETATION
Angelo Tinazzi
 
SDTM - Adverse Events Vs. Clinical Events
Vijayaraghava Karpurapu
 
Case Report Form (CRF) Design Tips
Perficient
 
SAS Proc SQL
guest2160992
 
CRF / e-CRF Completion Guidline (CCG).pptx
Siro Clinical Research Institute
 
SDTM (Study Data Tabulation Model)
SWAROOP KUMAR K
 
Clinical data-management-overview
Acri India
 
SAS Functions
guest2160992
 
CDISC SDTM Domain Presentation
Ankur Sharma
 
Clinical Data Management
DABBETA DIVYA
 
How to create edit checks in medidata rave painlessly
Weihong Yang
 
Clinical Data Management
Shray Jali
 

Viewers also liked (18)

PDF
Cmu experimental design
ray4hz
 
PPT
Motivating Clinical Sas Programmers
Daniel Boisvert
 
PDF
Clinical sas programmer
ray4hz
 
PPT
SAS Macros part 1
venkatam
 
PDF
Clinical Trials - An Introduction
Dr Purnendu Sekhar Das
 
PPT
d-Wise | SAS Clinical Data Integration
d-Wise Technologies
 
PPTX
ADM Server
Fitriana Nur Dhewayani
 
PPTX
Post-lock Data Flow: From CRF to FDA
Brook White, PMP
 
PDF
Sas cheat
imaduddin91
 
PPTX
Epoch Research Institute : Introduction to CR
Epoch Research Institute India Pvt. Ltd.
 
PPT
SAS Macros part 2
venkatam
 
PPT
When best to use the %let statement, the symput routine, or the into clause t...
Arthur8898
 
PDF
Professor Peivand Pirouzi - International clinical study and safety reporting...
Pharmaceutical Compliance Inspection unit, Crown College of Canada
 
PPT
Interventional+Procedures
dhavalshah4424
 
PPT
SDTM modelling: from study protocol to SDTM-compliant datasets
Angelo Tinazzi
 
PPT
SAS Macros
guest2160992
 
PPTX
Top 10 clinical data manager interview questions and answers
cadderlux
 
Cmu experimental design
ray4hz
 
Motivating Clinical Sas Programmers
Daniel Boisvert
 
Clinical sas programmer
ray4hz
 
SAS Macros part 1
venkatam
 
Clinical Trials - An Introduction
Dr Purnendu Sekhar Das
 
d-Wise | SAS Clinical Data Integration
d-Wise Technologies
 
Post-lock Data Flow: From CRF to FDA
Brook White, PMP
 
Sas cheat
imaduddin91
 
Epoch Research Institute : Introduction to CR
Epoch Research Institute India Pvt. Ltd.
 
SAS Macros part 2
venkatam
 
When best to use the %let statement, the symput routine, or the into clause t...
Arthur8898
 
Professor Peivand Pirouzi - International clinical study and safety reporting...
Pharmaceutical Compliance Inspection unit, Crown College of Canada
 
Interventional+Procedures
dhavalshah4424
 
SDTM modelling: from study protocol to SDTM-compliant datasets
Angelo Tinazzi
 
SAS Macros
guest2160992
 
Top 10 clinical data manager interview questions and answers
cadderlux
 
Ad

Similar to Introduction to clinical sas programming (20)

PDF
Sharpening Your SAS Skills 1st Edition Sunil Gupta
hemamrsaaban
 
PPTX
Sas base programmer
Aspire Techsoft Academy
 
PDF
Sasbasecontent2
lakshmicse14
 
PPT
Harshad sas
shivankitude
 
PPTX
BAS 150 Lesson 1 Lecture
Wake Tech BAS
 
PPT
Hechsp 001 Chapter 1
Brian Kelly
 
PPTX
BAS 150 Lesson 3 Lecture
Wake Tech BAS
 
PDF
What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...
Edureka!
 
PDF
Data Application Software Demonstration PPt
chasingalexis06
 
PPT
BASE SAS Training presentation of coding
shamarites
 
PPTX
Clinical SAS Training by Aspire Techsoft
Aspire Techsoft Academy
 
PDF
SAS Online Training
revanthonlineenquiry
 
PDF
Whats so important about sas
Sollers College
 
PDF
Introduction to-sas-1211594349119006-8
thotakoti
 
DOCX
Sample Questions The following sample questions are not in.docx
todd331
 
PDF
Sas language reference concepts
imaduddin91
 
PDF
Habits of Effective SAS Programmers
Sunil Gupta
 
PPT
8323 Stats - Lesson 1 - 03 Introduction To Sas 2008
untellectualism
 
PDF
SAS Base Programming Certification course in Pune - Aspire Techsoft
Aspire Techsoft Academy
 
PDF
Ampersand Academy - SAS Course Curriculum
Ampersand Academy
 
Sharpening Your SAS Skills 1st Edition Sunil Gupta
hemamrsaaban
 
Sas base programmer
Aspire Techsoft Academy
 
Sasbasecontent2
lakshmicse14
 
Harshad sas
shivankitude
 
BAS 150 Lesson 1 Lecture
Wake Tech BAS
 
Hechsp 001 Chapter 1
Brian Kelly
 
BAS 150 Lesson 3 Lecture
Wake Tech BAS
 
What Is SAS | SAS Tutorial For Beginners | SAS Training | SAS Programming | E...
Edureka!
 
Data Application Software Demonstration PPt
chasingalexis06
 
BASE SAS Training presentation of coding
shamarites
 
Clinical SAS Training by Aspire Techsoft
Aspire Techsoft Academy
 
SAS Online Training
revanthonlineenquiry
 
Whats so important about sas
Sollers College
 
Introduction to-sas-1211594349119006-8
thotakoti
 
Sample Questions The following sample questions are not in.docx
todd331
 
Sas language reference concepts
imaduddin91
 
Habits of Effective SAS Programmers
Sunil Gupta
 
8323 Stats - Lesson 1 - 03 Introduction To Sas 2008
untellectualism
 
SAS Base Programming Certification course in Pune - Aspire Techsoft
Aspire Techsoft Academy
 
Ampersand Academy - SAS Course Curriculum
Ampersand Academy
 
Ad

More from ray4hz (7)

PDF
Moving from programmer to statistician
ray4hz
 
PDF
Senior sas programmer
ray4hz
 
PDF
Career path for sas programmer
ray4hz
 
PDF
尽管去做——无压工作的艺术
ray4hz
 
PDF
Visualization hang zhong
ray4hz
 
PDF
Cdisc sdtm implementation_process _v1
ray4hz
 
PPT
Test your macro
ray4hz
 
Moving from programmer to statistician
ray4hz
 
Senior sas programmer
ray4hz
 
Career path for sas programmer
ray4hz
 
尽管去做——无压工作的艺术
ray4hz
 
Visualization hang zhong
ray4hz
 
Cdisc sdtm implementation_process _v1
ray4hz
 
Test your macro
ray4hz
 

Recently uploaded (20)

PPT
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PPTX
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
PPTX
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
PPTX
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
PDF
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
PPT
AI Future trends and opportunities_oct7v1.ppt
SHIKHAKMEHTA
 
PPTX
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PDF
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
PPTX
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
PDF
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
PPTX
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
PDF
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
PPTX
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
PDF
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
PPTX
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
PDF
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
tuberculosiship-2106031cyyfuftufufufivifviviv
AkshaiRam
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
apidays Singapore 2025 - The Quest for the Greenest LLM , Jean Philippe Ehre...
apidays
 
apidays Singapore 2025 - From Data to Insights: Building AI-Powered Data APIs...
apidays
 
apidays Helsinki & North 2025 - Agentic AI: A Friend or Foe?, Merja Kajava (A...
apidays
 
What does good look like - CRAP Brighton 8 July 2025
Jan Kierzyk
 
AI Future trends and opportunities_oct7v1.ppt
SHIKHAKMEHTA
 
apidays Helsinki & North 2025 - Vero APIs - Experiences of API development in...
apidays
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
Avatar for apidays apidays PRO June 07, 2025 0 5 apidays Helsinki & North 2...
apidays
 
SlideEgg_501298-Agentic AI.pptx agentic ai
530BYManoj
 
The European Business Wallet: Why It Matters and How It Powers the EUDI Ecosy...
Lal Chandran
 
b6057ea5-8e8c-4415-90c0-ed8e9666ffcd.pptx
Anees487379
 
AUDITABILITY & COMPLIANCE OF AI SYSTEMS IN HEALTHCARE
GAHI Youssef
 
Exploring Multilingual Embeddings for Italian Semantic Search: A Pretrained a...
Sease
 
Development and validation of the Japanese version of the Organizational Matt...
Yoga Tokuyoshi
 
apidays Helsinki & North 2025 - Running a Successful API Program: Best Practi...
apidays
 
How to Connect Your On-Premises Site to AWS Using Site-to-Site VPN.pdf
Tamanna
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 

Introduction to clinical sas programming

  • 3. Source: http://www.cdisc.org/standards-and-implementations CDISC – The Clinical Data Interchange Consortium, non-profitable organization, that defines the world-wide standards for representing clinical data, required by the regulatory authorities as part of submissions
  • 6. Career in Clinical Data Management • Data Entry Operator • Data Validation Executive • QA Executive • Data Manager • QA Manager • Statistical Programmer • Statistician • Data Reviewer • Data Base Designer • Medical Writer • Head –Data Management
  • 7. What is the job of Clinical SAS Programmer? • Manage hundreds of datasets • Thousands of data points • Understand all interdependencies of the data points • Deliver accurate, timely, and reproducible analyses that determine the approval of novel therapies • Conclusion • Clinical SAS Programming is a difficult mental activity and requires uninterrupted concentration. Source: “Motivating Clinical SAS Programmers” from Daniel Boisvert
  • 8. Attributes which makes Programmers Competent • SAS technical skills • Clinical trial understanding • Basic understanding of statistics in clinical trial • Industry data standards and guidelines • Analyst mindset (planning, execution, problem solving and decision making skill) • Soft skill (communication skill and basic etiquette) • Industry collaboration (conferences, microblogging sites etc.) Source: “Competent statistical programmer: Need of business process outsourcing industry” from Imran Khan
  • 10. Documents Data Submission Protocol, SAP Case Report Form (CRF) Table Shells SAS Dataset format or others (xpt, xls, xlsx) Tables, Figures, Listings (TFLs) Define.xml, Review Guide
  • 11. Clinical Reporting Summary • Input: reading in the source data • Analysis: determining the analysis result • Output: presenting the analysis result Source: http://www.phusewiki.org/wiki/index.php?title=Clinical_Reporting_Summary
  • 12. SAS • SAS (Statistical Analysis Software) was founded in 1976 by James Goodnight and several colleagues from North Carolina State University • Originally designed to mine agricultural research, SAS's software was quickly adopted by corporate, government, and academic customers. • $3 billion business-analytics juggernaut, with 12,000 employees and an unbroken 35-year track record of revenue growth Source: Forbes, “Roundup Of Analytics, Big Data & Business Intelligence Forecasts And Market Estimates, 2015”
  • 13. Source: http://cos.name/2010/12/think-sas-2/ What is SAS? It’s not only a statistical software, but also a Business Analytics and Business Intelligence Software
  • 14. • 1.传统SAS(编程驱动) • 1-1 基础模块(Base SAS):包括类似于PL/L的第四代编程语言data steps、SQL、ODS、XML Engine、Macro以及大量的内置函数(支 持Perl 正则表达式)和过程步(procedures)等。对一个SAS程序员来说,这个BASE模块几乎就是全部(说学SAS,就是先学这个,这不 需要统计背景或者计算机背景)。又,狭义的SAS Base仅指data steps。 • 1-2 数据存取模块(SAS/ACCESS):支持大量的PC文件(除文本文件外,还包括Excel、SPSS的.sav、Stata的.dta等等)以及所有主流 的关系数据库及ERP系统(Oracle、SAP、SQL Server、DB2、MySQL等等) • 1-3 作图模块(SAS/GRAPH):SAS作图功能强大,只是模样够土。新版SAS 9.2在这块有不少让人兴奋的改进,比如支持ODS、TrueColor、 ActiveX、SVG(Scalable Vector Graphics)等,另外还加了一个graph editor,新潮不少。 • 1-4 统计分析模块(SAS/STAT ) :包括回归模型、方差分析模型、混合模型、贝叶斯分析、分类数据分析、多元分析(主成分和因子 分析等)、判别分析、聚类分析、生存分析、非参数分析等,多是我不熟悉的领域,只管罗列。有个Stat Studio支持R。 • 1-5 时间序列与计量经济学模块(SAS/ETS ):同上,X11、X12、ARIMA、PANEL、AUTOREG、……。又,做资产组合等投资分析也是 在这个模块。 • 1-6 矩阵运算模块(SAS/IML) 1-7 运筹学模块(SAS/OR) 1-8 地理信息系统模块(SAS/GIS) …… 以上部分是大部分SAS高校用户能接触到的东 西,主要用命令行实现功能。下面则是有GUI的客户端工具,一般都用它们作计算引擎。 • 2. 客户端工具 • 2-1 元数据管理(SAS Management Console,SMC):元数据(metadata)就是关于数据的数据。最简单的,一个数据集的变量属性就是 metadata。SMC是SAS产品元数据管理的统一中心。 • 2-2 ETL工具(SAS Data Integration Studio):ETL就是数据抽取(Extract)、转换(Transform)和加载(Load)。完成企业数据处理的工具还有 SAS OLAP Cube Studio、SAS Information Map Studio等。 • 2-3 数据挖掘模块(SAS Enterprise Miner,EM) :SAS的重磅产品之一。5以上是Java客户端版本,用户体验增进不少。 • 2-4 综合分析工具包(SAS Enterprise Guide,EG): 有完善的GUI界面,完成SAS从数据整合、分析到报表的一系列功能。EG与JMP,是 SAS公司的两个明星产品。 • 3. 其他 • 3-1 统计探索软件包JMP:这是一个独立于Base SAS的软件,由SAS公司的二把手John Sall主管,界面很炫,功能很强,让 业务人员爱不释手那种。JMP9支持R。 Source: http://cos.name/2010/12/think-sas-2/
  • 15. Why use SAS? • SAS is very efficient with data manipulation if you know what you're doing. It's been designed to work with sequential tapes so it is built with the assumption that data access is expensive. Makes wonders when you work truly massive datasets. • SAS is good at opening up gigantic data sets even on computer which do not have a lot of computing power. Essentially data sets that would crash most programs on a given computer in a heart beat can load in SAS. • SAS as a company is smart and designs its products at corporate cost centers. This includes doing things like company wide installations and setting up its platform in a way that makes it easy for corporate it departments to setup a company wide SAS infrastructure. Source: How to start using SAS from SARBAJIT MUKHERJEE From 2016 TIOBE
  • 16. How to learn SAS? • Learn SAS ® in 50 minutes http://support.sas.com/resources/papers/proceedings11/054-2011.pdf • Book: Learning SAS by Example: A Programmer's Guide • For more recourses, check: https://web.stanford.edu/group/ssds/cgi- bin/drupal/files/Guides/Resources_for_Learning_SAS.pdf
  • 17. Source: How to start using SAS from SARBAJIT MUKHERJEE
  • 18. SAS Data Step • The DATA step is a separate language for performing programming tasks such as data manipulation, i.e. cleaning and editing, and data restructuring. • The Implied Loop of the DATA Step (ILDS), internal looping. One important consequence of the ILDS is that it may make sense to place code before the line that reads data. • Program Data Vector (PDV) • it is a logical area in memory where SAS builds a data set, one observation at a time • contains current values for all variables • maintains two automatic variables, _N_ and _ERROR_ Source: http://www2.sas.com/proceedings/sugi31/246-31.pdf
  • 20. Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf The macro facility is a tool for text substitution, which reduces the amount of text entered for common tasks. Two components of Macro facility: 1.the macro processor, which is the portion of the system that does the work and 2.macro language, the syntax used to communicate with the macro processor
  • 21. Source: http://www.lexjansen.com/nesug/nesug04/pm/pm07.pdf When the word scanner detects a macro trigger, ampersand (&) or percent (%), it sends information and temporarily turns processing over to the macro processor.