SlideShare a Scribd company logo
© 2014 DataStreams Corp. All Rights Reserved. 
DataStreams Corp. 
"Always find the better value of your data" 
www.datastreams.co.kr 
Corporate Overview
© 2014 DataStreams Corp. All Rights Reserved. 
Name 
Data Streams Corp. 
CEO 
Mr. Young-sang Lee 
Business 
Area 
Data Integration Solutions Development and Sales 
Data Quality Solutions Development and Sales 
Data Warehouse / BI / FDS / Forensic / Audit Consulting and Construction 
Big Data Analytic Consulting and Platform Construction 
Data Governance Platform Development/Consulting/Sales 
Data Migration Consulting and Construction 
Large Volume Data Batch Processing Improvement Consulting and Construction 
Data Standardization and Quality Management Consulting and System Construction 
Data Architecture Consulting 
Office 
Address 
HQ Chungho-nais B/D 6F, 28 Saimdang-ro, Seocho-gu, Seoul, Korea 
R&D U-Spacemall #2 B-601, 670 Daewangpangyo-ro, Bundang-gu, Seongnam, Korea 
China Office Room 1216, 12th Floor, Intersection of Hopson Kirin Society Building 2, 
Wang Jing Fu Tong West Street, Wangjing, Chaoyang District, Bejing 
Contact 
Tel) +82-2-3473-9077 / Fax) +82-2-3473-9084 
Investor 
JAFCO ASIA 
Capital 
USD 2M 
Sales Amount 
USD 19M (2013) 
Established 
Sep 19, 2001 
Employees 
125 
1 
Company Profile
© 2014 DataStreams Corp. All Rights Reserved. 
Company History 
2 
2012 
•Established R&D Center in Pangyo Techno Valley 
•Released Social Cube for SNS Data Analytics 
•Participated in Original Technology Development Project for Next Generation Memory Based Big Data Analytics and Management 
2009 
•TeraStream™, Selected as Standard Data Integration Tool by The Korea Federation of Banks 
•Selected as Contractor for Building Resource Management Data Standardization and Meta-data Management System by Ministry of National Defense 
•Released DeltaStream™, QualityStream™, and ImpactStream™ 
2007 
•Awarded for Excellent Venture Company by Deputy Prime Minister 
•MetaStream™, Awarded for Digital Business Innovation by SMBA 
•Released TeraStream™ Version 2.0 
•JAFCO, Japan invested 4 million USD 
2005 
•Mr. Young-sang Lee, CEO, Was Awarded a Grand Prize for Korea Digital Competitiveness 
•TeraStream™ Won New Technology Certification from Ministry of Knowledge Economy 
•Released MetaStream™ Version 1.0 
•Acquired KDB Solution Co., Ltd., Korea’s First Meta-data Management Solution Company 
•TeraStream™ Version 1.4, Acquired GS(Good Software) Certification 
2003 
•KEB selected TeraStream™ as Standard Batch/ETL Solution for Next Generation Banking System 
•First Worldwide Sales Contract of FACT™ 
•Presented FACT™ to Oracle Open World 2003 in San Francisco, USA 
•Released MetaStream™ Version 2.7 
•CEO, Mr. Young-sang Lee, Was Elected as a Chairman of KOSEA(Korea Software Enterprise Association) 
•Released TeraStream™ Version 3.2 
2010 
•TeraStream™ Version 2.2, Acquired GS Certification 
•Changed Company Name to DataStreams Corp. 
•Contracted with Intellectual Property Office for Enterprise Data Quality Management 
2008 
•Awarded for Top Private Company for Population and Housing Census by Deputy Prime Minister 
•TeraStream™, Selected as Standard ETL Tool by Ministry of Government Administration and Home Affairs 
2006 
•Selected as Technically Innovated Company of 2004 by SMBA 
•Selected as Technically Innovated Company of 2004 by Small & Medium Business Administration(SMBA) 
•TeraStream™, Selected for a Next Generation Banking Data Migration Tool by Shinhan Bank 
2004 
•Registered TeraStream™ as a Trademark 
•Released TeraStream™ Designer Version 1.1 
•First TeraStream™ V.1.1 Contract with National Statistics Office 
2002 
2011 
•Awarded Prime Minister Citation for SW Achievement 
•Selected as an ATC(Advanced Technology Center) by Ministry of Knowledge Economy 
•Established China Office in Beijing 
2013 
•TeraStream™ for Hadoop, Selected as Base Solution for Building Government-Wide Big Data Infrastructure 
•Acquired Patent for Readable Data Encryption and Decryption 
•CEO, Mr. Young-sang Lee, Was Awarded Digital Management Innovation Prize 
•Joined Int’l SOFT China in Beijing 
• MetaStream™ Version 3.0, Acquired GS Certification 
• Launched DQ Appliance 
2001 
•Released TeraStream™ Beta Version 
•Innovative Data Solutions Corp., Was Established 
2014 
•TeraStream™, Was Awarded 2014 Korea Software Award by Ministrer of Science, ICT and Future 
•DataStreams Is Listed on KONEX(Korea New Exchange)
© 2014 DataStreams Corp. All Rights Reserved. 
Organizational Structure 
Present Condition of Engineering Employees 
96 
Consultants 
Data Governance 
36 
Data Integration & Migration 
Big Data Management 
DW & BI 
SNS Analytics 
Engineers 
Meta Data & Data Quality Management Solution 
60 
Data Integration & Migration Solution 
Big Data Management Platform 
Number 
Of 
Employees 
Total 
Consultants/Developers 
Management/Sales 
125 
96 
29 
Engineering Lev. 
Total 
Consultants 
Engineers 
Total 
96 
36 
60 
Professional 
27 
18 
9 
Qualified 
26 
11 
15 
Intermediate 
19 
3 
16 
Beginning 
24 
4 
20 
3 
•Government Offices 
•Banking 
•Manufacturing Business 
•Logistics/Services 
•Planning Products 
•Presales & Consulting 
•DW/BI 
•Big Data 
•SNS 
•QA(Quality Assurance) 
•Marketing 
•Overseas Sales 
•Overseas Corp. 
•HR/General Affairs 
•Financial Admin. 
•Knowledge Mgmt. 
•Sales Support 
•PI 
•DI Technical Support 
•DQ Technical Support 
•DI 
•RTI 
•DQ 
•UI 
CEO 
Auditor 
Counselor 
Sales Div. 
PPC Div. 
Business Consulting Div. 
Global Business Div. 
Management Support Div. 
Technical Service Div. 
R&D Center
© 2014 DataStreams Corp. All Rights Reserved. 
Business Area 
Data Governance 
• Data Governance Architecture 
• Data Quality Management 
• Meta Data Management 
• Master Data Management 
• Data Quality Appliance 
Data Integration 
• High Performance ETL ∙ Batch 
• Data Integration 
• Deferred(Near Real Time) 
• CDC, Real Time Data Transition 
• High Speed Data Extraction 
• High Speed Data Sort 
• Data Integration with Hadoop 
• Data Integration with Grid 
• Test Data Management 
Big Data 
• Big Data Platform with Hadoop 
• Big Data Analysis & Visualization 
• Structured & Unstructured Data Analysis 
• SNS Data Analysis ∙ Monitoring 
Consulting 
• ISP & Big Data Consulting 
• Fraud Detection System(FDS) Consulting 
• DW ∙ CRM ∙ BI Consulting 
• Data Integration & Migration Consulting 
• Data Standardization ∙ Quality Management ∙ Architecture Consulting 
• Master Data Management Consulting 
• Data Lineage Management Consulting 
DW/BI 
• Building DW ∙ CRM ∙ BI 
• QPI Methodology 
• Fraud Detection System(FDS) 
• Information System Planning(ISP) 
• Alternative Trading System Consulting 
• Transaction Cost Data Analysis Framework 
• Transaction Cost Data Analysis Framework & Consulting(TCA) 
• Financial Analysis Services 
DataStreams Is a Company Which Has Expertise in Data Processing and Analysis to Provide Total Data Management Services in Data Integration and Quality Management. 
Data Lineage Management 
• Data Lineage Analysis Platform 
• Visualization for Data Lineage 
• Relative Tool, Program & Script Language Analysis 
• Table Column Search & Monitoring 
4
© 2014 DataStreams Corp. All Rights Reserved. 
Market Recognition & Share 
5 
60% 
25% 
10% 
5% 
80% 
15% 
5% 
Korean Market Share 
for ETL Solution 
Korean Market Share 
for Data Migration 
(Banking Industry) 
DataStreams Corp. 
IBM 
Informatica 
Others 
* The market share for ETL solutions is self-researched in 2013. 
55% 
30% 
15% 
Korean Market Share 
for Metadata Management 
W Company (Korean) 
G Company (Korean) 
ETL 
Data 
Migration 
Metadata 
Management 
No. 1 Total Data Management Technologies in Korea 
Vendor Report of Magic Quadrant for Data Integration and Data Quality Tools 
(2013) 
Reference URL : http://www.citia.co.uk/ 
Mentioned DataStreams’ capabilities of offering wide range of data integration 
products through ETL, CDC and near-real time technologies.
© 2014 DataStreams Corp. All Rights Reserved. 
Private 
Banking / 
Finance Companies 
Public Finance 
Companies 
Government & 
Public / 
Educational 
Institutions 
Enterprises 
Major Domestic Customers 
6
© 2014 DataStreams Corp. All Rights Reserved. 
DataStreams Is Exporting & Expanding… 
Columbia 
Banco Colpatria 
Bogota City Government 
Credibanco 
China 
Kookmin Bank 
Hana Bank 
USA Merklenet, Inc. CSC Consulting Bisys Comcast Merkle Data Tech 
USA 
Airweb 
Sungard 
American Airlines 
Highmark, Inc. 
Mexico 
Sodexhopass 
Procesar 
Peru Banco Ripley 
Chile Banco Estado de Chile 
Australia National Wealth Management (MLC/NAB) 
Spain Procecard Tecnocom Telefonica Soluciones ITnow! 
Germany Accenture GmbH 
India Reliance Industry 
Indonesia Excelcom Aviva Telkomsel Hana Bank 
Global Customers 
7 
Japan with Reliable Business Partners 
U.S.A. BellaDati(US) 
Vietnam 
HIPT 
FPT IS 
Lac Viet 
QTSC 
EU 
BellaDati(CZE) 
Gibkie(RUS) 
IMBI(Europe, N. Africa) 
China China Mobile Fuchen Telecom & Banking / Insurance Companies
© 2014 DataStreams Corp. All Rights Reserved. 
Data Integration Product Line – 1/2 
제품 이미지 
TeraStreamTM 
•Data Extraction at High-Speed 
•Data Transformation/Conversion 
•Data Load 
•Meta Data Management 
•Real-time Monitoring 
•Shortening Processing Time 
•Dropping System Load by Improved Processing 
•Shortening Development Time by Integrated GUI 
•Cost Reduction for Data Integration Process 
•Securing Data Consistency with TeraNRTTM 
•Efficient Use of Resource by Distributed Processing 
Main Features 
Benefit 
제품 이미지 
DeltaStreamTM 
•Real-time Change Data Extraction 
•Data Transformation/Conversion 
•Immediate Data Load 
•Real-time Fault Handling 
•ETL Supported 
•Minimizing Use of DB Engine and System Resource by Using Transaction Log in DBMS 
•Extract Data without Any Effect on Existing On- line Tasks 
•Copy Data at High-Speed through Real-time Data Extraction 
제품 이미지 
TeraNRTTM 
•Change Data Extraction(NRT) Scheduling 
•NRT Monitoring 
•Column Change Impact Analysis 
•Automatic Script Creation 
•Automatic Verification 
•Increase of Data Consistency and Work Efficiency by NRT Extraction for Large Volume Data 
•Time and Cost Reduction by Parallel Process and Large Volume Data Processing at High-Speed 
•Securing Work Credibility by Using Automatic Verification and Missing Data Revision 
The No. 1 Data Integration Solution in Korea to Extract, Transform, and Load the Data from the Source DB to the Target System in Various Environments 
The Real-time Data Processing Solution to Transfer the Transaction Log for Change Data Information in DBMS to the Target System by Choosing CDC(Change Data Capture) Method 
The Near Real-time(NRT) Data Processing Solution to Capture Change Data in Source DBMS to Load the Transformed and Changed Data to the Target System by NRT Method 
8 
Main Features 
Main Features 
Benefit 
Benefit
© 2014 DataStreams Corp. All Rights Reserved. 
Data Integration Product Line – 2/2 
TeraStreamTM for Hadoop 
•Can Use Outstanding Features of TeraStreamTM 
•Can Use Variable Function of TeraStreamTM 
•Big Data Analytics by Supporting Various Tools 
•Map/Reduce by GUI 
•Distributed Processing Integrated Monitoring for Multiple Nodes 
•Easier and More Convenient Big Data Processing by Combining Strengths between TeraStreamTM and the Hadoop 
•Manpower Reduction for Development through the Interfaces of TeraStreamTM and Hadoop 
제품 이미지 
TeraTDSTM 
•RSC Encryption as Readable Form 
•Maintaining Attributes of Original Data 
•Guarantee Uniqueness of Data 
•High-Speed Data Extraction and Conversion 
•Support Easy Conversion Patterns by Users 
•Blocking Private Information Leakage from Test and Development DB 
•Provision with Easiness for Data Structure in Test and Development DB 
•Preparation for Relevant Laws and IT Audits 
The Test Data Management Solution to Convert the Original Data with Core Private Customer Information in a Readable Form over Maintaining Relations among the Tables in Development or Test System Structure Phases 
The Specialized Solution for Big Data Processing by Combining TeraStreamTM and the Hadoop Eco-system to Provide Distributed File System and Platform 
9 
Main Features 
Benefit 
Main Features 
Benefit 
FACTTM 
•Extracting Data from Various Commercial DBMS 
•Append or Overwrite in Data Output 
•Extracting Fixed and Variable Data 
•Extracting CLOB / BLOB Data 
•Providing Development Conveniences through Various Syntax of ANSI-SQL 
•Easy Loading of Extracted SAMFILE by Automatically Creating DBMS Load Script File 
•Preventing Problems with Data Extraction by Modifying Carriage Return Value 
The High-Speed Data Extraction Solution to Apply the Extracted Data in External System ETL, Batch, Data Conversion by Extracting the Source Data from Various RDBMS at High-Speed 
Main Features 
Benefit
© 2014 DataStreams Corp. All Rights Reserved. 
Data Governance Product Line – 1/2 
제품 이미지 
MetaStreamTM 
•Data Standardization Management 
•Modeling Management 
•Support Various DBMS 
•Transfer Management 
•Data Quality Management 
•Redundancy Prevention of Meta-data 
•Meta-data Application Increase for Business Development 
•Traceability Improvement for the Past Data 
•Redundancy Prevention of R&R 
•Time Reduction for Impact Analysis and Accuracy Improvement 
제품 이미지 
QualityStreamTM 
•Profiling Analysis Target DB 
•Management and Analysis of Business Rules 
•Integrated Management of Quality Diagnosis 
•Error Data Verification and Analysis 
•Six Sigma Based Statistical Management 
•Definition of Data Quality Management Process 
•Consistent Reduction of Error Rate 
•Consistent Management of Core Target for Data Quality Management 
•Structural Data Quality Management through Integrated Repository 
제품 이미지 
ImpactStreamTM 
•Program AS-IS Analysis 
•DB AS-IS Analysis 
•Program/DB Impact Analysis 
•Creation Out-put / Provision with Excel Report 
•Efficient Integration for Business Applications 
•Maintenance of IT Application Development and Management Information 
•Improving Outsourcing Control 
•Improving Comprehension for Whole Application Road Map 
•Development Productivity Improvement and Maintenance Cost Reduction 
The Meta-data Management Solution to Realize the Structural Data Quality Management by Meta-data Life Cycle Management Features such as Meta-data Extraction, Standardization Management, Mapping Management, Standardization Observation, and Provision with Statistics 
The Data Quality Management Solution to Secure Consistent Data Quality Level by Result Analysis of Quality Diagnosis through Accessing the Analysis Target Data 
The Solution for Impact Analysis due to Application Changes by Building Application Knowledge Information Structure to Improve Comprehension and Readability from Application Source Codes Which Are Changed and Managed Continuously 
10 
Main Features 
Benefit 
Main Features 
Main Features 
Benefit 
Benefit
© 2014 DataStreams Corp. All Rights Reserved. 
Data Governance Product Line – 2/2 
제품 이미지 
MasterStreamTM 
•Enterprise Master Data Governance 
•Master Data Quality Management 
•Pre/Post Verification by Set-up and Application of Business Rules 
•Business Process Control by Data Value 
•Efficiency Improvement by Sharing Core Data 
•Quick Decision Making by Statistical Analysis 
•Maintenance Cost Reduction by Operation System Improvement 
•Scalability Improvement for System Installation/Changes 
Q-TrackTM 
•Extract Mapping Information among Tables 
•Extract Legacy Table Lay-out Information of Original System 
•Extract Meta Information of TeraStreamTM 
•Extract Program Information (Stored Procedure or Shell with SQL) 
•Extract Undiscovered Data Problems 
•Accurate and Meaningful Analysis of Various ETL Environments 
•Provision with Communication Base between Parties 
The Master Data Management(MDM) Solution to Create and Manage Core Enterprise Data, Master Data, Continuously for Each Business Process Flow 
DQ Appliance 
•Verification Target Extraction / Change Management 
•Initial Data Verification 
•NRT Data Verification 
•Meta-data Management 
•Profiling / Rule / Verification Results / Modification Management 
•Data Governance Based Data Quality Management 
•Securing NRT Data Quality Verification Basis 
•Provision with Outstanding Performance and Management Conveniences through the Optimized Dedicated Server for DB and File Processing 
•Quality Verification Performance Improvement 
Takes Near Real-time(NRT) Quality Verification Based on High-Speed Extraction and Verification, and Change Data, by Building Data Store in the Separate Machine to Get Over Limitation in Time and Space of Existing Data Quality Management Systems 
The Solution to Provide Intuitive, Accurate, and Meaningful Lineage Information by Flow Visualization Regarding Creation, Transformation, and Use of Data from Operating System to Data Warehouse/Unit System 
11 
Main Features 
Benefit 
Main Features 
Main Features 
Benefit 
Benefit
© 2014 DataStreams Corp. All Rights Reserved. 
Real-Time Ecosystem 
Hadoop Ecosystem 
Analytic & Visualization Ecosystem 
Real-Time Collection (Storm) 
CEP 
(Esper) 
Collecting and Searching Social Data 
(Splunk, Marklogic) 
In- Memory NoSql (Redis) 
Document NoSql (Mongo) 
Data Store 
SQL 
(RDBMS) 
Distributed Database (Hbase) 
Workflow Management (Oozie) 
Data Collection (chukwa, Flume, Scribe) 
Structured Data Collection (Sqoop, hiho) 
Serialization 
(Avro) 
Real-time SQL Query (Impala, Tajo) 
Data Anaysis (Pig, Hive) 
Data Mining 
(Mahout) 
Metadata Management 
(HCatalog) 
Distributed Data Process (MapReduce) 
Document 
(D3) 
Visualization 
Graphic (Giraph, 
Gremlin) 
Analytic Engine 
(Complex Network Theory) 
Analytic Store 
(Greenplum, Exa, 
Nettiza) 
DS Solution 
Collecting Social (Social Cube) 
Structured Data Collection 
(FACT) 
Meta-data Management (MetaStream) 
Collection (DeltaStream) 
Tera 
Stream 
for 
Hadoop (HBASE) 
TeraStream for Hadoop (MAP/REDUCE) 
TeraStream for Hadoop (HIVE) 
TeraStream for 
Hadoop (NOSQL) 
TeraStream for Hadoop (RDBS) 
Unstructured Data Collection (Social Cube) 
Big Data Platform 
12
© 2014 DataStreams Corp. All Rights Reserved. 
Business System 
Real Time Monitoring 
ODS 
DM 
•Performance Tuning 
•Emergency Management 
DW 
Data Governance Architecture (m-DOSA) 
Load 
Clean 
Transform 
Extract 
Meta-data Management 
Data Quality Management 
Impact Analysis 
Master 
Data Management 
Enterprise Data Management Solutions 
13 
Multidimensional 
Mart 
Subject Mart 
Periodic Mart 
Summary Table 
Data Lineage Monitor 
Distribute 
Security 
Security
© 2014 DataStreams Corp. All Rights Reserved. 
Data Governance Solution Framework 
14
© 2014 DataStreams Corp. All Rights Reserved. 
Business Service & Consulting References
© 2014 DataStreams Corp. All Rights Reserved. 
Introduction 
Effect 
• Performed Information System DW Batch within 6 Hours, and Simultaneously Loaded the Data to DM 
• Support Bilateral ETL between Current ODW, New ODW, and EDW 
• Used in Whole Range of Data Processing and Data Integration by Applying Various Business Logic 
Customer’s Issue 
Implementation 
System Architecture 
Extract 
24-Hour 
DB SPLIT 
Profit Management 
RISK 
KPI 
EUC 
Current ODW 
ORACLE 
EDW 
Sybase ASIQ 
New ODW 
ORACLE 
Bilateral ETL 
ETL 
ETL 
ETL 
Batch 
Batch 
Batch 
FTP Transfer After Extract 
Accounting 
System 
Information System 
DW 
DM & Sub System 
Opened in Feb, 2005 
Batch 
• Needed Conversion of Mainframe and IMS HDB 
• Bilateral ETL Processing 
• On Time Processing of Large Volume Data Batch 
• Converted Mainframe Data to UNIX Data - Size of Data : 1~1.5TB 
• Support Korean Conversion and Cleansing 
• Perform Accounting System Batch, EDW ETL, Information System ETL and Batch - Daily Changed Data : 200GB (From Accounting System to New ODW) / Converted within 1½ Hours 
• Data Transformation and Delivery from DM & Sub System and New ETL Information System to Various Mart through EDW Server(Data HUB) (EUC, KPI, RISK Management, IFRS, BaselⅡ) 
Business Service & Consulting 
16
© 2014 DataStreams Corp. All Rights Reserved. 
17 
Electronic Voucher DW Performance Improvement for Ministry of Health and Welfare 
 Reduced Statistics Provision Time : Within A Few Seconds~Minutes from 1~6 Days 
 Shortened Information Process : Simplifying Data Processing Procedures and Direct Acquisition of Information by Person in Charge 
 Direct Inquiry and Editing through DW Construction 
 Data Credibility Improvement by Consistent Data Provision 
Introduction 
Effect 
• Deploying Work from Original Source to ODS 
• Constructing and Modeling ODS, DW, and DM for Statistics Analysis 
• Voucher System (DB1 → New DW Server) 
• Server - OS : AIX 5.3 (AS-IS and TO-BE are same) - CPU : Power5, 2.1GHz, 6-core , IBM P Series 
- MEM : 12 GB 
- H/W : 1TB 
• Easy Maintenance by Simple Logic 
•Lack of Data Consistency 
•Lack of Data Instantaneity 
•Impossible to Check Illegal Approval 
•Disharmony between Administrators and Work Sites due to a Lack of Statistical Data Credibility 
Implementation 
Customer’s Issue 
System Architecture 
Electronic Voucher Statistics Analysis System DW Server 
Original System 
Source DB 
(Oracle) 
- ODS Data Conversion 
- Update/Insert to DW 
- 1:1 Mapping Load 
- Work Deploy 
- Loading to ODS 
IBM P Series 
Voucher 
Service 
Illegal 
Approval 
Pregnancy 
Birth 
Manpower of 
Providers 
Target DB 
(Oracle) 
FACT™ 
ODS 
DM 
DW 
ETL 
- Data Transformation for ODS and DW 
- Update/Insert to DW 
ETL 
ETL 
Business Service & Consulting 
Introduction 
Effect
© 2014 DataStreams Corp. All Rights Reserved. 
Business Service & Consulting 
18 
Information System Construction for Korea National Open University 
• Academic Affairs Statistics Automation : Needed Personnel and Time Reduction of Manual Labor for Approx. 15 Days 
• Academic Affairs Computerization : Needed Administration Statistics System 
Customer’s issue 
System Architecture 
University 
Administration 
Graduate 
School 
Academic Affairs 
Electronic 
Approval 
Tutoring 
Administrative 
Work 
Institute 
of 
Lifelong 
Education 
Academic 
Affairs 
Graduation 
Grade 
Registration 
Admission 
Information System Server 
TeraStream™ 
ODS 
DW 
DM 
Data Extract/Load 
ETL Control 
Data Extract/Load 
Data Extract/Load 
• Extracting and Transforming Data for Academic and Administrative Affairs from Oracle DB to Provide Structured/Unstructured Statistics Reports After Loading the Data to ODS, DW, and DM of Oracle DB 
• Academic Affairs Data(Early/Changed) : 20GB/1GB, Total 4 Hours for Change/Load 
• Administration Work Data(Early/Changed) : 5GB/100MB, Total 1 Hours for Change/Load 
Implementation 
Tool 
Total Working Time 
Academic Affairs Statistics System 
• Within 4 Hours 
University Administration Statistics System 
• Within 1 Hour 
Reduced Working Time from 15 Days to 4 Hours 
 Reduced Time for Academic Affairs such as Grade, Registration, and Admission, from 15 Days to 4 Hours 
 The Statistics of University Administration Is Finished within 1 Hour 
Introduction 
Effect 
Data Extract/Load 
Data 
Extract/ Transform/Load
© 2014 DataStreams Corp. All Rights Reserved. 
Hadoop System of Price Index for NSO 
Customer’s Issue 
Implementation 
 Difference between Actual Price and Announced Monthly Price Index 
 Problem of Huge Volume Data Processing 
 Lack of Professional Engineers for Hadoop System 
 Introduced Hadoop System - Fast Process for Huge Volume of Data such as Internal Data and External(SNS) Data by Hadoop System 
 Introduced TeraStream™ for Hadoop - Easy Use of Hadoop System through Convenient Features Focused on Developers 
- Increase of Development Conveniences by Easy GUI 
System Architecture 
Data 
Collection 
Data Storage Area 
Data 
Analysis 
External Data 
Internal Data 
Data Analysis 
DBMS 
TeraStream™ for Hadoop 
Unstructured / Structured Data HDFS 
TeraStream™ for Hadoop Engine 
19 
National Statistical Office(NSO) Used to Announce Market Price Index based on Monthly Price Research for 250~450 Goods in the Market Before. However, There Were Differences between Actual Price and Announced Monthly Price Index because of Time Gap. NSO Solved This Difference Problem by Performing the Project by Introducing Hadoop Eco-system. 
Business Service & Consulting
© 2014 DataStreams Corp. All Rights Reserved. 
Support e-Document Filter for Filtering Various File Formats such as MS Office, HWP, PDF, e-mail Formats(EML, PST, OST), and DB File 
Save Extract Results with Text Format for Search Raw Data in Local File by Document Filter in Local Files System or Distributed File System 
Create Index by Considering Parallel Indexing Implementation in Hadoop Distributed File System and Morphology of Korean Language through Open Search Library Which Supports Hadoop or Open Search Engines 
Provide Various File Search Features based on Hadoop System 
Implemented Extendable Data Analysis System for Integrated Search for e-mail Files, HWP Format Files, and MS Office Files based on Hadoop System 
Business Service & Consulting 
Big Data Based Data Analysis System for National Tax Service 
20
© 2014 DataStreams Corp. All Rights Reserved. 
Shifted Paradigm of Railway Use to Collect Travel Information with Each Railway Station as the Center and to Provide Potential Customers with the Customized Information 
Potential Customers 
[Real Time Travel Info] 
[Travel Analysis∙Satisfaction Predict] 
[Travel Destination Rank] 
(Monthly/Weekly/Daily) 
① 
Interest(Recognition) 
Attention(Planning) 
Target Information Retrieval 
and Using 
② 
Access and Information Retrieval 
Information Provider(Device) 
Big Data Platform 
③ 
Collecting Potential Customers’ Information 
Convergence Data Focused on Railway 
Provision with Customized Information 
① 
Information of Device Is Composed of Easily Accessible and Usable Living Contents 
Induce Purchase of Goods in the Process of Information Use by Potential Customers 
① Access Information Provider Device and Search 
- Access through Mobile and Internet 
- Required Information Retrieval(e.g. Travel Prediction) 
② Information Use by Potential Customers 
- [Before Event Occurrence] Provide Travel Destination Rank and Issues Focused on Railway → Causing Interest 
- [Event Occurrence] Travel Destination Analysis and Predict 
Customized Satisfaction Info → Inducing to Planning and Target Information Retrieval 
※ Event : Vacation, Long Holiday, Business Travel, etc. 
③ Information Collection and Feedback based on Big Data Platform 
- Collection and Analysis of Customers’ Issue, Reaction, Type, and Behaviors and Feedback by Aggregation of Various Data Type 
Business Service & Consulting 
Big Data Consulting for Korail 
21
DataStreams : Corporate Overview

More Related Content

PDF
TeraStream - Data Integration/Migration/ETL/Batch Tool
DataStreams
 
PDF
Slides: Using Analytics and Fraud Management To Increase Revenues and Differe...
DATAVERSITY
 
PDF
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Denodo
 
PPTX
RFT for Business Intelligence and Data Strategy
SustainableEnergyAut
 
PPTX
The New Enterprise Data Platform
Krishnan Parasuraman
 
PDF
What Is My Enterprise Data Maturity 2021
DATAVERSITY
 
PDF
Speed Matters - Intelligent Strategies to Accelerate Data-Driven Decisions
DATAVERSITY
 
PDF
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 
TeraStream - Data Integration/Migration/ETL/Batch Tool
DataStreams
 
Slides: Using Analytics and Fraud Management To Increase Revenues and Differe...
DATAVERSITY
 
Accelerating Data-Driven Enterprise Transformation in Banking, Financial Serv...
Denodo
 
RFT for Business Intelligence and Data Strategy
SustainableEnergyAut
 
The New Enterprise Data Platform
Krishnan Parasuraman
 
What Is My Enterprise Data Maturity 2021
DATAVERSITY
 
Speed Matters - Intelligent Strategies to Accelerate Data-Driven Decisions
DATAVERSITY
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
DATAVERSITY
 

What's hot (20)

PDF
Why You Need to Govern Big Data
IBM Analytics
 
PDF
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
DATAVERSITY
 
PDF
Simplifying Big Data Analytics for the Business
Teradata Aster
 
PDF
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
DATAVERSITY
 
PDF
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
DATAVERSITY
 
PPTX
McKinsey Big Data Overview
optier
 
PDF
Telco Big Data 2012 Highlights
Alan Quayle
 
PDF
ADV Slides: Data Curation for Artificial Intelligence Strategies
DATAVERSITY
 
PDF
Platforming the Major Analytic Use Cases for Modern Engineering
DATAVERSITY
 
PDF
Modern Integrated Data Environment - Whitepaper | Qubole
Vasu S
 
PPTX
Digital Transformation: How to Build an Analytics-Driven Culture
Alexander Loth
 
PPTX
How Data is Driving AI Innovation
Matt Turner
 
PDF
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
DATAVERSITY
 
PDF
Telco Big Data Workshop Sample
Alan Quayle
 
PDF
Slides: Data Governance Reality Check
DATAVERSITY
 
PDF
DataEd Slides: Leveraging Data Management Technologies
DATAVERSITY
 
PDF
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
DATAVERSITY
 
PDF
DAS Slides: Graph Databases — Practical Use Cases
DATAVERSITY
 
PPTX
Integrate Big Data into Your Organization with Informatica and Perficient
Perficient, Inc.
 
PDF
Big Data World Forum
bigdatawf
 
Why You Need to Govern Big Data
IBM Analytics
 
ADV Slides: Modern Analytic Data Architecture Maturity Modeling
DATAVERSITY
 
Simplifying Big Data Analytics for the Business
Teradata Aster
 
ADV Slides: The Data Needed to Evolve an Enterprise Artificial Intelligence S...
DATAVERSITY
 
How to Crunch Petabytes with Hadoop and Big Data Using InfoSphere BigInsights...
DATAVERSITY
 
McKinsey Big Data Overview
optier
 
Telco Big Data 2012 Highlights
Alan Quayle
 
ADV Slides: Data Curation for Artificial Intelligence Strategies
DATAVERSITY
 
Platforming the Major Analytic Use Cases for Modern Engineering
DATAVERSITY
 
Modern Integrated Data Environment - Whitepaper | Qubole
Vasu S
 
Digital Transformation: How to Build an Analytics-Driven Culture
Alexander Loth
 
How Data is Driving AI Innovation
Matt Turner
 
Webinar: Decoding the Mystery - How to Know if You Need a Data Catalog, a Dat...
DATAVERSITY
 
Telco Big Data Workshop Sample
Alan Quayle
 
Slides: Data Governance Reality Check
DATAVERSITY
 
DataEd Slides: Leveraging Data Management Technologies
DATAVERSITY
 
Slides: Accelerate and Assure the Adoption of Cloud Data Platforms Using Inte...
DATAVERSITY
 
DAS Slides: Graph Databases — Practical Use Cases
DATAVERSITY
 
Integrate Big Data into Your Organization with Informatica and Perficient
Perficient, Inc.
 
Big Data World Forum
bigdatawf
 
Ad

Viewers also liked (20)

PPTX
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
DataStreams
 
PPTX
Tera stream for datastreams
치민 최
 
PPTX
건설공업
sooo02
 
PDF
PEB Steel Brochure (Korean version_Dec 2014)
PEB Steel Buildings
 
PDF
07 강태욱
KOS-ROBOT
 
PPTX
Augmented reality (Access virtual world)
chirag thakkar
 
PPTX
취업준비자들의 블루오션 3D 프린팅
Chiwon Song
 
PDF
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
Peter Woo
 
PPTX
증강현실과 가상현실
Changhwan Yoon
 
PPTX
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
DataWorks Summit/Hadoop Summit
 
PDF
TeraStream for ETL
치민 최
 
PDF
ICT 상용화 06 스마트 기기 부품, 외장 제조 개론
Junsang Dong
 
PDF
공공분야에서의 드론 활용 - 진정회 대표 (엑스드론)
brainerymakers
 
PPTX
3D 프린팅 연구/리서치. 3D printing research korean version
Jiwoo Seo
 
PDF
드론 분야의 기술 트렌드 및 발전 방향 - 김태호 실장
brainerymakers
 
PDF
2016 kcd 세미나 발표자료. 구글포토로 바라본 인공지능과 머신러닝
JungGeun Lee
 
PPTX
가상현실(Vr)과 증강현실(ar)
Heesung Youn
 
PPTX
[IGC2015] 스마일게이트 김용하-VR? AR? 차세대 게임의 기반 기술
강 민우
 
PDF
IoT 제품 리뷰 - 약 20개의 IoT 제품 리뷰
봉조 김
 
PDF
[Issue&trend] ict와 3 d 프린팅에 의한 제3차 산업혁명
Soomin(Simon) Shim
 
(주)데이터스트림즈 발표자료: 실시간 IoT 기반의 빅데이터 분석 서비스
DataStreams
 
Tera stream for datastreams
치민 최
 
건설공업
sooo02
 
PEB Steel Brochure (Korean version_Dec 2014)
PEB Steel Buildings
 
07 강태욱
KOS-ROBOT
 
Augmented reality (Access virtual world)
chirag thakkar
 
취업준비자들의 블루오션 3D 프린팅
Chiwon Song
 
빅데이터미래전략세미나발표자료 빅데이터기술현황및전망-황승구-20120410
Peter Woo
 
증강현실과 가상현실
Changhwan Yoon
 
Large Scale Health Telemetry and Analytics with MQTT, Hadoop and Machine Lear...
DataWorks Summit/Hadoop Summit
 
TeraStream for ETL
치민 최
 
ICT 상용화 06 스마트 기기 부품, 외장 제조 개론
Junsang Dong
 
공공분야에서의 드론 활용 - 진정회 대표 (엑스드론)
brainerymakers
 
3D 프린팅 연구/리서치. 3D printing research korean version
Jiwoo Seo
 
드론 분야의 기술 트렌드 및 발전 방향 - 김태호 실장
brainerymakers
 
2016 kcd 세미나 발표자료. 구글포토로 바라본 인공지능과 머신러닝
JungGeun Lee
 
가상현실(Vr)과 증강현실(ar)
Heesung Youn
 
[IGC2015] 스마일게이트 김용하-VR? AR? 차세대 게임의 기반 기술
강 민우
 
IoT 제품 리뷰 - 약 20개의 IoT 제품 리뷰
봉조 김
 
[Issue&trend] ict와 3 d 프린팅에 의한 제3차 산업혁명
Soomin(Simon) Shim
 
Ad

Similar to DataStreams : Corporate Overview (20)

PDF
Tera stream ETL
Nguyễn Nguyễn Mạnh Trung
 
PPTX
Hortonworks Oracle Big Data Integration
Hortonworks
 
PDF
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini
 
PPTX
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Cloudera, Inc.
 
PPTX
The Big Data Ecosystem for Financial Services
DataStax
 
PPTX
Datamensional Business Intelligence and Data Services
Datamensional
 
PDF
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
Big Data Week
 
PPTX
Smarter Management for Your Data Growth
RainStor
 
PPTX
Navigating the World of User Data Management and Data Discovery
DataWorks Summit/Hadoop Summit
 
PDF
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
Precisely
 
PDF
DAMA Big Data & The Cloud 2012-01-19
Robert J. Abate, CBIP, CDMP
 
PDF
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
In-Memory Computing Summit
 
PDF
The Maturity Model: Taking the Growing Pains Out of Hadoop
Inside Analysis
 
PDF
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Cubic Corporation
 
PDF
Rev_3 Components of a Data Warehouse
Ryan Andhavarapu
 
PDF
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
confluent
 
PDF
Teradata Aster: Big Data Discovery Made Easy
TIBCO Spotfire
 
PDF
Modern Data Challenges require Modern Graph Technology
Neo4j
 
PPTX
Skillwise Big Data part 2
Skillwise Group
 
PDF
Integrating Structure and Analytics with Unstructured Data
DATAVERSITY
 
Hortonworks Oracle Big Data Integration
Hortonworks
 
Capgemini Leap Data Transformation Framework with Cloudera
Capgemini
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Cloudera, Inc.
 
The Big Data Ecosystem for Financial Services
DataStax
 
Datamensional Business Intelligence and Data Services
Datamensional
 
EVOLVING PATTERNS IN BIG DATA - NEIL AVERY
Big Data Week
 
Smarter Management for Your Data Growth
RainStor
 
Navigating the World of User Data Management and Data Discovery
DataWorks Summit/Hadoop Summit
 
2018 Big Data Trends: Liberate, Integrate, and Trust Your Data
Precisely
 
DAMA Big Data & The Cloud 2012-01-19
Robert J. Abate, CBIP, CDMP
 
IMCSummit 2015 - Day 2 Developer Track - The Internet of Analytics – Discover...
In-Memory Computing Summit
 
The Maturity Model: Taking the Growing Pains Out of Hadoop
Inside Analysis
 
Monetizing Big Data with Streaming Analytics for Telecoms Service Providers
Cubic Corporation
 
Rev_3 Components of a Data Warehouse
Ryan Andhavarapu
 
Driving Business Transformation with Real-Time Analytics Using Apache Kafka a...
confluent
 
Teradata Aster: Big Data Discovery Made Easy
TIBCO Spotfire
 
Modern Data Challenges require Modern Graph Technology
Neo4j
 
Skillwise Big Data part 2
Skillwise Group
 
Integrating Structure and Analytics with Unstructured Data
DATAVERSITY
 

Recently uploaded (20)

PPTX
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
PPTX
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
PDF
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PPTX
Introduction to computer chapter one 2017.pptx
mensunmarley
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PDF
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
INFO8116 - Week 10 - Slides.pptx data analutics
guddipatel10
 
M1-T1.pptxM1-T1.pptxM1-T1.pptxM1-T1.pptx
teodoroferiarevanojr
 
D9110.pdfdsfvsdfvsdfvsdfvfvfsvfsvffsdfvsdfvsd
minhn6673
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
SUMMER INTERNSHIP REPORT[1] (AutoRecovered) (6) (1).pdf
pandeydiksha814
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
Introduction to computer chapter one 2017.pptx
mensunmarley
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Practical Measurement Systems Analysis (Gage R&R) for design
Rob Schubert
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 

DataStreams : Corporate Overview

  • 1. © 2014 DataStreams Corp. All Rights Reserved. DataStreams Corp. "Always find the better value of your data" www.datastreams.co.kr Corporate Overview
  • 2. © 2014 DataStreams Corp. All Rights Reserved. Name Data Streams Corp. CEO Mr. Young-sang Lee Business Area Data Integration Solutions Development and Sales Data Quality Solutions Development and Sales Data Warehouse / BI / FDS / Forensic / Audit Consulting and Construction Big Data Analytic Consulting and Platform Construction Data Governance Platform Development/Consulting/Sales Data Migration Consulting and Construction Large Volume Data Batch Processing Improvement Consulting and Construction Data Standardization and Quality Management Consulting and System Construction Data Architecture Consulting Office Address HQ Chungho-nais B/D 6F, 28 Saimdang-ro, Seocho-gu, Seoul, Korea R&D U-Spacemall #2 B-601, 670 Daewangpangyo-ro, Bundang-gu, Seongnam, Korea China Office Room 1216, 12th Floor, Intersection of Hopson Kirin Society Building 2, Wang Jing Fu Tong West Street, Wangjing, Chaoyang District, Bejing Contact Tel) +82-2-3473-9077 / Fax) +82-2-3473-9084 Investor JAFCO ASIA Capital USD 2M Sales Amount USD 19M (2013) Established Sep 19, 2001 Employees 125 1 Company Profile
  • 3. © 2014 DataStreams Corp. All Rights Reserved. Company History 2 2012 •Established R&D Center in Pangyo Techno Valley •Released Social Cube for SNS Data Analytics •Participated in Original Technology Development Project for Next Generation Memory Based Big Data Analytics and Management 2009 •TeraStream™, Selected as Standard Data Integration Tool by The Korea Federation of Banks •Selected as Contractor for Building Resource Management Data Standardization and Meta-data Management System by Ministry of National Defense •Released DeltaStream™, QualityStream™, and ImpactStream™ 2007 •Awarded for Excellent Venture Company by Deputy Prime Minister •MetaStream™, Awarded for Digital Business Innovation by SMBA •Released TeraStream™ Version 2.0 •JAFCO, Japan invested 4 million USD 2005 •Mr. Young-sang Lee, CEO, Was Awarded a Grand Prize for Korea Digital Competitiveness •TeraStream™ Won New Technology Certification from Ministry of Knowledge Economy •Released MetaStream™ Version 1.0 •Acquired KDB Solution Co., Ltd., Korea’s First Meta-data Management Solution Company •TeraStream™ Version 1.4, Acquired GS(Good Software) Certification 2003 •KEB selected TeraStream™ as Standard Batch/ETL Solution for Next Generation Banking System •First Worldwide Sales Contract of FACT™ •Presented FACT™ to Oracle Open World 2003 in San Francisco, USA •Released MetaStream™ Version 2.7 •CEO, Mr. Young-sang Lee, Was Elected as a Chairman of KOSEA(Korea Software Enterprise Association) •Released TeraStream™ Version 3.2 2010 •TeraStream™ Version 2.2, Acquired GS Certification •Changed Company Name to DataStreams Corp. •Contracted with Intellectual Property Office for Enterprise Data Quality Management 2008 •Awarded for Top Private Company for Population and Housing Census by Deputy Prime Minister •TeraStream™, Selected as Standard ETL Tool by Ministry of Government Administration and Home Affairs 2006 •Selected as Technically Innovated Company of 2004 by SMBA •Selected as Technically Innovated Company of 2004 by Small & Medium Business Administration(SMBA) •TeraStream™, Selected for a Next Generation Banking Data Migration Tool by Shinhan Bank 2004 •Registered TeraStream™ as a Trademark •Released TeraStream™ Designer Version 1.1 •First TeraStream™ V.1.1 Contract with National Statistics Office 2002 2011 •Awarded Prime Minister Citation for SW Achievement •Selected as an ATC(Advanced Technology Center) by Ministry of Knowledge Economy •Established China Office in Beijing 2013 •TeraStream™ for Hadoop, Selected as Base Solution for Building Government-Wide Big Data Infrastructure •Acquired Patent for Readable Data Encryption and Decryption •CEO, Mr. Young-sang Lee, Was Awarded Digital Management Innovation Prize •Joined Int’l SOFT China in Beijing • MetaStream™ Version 3.0, Acquired GS Certification • Launched DQ Appliance 2001 •Released TeraStream™ Beta Version •Innovative Data Solutions Corp., Was Established 2014 •TeraStream™, Was Awarded 2014 Korea Software Award by Ministrer of Science, ICT and Future •DataStreams Is Listed on KONEX(Korea New Exchange)
  • 4. © 2014 DataStreams Corp. All Rights Reserved. Organizational Structure Present Condition of Engineering Employees 96 Consultants Data Governance 36 Data Integration & Migration Big Data Management DW & BI SNS Analytics Engineers Meta Data & Data Quality Management Solution 60 Data Integration & Migration Solution Big Data Management Platform Number Of Employees Total Consultants/Developers Management/Sales 125 96 29 Engineering Lev. Total Consultants Engineers Total 96 36 60 Professional 27 18 9 Qualified 26 11 15 Intermediate 19 3 16 Beginning 24 4 20 3 •Government Offices •Banking •Manufacturing Business •Logistics/Services •Planning Products •Presales & Consulting •DW/BI •Big Data •SNS •QA(Quality Assurance) •Marketing •Overseas Sales •Overseas Corp. •HR/General Affairs •Financial Admin. •Knowledge Mgmt. •Sales Support •PI •DI Technical Support •DQ Technical Support •DI •RTI •DQ •UI CEO Auditor Counselor Sales Div. PPC Div. Business Consulting Div. Global Business Div. Management Support Div. Technical Service Div. R&D Center
  • 5. © 2014 DataStreams Corp. All Rights Reserved. Business Area Data Governance • Data Governance Architecture • Data Quality Management • Meta Data Management • Master Data Management • Data Quality Appliance Data Integration • High Performance ETL ∙ Batch • Data Integration • Deferred(Near Real Time) • CDC, Real Time Data Transition • High Speed Data Extraction • High Speed Data Sort • Data Integration with Hadoop • Data Integration with Grid • Test Data Management Big Data • Big Data Platform with Hadoop • Big Data Analysis & Visualization • Structured & Unstructured Data Analysis • SNS Data Analysis ∙ Monitoring Consulting • ISP & Big Data Consulting • Fraud Detection System(FDS) Consulting • DW ∙ CRM ∙ BI Consulting • Data Integration & Migration Consulting • Data Standardization ∙ Quality Management ∙ Architecture Consulting • Master Data Management Consulting • Data Lineage Management Consulting DW/BI • Building DW ∙ CRM ∙ BI • QPI Methodology • Fraud Detection System(FDS) • Information System Planning(ISP) • Alternative Trading System Consulting • Transaction Cost Data Analysis Framework • Transaction Cost Data Analysis Framework & Consulting(TCA) • Financial Analysis Services DataStreams Is a Company Which Has Expertise in Data Processing and Analysis to Provide Total Data Management Services in Data Integration and Quality Management. Data Lineage Management • Data Lineage Analysis Platform • Visualization for Data Lineage • Relative Tool, Program & Script Language Analysis • Table Column Search & Monitoring 4
  • 6. © 2014 DataStreams Corp. All Rights Reserved. Market Recognition & Share 5 60% 25% 10% 5% 80% 15% 5% Korean Market Share for ETL Solution Korean Market Share for Data Migration (Banking Industry) DataStreams Corp. IBM Informatica Others * The market share for ETL solutions is self-researched in 2013. 55% 30% 15% Korean Market Share for Metadata Management W Company (Korean) G Company (Korean) ETL Data Migration Metadata Management No. 1 Total Data Management Technologies in Korea Vendor Report of Magic Quadrant for Data Integration and Data Quality Tools (2013) Reference URL : http://www.citia.co.uk/ Mentioned DataStreams’ capabilities of offering wide range of data integration products through ETL, CDC and near-real time technologies.
  • 7. © 2014 DataStreams Corp. All Rights Reserved. Private Banking / Finance Companies Public Finance Companies Government & Public / Educational Institutions Enterprises Major Domestic Customers 6
  • 8. © 2014 DataStreams Corp. All Rights Reserved. DataStreams Is Exporting & Expanding… Columbia Banco Colpatria Bogota City Government Credibanco China Kookmin Bank Hana Bank USA Merklenet, Inc. CSC Consulting Bisys Comcast Merkle Data Tech USA Airweb Sungard American Airlines Highmark, Inc. Mexico Sodexhopass Procesar Peru Banco Ripley Chile Banco Estado de Chile Australia National Wealth Management (MLC/NAB) Spain Procecard Tecnocom Telefonica Soluciones ITnow! Germany Accenture GmbH India Reliance Industry Indonesia Excelcom Aviva Telkomsel Hana Bank Global Customers 7 Japan with Reliable Business Partners U.S.A. BellaDati(US) Vietnam HIPT FPT IS Lac Viet QTSC EU BellaDati(CZE) Gibkie(RUS) IMBI(Europe, N. Africa) China China Mobile Fuchen Telecom & Banking / Insurance Companies
  • 9. © 2014 DataStreams Corp. All Rights Reserved. Data Integration Product Line – 1/2 제품 이미지 TeraStreamTM •Data Extraction at High-Speed •Data Transformation/Conversion •Data Load •Meta Data Management •Real-time Monitoring •Shortening Processing Time •Dropping System Load by Improved Processing •Shortening Development Time by Integrated GUI •Cost Reduction for Data Integration Process •Securing Data Consistency with TeraNRTTM •Efficient Use of Resource by Distributed Processing Main Features Benefit 제품 이미지 DeltaStreamTM •Real-time Change Data Extraction •Data Transformation/Conversion •Immediate Data Load •Real-time Fault Handling •ETL Supported •Minimizing Use of DB Engine and System Resource by Using Transaction Log in DBMS •Extract Data without Any Effect on Existing On- line Tasks •Copy Data at High-Speed through Real-time Data Extraction 제품 이미지 TeraNRTTM •Change Data Extraction(NRT) Scheduling •NRT Monitoring •Column Change Impact Analysis •Automatic Script Creation •Automatic Verification •Increase of Data Consistency and Work Efficiency by NRT Extraction for Large Volume Data •Time and Cost Reduction by Parallel Process and Large Volume Data Processing at High-Speed •Securing Work Credibility by Using Automatic Verification and Missing Data Revision The No. 1 Data Integration Solution in Korea to Extract, Transform, and Load the Data from the Source DB to the Target System in Various Environments The Real-time Data Processing Solution to Transfer the Transaction Log for Change Data Information in DBMS to the Target System by Choosing CDC(Change Data Capture) Method The Near Real-time(NRT) Data Processing Solution to Capture Change Data in Source DBMS to Load the Transformed and Changed Data to the Target System by NRT Method 8 Main Features Main Features Benefit Benefit
  • 10. © 2014 DataStreams Corp. All Rights Reserved. Data Integration Product Line – 2/2 TeraStreamTM for Hadoop •Can Use Outstanding Features of TeraStreamTM •Can Use Variable Function of TeraStreamTM •Big Data Analytics by Supporting Various Tools •Map/Reduce by GUI •Distributed Processing Integrated Monitoring for Multiple Nodes •Easier and More Convenient Big Data Processing by Combining Strengths between TeraStreamTM and the Hadoop •Manpower Reduction for Development through the Interfaces of TeraStreamTM and Hadoop 제품 이미지 TeraTDSTM •RSC Encryption as Readable Form •Maintaining Attributes of Original Data •Guarantee Uniqueness of Data •High-Speed Data Extraction and Conversion •Support Easy Conversion Patterns by Users •Blocking Private Information Leakage from Test and Development DB •Provision with Easiness for Data Structure in Test and Development DB •Preparation for Relevant Laws and IT Audits The Test Data Management Solution to Convert the Original Data with Core Private Customer Information in a Readable Form over Maintaining Relations among the Tables in Development or Test System Structure Phases The Specialized Solution for Big Data Processing by Combining TeraStreamTM and the Hadoop Eco-system to Provide Distributed File System and Platform 9 Main Features Benefit Main Features Benefit FACTTM •Extracting Data from Various Commercial DBMS •Append or Overwrite in Data Output •Extracting Fixed and Variable Data •Extracting CLOB / BLOB Data •Providing Development Conveniences through Various Syntax of ANSI-SQL •Easy Loading of Extracted SAMFILE by Automatically Creating DBMS Load Script File •Preventing Problems with Data Extraction by Modifying Carriage Return Value The High-Speed Data Extraction Solution to Apply the Extracted Data in External System ETL, Batch, Data Conversion by Extracting the Source Data from Various RDBMS at High-Speed Main Features Benefit
  • 11. © 2014 DataStreams Corp. All Rights Reserved. Data Governance Product Line – 1/2 제품 이미지 MetaStreamTM •Data Standardization Management •Modeling Management •Support Various DBMS •Transfer Management •Data Quality Management •Redundancy Prevention of Meta-data •Meta-data Application Increase for Business Development •Traceability Improvement for the Past Data •Redundancy Prevention of R&R •Time Reduction for Impact Analysis and Accuracy Improvement 제품 이미지 QualityStreamTM •Profiling Analysis Target DB •Management and Analysis of Business Rules •Integrated Management of Quality Diagnosis •Error Data Verification and Analysis •Six Sigma Based Statistical Management •Definition of Data Quality Management Process •Consistent Reduction of Error Rate •Consistent Management of Core Target for Data Quality Management •Structural Data Quality Management through Integrated Repository 제품 이미지 ImpactStreamTM •Program AS-IS Analysis •DB AS-IS Analysis •Program/DB Impact Analysis •Creation Out-put / Provision with Excel Report •Efficient Integration for Business Applications •Maintenance of IT Application Development and Management Information •Improving Outsourcing Control •Improving Comprehension for Whole Application Road Map •Development Productivity Improvement and Maintenance Cost Reduction The Meta-data Management Solution to Realize the Structural Data Quality Management by Meta-data Life Cycle Management Features such as Meta-data Extraction, Standardization Management, Mapping Management, Standardization Observation, and Provision with Statistics The Data Quality Management Solution to Secure Consistent Data Quality Level by Result Analysis of Quality Diagnosis through Accessing the Analysis Target Data The Solution for Impact Analysis due to Application Changes by Building Application Knowledge Information Structure to Improve Comprehension and Readability from Application Source Codes Which Are Changed and Managed Continuously 10 Main Features Benefit Main Features Main Features Benefit Benefit
  • 12. © 2014 DataStreams Corp. All Rights Reserved. Data Governance Product Line – 2/2 제품 이미지 MasterStreamTM •Enterprise Master Data Governance •Master Data Quality Management •Pre/Post Verification by Set-up and Application of Business Rules •Business Process Control by Data Value •Efficiency Improvement by Sharing Core Data •Quick Decision Making by Statistical Analysis •Maintenance Cost Reduction by Operation System Improvement •Scalability Improvement for System Installation/Changes Q-TrackTM •Extract Mapping Information among Tables •Extract Legacy Table Lay-out Information of Original System •Extract Meta Information of TeraStreamTM •Extract Program Information (Stored Procedure or Shell with SQL) •Extract Undiscovered Data Problems •Accurate and Meaningful Analysis of Various ETL Environments •Provision with Communication Base between Parties The Master Data Management(MDM) Solution to Create and Manage Core Enterprise Data, Master Data, Continuously for Each Business Process Flow DQ Appliance •Verification Target Extraction / Change Management •Initial Data Verification •NRT Data Verification •Meta-data Management •Profiling / Rule / Verification Results / Modification Management •Data Governance Based Data Quality Management •Securing NRT Data Quality Verification Basis •Provision with Outstanding Performance and Management Conveniences through the Optimized Dedicated Server for DB and File Processing •Quality Verification Performance Improvement Takes Near Real-time(NRT) Quality Verification Based on High-Speed Extraction and Verification, and Change Data, by Building Data Store in the Separate Machine to Get Over Limitation in Time and Space of Existing Data Quality Management Systems The Solution to Provide Intuitive, Accurate, and Meaningful Lineage Information by Flow Visualization Regarding Creation, Transformation, and Use of Data from Operating System to Data Warehouse/Unit System 11 Main Features Benefit Main Features Main Features Benefit Benefit
  • 13. © 2014 DataStreams Corp. All Rights Reserved. Real-Time Ecosystem Hadoop Ecosystem Analytic & Visualization Ecosystem Real-Time Collection (Storm) CEP (Esper) Collecting and Searching Social Data (Splunk, Marklogic) In- Memory NoSql (Redis) Document NoSql (Mongo) Data Store SQL (RDBMS) Distributed Database (Hbase) Workflow Management (Oozie) Data Collection (chukwa, Flume, Scribe) Structured Data Collection (Sqoop, hiho) Serialization (Avro) Real-time SQL Query (Impala, Tajo) Data Anaysis (Pig, Hive) Data Mining (Mahout) Metadata Management (HCatalog) Distributed Data Process (MapReduce) Document (D3) Visualization Graphic (Giraph, Gremlin) Analytic Engine (Complex Network Theory) Analytic Store (Greenplum, Exa, Nettiza) DS Solution Collecting Social (Social Cube) Structured Data Collection (FACT) Meta-data Management (MetaStream) Collection (DeltaStream) Tera Stream for Hadoop (HBASE) TeraStream for Hadoop (MAP/REDUCE) TeraStream for Hadoop (HIVE) TeraStream for Hadoop (NOSQL) TeraStream for Hadoop (RDBS) Unstructured Data Collection (Social Cube) Big Data Platform 12
  • 14. © 2014 DataStreams Corp. All Rights Reserved. Business System Real Time Monitoring ODS DM •Performance Tuning •Emergency Management DW Data Governance Architecture (m-DOSA) Load Clean Transform Extract Meta-data Management Data Quality Management Impact Analysis Master Data Management Enterprise Data Management Solutions 13 Multidimensional Mart Subject Mart Periodic Mart Summary Table Data Lineage Monitor Distribute Security Security
  • 15. © 2014 DataStreams Corp. All Rights Reserved. Data Governance Solution Framework 14
  • 16. © 2014 DataStreams Corp. All Rights Reserved. Business Service & Consulting References
  • 17. © 2014 DataStreams Corp. All Rights Reserved. Introduction Effect • Performed Information System DW Batch within 6 Hours, and Simultaneously Loaded the Data to DM • Support Bilateral ETL between Current ODW, New ODW, and EDW • Used in Whole Range of Data Processing and Data Integration by Applying Various Business Logic Customer’s Issue Implementation System Architecture Extract 24-Hour DB SPLIT Profit Management RISK KPI EUC Current ODW ORACLE EDW Sybase ASIQ New ODW ORACLE Bilateral ETL ETL ETL ETL Batch Batch Batch FTP Transfer After Extract Accounting System Information System DW DM & Sub System Opened in Feb, 2005 Batch • Needed Conversion of Mainframe and IMS HDB • Bilateral ETL Processing • On Time Processing of Large Volume Data Batch • Converted Mainframe Data to UNIX Data - Size of Data : 1~1.5TB • Support Korean Conversion and Cleansing • Perform Accounting System Batch, EDW ETL, Information System ETL and Batch - Daily Changed Data : 200GB (From Accounting System to New ODW) / Converted within 1½ Hours • Data Transformation and Delivery from DM & Sub System and New ETL Information System to Various Mart through EDW Server(Data HUB) (EUC, KPI, RISK Management, IFRS, BaselⅡ) Business Service & Consulting 16
  • 18. © 2014 DataStreams Corp. All Rights Reserved. 17 Electronic Voucher DW Performance Improvement for Ministry of Health and Welfare  Reduced Statistics Provision Time : Within A Few Seconds~Minutes from 1~6 Days  Shortened Information Process : Simplifying Data Processing Procedures and Direct Acquisition of Information by Person in Charge  Direct Inquiry and Editing through DW Construction  Data Credibility Improvement by Consistent Data Provision Introduction Effect • Deploying Work from Original Source to ODS • Constructing and Modeling ODS, DW, and DM for Statistics Analysis • Voucher System (DB1 → New DW Server) • Server - OS : AIX 5.3 (AS-IS and TO-BE are same) - CPU : Power5, 2.1GHz, 6-core , IBM P Series - MEM : 12 GB - H/W : 1TB • Easy Maintenance by Simple Logic •Lack of Data Consistency •Lack of Data Instantaneity •Impossible to Check Illegal Approval •Disharmony between Administrators and Work Sites due to a Lack of Statistical Data Credibility Implementation Customer’s Issue System Architecture Electronic Voucher Statistics Analysis System DW Server Original System Source DB (Oracle) - ODS Data Conversion - Update/Insert to DW - 1:1 Mapping Load - Work Deploy - Loading to ODS IBM P Series Voucher Service Illegal Approval Pregnancy Birth Manpower of Providers Target DB (Oracle) FACT™ ODS DM DW ETL - Data Transformation for ODS and DW - Update/Insert to DW ETL ETL Business Service & Consulting Introduction Effect
  • 19. © 2014 DataStreams Corp. All Rights Reserved. Business Service & Consulting 18 Information System Construction for Korea National Open University • Academic Affairs Statistics Automation : Needed Personnel and Time Reduction of Manual Labor for Approx. 15 Days • Academic Affairs Computerization : Needed Administration Statistics System Customer’s issue System Architecture University Administration Graduate School Academic Affairs Electronic Approval Tutoring Administrative Work Institute of Lifelong Education Academic Affairs Graduation Grade Registration Admission Information System Server TeraStream™ ODS DW DM Data Extract/Load ETL Control Data Extract/Load Data Extract/Load • Extracting and Transforming Data for Academic and Administrative Affairs from Oracle DB to Provide Structured/Unstructured Statistics Reports After Loading the Data to ODS, DW, and DM of Oracle DB • Academic Affairs Data(Early/Changed) : 20GB/1GB, Total 4 Hours for Change/Load • Administration Work Data(Early/Changed) : 5GB/100MB, Total 1 Hours for Change/Load Implementation Tool Total Working Time Academic Affairs Statistics System • Within 4 Hours University Administration Statistics System • Within 1 Hour Reduced Working Time from 15 Days to 4 Hours  Reduced Time for Academic Affairs such as Grade, Registration, and Admission, from 15 Days to 4 Hours  The Statistics of University Administration Is Finished within 1 Hour Introduction Effect Data Extract/Load Data Extract/ Transform/Load
  • 20. © 2014 DataStreams Corp. All Rights Reserved. Hadoop System of Price Index for NSO Customer’s Issue Implementation  Difference between Actual Price and Announced Monthly Price Index  Problem of Huge Volume Data Processing  Lack of Professional Engineers for Hadoop System  Introduced Hadoop System - Fast Process for Huge Volume of Data such as Internal Data and External(SNS) Data by Hadoop System  Introduced TeraStream™ for Hadoop - Easy Use of Hadoop System through Convenient Features Focused on Developers - Increase of Development Conveniences by Easy GUI System Architecture Data Collection Data Storage Area Data Analysis External Data Internal Data Data Analysis DBMS TeraStream™ for Hadoop Unstructured / Structured Data HDFS TeraStream™ for Hadoop Engine 19 National Statistical Office(NSO) Used to Announce Market Price Index based on Monthly Price Research for 250~450 Goods in the Market Before. However, There Were Differences between Actual Price and Announced Monthly Price Index because of Time Gap. NSO Solved This Difference Problem by Performing the Project by Introducing Hadoop Eco-system. Business Service & Consulting
  • 21. © 2014 DataStreams Corp. All Rights Reserved. Support e-Document Filter for Filtering Various File Formats such as MS Office, HWP, PDF, e-mail Formats(EML, PST, OST), and DB File Save Extract Results with Text Format for Search Raw Data in Local File by Document Filter in Local Files System or Distributed File System Create Index by Considering Parallel Indexing Implementation in Hadoop Distributed File System and Morphology of Korean Language through Open Search Library Which Supports Hadoop or Open Search Engines Provide Various File Search Features based on Hadoop System Implemented Extendable Data Analysis System for Integrated Search for e-mail Files, HWP Format Files, and MS Office Files based on Hadoop System Business Service & Consulting Big Data Based Data Analysis System for National Tax Service 20
  • 22. © 2014 DataStreams Corp. All Rights Reserved. Shifted Paradigm of Railway Use to Collect Travel Information with Each Railway Station as the Center and to Provide Potential Customers with the Customized Information Potential Customers [Real Time Travel Info] [Travel Analysis∙Satisfaction Predict] [Travel Destination Rank] (Monthly/Weekly/Daily) ① Interest(Recognition) Attention(Planning) Target Information Retrieval and Using ② Access and Information Retrieval Information Provider(Device) Big Data Platform ③ Collecting Potential Customers’ Information Convergence Data Focused on Railway Provision with Customized Information ① Information of Device Is Composed of Easily Accessible and Usable Living Contents Induce Purchase of Goods in the Process of Information Use by Potential Customers ① Access Information Provider Device and Search - Access through Mobile and Internet - Required Information Retrieval(e.g. Travel Prediction) ② Information Use by Potential Customers - [Before Event Occurrence] Provide Travel Destination Rank and Issues Focused on Railway → Causing Interest - [Event Occurrence] Travel Destination Analysis and Predict Customized Satisfaction Info → Inducing to Planning and Target Information Retrieval ※ Event : Vacation, Long Holiday, Business Travel, etc. ③ Information Collection and Feedback based on Big Data Platform - Collection and Analysis of Customers’ Issue, Reaction, Type, and Behaviors and Feedback by Aggregation of Various Data Type Business Service & Consulting Big Data Consulting for Korail 21