SlideShare a Scribd company logo
Q. The CDO Agenda,
how can Data Architecture help?
Phill Radley,
Chief Data Architect
26 / October / 2016
Data Management
Specialist Group
1/25
Answer
• A lot
• A bit
• Not much
The architects answer…
It depends
…..State of the business
…..State of enterprise architecture
……What’s going on externally
Do you have or need a CDO ?
AGENDA
Framing View of the Industry
Discuss the CDO role
The organisation of BT & IT Challenges
Examples of what we’re doing with data
(in the absence of a CDO)
The Long View of Big Data
Data “Bigness” =
 ( Volume, Velocity, Variety)
1990 Y2K
Mainframe (1st Platform)
1960 09
First Research cluster   Production Cluster
HAAS = Hadoop as a Service
14
Proprietary, Monolithic
Batch, Interactive
COBOL/ISAM/IDMS
Linked Record sets Client-Server Applications +
RDBMS
(2nd Platform )
OPEN ! 3GL, 4GL
PC & Servers
on premise
RELATIONAL
1606
scale out infrastructure
(3rd Platform)
Clusters, Data hub, pipelines
Mobile
Social
Big Data
Cloud
?
cost/performance
VVV crunch
What does a Chief Data Office(r) do ?
Evangelise
• Culture change to “data driven organisation”
• Self-Service Data & Analytics
Centralise ( tackle the silo problem )
“Year 1: Build the House”
“Year 2: Throw Open the doors”
Facilitate
• Educate
• Design Pattern Cookbook for the Enterprise
• Briefings – All Hands Calls, Leadership Team Mtgs, Hackathons….
• Tooling
• SKOOL on GITHUB (tool to simplify transferring tables from Oracle to Hadoop)
“Understanding the Chief Data Officer”
O’Reilly – Julie Steele of Silicon Valley Data Science
A Good CDO Role Model ?
Joy Bonaguro
City of San Francisco
data.sfgov.org
Cataloguing Data Assets
Facilitating data sharing
Building enabling infrastructure
BT Group Structure 1/Apr/2016
Customers
Chief Architects Office Enterprise Architecture
Data Architecture
For BT Group
~ 90K FTE in 61 countries, serving 180 countries
Research & Innovation
Legacy Systems Architecture in each BT Business Unit
Analytics
Data
Warehouse
ESB
CRM
Service Management
Network Management
Networks
& IT
Customers
 
• Hundreds of systems in each business unit grouped
into 3 operational areas (CRM/Service Mgt/Network Mgt)
• Data Warehouse per business unit
• Client – Server applications running on
servers in BT Data Centres (~ 35K hosts)
• Mainframe applications (in Openreach)
• Total Storage ~ 25PB
• Lots of event / time series data
– Network Alarms & Telemetry
– Netflow Traffic Events, Security events
– Call Detail Records, web clicks,
– mobile handset data (GPS, Apps, browsing..)
• Business Unit CIOs manage IT investment roadmap, each business
unit deploys a “stack release” quarterly
Field Engineers
Challenges - Complexity
Example from BT Global Services
Design for Release 17 of
Repair Systems for 1 product family
Where’s the Master Data ?
Which flows are data replication ?
Which flows are transactional ?
x 70 Similar “system stacks”
x 4 Releases / yr
Data Replication pinball
Revenue
Assurance
Customer
Details
Archive
Order
Mgt.
CRM
Billing
25K
20K 75K
10K
10K
10K
35K
Challenges – Risk & Compliance
Challenge – Agility Opportunity - Scaling
What does Data architecture do…? 1. Sort the basics
Adopt/Adapt a framework
Establish Lists(systems, data landscape….)
DAMA DMBOK.. TOGAF…
What does Data architecture do…? 2. Develop Vision


 

CRM
Hive
Meta
Store
RDBMS
Web/APP
Server
 Map 
Reduce
code

BI Tools
Tableau, Zoomdata…
(HIVE TABLE ACCESS)
HDFS
Impala
+ Sentry
Wrangling & Discovery
Data Science
Datameer, HUE…
(HDFS FILE ACCESS)
Flume
Golden
Gate
 

ERP
RDBMS
Web/APP
Server
 Map 
Reduce
code

sqoop
 

DW
RDBMS
Web/APP
Server
 Map 
Reduce
code

sqoop







1. Event Ingestion from
Networks/IT/Web servers
Collection with flume agents
landing in HDFS files 2. DB Table transfer using sqoop
(map/reduce) jobs, landing in HDFS files
Active
Directory
FILES
TABLES
snapshotCDC snapshot

Data
Scientists
SQL
analysts
business
users
What does Data architecture do…? 3A. Build the data house
• Following a presentation to the TSO Leadership team Dec 2013 an initial inovestment in
a production cluster was agreed backed by a plan to launch in Feb 2014
• 60 nodes optimised for Hadoop map/reduce deployed in BT Data Centre in Sheffield
(6TB local disks, 1:1 core:spindle ratio, 8GB for JVM per map/reduce slot
• Existing linux 3rd line team tasked with running basic (Min. Viable Product) Hadoop
Cluster as a shared service platform
BT HaaS Release 1: 60 Nodes ~ 2 PB Feb 2014 Linux 3rd Line  Hadoop Admin
What does Data architecture do…? 3B. Build the data house
HAAS Platform
Hadoop Cluster B (Openreach only)
Order form
(SharePoint)

script

email
Active
Directory
Tennant
“Project Owner”

User
admin
Standard
User Admin
Process
Hadoop
Cluster A HAASA AP 00307_12126
HIVE
HDFS
sentry
Job queue
HUE Impala
Flume
BI Server
Create
Hadoop
Features
“HAASA AP 00307_12126
Is ready for you to use”
existing
Business APP
12126 .
Oracle
DB
APP extends footprint in HaaS
http FS
Kerberos
Datameer
Analytics
Review
Board
Platform
Admin
ARB
User Access
Systems Access
Sqoop
Create
Security
Group
HAASA AP 00101_2029
Faults
4369
Orders
3531
CRM
2029
 hree existing business applications (CRM, Orders, Faults) extended into HaaS 
RDBMS
Customer
Table
RDBMS
Orders
Table
RDBMS
Faults
Table
T_CustomerHive DB
HAASA
AP 00101_2029
sqoop
V_Customer
HAASA AP 00202_3531
T_OrdersHive DB
HAASA
AP 0202_3531
sqoop
V_Orders
HAASA AP 00303_4369
T_FaultsHive DB
HAASA
AP 0303_4369
sqoop
V_Faults
Business
Data
Stewards

Business Analysts / Data Scientists

CRM

Orders

Faults
Governing Access to Data on the Platform ** WIP **
1. Browse & select data
2. Get Steward Approval
3. Create VIEWs & GRANTs
4. Recommend joins/ Views
Data Catalogue
(Million Table Meta-store)
Cloudera
“Resident”
Solution
Architect
What does Data architecture do…? 3. Educate
BT HaaS Cookbook
snip.bt.com/haascook
Design patterns to
ease project on boarding
included in “Learning Pathways”
Research & Innovation
Data Scientists
Dec 2015 3rd BT Data Science Week
(50 @ Adastral)
Business Awareness
Sep 2014
UK Hadoop User Group
(200 @ BT Centre)
IT Operations
Jan 2014
RESOPS training week
(Research + IT Ops Adastral)
Architecture
Hadoop Summit Mar 2014
(Doug Cutting- Cloudera+BT)
Big Data Data Centre of Excellence
Cardiff / Bangalore
20 designers / developers
working on > 50 opportunities & projects
published open source “skool” utility
Q & A
Phill Radley
Chief Data Architect
phillip.radley@bt.com

More Related Content

What's hot (20)

PDF
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Denodo
 
PDF
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo
 
PDF
Architecture for Real-Time and Batch Big Data Analytics
Nir Rubinstein
 
PPTX
Seamless, Real-Time Data Integration with Connect
Precisely
 
PDF
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo
 
PDF
Apache Kafka® and the Data Mesh
ConfluentInc1
 
PPTX
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
PPTX
Big Data Management: What's New, What's Different, and What You Need To Know
SnapLogic
 
PDF
Data platform architecture
Sudheer Kondla
 
PDF
Ibm machine learning for z os
Cuneyt Goksu
 
PPTX
Modernize & Automate Analytics Data Pipelines
Carole Gunst
 
PDF
Where does Fast Data Strategy Fit within IT Projects
Denodo
 
PPTX
Big-Data Server Farm Architecture
Jordan Chung
 
PDF
Hitachi Data Systems Hadoop Solution
Hitachi Vantara
 
PDF
Modernizing Data Management Through Metadata
MANTA
 
PDF
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
PDF
Architecture of Big Data Solutions
Guido Schmutz
 
PDF
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo
 
PDF
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
Rittman Analytics
 
PDF
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Dipti Borkar
 
Designing Fast Data Architecture for Big Data using Logical Data Warehouse a...
Denodo
 
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo
 
Architecture for Real-Time and Batch Big Data Analytics
Nir Rubinstein
 
Seamless, Real-Time Data Integration with Connect
Precisely
 
Denodo DataFest 2016: Big Data Virtualization in the Cloud
Denodo
 
Apache Kafka® and the Data Mesh
ConfluentInc1
 
Data Lakehouse, Data Mesh, and Data Fabric (r1)
James Serra
 
Big Data Management: What's New, What's Different, and What You Need To Know
SnapLogic
 
Data platform architecture
Sudheer Kondla
 
Ibm machine learning for z os
Cuneyt Goksu
 
Modernize & Automate Analytics Data Pipelines
Carole Gunst
 
Where does Fast Data Strategy Fit within IT Projects
Denodo
 
Big-Data Server Farm Architecture
Jordan Chung
 
Hitachi Data Systems Hadoop Solution
Hitachi Vantara
 
Modernizing Data Management Through Metadata
MANTA
 
Data Mesh Part 4 Monolith to Mesh
Jeffrey T. Pollock
 
Architecture of Big Data Solutions
Guido Schmutz
 
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo
 
New World Hadoop Architectures (& What Problems They Really Solve) for Oracle...
Rittman Analytics
 
Presto – Today and Beyond – The Open Source SQL Engine for Querying all Data...
Dipti Borkar
 

Viewers also liked (9)

PDF
Beyond SQL: Managing Events and Relationships in Social Care
BCS Data Management Specialist Group
 
PDF
The CDO Challenge 24-11-16
BCS Data Management Specialist Group
 
PDF
Nick Keen Data governance in the Environment Agency
BCS Data Management Specialist Group
 
PPTX
Moving from 3rd Normal Form to a web enabled world 22-9-15
BCS Data Management Specialist Group
 
PDF
Nigel Turner data governance is not boring
BCS Data Management Specialist Group
 
PDF
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
BCS Data Management Specialist Group
 
PDF
Michael Bironneau Data governance and the IoT
BCS Data Management Specialist Group
 
PPTX
Big Data Analytics, Dave Shuttleworth - 22-9-15
BCS Data Management Specialist Group
 
PDF
Nicola Askham Key concepts in data governance
BCS Data Management Specialist Group
 
Beyond SQL: Managing Events and Relationships in Social Care
BCS Data Management Specialist Group
 
The CDO Challenge 24-11-16
BCS Data Management Specialist Group
 
Nick Keen Data governance in the Environment Agency
BCS Data Management Specialist Group
 
Moving from 3rd Normal Form to a web enabled world 22-9-15
BCS Data Management Specialist Group
 
Nigel Turner data governance is not boring
BCS Data Management Specialist Group
 
John Stuart-Clarke - beginning the data governance journey - 8th june 2016
BCS Data Management Specialist Group
 
Michael Bironneau Data governance and the IoT
BCS Data Management Specialist Group
 
Big Data Analytics, Dave Shuttleworth - 22-9-15
BCS Data Management Specialist Group
 
Nicola Askham Key concepts in data governance
BCS Data Management Specialist Group
 
Ad

Similar to The CDO Agenda: how data architecture can help? (20)

PPTX
DA_01_Intro.pptx
Alok Mohapatra
 
PDF
Modern data warehouse
Stephen Alex
 
PDF
Modern data warehouse
Stephen Alex
 
PDF
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
PDF
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
PDF
Modern Data Architecture
Mark Hewitt
 
PDF
Building a Modern Data Architecture with Enterprise Hadoop
Slim Baltagi
 
PPTX
Big Data Practice_Planning_steps_RK
Rajesh Jayarman
 
PDF
Aioug big data and hadoop
AiougVizagChapter
 
PDF
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 
PDF
Creating a Next-Generation Big Data Architecture
Perficient, Inc.
 
PDF
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
PDF
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
Hortonworks
 
PDF
Hadoop at the Center: The Next Generation of Hadoop
Adam Muise
 
PDF
Big Data Paris - A Modern Enterprise Architecture
MongoDB
 
PDF
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Hortonworks
 
PDF
Data Structuring PowerPoint Presentation Slides
SlideTeam
 
PDF
Big Data Analytics: Architectural Perspective
Sumit Kalra
 
PDF
Data-Ed Online Webinar: Data Architecture Requirements
DATAVERSITY
 
PDF
Architecting Modern Data Platforms
Ankit Rathi
 
DA_01_Intro.pptx
Alok Mohapatra
 
Modern data warehouse
Stephen Alex
 
Modern data warehouse
Stephen Alex
 
The Value of the Modern Data Architecture with Apache Hadoop and Teradata
Hortonworks
 
Open-BDA Hadoop Summit 2014 - Mr. Slim Baltagi (Building a Modern Data Archit...
Innovative Management Services
 
Modern Data Architecture
Mark Hewitt
 
Building a Modern Data Architecture with Enterprise Hadoop
Slim Baltagi
 
Big Data Practice_Planning_steps_RK
Rajesh Jayarman
 
Aioug big data and hadoop
AiougVizagChapter
 
Creatinganext generationbigdataarchitecture-141204150317-conversion-gate02
email2jl
 
Creating a Next-Generation Big Data Architecture
Perficient, Inc.
 
Data Architecture Strategies: Data Architecture for Digital Transformation
DATAVERSITY
 
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
Hortonworks
 
Hadoop at the Center: The Next Generation of Hadoop
Adam Muise
 
Big Data Paris - A Modern Enterprise Architecture
MongoDB
 
Modern Data Architecture for a Data Lake with Informatica and Hortonworks Dat...
Hortonworks
 
Data Structuring PowerPoint Presentation Slides
SlideTeam
 
Big Data Analytics: Architectural Perspective
Sumit Kalra
 
Data-Ed Online Webinar: Data Architecture Requirements
DATAVERSITY
 
Architecting Modern Data Platforms
Ankit Rathi
 
Ad

Recently uploaded (20)

PPTX
Solution+Architecture+Review+-+Sample.pptx
manuvratsingh1
 
PDF
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PDF
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
PPT
introdution to python with a very little difficulty
HUZAIFABINABDULLAH
 
PPTX
short term internship project on Data visualization
JMJCollegeComputerde
 
PPTX
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPTX
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
PPTX
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PDF
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
PPTX
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
Solution+Architecture+Review+-+Sample.pptx
manuvratsingh1
 
apidays Munich 2025 - Making Sense of AI-Ready APIs in a Buzzword World, Andr...
apidays
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
apidays Munich 2025 - The Double Life of the API Product Manager, Emmanuel Pa...
apidays
 
introdution to python with a very little difficulty
HUZAIFABINABDULLAH
 
short term internship project on Data visualization
JMJCollegeComputerde
 
Multiscale Segmentation of Survey Respondents: Seeing the Trees and the Fores...
Sione Palu
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
Data-Users-in-Database-Management-Systems (1).pptx
dharmik832021
 
MR and reffffffvvvvvvvfversal_083605.pptx
manjeshjain
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
McKinsey - Global Energy Perspective 2023_11.pdf
niyudha
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
apidays Munich 2025 - The Physics of Requirement Sciences Through Application...
apidays
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
apidays Munich 2025 - Integrate Your APIs into the New AI Marketplace, Senthi...
apidays
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
Insurance-Analytics-Branch-Dashboard (1).pptx
trivenisapate02
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 

The CDO Agenda: how data architecture can help?

  • 1. Q. The CDO Agenda, how can Data Architecture help? Phill Radley, Chief Data Architect 26 / October / 2016 Data Management Specialist Group 1/25
  • 2. Answer • A lot • A bit • Not much
  • 3. The architects answer… It depends …..State of the business …..State of enterprise architecture ……What’s going on externally Do you have or need a CDO ?
  • 4. AGENDA Framing View of the Industry Discuss the CDO role The organisation of BT & IT Challenges Examples of what we’re doing with data (in the absence of a CDO)
  • 5. The Long View of Big Data Data “Bigness” =  ( Volume, Velocity, Variety) 1990 Y2K Mainframe (1st Platform) 1960 09 First Research cluster   Production Cluster HAAS = Hadoop as a Service 14 Proprietary, Monolithic Batch, Interactive COBOL/ISAM/IDMS Linked Record sets Client-Server Applications + RDBMS (2nd Platform ) OPEN ! 3GL, 4GL PC & Servers on premise RELATIONAL 1606 scale out infrastructure (3rd Platform) Clusters, Data hub, pipelines Mobile Social Big Data Cloud ? cost/performance VVV crunch
  • 6. What does a Chief Data Office(r) do ? Evangelise • Culture change to “data driven organisation” • Self-Service Data & Analytics Centralise ( tackle the silo problem ) “Year 1: Build the House” “Year 2: Throw Open the doors” Facilitate • Educate • Design Pattern Cookbook for the Enterprise • Briefings – All Hands Calls, Leadership Team Mtgs, Hackathons…. • Tooling • SKOOL on GITHUB (tool to simplify transferring tables from Oracle to Hadoop) “Understanding the Chief Data Officer” O’Reilly – Julie Steele of Silicon Valley Data Science
  • 7. A Good CDO Role Model ? Joy Bonaguro City of San Francisco data.sfgov.org Cataloguing Data Assets Facilitating data sharing Building enabling infrastructure
  • 8. BT Group Structure 1/Apr/2016 Customers Chief Architects Office Enterprise Architecture Data Architecture For BT Group ~ 90K FTE in 61 countries, serving 180 countries Research & Innovation
  • 9. Legacy Systems Architecture in each BT Business Unit Analytics Data Warehouse ESB CRM Service Management Network Management Networks & IT Customers   • Hundreds of systems in each business unit grouped into 3 operational areas (CRM/Service Mgt/Network Mgt) • Data Warehouse per business unit • Client – Server applications running on servers in BT Data Centres (~ 35K hosts) • Mainframe applications (in Openreach) • Total Storage ~ 25PB • Lots of event / time series data – Network Alarms & Telemetry – Netflow Traffic Events, Security events – Call Detail Records, web clicks, – mobile handset data (GPS, Apps, browsing..) • Business Unit CIOs manage IT investment roadmap, each business unit deploys a “stack release” quarterly Field Engineers
  • 10. Challenges - Complexity Example from BT Global Services Design for Release 17 of Repair Systems for 1 product family Where’s the Master Data ? Which flows are data replication ? Which flows are transactional ? x 70 Similar “system stacks” x 4 Releases / yr
  • 12. Challenges – Risk & Compliance
  • 13. Challenge – Agility Opportunity - Scaling
  • 14. What does Data architecture do…? 1. Sort the basics Adopt/Adapt a framework Establish Lists(systems, data landscape….) DAMA DMBOK.. TOGAF…
  • 15. What does Data architecture do…? 2. Develop Vision      CRM Hive Meta Store RDBMS Web/APP Server  Map  Reduce code  BI Tools Tableau, Zoomdata… (HIVE TABLE ACCESS) HDFS Impala + Sentry Wrangling & Discovery Data Science Datameer, HUE… (HDFS FILE ACCESS) Flume Golden Gate    ERP RDBMS Web/APP Server  Map  Reduce code  sqoop    DW RDBMS Web/APP Server  Map  Reduce code  sqoop        1. Event Ingestion from Networks/IT/Web servers Collection with flume agents landing in HDFS files 2. DB Table transfer using sqoop (map/reduce) jobs, landing in HDFS files Active Directory FILES TABLES snapshotCDC snapshot  Data Scientists SQL analysts business users
  • 16. What does Data architecture do…? 3A. Build the data house • Following a presentation to the TSO Leadership team Dec 2013 an initial inovestment in a production cluster was agreed backed by a plan to launch in Feb 2014 • 60 nodes optimised for Hadoop map/reduce deployed in BT Data Centre in Sheffield (6TB local disks, 1:1 core:spindle ratio, 8GB for JVM per map/reduce slot • Existing linux 3rd line team tasked with running basic (Min. Viable Product) Hadoop Cluster as a shared service platform BT HaaS Release 1: 60 Nodes ~ 2 PB Feb 2014 Linux 3rd Line  Hadoop Admin
  • 17. What does Data architecture do…? 3B. Build the data house HAAS Platform Hadoop Cluster B (Openreach only) Order form (SharePoint)  script  email Active Directory Tennant “Project Owner”  User admin Standard User Admin Process Hadoop Cluster A HAASA AP 00307_12126 HIVE HDFS sentry Job queue HUE Impala Flume BI Server Create Hadoop Features “HAASA AP 00307_12126 Is ready for you to use” existing Business APP 12126 . Oracle DB APP extends footprint in HaaS http FS Kerberos Datameer Analytics Review Board Platform Admin ARB User Access Systems Access Sqoop Create Security Group
  • 18. HAASA AP 00101_2029 Faults 4369 Orders 3531 CRM 2029  hree existing business applications (CRM, Orders, Faults) extended into HaaS  RDBMS Customer Table RDBMS Orders Table RDBMS Faults Table T_CustomerHive DB HAASA AP 00101_2029 sqoop V_Customer HAASA AP 00202_3531 T_OrdersHive DB HAASA AP 0202_3531 sqoop V_Orders HAASA AP 00303_4369 T_FaultsHive DB HAASA AP 0303_4369 sqoop V_Faults Business Data Stewards  Business Analysts / Data Scientists  CRM  Orders  Faults Governing Access to Data on the Platform ** WIP ** 1. Browse & select data 2. Get Steward Approval 3. Create VIEWs & GRANTs 4. Recommend joins/ Views Data Catalogue (Million Table Meta-store)
  • 19. Cloudera “Resident” Solution Architect What does Data architecture do…? 3. Educate BT HaaS Cookbook snip.bt.com/haascook Design patterns to ease project on boarding included in “Learning Pathways” Research & Innovation Data Scientists Dec 2015 3rd BT Data Science Week (50 @ Adastral) Business Awareness Sep 2014 UK Hadoop User Group (200 @ BT Centre) IT Operations Jan 2014 RESOPS training week (Research + IT Ops Adastral) Architecture Hadoop Summit Mar 2014 (Doug Cutting- Cloudera+BT) Big Data Data Centre of Excellence Cardiff / Bangalore 20 designers / developers working on > 50 opportunities & projects published open source “skool” utility
  • 20. Q & A Phill Radley Chief Data Architect [email protected]