SlideShare a Scribd company logo
HTAP Queries & Data Fabrics
Atif Rahman
@mantaq10
7th December, 2018
PostgreSQL Down Under
Melbourne, Australia
The agenda
OLTP vs OLAP vs HTAP
The Problem Statement
Data Fabrics
Key PostgreSQL Features
Foreign Data Wrappers
Distributed Cache
Background
Patterns
Components
OLTP vs OLAP vs HTAP
OLTP
OLAP
HTAP
Smaller
transactions
Lots of
them
Lots of
Updates
ACID (Transactional)
CRUD (Commands)
complex
queries
Large
working sets
Bulk loads &
offloads
INDEXING
AGGREGATIONS
Mixed
workloads
on the same
system
Analytics on
‘inflight’
transactional
data
Addresses
resource
contention
FEDERATION
SYNCHRONISATION
Hybrid Transactional / Analytical Processing
Data Organisation
Row wise Writes
OLTP
Column Wise Reads
OLAP
OLTP OLAP
Analytics
Latency
Data
Redundancy
Single platform for
multimodel HTAP
HTAP
HTAP Queries
Data & Application Integration
Data Integration
(ETL/ELT)
Application Integration
(ESB / API)
Data Virtualisation
(DV)
Low Fidelity View
Integration Type Physical Movement and
Consolidation
Synchronization and
Propagation
Abstraction, Virtual
Consolidation, Federation
Purpose Database to Database Application to Application Database to Application
Agility* Weeks, Months Minutes, Hours Hours, Days
Repository Warehouse / Lake Transactional System Semantic Layer
Run Time* Typically Scheduled Event Driven Typically OnDemand
Data Warehouse vs Data Lakes
Data Warehouse Data Lake
• Schema on write
• (early binding)
• BI and analysts
• Arguably better governed.
• MPP / SMP databases
• Schema on read
• (Late binding)
• Data scientists
• Arguably more flexible
• MapReduce et al.
HTAP Queries
Data Warehouse vs Data Lakes
Data Warehouse Data Lake
Metro Uber
HTAP Queries
HTAP Queries
The Problems
• ACID
• Atomicity
• Concurrency
• Isolation
• Durability
• BASE
• Basic Availability
• Soft States
• Eventual Consistency
• CAP Theorem (Consistency vs Availability vs Partitions)
• CQRS (Command Query Response Segregation)
[Distributed Cache]
Metrics &
Analytics
Complex
Alerts
Scheduled
Feeds
Adhoc
Queries
Detail
Data
Logical Unified
Data Model
Subject Area
Logical Data
Models
Enterprise Data Fabric Cluster
Unified queries
Business &
Exceptions Rules
Normalisation and
Standardisation Rules
Top Down Data
Quality (and profiling)
Connectors (JDBC,
APIs, ODBC, etc)
Schedulers and
Refresh Jobs Rules
Query Planners &
Optimiser(s)
Data Fabric Architecture
• Schema Store
• Distributed Cache
• Query Federation
• Query Optimisation
• Semantic Normalisation
Foreign Data Wrappers
One database to rule them all,
One database to find them,
One database to bring them all,
And in a wrapper bind them.
Foreign Data Wrappers
• Uses the standard compliant SQL/MED
• *Data Type Translations (SQL, NoSQL etc)
• *Push Down Predicates
• WHERE and ORDER BY are propagated
• Required COLUMNS
• *Supports CRUD
• *Two Way Joins
• Import Foreign Schemas
*May vary based on specific wrapper
(Reflections) the PostgreSQL way
TABLE_ALPHA
A
Database
Cluster
Shared
Buffer
Pool TABLE_ALPHA
A
BEGIN;
INSERT INTO
TABLE_ALPHA
VALUES (‘B’);
COMMIT;
TABLE_ALPHA
A B
Dirty
Page(s)
Without WAL Logs
OS Cache
Use WAL Logs for Cache Synchronisation
TABLE_ALPHA
A
TABLE_ALPHA
A
TABLE_ALPHA
A B
Dirty
Page(s)
BEGIN;
INSERT INTO
TABLE_ALPHA
VALUES (‘B’);
COMMIT;
WAL
Checkpoints
write back to
disk
pg_prewarm()
Loads OS and
buffer cache(s)
Distributed Cache with PostgreSQL
Disk
Cache
Disk
Cache
FDW
Cache
FDW
Cache
Distributed PG Cluster with
Federated queries loaded
directed to Cache
BI APP DS MSG
Disk
Distributed Cache
Disk FDW FDW
Distributed PG Cluster
integrated With Distributed
Cache Technologies
BI APP DS MSG
JDBC, pgmemcache
Apache Ignite
Dremio
Terracotta
memcached
Some Technologies
Key PostgreSQL Features for Fabrics
Schema Store
Distributed Cache
Query Federation
Query Optimisation
Semantic Normalisation
Dirty Pages
pg_prewarm()
pgmemcache
Persistence WAL
Database
CTEs,
Least Cost Routing
ORM + FRM
Foreign Data Wrappers
Key Takeaways
• Cloud migrations
• Jurisdictions (privacy and availability zones)
• Computing at the edge of the network.
• Data Virtualisation
• Rights to be Forgotten (GDPR)
• Query Lineage & Audit across ALL data
• Areas for future development.
HTAP OLAP OLTP

More Related Content

PDF
How Netflix Tunes EC2 Instances for Performance
Brendan Gregg
 
PPTX
Spark etl
Imran Rashid
 
PDF
Multiple Sites and Disaster Recovery with Ceph: Andrew Hatfield, Red Hat
OpenStack
 
PDF
Nosql data models
Viet-Trung TRAN
 
PPTX
How to Actually Tune Your Spark Jobs So They Work
Ilya Ganelin
 
PPTX
Optimizing Apache Spark SQL Joins
Databricks
 
PPTX
Analyzing 1.2 Million Network Packets per Second in Real-time
DataWorks Summit
 
PDF
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Databricks
 
How Netflix Tunes EC2 Instances for Performance
Brendan Gregg
 
Spark etl
Imran Rashid
 
Multiple Sites and Disaster Recovery with Ceph: Andrew Hatfield, Red Hat
OpenStack
 
Nosql data models
Viet-Trung TRAN
 
How to Actually Tune Your Spark Jobs So They Work
Ilya Ganelin
 
Optimizing Apache Spark SQL Joins
Databricks
 
Analyzing 1.2 Million Network Packets per Second in Real-time
DataWorks Summit
 
Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang
Databricks
 

What's hot (20)

PDF
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks
 
PDF
10 Tips for Configuring Your Builds with Bamboo Specs
Atlassian
 
PDF
Deep Dive: Memory Management in Apache Spark
Databricks
 
PPTX
Apache flink
Ahmed Nader
 
PDF
Best Practices for the Most Impactful Oracle Database 18c and 19c Features
Markus Michalewicz
 
PDF
Oracle Active Data Guard: Best Practices and New Features Deep Dive
Glen Hawkins
 
PPTX
Introduction to NoSQL Databases
Derek Stainer
 
PDF
Speed up UDFs with GPUs using the RAPIDS Accelerator
Databricks
 
PDF
My First 100 days with an Exadata (PPT)
Gustavo Rene Antunez
 
PPTX
NoSQL Architecture Overview
Christopher Foot
 
PDF
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
PPTX
Grafana Mimir and VictoriaMetrics_ Performance Tests.pptx
RomanKhavronenko
 
PDF
Introduction to Apache Flink - Fast and reliable big data processing
Till Rohrmann
 
PPTX
What is ETL testing & how to enforce it in Data Wharehouse
BugRaptors
 
PDF
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Flink Forward
 
PDF
Introduction to Spark with Python
Gokhan Atil
 
PDF
Introduction to Apache NiFi 1.11.4
Timothy Spann
 
PDF
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
PPTX
Understanding SQL Trace, TKPROF and Execution Plan for beginners
Carlos Sierra
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
Databricks
 
10 Tips for Configuring Your Builds with Bamboo Specs
Atlassian
 
Deep Dive: Memory Management in Apache Spark
Databricks
 
Apache flink
Ahmed Nader
 
Best Practices for the Most Impactful Oracle Database 18c and 19c Features
Markus Michalewicz
 
Oracle Active Data Guard: Best Practices and New Features Deep Dive
Glen Hawkins
 
Introduction to NoSQL Databases
Derek Stainer
 
Speed up UDFs with GPUs using the RAPIDS Accelerator
Databricks
 
My First 100 days with an Exadata (PPT)
Gustavo Rene Antunez
 
NoSQL Architecture Overview
Christopher Foot
 
Apache Flink 101 - the rise of stream processing and beyond
Bowen Li
 
Grafana Mimir and VictoriaMetrics_ Performance Tests.pptx
RomanKhavronenko
 
Introduction to Apache Flink - Fast and reliable big data processing
Till Rohrmann
 
What is ETL testing & how to enforce it in Data Wharehouse
BugRaptors
 
Time to-live: How to Perform Automatic State Cleanup in Apache Flink - Andrey...
Flink Forward
 
Introduction to Spark with Python
Gokhan Atil
 
Introduction to Apache NiFi 1.11.4
Timothy Spann
 
How to Utilize MLflow and Kubernetes to Build an Enterprise ML Platform
Databricks
 
Understanding SQL Trace, TKPROF and Execution Plan for beginners
Carlos Sierra
 
Ad

Similar to HTAP Queries (20)

PDF
Drill architecture 20120913
jasonfrantz
 
PPT
Postgres for the Future
EDB
 
PDF
Open Source SQL Databases
Emanuel Calvo
 
PDF
Postgres.foreign.data.wrappers.2015
EDB
 
PDF
The Central View of your Data with Postgres
EDB
 
PPTX
PostgreSQL 10: What to Look For
Amit Langote
 
PDF
Understanding Presto - Presto meetup @ Tokyo #1
Sadayuki Furuhashi
 
PPTX
Unit II Hadoop Ecosystem_Updated.pptx
BhavanaHotchandani
 
PDF
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
PDF
Understanding and building big data Architectures - NoSQL
Hyderabad Scalability Meetup
 
PDF
Software Developer Portfolio: Backend Architecture & Performance Optimization
kiwoong (daniel) kim
 
PDF
FOSSASIA 2015 - 10 Features your developers are missing when stuck with Propr...
Ashnikbiz
 
PDF
Heterogenous Persistence
Jervin Real
 
PDF
Building a Complex, Real-Time Data Management Application
Jonathan Katz
 
PDF
Datalake Architecture
TechYugadi IT Solutions & Consulting
 
ODP
HadoopDB
Miguel Pastor
 
PPT
Hive @ Hadoop day seattle_2010
nzhang
 
PDF
NoSQL and Spatial Database Capabilities using PostgreSQL
EDB
 
PDF
PostgreSQL as a Big Data Platform
Chris Travers
 
PDF
The Future of Fast Databases: Lessons from a Decade of QuestDB
javier ramirez
 
Drill architecture 20120913
jasonfrantz
 
Postgres for the Future
EDB
 
Open Source SQL Databases
Emanuel Calvo
 
Postgres.foreign.data.wrappers.2015
EDB
 
The Central View of your Data with Postgres
EDB
 
PostgreSQL 10: What to Look For
Amit Langote
 
Understanding Presto - Presto meetup @ Tokyo #1
Sadayuki Furuhashi
 
Unit II Hadoop Ecosystem_Updated.pptx
BhavanaHotchandani
 
Two Leading Approaches to Data Virtualization, and Which Scales Better? by Da...
ScyllaDB
 
Understanding and building big data Architectures - NoSQL
Hyderabad Scalability Meetup
 
Software Developer Portfolio: Backend Architecture & Performance Optimization
kiwoong (daniel) kim
 
FOSSASIA 2015 - 10 Features your developers are missing when stuck with Propr...
Ashnikbiz
 
Heterogenous Persistence
Jervin Real
 
Building a Complex, Real-Time Data Management Application
Jonathan Katz
 
HadoopDB
Miguel Pastor
 
Hive @ Hadoop day seattle_2010
nzhang
 
NoSQL and Spatial Database Capabilities using PostgreSQL
EDB
 
PostgreSQL as a Big Data Platform
Chris Travers
 
The Future of Fast Databases: Lessons from a Decade of QuestDB
javier ramirez
 
Ad

More from Atif Shaikh (10)

PPTX
Patterns and Packages in PostgreSQL for Privacy Preservation
Atif Shaikh
 
PPTX
Privacy Preserved Data Augmentation using Enterprise Data Fabric
Atif Shaikh
 
PPTX
Data Infrastructure for Your Retail Digital Strategy
Atif Shaikh
 
PPTX
Agile Analytics: Discovering Expectations
Atif Shaikh
 
PPTX
Agile Big Data Practices
Atif Shaikh
 
PPTX
Agile Analytics
Atif Shaikh
 
PPTX
Transforming Organizations to Better Leverage Analytics
Atif Shaikh
 
PPTX
Introduction to Knowledge Management
Atif Shaikh
 
PPTX
Hr Analytics
Atif Shaikh
 
PPTX
Strategy by Measurement
Atif Shaikh
 
Patterns and Packages in PostgreSQL for Privacy Preservation
Atif Shaikh
 
Privacy Preserved Data Augmentation using Enterprise Data Fabric
Atif Shaikh
 
Data Infrastructure for Your Retail Digital Strategy
Atif Shaikh
 
Agile Analytics: Discovering Expectations
Atif Shaikh
 
Agile Big Data Practices
Atif Shaikh
 
Agile Analytics
Atif Shaikh
 
Transforming Organizations to Better Leverage Analytics
Atif Shaikh
 
Introduction to Knowledge Management
Atif Shaikh
 
Hr Analytics
Atif Shaikh
 
Strategy by Measurement
Atif Shaikh
 

Recently uploaded (20)

PPTX
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
PPTX
Probability systematic sampling methods.pptx
PrakashRajput19
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PPTX
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
PPT
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
PDF
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PPTX
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
PPTX
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PDF
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PDF
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
PPTX
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
PPTX
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Introduction to Data Analytics and Data Science
KavithaCIT
 
PDF
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 
HSE WEEKLY REPORT for dummies and lazzzzy.pptx
ahmedibrahim691723
 
Probability systematic sampling methods.pptx
PrakashRajput19
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
IP_Journal_Articles_2025IP_Journal_Articles_2025
mishell212144
 
From Vision to Reality: The Digital India Revolution
Harsh Bharvadiya
 
Fundamentals and Techniques of Biophysics and Molecular Biology (Pranav Kumar...
RohitKumar868624
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
Introduction-to-Python-Programming-Language (1).pptx
dhyeysapariya
 
Presentation (1) (1).pptx k8hhfftuiiigff
karthikjagath2005
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
Key_Statistical_Techniques_in_Analytics_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
An Uncut Conversation With Grok | PDF Document
Mike Hydes
 
Web dev -ppt that helps us understand web technology
shubhragoyal12
 
Fluvial_Civilizations_Presentation (1).pptx
alisslovemendoza7
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Introduction to Data Analytics and Data Science
KavithaCIT
 
Blue Futuristic Cyber Security Presentation.pdf
tanvikhunt1003
 

HTAP Queries

  • 1. HTAP Queries & Data Fabrics Atif Rahman @mantaq10 7th December, 2018 PostgreSQL Down Under Melbourne, Australia
  • 2. The agenda OLTP vs OLAP vs HTAP The Problem Statement Data Fabrics Key PostgreSQL Features Foreign Data Wrappers Distributed Cache Background Patterns Components
  • 3. OLTP vs OLAP vs HTAP OLTP OLAP HTAP Smaller transactions Lots of them Lots of Updates ACID (Transactional) CRUD (Commands) complex queries Large working sets Bulk loads & offloads INDEXING AGGREGATIONS Mixed workloads on the same system Analytics on ‘inflight’ transactional data Addresses resource contention FEDERATION SYNCHRONISATION
  • 4. Hybrid Transactional / Analytical Processing Data Organisation Row wise Writes OLTP Column Wise Reads OLAP OLTP OLAP Analytics Latency Data Redundancy Single platform for multimodel HTAP HTAP
  • 6. Data & Application Integration Data Integration (ETL/ELT) Application Integration (ESB / API) Data Virtualisation (DV) Low Fidelity View Integration Type Physical Movement and Consolidation Synchronization and Propagation Abstraction, Virtual Consolidation, Federation Purpose Database to Database Application to Application Database to Application Agility* Weeks, Months Minutes, Hours Hours, Days Repository Warehouse / Lake Transactional System Semantic Layer Run Time* Typically Scheduled Event Driven Typically OnDemand
  • 7. Data Warehouse vs Data Lakes Data Warehouse Data Lake • Schema on write • (early binding) • BI and analysts • Arguably better governed. • MPP / SMP databases • Schema on read • (Late binding) • Data scientists • Arguably more flexible • MapReduce et al.
  • 9. Data Warehouse vs Data Lakes Data Warehouse Data Lake Metro Uber
  • 12. The Problems • ACID • Atomicity • Concurrency • Isolation • Durability • BASE • Basic Availability • Soft States • Eventual Consistency • CAP Theorem (Consistency vs Availability vs Partitions) • CQRS (Command Query Response Segregation)
  • 13. [Distributed Cache] Metrics & Analytics Complex Alerts Scheduled Feeds Adhoc Queries Detail Data Logical Unified Data Model Subject Area Logical Data Models Enterprise Data Fabric Cluster Unified queries Business & Exceptions Rules Normalisation and Standardisation Rules Top Down Data Quality (and profiling) Connectors (JDBC, APIs, ODBC, etc) Schedulers and Refresh Jobs Rules Query Planners & Optimiser(s) Data Fabric Architecture • Schema Store • Distributed Cache • Query Federation • Query Optimisation • Semantic Normalisation
  • 14. Foreign Data Wrappers One database to rule them all, One database to find them, One database to bring them all, And in a wrapper bind them.
  • 15. Foreign Data Wrappers • Uses the standard compliant SQL/MED • *Data Type Translations (SQL, NoSQL etc) • *Push Down Predicates • WHERE and ORDER BY are propagated • Required COLUMNS • *Supports CRUD • *Two Way Joins • Import Foreign Schemas *May vary based on specific wrapper
  • 16. (Reflections) the PostgreSQL way TABLE_ALPHA A Database Cluster Shared Buffer Pool TABLE_ALPHA A BEGIN; INSERT INTO TABLE_ALPHA VALUES (‘B’); COMMIT; TABLE_ALPHA A B Dirty Page(s) Without WAL Logs OS Cache Use WAL Logs for Cache Synchronisation TABLE_ALPHA A TABLE_ALPHA A TABLE_ALPHA A B Dirty Page(s) BEGIN; INSERT INTO TABLE_ALPHA VALUES (‘B’); COMMIT; WAL Checkpoints write back to disk pg_prewarm() Loads OS and buffer cache(s)
  • 17. Distributed Cache with PostgreSQL Disk Cache Disk Cache FDW Cache FDW Cache Distributed PG Cluster with Federated queries loaded directed to Cache BI APP DS MSG Disk Distributed Cache Disk FDW FDW Distributed PG Cluster integrated With Distributed Cache Technologies BI APP DS MSG JDBC, pgmemcache Apache Ignite Dremio Terracotta memcached Some Technologies
  • 18. Key PostgreSQL Features for Fabrics Schema Store Distributed Cache Query Federation Query Optimisation Semantic Normalisation Dirty Pages pg_prewarm() pgmemcache Persistence WAL Database CTEs, Least Cost Routing ORM + FRM Foreign Data Wrappers
  • 19. Key Takeaways • Cloud migrations • Jurisdictions (privacy and availability zones) • Computing at the edge of the network. • Data Virtualisation • Rights to be Forgotten (GDPR) • Query Lineage & Audit across ALL data • Areas for future development.

Editor's Notes

  • #3: Anyone heard of HTAP before? Big Data
  • #4: They freak out when you try to run batch queries. Finicky about offload windows They Typically have huge backlogs Emerging pattern. Atomic All operations in a transaction succeed or every operation is rolled back. Consistent On the completion of a transaction, the database is structurally sound. Isolated Transactions do not contend with one another. Contentious access to data is moderated by the database so that transactions appear to run sequentially. Durable The results of applying a transaction are permanent, even in the presence of failures. asic Availability The database appears to work most of the time. Soft-state Stores don’t have to be write-consistent, nor do different replicas have to be mutually consistent all the time. Eventual consistency Stores exhibit consistency at some later point (e.g., lazily at read time).
  • #5: NOT A NEW CONCEPT
  • #6: The scary santa claus!
  • #8: We need both but we didn’t realize that until recently. Late Binding
  • #9: It started as an epic battle! Society was divided…
  • #10: The metro and the uber We need both but we didn’t realize that until recently. Late Binding
  • #11: Issues in CAP theorem BASE is not ACID CQRS is well suited to edge cases, not many instances where it worked for the norms. Other issues in distributed systems remain or have become more specialised problems Two approaches to overcome operational copmexlity: Multimodel databases or tightly-integrated middleware over multiple single-model data stores Caveat: In order for a custom data model to support concurrent updates, the database must be able to synchronize updates across multiple keys. ACID transactions, if they are sufficiently performant, allow such synchronization.[12] JSON documents, graphs, and relational tables can all be implemented in a manner that inherits the horizontal scalability and fault-tolerance of the underlying data store.
  • #13: Caveat: In order for a custom data model to support concurrent updates, the database must be able to synchronize updates across multiple keys. ACID transactions, if they are sufficiently performant, allow such synchronization.[12] JSON documents, graphs, and relational tables can all be implemented in a manner that inherits the horizontal scalability and fault-tolerance of the underlying data store.
  • #15: OLAP systems are great are low volume transactions but typically complex and long running queries with larger working sets Differentiated between adhoc queries vs operational queries. Massively parallel processing engines came out Spins off like Vertica, Greenplum, Netezza and Teradata all were based on PostgreSQL at some point. Architectural drivers Reliability Short queries but very frequent Transaction Support is critical (ACID) Security Typically lots of ad-hoc writes Not great for complex and bulk reads Resource contention Indexes and other data structures. Data is typically in 3NF kind of models Although with PostgreSQL, we know this is not true. JSON, etc.
  • #16: OLAP systems are great are low volume transactions but typically complex and long running queries with larger working sets Differentiated between adhoc queries vs operational queries. Massively parallel processing engines came out Spins off like Vertica, Greenplum, Netezza and Teradata all were based on PostgreSQL at some point. Architectural drivers Reliability Short queries but very frequent Transaction Support is critical (ACID) Security Typically lots of ad-hoc writes Not great for complex and bulk reads Resource contention Indexes and other data structures. Data is typically in 3NF kind of models Although with PostgreSQL, we know this is not true. JSON, etc.
  • #17: OLAP systems are great are low volume transactions but typically complex and long running queries with larger working sets Differentiated between adhoc queries vs operational queries. Massively parallel processing engines came out Spins off like Vertica, Greenplum, Netezza and Teradata all were based on PostgreSQL at some point. Architectural drivers Reliability Short queries but very frequent Transaction Support is critical (ACID) Security Typically lots of ad-hoc writes Not great for complex and bulk reads Resource contention Indexes and other data structures. Data is typically in 3NF kind of models Although with PostgreSQL, we know this is not true. JSON, etc.
  • #20: OLAP systems are great are low volume transactions but typically complex and long running queries with larger working sets Differentiated between adhoc queries vs operational queries. Massively parallel processing engines came out Spins off like Vertica, Greenplum, Netezza and Teradata all were based on PostgreSQL at some point. Architectural drivers Reliability Short queries but very frequent Transaction Support is critical (ACID) Security Typically lots of ad-hoc writes Not great for complex and bulk reads Resource contention Indexes and other data structures. Data is typically in 3NF kind of models Although with PostgreSQL, we know this is not true. JSON, etc.