SlideShare a Scribd company logo
Leveraging the Power of Cassandra:
Operational Reporting & Interactive
Analysis

Ernesto Ongaro
BI Consultant Jaspersoft
Agenda







Requirements for Cassandra reporting and analysis
Current state of reporting and analysis
Architectural approaches
Demo
Q&A

©2013 Jaspersoft Corporation. Proprietary and Confidential

2
Requirements for Cassandra
reporting and analysis

 People want access to the data in Cassandra
 Most consumers of data are not technical
 Traditional reporting and analytics tools don’t work with



Cassandra
Building reports from scratch is not easy or fun
Providing ad-hoc analytics is very complicated

©2013 Jaspersoft Corporation.

3
Current State of Reporting &
Analytics






Connectors are for RDBMS only
Expensive
Desktop
Standalone

©2013 Jaspersoft Corporation. Proprietary and Confidential

4
Advantages to using a reporting
and analysis framework
Build it yourself

Use a framework

Visual report designer

✖

✔

Security

✖

✔

Scheduling

✖

✔

Web access

✖

✔

API

✖

✔

Self-service queries

✖

✔

Charting libraries

✖

✔

Metadata layer

✖

✔

Input controls

✖

✔

Flexibility

✔

✔

©2013 Jaspersoft Corporation.

5
Architectural Approaches

 Four methods to visualize your Cassandra data
① ETL Approach (Extract, Transform, Load)
② Direct access reports and dashboards
③ Direct access data exploration
④ 1-3 with Hadoop Hive

©2013 Jaspersoft Corporation. Proprietary and Confidential

6
1 – ETL Approach

 Most traditional





approach
Data is extracted via
batch method
Option with most
connectors
ETL process is most
robust option

BI Platform
SQL
ETL

RDBMS

©2013 Jaspersoft Corporation. Proprietary and Confidential

7
JaspersoftETL

 Powered by
 Over 450 connectors
 Data quality, transformations, aggregations

©2013 Jaspersoft Corporation. Proprietary and Confidential

8
2 – Direct Access Reports and
Dashboards

 Reports are developed




using Jaspersoft Studio
(Eclipse based designer)
Lowest latency
Good supplement to ETL
when “near time” is
required
Connector based on
https://github.com/Netflix/
astyanax

©2013 Jaspersoft Corporation. Proprietary and Confidential

BI Platform
CQL3 Native
Connector

9
Example Dashboard

©2013 Jaspersoft Corporation. Proprietary and Confidential

10
3 – Direct Access Exploration

 Allows users to



explore data (vs
pre-defined reports
+ dashboards)
Loads results of a
query into memory
where further
filtering, grouping
and agg. occurs

In Memory
OLAP Engine
BI Platform
CQL3 Native
Connector

©2013 Jaspersoft Corporation. Proprietary and Confidential

11
Example OLAP View

©2013 Jaspersoft Corporation. Proprietary and Confidential

12
4 – Hadoop Hive

 Good for massive



data
Batch process
Native Hadoop
Hive connector as
well

BI Platform
HQL

SQL
ETL

RDBMS

©2013 Jaspersoft Corporation. Proprietary and Confidential

13

libhive
Demonstration
Demo flow:
•

•
•

Example Dashboard +
report
Jaspersoft Studio
Ad-hoc Exploration

Demo environment:
•

•

Jaspersoft 5.5 – runs on
Tomcat 7
DataStax Enterprise 3.1
(Cassandra 1.2.10.1)

©2013 Jaspersoft Corporation. Proprietary and Confidential

14
Questions?
www.jaspersoft.com
BigData@jaspersoft.com

©2013 Jaspersoft Corporation.

15
Conclusion

•
•
•
•

Four different ways to get insights from Cassandra
Commercial open-source software
Get started at http://jaspersoft.com
Thank you!

©2013 Jaspersoft Corporation. Proprietary and Confidential

16

More Related Content

What's hot (20)

PDF
Empowering you with Democratized Data Access, Data Science and Machine Learning
DataWorks Summit
 
PPTX
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Microsoft
 
PPTX
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
BMC Software
 
PPTX
Big Data in Azure
DataWorks Summit/Hadoop Summit
 
PPTX
Oncrawl elasticsearch meetup france #12
Tanguy MOAL
 
PDF
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
 
PDF
InfoSphere BigInsights - Analytics power for Hadoop - field experience
Wilfried Hoge
 
PDF
Evolving Hadoop into an Operational Platform with Data Applications
DataWorks Summit
 
PDF
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
 
PDF
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Denodo
 
PPTX
Build Big Data Enterprise Solutions Faster on Azure HDInsight
DataWorks Summit/Hadoop Summit
 
PDF
Democratizing Data Science on Kubernetes
John Archer
 
PPTX
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
DataWorks Summit
 
PPTX
Hadoop Powers Modern Enterprise Data Architectures
DataWorks Summit
 
PPTX
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
PDF
Ibm big data ibm marriage of hadoop and data warehousing
DataWorks Summit
 
PDF
InfoSphere BigInsights
Wilfried Hoge
 
PDF
Hadoop Integration into Data Warehousing Architectures
Humza Naseer
 
PDF
Data-In-Motion Unleashed
DataWorks Summit
 
Empowering you with Democratized Data Access, Data Science and Machine Learning
DataWorks Summit
 
Georgia Azure Event - Scalable cloud games using Microsoft Azure
Microsoft
 
How Big Data and Hadoop Integrated into BMC ControlM at CARFAX
BMC Software
 
Oncrawl elasticsearch meetup france #12
Tanguy MOAL
 
Combine Apache Hadoop and Elasticsearch to Get the Most of Your Big Data
Hortonworks
 
InfoSphere BigInsights - Analytics power for Hadoop - field experience
Wilfried Hoge
 
Evolving Hadoop into an Operational Platform with Data Applications
DataWorks Summit
 
IlOUG Tech Days 2016 - Unlock the Value in your Data Reservoir using Oracle B...
Mark Rittman
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Denodo
 
Build Big Data Enterprise Solutions Faster on Azure HDInsight
DataWorks Summit/Hadoop Summit
 
Democratizing Data Science on Kubernetes
John Archer
 
Journey to the Data Lake: How Progressive Paved a Faster, Smoother Path to In...
DataWorks Summit
 
Hadoop Powers Modern Enterprise Data Architectures
DataWorks Summit
 
Breakout: Hadoop and the Operational Data Store
Cloudera, Inc.
 
Ibm big data ibm marriage of hadoop and data warehousing
DataWorks Summit
 
InfoSphere BigInsights
Wilfried Hoge
 
Hadoop Integration into Data Warehousing Architectures
Humza Naseer
 
Data-In-Motion Unleashed
DataWorks Summit
 

Viewers also liked (19)

PDF
Promise Object in Windows Store App
Mindfire Solutions
 
RTF
Clases jasper report
jbersosa
 
PDF
Jaspersoft Reporting v5
Ahmed Muzammil
 
PPTX
Jasper Report - Lesson
Alex Fernandez
 
PPTX
Jaspersoft Studio Quick Start Guide
Jeff Rix
 
PDF
Jasper reports in 3 easy steps
Ivaylo Zashev
 
PDF
"Analytics inside your Java application", Part 2, jDays 2015 Speaker: "Veaces...
hamidsamadi
 
PPTX
Mobile Web Development from Scratch
NokiaAppForum
 
PDF
Jasper Reports
Enkitec
 
PDF
Jaspersoft Studioチュートリアル1 - レポートの作成
htshozawa
 
PPTX
Introduction to java Jasper Report with Server & iReport
Arif Hosain
 
PPT
Advanced Jasper Reports
Mindfire Solutions
 
PPTX
Embedding Jaspersoft into your PHP application
Mariano Luna
 
PDF
BatchJobService
supergigas
 
PDF
A Short Intorduction to JasperReports
Guo Albert
 
PPT
Open Source Reporting Tool Comparison
Rogue Wave Software
 
PDF
Introduction to Jasper Reports
Mindfire Solutions
 
ODP
Japer Reports
joshipriya162
 
PPT
Captcha ppt
Abhimanyu Sood
 
Promise Object in Windows Store App
Mindfire Solutions
 
Clases jasper report
jbersosa
 
Jaspersoft Reporting v5
Ahmed Muzammil
 
Jasper Report - Lesson
Alex Fernandez
 
Jaspersoft Studio Quick Start Guide
Jeff Rix
 
Jasper reports in 3 easy steps
Ivaylo Zashev
 
"Analytics inside your Java application", Part 2, jDays 2015 Speaker: "Veaces...
hamidsamadi
 
Mobile Web Development from Scratch
NokiaAppForum
 
Jasper Reports
Enkitec
 
Jaspersoft Studioチュートリアル1 - レポートの作成
htshozawa
 
Introduction to java Jasper Report with Server & iReport
Arif Hosain
 
Advanced Jasper Reports
Mindfire Solutions
 
Embedding Jaspersoft into your PHP application
Mariano Luna
 
BatchJobService
supergigas
 
A Short Intorduction to JasperReports
Guo Albert
 
Open Source Reporting Tool Comparison
Rogue Wave Software
 
Introduction to Jasper Reports
Mindfire Solutions
 
Japer Reports
joshipriya162
 
Captcha ppt
Abhimanyu Sood
 
Ad

Similar to C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis (20)

PPTX
BI, Reporting and Analytics on Apache Cassandra
Victor Coustenoble
 
PPTX
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Caserta
 
ODP
Open Source Business Intelligence Overview
Alex Meadows
 
PPTX
cognos BI10.pptx
vishal choudhary
 
PPTX
cognos BI10.pptx
vishal choudhary
 
DOCX
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PDF
Slides: Relational to NoSQL Migration
DATAVERSITY
 
PPTX
5 Ways to Use Spark to Enrich your Cassandra Environment
Jim Hatcher
 
PDF
Business Intelligence: Data Warehouses
Michael Lamont
 
PPTX
Big Data Warehousing Meetup: Real-time Trade Data Monitoring with Storm & Cas...
Caserta
 
DOCX
Oracle reports to jasper reports
JoelDsouza83
 
PDF
No sql now2011_review_of_adhoc_architectures
Nicholas Goodman
 
PPTX
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
DataStax
 
PDF
IBM Cognos tutorial - ABC LEARN
abclearnn
 
PPTX
Skilwise Big data
Skillwise Group
 
PPTX
Skillwise Big Data part 2
Skillwise Group
 
PDF
Apache Cassandra and Python for Analyzing Streaming Big Data
prajods
 
PPTX
Cognos bi10
vishal choudhary
 
DOCX
us it recruiter
Ro Hith
 
PPTX
BI Reporting Application Comparison
Scott Mitchell
 
BI, Reporting and Analytics on Apache Cassandra
Victor Coustenoble
 
Low-Latency Analytics with NoSQL – Introduction to Storm and Cassandra
Caserta
 
Open Source Business Intelligence Overview
Alex Meadows
 
cognos BI10.pptx
vishal choudhary
 
cognos BI10.pptx
vishal choudhary
 
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
Slides: Relational to NoSQL Migration
DATAVERSITY
 
5 Ways to Use Spark to Enrich your Cassandra Environment
Jim Hatcher
 
Business Intelligence: Data Warehouses
Michael Lamont
 
Big Data Warehousing Meetup: Real-time Trade Data Monitoring with Storm & Cas...
Caserta
 
Oracle reports to jasper reports
JoelDsouza83
 
No sql now2011_review_of_adhoc_architectures
Nicholas Goodman
 
Building a Pluggable Analytics Stack with Cassandra (Jim Peregord, Element Co...
DataStax
 
IBM Cognos tutorial - ABC LEARN
abclearnn
 
Skilwise Big data
Skillwise Group
 
Skillwise Big Data part 2
Skillwise Group
 
Apache Cassandra and Python for Analyzing Streaming Big Data
prajods
 
Cognos bi10
vishal choudhary
 
us it recruiter
Ro Hith
 
BI Reporting Application Comparison
Scott Mitchell
 
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Data Modeling for Apache Cassandra
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Data Modeling for Apache Cassandra
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 

Recently uploaded (20)

PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PDF
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PPTX
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
PPTX
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
RAT Builders - How to Catch Them All [DeepSec 2024]
malmoeb
 
PDF
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
PDF
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
PDF
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
PPTX
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
PDF
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
PPTX
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
PDF
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
PDF
The Future of Artificial Intelligence (AI)
Mukul
 
PDF
Researching The Best Chat SDK Providers in 2025
Ray Fields
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
NewMind AI Weekly Chronicles – July’25, Week III
NewMind AI
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
Introduction to Flutter by Ayush Desai.pptx
ayushdesai204
 
AVL ( audio, visuals or led ), technology.
Rajeshwri Panchal
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
RAT Builders - How to Catch Them All [DeepSec 2024]
malmoeb
 
How Open Source Changed My Career by abdelrahman ismail
a0m0rajab1
 
Economic Impact of Data Centres to the Malaysian Economy
flintglobalapac
 
CIFDAQ's Market Wrap : Bears Back in Control?
CIFDAQ
 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
 
Agile Chennai 18-19 July 2025 Ideathon | AI Powered Microfinance Literacy Gui...
AgileNetwork
 
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
AI Code Generation Risks (Ramkumar Dilli, CIO, Myridius)
Priyanka Aash
 
Peak of Data & AI Encore - Real-Time Insights & Scalable Editing with ArcGIS
Safe Software
 
The Future of Artificial Intelligence (AI)
Mukul
 
Researching The Best Chat SDK Providers in 2025
Ray Fields
 

C* Summit EU 2013: Leveraging the Power of Cassandra: Operational Reporting and Interactive Analysis

  • 1. Leveraging the Power of Cassandra: Operational Reporting & Interactive Analysis Ernesto Ongaro BI Consultant Jaspersoft
  • 2. Agenda      Requirements for Cassandra reporting and analysis Current state of reporting and analysis Architectural approaches Demo Q&A ©2013 Jaspersoft Corporation. Proprietary and Confidential 2
  • 3. Requirements for Cassandra reporting and analysis  People want access to the data in Cassandra  Most consumers of data are not technical  Traditional reporting and analytics tools don’t work with   Cassandra Building reports from scratch is not easy or fun Providing ad-hoc analytics is very complicated ©2013 Jaspersoft Corporation. 3
  • 4. Current State of Reporting & Analytics     Connectors are for RDBMS only Expensive Desktop Standalone ©2013 Jaspersoft Corporation. Proprietary and Confidential 4
  • 5. Advantages to using a reporting and analysis framework Build it yourself Use a framework Visual report designer ✖ ✔ Security ✖ ✔ Scheduling ✖ ✔ Web access ✖ ✔ API ✖ ✔ Self-service queries ✖ ✔ Charting libraries ✖ ✔ Metadata layer ✖ ✔ Input controls ✖ ✔ Flexibility ✔ ✔ ©2013 Jaspersoft Corporation. 5
  • 6. Architectural Approaches  Four methods to visualize your Cassandra data ① ETL Approach (Extract, Transform, Load) ② Direct access reports and dashboards ③ Direct access data exploration ④ 1-3 with Hadoop Hive ©2013 Jaspersoft Corporation. Proprietary and Confidential 6
  • 7. 1 – ETL Approach  Most traditional    approach Data is extracted via batch method Option with most connectors ETL process is most robust option BI Platform SQL ETL RDBMS ©2013 Jaspersoft Corporation. Proprietary and Confidential 7
  • 8. JaspersoftETL  Powered by  Over 450 connectors  Data quality, transformations, aggregations ©2013 Jaspersoft Corporation. Proprietary and Confidential 8
  • 9. 2 – Direct Access Reports and Dashboards  Reports are developed    using Jaspersoft Studio (Eclipse based designer) Lowest latency Good supplement to ETL when “near time” is required Connector based on https://github.com/Netflix/ astyanax ©2013 Jaspersoft Corporation. Proprietary and Confidential BI Platform CQL3 Native Connector 9
  • 10. Example Dashboard ©2013 Jaspersoft Corporation. Proprietary and Confidential 10
  • 11. 3 – Direct Access Exploration  Allows users to  explore data (vs pre-defined reports + dashboards) Loads results of a query into memory where further filtering, grouping and agg. occurs In Memory OLAP Engine BI Platform CQL3 Native Connector ©2013 Jaspersoft Corporation. Proprietary and Confidential 11
  • 12. Example OLAP View ©2013 Jaspersoft Corporation. Proprietary and Confidential 12
  • 13. 4 – Hadoop Hive  Good for massive   data Batch process Native Hadoop Hive connector as well BI Platform HQL SQL ETL RDBMS ©2013 Jaspersoft Corporation. Proprietary and Confidential 13 libhive
  • 14. Demonstration Demo flow: • • • Example Dashboard + report Jaspersoft Studio Ad-hoc Exploration Demo environment: • • Jaspersoft 5.5 – runs on Tomcat 7 DataStax Enterprise 3.1 (Cassandra 1.2.10.1) ©2013 Jaspersoft Corporation. Proprietary and Confidential 14
  • 16. Conclusion • • • • Four different ways to get insights from Cassandra Commercial open-source software Get started at http://jaspersoft.com Thank you! ©2013 Jaspersoft Corporation. Proprietary and Confidential 16