SlideShare a Scribd company logo
Lead Technical Evangelist- Datastax Enterprise
@RachelPedreschi
Rachel Pedreschi

Enabling Search in your Cassandra Application with Datastax Enterprise
1
What is Search?
Confidential
Confidential
Confidential
The bright blue butterfly hangs on the breeze.

[the] [bright] [blue] [butterfly] [hangs] [on] [the] [breeze]
Terms
Confidential Credit: https://developer.apple.com/library/mac/documentation/userexperience/conceptual/SearchKitConcepts/searchKit_basics/searchKit_basics.html
What is Solr Missing?
Not a
Database
Doesn’t
Cluster
Not
transparently
sharded
Requires ETL
to injest
application data
Doesn’t
Reindex
Confidential
Cassandra
✓Highly available
✓Linear scalability
✓Low latency OLTP queries
Confidential
+ =
Confidential
Its Not ETL,
Its Replication!C* C*
C*/
Solr
Data stored in Cassandra
Indexes stored in Solr/Lucene
Disk
Memory
Solr Cassandra
Disk
Memory
Mem-
Table
Index
Segments
Ram Buffer
Index
Segments
Index
Segments
Mem-
Table
Mem-
table
Index
Segments
SSTables
Commit
Log
Coordinator
Index
Segments
Shard Router
UPDATE videos (videoid, tags)
SET tags = {‘cat tubes’, ‘Al Gore’s Internet’,
‘NoSQL Fairytales’}
WHERE voided = b3a76c6b-7c7f-4af6-964f-803a9283c401
OSS Solr
Disk
Memory
Index
Segments
Ram Buffer
Index
Segments
Index
Segments
Index
Segments
Index
Segments
Not Searchable
Searchable
DSE Search
Disk
Memory
Index
Segments
Ram Buffer
Index
Segments
Index
Segments
Index
Segments
Index
Segments
Searchable
Confidential
Let’s see this in action!
Confidential
// Videos by id
CREATE TABLE videos (
videoid uuid,
userid uuid,
name text,
description text,
location text,
location_type int,
preview_image_location text,
tags set<text>,
added_date timestamp,
PRIMARY KEY (videoid)
);
How
can we search on tags?
// Index for tag keywords
CREATE TABLE videos_by_tag (
tag text,
videoid uuid,
added_date timestamp,
userid uuid,
name text,
preview_image_location text,
tagged_date timestamp,
PRIMARY KEY (tag, videoid)
);
Confidential
// Videos by id
CREATE TABLE videos (
videoid uuid,
userid uuid,
name text,
description text,
location text,
location_type int,
preview_image_location text,
tags set<text>,
added_date timestamp,
PRIMARY KEY (videoid)
);
Or this?
But what about this?
Confidential
Behind the scenes…
dse_tool
schema.xml
solrconfig.xml
CQL Query$ dsetool create_core killrvideo.videos generateResources=true
<?xml version="1.0" encoding="UTF-8" standalone="no"?>
<schema name="autoSolrSchema" version="1.5">
<types>
…
<fields>
<field indexed="true" multiValued="false" name="added_date" stored="true" type="TrieDateField"/>
<field indexed="true" multiValued="false" name="location" stored="true" type="TextField"/>
<field indexed="true" multiValued="false" name="preview_image_location" stored="true" type="TextField"/>
<field indexed="true" multiValued="false" name="name" termVectors="true" stored="true" type="TextField"/>
<field indexed="true" multiValued="true" name="tags" termVectors="true" stored="true" type="TextField"/>
<field indexed="true" multiValued="false" name="userid" stored="true" type="UUIDField"/>
<field indexed="true" multiValued="false" name="videoid" stored="true" type="UUIDField"/>
<field indexed="true" multiValued="false" name="location_type" stored="true" type="TrieIntField"/>
<field indexed="true" multiValued="false" name="description" termVectors="true" stored="true" type="TextField"/>
</fields>
<uniqueKey>videoid</uniqueKey>
</schema>
<!--
=======
Copyright DataStax, Inc.
Please see the included license file for details.
-->
<!--
For more details about configurations options that may appear in
this file, see http://wiki.apache.org/solr/SolrConfigXml.
-->
<config>
<!-- In all configuration below, a prefix of "solr." for class names
is an alias that causes solr to search appropriate packages,
including org.apache.solr.(search|update|request|core|analysis)
You may also specify a fully qualified Java classname if you
have your own custom plugins.
-->
…
SELECT * FROM killrvideo.videos
WHERE solr_query=‘name:*’
Thank you!
20

More Related Content

Viewers also liked (20)

PDF
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
DataStax Academy
 
PDF
DataStax: Old Dogs, New Tricks. Teaching your Relational DBA to fetch
DataStax Academy
 
PDF
Solr & Cassandra: Searching Cassandra with DataStax Enterprise
DataStax Academy
 
PDF
DataStax: 7 Deadly Sins for Cassandra Ops
DataStax Academy
 
PDF
Cassandra Core Concepts
Jon Haddad
 
PDF
Cassandra 3.0 Awesomeness
Jon Haddad
 
PDF
Crash course intro to cassandra
Jon Haddad
 
PDF
DataStax & O'Reilly Media: Large Scale Data Analytics with Spark and Cassandr...
DataStax Academy
 
PDF
Instaclustr: Securing Cassandra
DataStax Academy
 
PDF
Diagnosing Problems in Production - Cassandra
Jon Haddad
 
PDF
Enter the Snake Pit for Fast and Easy Spark
Jon Haddad
 
PDF
DataStax: Making Cassandra Fail (for effective testing)
DataStax Academy
 
PDF
Diagnosing Problems in Production (Nov 2015)
Jon Haddad
 
PDF
Spark and cassandra (Hulu Talk)
Jon Haddad
 
PDF
Cake Solutions: Cassandra as event sourced journal for big data analytics
DataStax Academy
 
PDF
Azure + DataStax Enterprise Powers Office 365 Per User Store
DataStax Academy
 
PDF
Cassandra meetup slides - Oct 15 Santa Monica Coloft
Jon Haddad
 
PDF
Cassandra Core Concepts - Cassandra Day Toronto
Jon Haddad
 
PDF
Python and cassandra
Jon Haddad
 
PDF
How We Used Cassandra/Solr to Build Real-Time Analytics Platform
DataStax Academy
 
Battery Ventures: Simulating and Visualizing Large Scale Cassandra Deployments
DataStax Academy
 
DataStax: Old Dogs, New Tricks. Teaching your Relational DBA to fetch
DataStax Academy
 
Solr & Cassandra: Searching Cassandra with DataStax Enterprise
DataStax Academy
 
DataStax: 7 Deadly Sins for Cassandra Ops
DataStax Academy
 
Cassandra Core Concepts
Jon Haddad
 
Cassandra 3.0 Awesomeness
Jon Haddad
 
Crash course intro to cassandra
Jon Haddad
 
DataStax & O'Reilly Media: Large Scale Data Analytics with Spark and Cassandr...
DataStax Academy
 
Instaclustr: Securing Cassandra
DataStax Academy
 
Diagnosing Problems in Production - Cassandra
Jon Haddad
 
Enter the Snake Pit for Fast and Easy Spark
Jon Haddad
 
DataStax: Making Cassandra Fail (for effective testing)
DataStax Academy
 
Diagnosing Problems in Production (Nov 2015)
Jon Haddad
 
Spark and cassandra (Hulu Talk)
Jon Haddad
 
Cake Solutions: Cassandra as event sourced journal for big data analytics
DataStax Academy
 
Azure + DataStax Enterprise Powers Office 365 Per User Store
DataStax Academy
 
Cassandra meetup slides - Oct 15 Santa Monica Coloft
Jon Haddad
 
Cassandra Core Concepts - Cassandra Day Toronto
Jon Haddad
 
Python and cassandra
Jon Haddad
 
How We Used Cassandra/Solr to Build Real-Time Analytics Platform
DataStax Academy
 

Similar to DataStax: Enabling Search in your Cassandra Application with DataStax Enterprise (20)

PPTX
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
PDF
Rapid prototyping search applications with solr
Lucidworks (Archived)
 
PDF
Hey Relational Developer, Let's Go Crazy (Patrick McFadin, DataStax) | Cassan...
DataStax
 
PDF
Rapid Prototyping with Solr
Erik Hatcher
 
PPTX
Connection String Parameter Pollution Attacks
Chema Alonso
 
PPT
PHP CLI: A Cinderella Story
Mike Lively
 
PDF
Learning To Run - XPages for Lotus Notes Client Developers
Kathy Brown
 
PDF
Compass Framework
Lukas Vlcek
 
PDF
Using Apache Solr
pittaya
 
PDF
OSXCollector: Automated forensic evidence collection & analysis for OS X (Bru...
Jakub "Kuba" Sendor
 
PDF
Service discovery and configuration provisioning
Source Ministry
 
PPTX
LEGO: Data Driven Growth Hacking Powered by Big Data
DataWorks Summit/Hadoop Summit
 
KEY
[Coscup 2012] JavascriptMVC
Alive Kuo
 
PPTX
Solr Search Engine: Optimize Is (Not) Bad for You
Sematext Group, Inc.
 
PPT
2005 - .NET Chaostage: 1st class data driven applications with ASP.NET 2.0
Daniel Fisher
 
PDF
Agile Data Science 2.0
Russell Jurney
 
PDF
Agile Data Science 2.0 - Big Data Science Meetup
Russell Jurney
 
ZIP
Rails and alternative ORMs
Jonathan Dahl
 
PDF
NHibernate (The ORM For .NET Platform)
Samnang Chhun
 
PDF
Microsoft azure data fundamentals (dp 900) practice tests 2022
SkillCertProExams
 
Enabling Search in your Cassandra Application with DataStax Enterprise
DataStax Academy
 
Rapid prototyping search applications with solr
Lucidworks (Archived)
 
Hey Relational Developer, Let's Go Crazy (Patrick McFadin, DataStax) | Cassan...
DataStax
 
Rapid Prototyping with Solr
Erik Hatcher
 
Connection String Parameter Pollution Attacks
Chema Alonso
 
PHP CLI: A Cinderella Story
Mike Lively
 
Learning To Run - XPages for Lotus Notes Client Developers
Kathy Brown
 
Compass Framework
Lukas Vlcek
 
Using Apache Solr
pittaya
 
OSXCollector: Automated forensic evidence collection & analysis for OS X (Bru...
Jakub "Kuba" Sendor
 
Service discovery and configuration provisioning
Source Ministry
 
LEGO: Data Driven Growth Hacking Powered by Big Data
DataWorks Summit/Hadoop Summit
 
[Coscup 2012] JavascriptMVC
Alive Kuo
 
Solr Search Engine: Optimize Is (Not) Bad for You
Sematext Group, Inc.
 
2005 - .NET Chaostage: 1st class data driven applications with ASP.NET 2.0
Daniel Fisher
 
Agile Data Science 2.0
Russell Jurney
 
Agile Data Science 2.0 - Big Data Science Meetup
Russell Jurney
 
Rails and alternative ORMs
Jonathan Dahl
 
NHibernate (The ORM For .NET Platform)
Samnang Chhun
 
Microsoft azure data fundamentals (dp 900) practice tests 2022
SkillCertProExams
 
Ad

More from DataStax Academy (20)

PDF
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
PPTX
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
PPTX
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
PDF
Cassandra 3.0 Data Modeling
DataStax Academy
 
PPTX
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
PDF
Data Modeling for Apache Cassandra
DataStax Academy
 
PDF
Coursera Cassandra Driver
DataStax Academy
 
PDF
Production Ready Cassandra
DataStax Academy
 
PDF
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
PPTX
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
PDF
Standing Up Your First Cluster
DataStax Academy
 
PDF
Real Time Analytics with Dse
DataStax Academy
 
PDF
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Cassandra Core Concepts
DataStax Academy
 
PPTX
Bad Habits Die Hard
DataStax Academy
 
PDF
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
PDF
Advanced Cassandra
DataStax Academy
 
PDF
Apache Cassandra and Drivers
DataStax Academy
 
Forrester CXNYC 2017 - Delivering great real-time cx is a true craft
DataStax Academy
 
Introduction to DataStax Enterprise Graph Database
DataStax Academy
 
Introduction to DataStax Enterprise Advanced Replication with Apache Cassandra
DataStax Academy
 
Cassandra on Docker @ Walmart Labs
DataStax Academy
 
Cassandra 3.0 Data Modeling
DataStax Academy
 
Cassandra Adoption on Cisco UCS & Open stack
DataStax Academy
 
Data Modeling for Apache Cassandra
DataStax Academy
 
Coursera Cassandra Driver
DataStax Academy
 
Production Ready Cassandra
DataStax Academy
 
Cassandra @ Netflix: Monitoring C* at Scale, Gossip and Tickler & Python
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 1
DataStax Academy
 
Cassandra @ Sony: The good, the bad, and the ugly part 2
DataStax Academy
 
Standing Up Your First Cluster
DataStax Academy
 
Real Time Analytics with Dse
DataStax Academy
 
Introduction to Data Modeling with Apache Cassandra
DataStax Academy
 
Cassandra Core Concepts
DataStax Academy
 
Bad Habits Die Hard
DataStax Academy
 
Advanced Data Modeling with Apache Cassandra
DataStax Academy
 
Advanced Cassandra
DataStax Academy
 
Apache Cassandra and Drivers
DataStax Academy
 
Ad

Recently uploaded (20)

PDF
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
PDF
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
PDF
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
PPTX
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
AWS Chicago
 
PDF
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
ShapeBlue
 
PDF
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
PPTX
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
PDF
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
PDF
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
PDF
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
PDF
Français Patch Tuesday - Juillet
Ivanti
 
PDF
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
PPTX
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
PDF
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
PDF
Rethinking Security Operations - SOC Evolution Journey.pdf
Haris Chughtai
 
PDF
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
PPT
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
PDF
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
Meetup Kickoff & Welcome - Rohit Yadav, CSIUG Chairman
ShapeBlue
 
Complete JavaScript Notes: From Basics to Advanced Concepts.pdf
haydendavispro
 
Smart Air Quality Monitoring with Serrax AQM190 LITE
SERRAX TECHNOLOGIES LLP
 
Darren Mills The Migration Modernization Balancing Act: Navigating Risks and...
AWS Chicago
 
Apache CloudStack 201: Let's Design & Build an IaaS Cloud
ShapeBlue
 
LLMs.txt: Easily Control How AI Crawls Your Site
Keploy
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Extensions Framework (XaaS) - Enabling Orchestrate Anything
ShapeBlue
 
Fl Studio 24.2.2 Build 4597 Crack for Windows Free Download 2025
faizk77g
 
SFWelly Summer 25 Release Highlights July 2025
Anna Loughnan Colquhoun
 
Building Resilience with Digital Twins : Lessons from Korea
SANGHEE SHIN
 
Français Patch Tuesday - Juillet
Ivanti
 
Women in Automation Presents: Reinventing Yourself — Bold Career Pivots That ...
DianaGray10
 
Top iOS App Development Company in the USA for Innovative Apps
SynapseIndia
 
NewMind AI - Journal 100 Insights After The 100th Issue
NewMind AI
 
Rethinking Security Operations - SOC Evolution Journey.pdf
Haris Chughtai
 
CIFDAQ Token Spotlight for 9th July 2025
CIFDAQ
 
Interview paper part 3, It is based on Interview Prep
SoumyadeepGhosh39
 
DevBcn - Building 10x Organizations Using Modern Productivity Metrics
Justin Reock
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 

DataStax: Enabling Search in your Cassandra Application with DataStax Enterprise