SlideShare a Scribd company logo
1 | © Copyright 2024 Zilliz
1
Step-by-Step - RAG 101
Tim Spann @ Zilliz
Principal Developer Advocate
2 | © Copyright 2024 Zilliz
2
Slides
XX
3 | © Copyright 2024 Zilliz
3
3 | © Copyright 10/22/23 Zilliz
3 | © Copyright 2024 Zilliz
Tim Spann
Principal Developer
Advocate, Zilliz
tim.spann@zilliz.com
https://www.linkedin.com/in/timothyspann/
https://x.com/PaaSDev
4 | © Copyright 2024 Zilliz
4
Agenda About Zilliz, Milvus and Vector Databases
01
Level 100  Basic RAG
References
03
02
5 | © Copyright 2024 Zilliz
5 | © Copyright 9/25/23 Zilliz
5
VECTOR DATABASE
6 | © Copyright 2024 Zilliz
6 | © Copyright 9/25/23 Zilliz
6
The Forrester Wave™ Vector
Database Providers, Q3 2024
Zilliz is the right partner for
your Vector Database
needs.
7 | © Copyright 2024 Zilliz
7 | © Copyright 9/25/23 Zilliz
7
Milvus is an Open-Source Vector Database to
store, index, manage, and use the massive
number of embedding vectors generated by
deep neural networks and LLMs.
contributors
283
stars
30K
docker pulls
67M
forks
2.8K
+
Milvus: The most widely-adopted vector database
8 | © Copyright Zilliz
8
Easy Open RAG Stack Highlighted
Framework
Hardware
Infrastructure
Embedding Models LLMs
Software Infrastructure
Vector Database
9 | © Copyright Zilliz
9
10 | © Copyright Zilliz
10
Image from Nvidia
Vector Search Overview
Vn, 1
…
…
…
1
2
3
4
5
Transform into
Vectors
Unstructured Data
Images
User Generated
Content
Video
Documents
Audio
Vector Embeddings
Perform
Approximate
Nearest Neighbor
Similarity Search
Perform Query
Get Results
Store in Vector Database
How Similarity Search Works
12 | © Copyright 2024 Zilliz
12 | © Copyright 9/25/23 Zilliz
12
RAG
Retrieval Augmented Generation)
13 | © Copyright 2024 Zilliz
13
Basic Idea
Use RAG to force the LLM to work with your
data by injecting it via a vector database like
Milvus
14 | © Copyright 2024 Zilliz
14
Basic RAG Architecture
1
5
Retrieval-Augmented Generation (RAG)
2024
A technique that combines the
strength of retrieval-based and
generative models:
● Improve accuracy and relevance
● Eliminate hallucination
● Provide domain-specific
knowledge
1
6
RAG : an economic perspective
2024
A business model that bridges public
data and private data
● Data sovereignty
● You can't and shouldn't give your
private data to others
17 | © Copyright 2024 Zilliz
17 | © Copyright 9/25/23 Zilliz
17
Your Turn
Time To Build a Simple RAG App
18 | © Copyright 2024 Zilliz
18
Documents From Here
https://bit.ly/3U7Okgu
19 | © Copyright 2024 Zilliz
19
Notebook From Here
https://bit.ly/3BV4IKX
20 | © Copyright Zilliz
20
21 | © Copyright Zilliz
21
22 | © Copyright Zilliz
22
Python SDK Connect…
23 | © Copyright Zilliz
23
Choose Your Embedding Function
24 | © Copyright Zilliz
24
Vector Embedding
25 | © Copyright Zilliz
25
26 | © Copyright Zilliz
26
27 | © Copyright Zilliz
27
28 | © Copyright Zilliz
28
29 | © Copyright Zilliz
29
30 | © Copyright Zilliz
30
31 | © Copyright Zilliz
31
32 | © Copyright Zilliz
32
33 | © Copyright 2024 Zilliz
33 | © Copyright 9/25/23 Zilliz
33
Advanced RAG
34 | © Copyright Zilliz
34
https://bit.ly/4eFdMlK https://bit.ly/3BLeLCx
35 | © Copyright Zilliz
35
https://bit.ly/3zXW8dX https://bit.ly/3NuK5ru
36 | © Copyright Zilliz
36
https://bit.ly/4gZ4Lpn
Metadata Filtering
Hybrid Search
Agents
https://bit.ly/3UbqUqx
https://bit.ly/3YpKd1K
Smart Chunking
Embedding Model Choice
37 | © Copyright 2024 Zilliz
37 | © Copyright 9/25/23 Zilliz
37
Resources
38 | © Copyright Zilliz
38
Wednesday, October 23, 2024
5:30 PM to 8:30 PM EDT NYC
https://bit.ly/403JIvy
Thursday, October 24, 2024
6:30 PM to 8:00 PM EDT Princeton
https://bit.ly/3NopXY2
November 5-7, 10-12, 2024: CloudX
https://www.developerweek.com/cloudx/
November 15, 2024: Build Stuff
https://www.developerweek.com/cloudx/speakers/
November 19, 2024: Xtreme Python
https://xtremepython.dev/2024/
November 21, 2024: bigdata Conference
November 21, 2024: Unstructured Data NYC
39 | © Copyright Zilliz
39
Vector Database Resources
Give Milvus a Star! Chat with me on Discord!
https://github.com/milvus-io/milvus
40
Unstructured Data Meetup
https://www.meetup.com/unstructured-data-meetup-new-york/
This meetup is for people working in unstructured data. Speakers will come present about related topics
such as vector databases, LLMs, and managing data at scale. The intended audience of this group
includes roles like machine learning engineers, data scientists, data engineers, software engineers, and
PMs.
This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
https://medium.com/@tspann/shining-some-light-on-the-new-milvus-lite-5a0565eb5dd9
https://medium.com/@tspann/unstructured-data-processing-with-a-raspberry-pi-ai-kit-c959dd7fff47
Raspberry Pi AI Kit Hailo
Edge AI
43 | © Copyright 2024 Zilliz
43
43
This week in Milvus, Towhee, Attu, GPT
Cache, Gen AI, LLM, Apache NiFi, Apache
Flink, Apache Kafka, ML, AI, Apache Spark,
Apache Iceberg, Python, Java, Vector DB
and Open Source friends.
https://bit.ly/32dAJft
https://github.com/milvus-io/milvus
AIM Weekly by Tim Spann
44 | © Copyright 2024 Zilliz
44
milvus.io
github.com/milvus-io/
@milvusio
@paasDev
/in/timothyspann
Connect with me!
Thank you!
45 | © Copyright 2024 Zilliz
45
Milvus
Open Source Self-Managed
Zilliz Cloud
SaaS Fully-Managed
github.com/milvus-io/milvus
Getting Started with Vector Databases
zilliz.com/cloud
29K - Star us on GitHub!
46 | © Copyright 2024 Zilliz
46
Get started for free
zilliz.com/cloud
47 | © Copyright 2024 Zilliz
47 | © Copyright 9/25/23 Zilliz
47
Deep-Dive
48 | © Copyright Zilliz
48
Vector Similarity Measures: L2 Euclidean)
Queen = [0.3, 0.9]
King = [0.5, 0.7]
d(Queen, King) = √(0.3-0.5)2
+ (0.9-0.7)2
= √(0.2)2
+ (0.2)2
= √0.04 + 0.04
= √0.08 ≅ 0.28
49 | © Copyright Zilliz
49
Vector Similarity Measures: Inner Product IP
Queen = [0.3, 0.9]
King = [0.5, 0.7]
Queen · King = (0.3*0.5) + (0.9*0.7)
= 0.15 + 0.63 = 0.78
50 | © Copyright Zilliz
50
Queen = [0.3, 0.9]
King = [0.5, 0.7]
Vector Similarity Measures: Cosine
𝚹
cos(Queen, King) = (0.3*0.5)+(0.9*0.7)
√0.32
+0.92
* √0.52
+0.72
= 0.15+0.63 _
√0.9 * √0.74
= 0.78 _
√0.666
≅ 0.03
51 | © Copyright Zilliz
51
Hybrid Search
52 | © Copyright Zilliz
52

More Related Content

PDF
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
PDF
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Zilliz
 
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
PDF
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
PDF
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
 
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
Timothy Spann
 
PDF
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Zilliz
 
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
2025-02-24 - AWS meetup - Zilliz presentation.pdf
Ivan Tang
 
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
Timothy Spann
 
Hands-on Tutorial: Building an Agent to Reason about Private Data with OpenAI...
Zilliz
 

Similar to 17-October-2024 NYC AI Camp - Step-by-Step RAG 101 (20)

PDF
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
PDF
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
PDF
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
Timothy Spann
 
PDF
Open Source Milvus Vector Database v 2.6
Zilliz
 
PDF
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
Ivan Tang
 
PDF
09-18-2024 NYC Meetup Vector Databases 102
Timothy Spann
 
PDF
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Zilliz
 
PDF
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
PDF
Introduction to Large Language Model Customization.pdf
Zilliz
 
PDF
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Timothy Spann
 
PDF
Scaling Vector Search: How Milvus Handles Billions+
Zilliz
 
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
Timothy Spann
 
PDF
06-20-2024-AI Camp Meetup-Unstructured Data and Vector Databases
Timothy Spann
 
PDF
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
PDF
Advanced Retrieval Augmented Generation Techniques
Zilliz
 
PDF
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
PDF
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
PDF
Chunking, Embeddings, and Vector Databases
Zilliz
 
PDF
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
Timothy Spann
 
Open Source Milvus Vector Database v 2.6
Zilliz
 
2025-04-05 - Block71 Event - The Landscape of GenAI and Ecosystem.pdf
Ivan Tang
 
09-18-2024 NYC Meetup Vector Databases 102
Timothy Spann
 
Multimodal Retrieval-Augmented Generation (RAG) with Vector Database
Zilliz
 
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
Introduction to Large Language Model Customization.pdf
Zilliz
 
NYCMeetup07-25-2024-Unstructured Data Processing From Cloud to Edge
Timothy Spann
 
Scaling Vector Search: How Milvus Handles Billions+
Zilliz
 
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
Timothy Spann
 
06-20-2024-AI Camp Meetup-Unstructured Data and Vector Databases
Timothy Spann
 
Milvus: Scaling Vector Data Solutions for Gen AI
Zilliz
 
Advanced Retrieval Augmented Generation Techniques
Zilliz
 
06-18-2024-Princeton Meetup-Introduction to Milvus
Timothy Spann
 
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
Chunking, Embeddings, and Vector Databases
Zilliz
 
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Ad

More from Timothy Spann (20)

PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Timothy Spann
 
PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
Timothy Spann
 
PDF
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Timothy Spann
 
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
Timothy Spann
 
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
Timothy Spann
 
PDF
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
Timothy Spann
 
PDF
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
Timothy Spann
 
PDF
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
Timothy Spann
 
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
Timothy Spann
 
PPTX
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
Timothy Spann
 
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
Timothy Spann
 
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
PDF
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
Timothy Spann
 
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Timothy Spann
 
PDF
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Timothy Spann
 
PDF
08-15-2024 - AI Camp Meetup - Human Pose Estimation in Real-Time Utilizing Ed...
Timothy Spann
 
PDF
Unstructured Data Meetup - NYC - Qarbine - Milvus 13-Aug-2024
Timothy Spann
 
PDF
Milvus Vector Database: Integrating Semantic Search Capabilities with .NET an...
Timothy Spann
 
PDF
Unstructured Data Processing from Cloud to Edge Webinar
Timothy Spann
 
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Timothy Spann
 
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
Timothy Spann
 
2025-03-03-Philly-AAAI-GoodData-Build Secure RAG Apps With Open LLM
Timothy Spann
 
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
Timothy Spann
 
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
Timothy Spann
 
TSPANN-2024-Nov-CloudX-Adding Generative AI to Real-Time Streaming Pipelines
Timothy Spann
 
2024-Nov-BuildStuff-Adding Generative AI to Real-Time Streaming Pipelines
Timothy Spann
 
14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipe...
Timothy Spann
 
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
Timothy Spann
 
tspann06-NOV-2024_AI-Alliance_NYC_ intro to Data Prep Kit and Open Source RAG
Timothy Spann
 
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
Timothy Spann
 
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
Timothy Spann
 
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Timothy Spann
 
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Timothy Spann
 
08-15-2024 - AI Camp Meetup - Human Pose Estimation in Real-Time Utilizing Ed...
Timothy Spann
 
Unstructured Data Meetup - NYC - Qarbine - Milvus 13-Aug-2024
Timothy Spann
 
Milvus Vector Database: Integrating Semantic Search Capabilities with .NET an...
Timothy Spann
 
Unstructured Data Processing from Cloud to Edge Webinar
Timothy Spann
 
Ad

Recently uploaded (20)

PDF
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
PDF
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
PDF
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
PDF
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
PPTX
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
PPTX
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
PPTX
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
PPTX
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
PPTX
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
PDF
blockchain123456789012345678901234567890
tanvikhunt1003
 
PPTX
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
PDF
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
PPTX
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
PPTX
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
PPT
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
PPTX
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
PPTX
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
PPTX
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
PDF
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
PDF
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 
TIC ACTIVIDAD 1geeeeeeeeeeeeeeeeeeeeeeeeeeeeeer3.pdf
Thais Ruiz
 
The_Future_of_Data_Analytics_by_CA_Suvidha_Chaplot_UPDATED.pdf
CA Suvidha Chaplot
 
Mastering Financial Analysis Materials.pdf
SalamiAbdullahi
 
WISE main accomplishments for ISQOLS award July 2025.pdf
StatsCommunications
 
short term project on AI Driven Data Analytics
JMJCollegeComputerde
 
Pipeline Automatic Leak Detection for Water Distribution Systems
Sione Palu
 
Databricks-DE-Associate Certification Questions-june-2024.pptx
pedelli41
 
World-population.pptx fire bunberbpeople
umutunsalnsl4402
 
Introduction to Biostatistics Presentation.pptx
AtemJoshua
 
blockchain123456789012345678901234567890
tanvikhunt1003
 
lecture 13 mind test academy it skills.pptx
ggesjmrasoolpark
 
Technical Writing Module-I Complete Notes.pdf
VedprakashArya13
 
INFO8116 - Week 10 - Slides.pptx big data architecture
guddipatel10
 
Future_of_AI_Presentation for everyone.pptx
boranamanju07
 
Grade 5 PPT_Science_Q2_W6_Methods of reproduction.ppt
AaronBaluyut
 
White Blue Simple Modern Enhancing Sales Strategy Presentation_20250724_21093...
RamNeymarjr
 
Blue and Dark Blue Modern Technology Presentation.pptx
ap177979
 
Fuzzy_Membership_Functions_Presentation.pptx
pythoncrazy2024
 
717629748-Databricks-Certified-Data-Engineer-Professional-Dumps-by-Ball-21-03...
pedelli41
 
Classifcation using Machine Learning and deep learning
bhaveshagrawal35
 

17-October-2024 NYC AI Camp - Step-by-Step RAG 101

  • 1. 1 | © Copyright 2024 Zilliz 1 Step-by-Step - RAG 101 Tim Spann @ Zilliz Principal Developer Advocate
  • 2. 2 | © Copyright 2024 Zilliz 2 Slides XX
  • 3. 3 | © Copyright 2024 Zilliz 3 3 | © Copyright 10/22/23 Zilliz 3 | © Copyright 2024 Zilliz Tim Spann Principal Developer Advocate, Zilliz [email protected] https://www.linkedin.com/in/timothyspann/ https://x.com/PaaSDev
  • 4. 4 | © Copyright 2024 Zilliz 4 Agenda About Zilliz, Milvus and Vector Databases 01 Level 100  Basic RAG References 03 02
  • 5. 5 | © Copyright 2024 Zilliz 5 | © Copyright 9/25/23 Zilliz 5 VECTOR DATABASE
  • 6. 6 | © Copyright 2024 Zilliz 6 | © Copyright 9/25/23 Zilliz 6 The Forrester Wave™ Vector Database Providers, Q3 2024 Zilliz is the right partner for your Vector Database needs.
  • 7. 7 | © Copyright 2024 Zilliz 7 | © Copyright 9/25/23 Zilliz 7 Milvus is an Open-Source Vector Database to store, index, manage, and use the massive number of embedding vectors generated by deep neural networks and LLMs. contributors 283 stars 30K docker pulls 67M forks 2.8K + Milvus: The most widely-adopted vector database
  • 8. 8 | © Copyright Zilliz 8 Easy Open RAG Stack Highlighted Framework Hardware Infrastructure Embedding Models LLMs Software Infrastructure Vector Database
  • 9. 9 | © Copyright Zilliz 9
  • 10. 10 | © Copyright Zilliz 10 Image from Nvidia Vector Search Overview
  • 11. Vn, 1 … … … 1 2 3 4 5 Transform into Vectors Unstructured Data Images User Generated Content Video Documents Audio Vector Embeddings Perform Approximate Nearest Neighbor Similarity Search Perform Query Get Results Store in Vector Database How Similarity Search Works
  • 12. 12 | © Copyright 2024 Zilliz 12 | © Copyright 9/25/23 Zilliz 12 RAG Retrieval Augmented Generation)
  • 13. 13 | © Copyright 2024 Zilliz 13 Basic Idea Use RAG to force the LLM to work with your data by injecting it via a vector database like Milvus
  • 14. 14 | © Copyright 2024 Zilliz 14 Basic RAG Architecture
  • 15. 1 5 Retrieval-Augmented Generation (RAG) 2024 A technique that combines the strength of retrieval-based and generative models: ● Improve accuracy and relevance ● Eliminate hallucination ● Provide domain-specific knowledge
  • 16. 1 6 RAG : an economic perspective 2024 A business model that bridges public data and private data ● Data sovereignty ● You can't and shouldn't give your private data to others
  • 17. 17 | © Copyright 2024 Zilliz 17 | © Copyright 9/25/23 Zilliz 17 Your Turn Time To Build a Simple RAG App
  • 18. 18 | © Copyright 2024 Zilliz 18 Documents From Here https://bit.ly/3U7Okgu
  • 19. 19 | © Copyright 2024 Zilliz 19 Notebook From Here https://bit.ly/3BV4IKX
  • 20. 20 | © Copyright Zilliz 20
  • 21. 21 | © Copyright Zilliz 21
  • 22. 22 | © Copyright Zilliz 22 Python SDK Connect…
  • 23. 23 | © Copyright Zilliz 23 Choose Your Embedding Function
  • 24. 24 | © Copyright Zilliz 24 Vector Embedding
  • 25. 25 | © Copyright Zilliz 25
  • 26. 26 | © Copyright Zilliz 26
  • 27. 27 | © Copyright Zilliz 27
  • 28. 28 | © Copyright Zilliz 28
  • 29. 29 | © Copyright Zilliz 29
  • 30. 30 | © Copyright Zilliz 30
  • 31. 31 | © Copyright Zilliz 31
  • 32. 32 | © Copyright Zilliz 32
  • 33. 33 | © Copyright 2024 Zilliz 33 | © Copyright 9/25/23 Zilliz 33 Advanced RAG
  • 34. 34 | © Copyright Zilliz 34 https://bit.ly/4eFdMlK https://bit.ly/3BLeLCx
  • 35. 35 | © Copyright Zilliz 35 https://bit.ly/3zXW8dX https://bit.ly/3NuK5ru
  • 36. 36 | © Copyright Zilliz 36 https://bit.ly/4gZ4Lpn Metadata Filtering Hybrid Search Agents https://bit.ly/3UbqUqx https://bit.ly/3YpKd1K Smart Chunking Embedding Model Choice
  • 37. 37 | © Copyright 2024 Zilliz 37 | © Copyright 9/25/23 Zilliz 37 Resources
  • 38. 38 | © Copyright Zilliz 38 Wednesday, October 23, 2024 5:30 PM to 8:30 PM EDT NYC https://bit.ly/403JIvy Thursday, October 24, 2024 6:30 PM to 8:00 PM EDT Princeton https://bit.ly/3NopXY2 November 5-7, 10-12, 2024: CloudX https://www.developerweek.com/cloudx/ November 15, 2024: Build Stuff https://www.developerweek.com/cloudx/speakers/ November 19, 2024: Xtreme Python https://xtremepython.dev/2024/ November 21, 2024: bigdata Conference November 21, 2024: Unstructured Data NYC
  • 39. 39 | © Copyright Zilliz 39 Vector Database Resources Give Milvus a Star! Chat with me on Discord! https://github.com/milvus-io/milvus
  • 40. 40 Unstructured Data Meetup https://www.meetup.com/unstructured-data-meetup-new-york/ This meetup is for people working in unstructured data. Speakers will come present about related topics such as vector databases, LLMs, and managing data at scale. The intended audience of this group includes roles like machine learning engineers, data scientists, data engineers, software engineers, and PMs. This meetup was formerly Milvus Meetup, and is sponsored by Zilliz maintainers of Milvus.
  • 43. 43 | © Copyright 2024 Zilliz 43 43 This week in Milvus, Towhee, Attu, GPT Cache, Gen AI, LLM, Apache NiFi, Apache Flink, Apache Kafka, ML, AI, Apache Spark, Apache Iceberg, Python, Java, Vector DB and Open Source friends. https://bit.ly/32dAJft https://github.com/milvus-io/milvus AIM Weekly by Tim Spann
  • 44. 44 | © Copyright 2024 Zilliz 44 milvus.io github.com/milvus-io/ @milvusio @paasDev /in/timothyspann Connect with me! Thank you!
  • 45. 45 | © Copyright 2024 Zilliz 45 Milvus Open Source Self-Managed Zilliz Cloud SaaS Fully-Managed github.com/milvus-io/milvus Getting Started with Vector Databases zilliz.com/cloud 29K - Star us on GitHub!
  • 46. 46 | © Copyright 2024 Zilliz 46 Get started for free zilliz.com/cloud
  • 47. 47 | © Copyright 2024 Zilliz 47 | © Copyright 9/25/23 Zilliz 47 Deep-Dive
  • 48. 48 | © Copyright Zilliz 48 Vector Similarity Measures: L2 Euclidean) Queen = [0.3, 0.9] King = [0.5, 0.7] d(Queen, King) = √(0.3-0.5)2 + (0.9-0.7)2 = √(0.2)2 + (0.2)2 = √0.04 + 0.04 = √0.08 ≅ 0.28
  • 49. 49 | © Copyright Zilliz 49 Vector Similarity Measures: Inner Product IP Queen = [0.3, 0.9] King = [0.5, 0.7] Queen · King = (0.3*0.5) + (0.9*0.7) = 0.15 + 0.63 = 0.78
  • 50. 50 | © Copyright Zilliz 50 Queen = [0.3, 0.9] King = [0.5, 0.7] Vector Similarity Measures: Cosine 𝚹 cos(Queen, King) = (0.3*0.5)+(0.9*0.7) √0.32 +0.92 * √0.52 +0.72 = 0.15+0.63 _ √0.9 * √0.74 = 0.78 _ √0.666 ≅ 0.03
  • 51. 51 | © Copyright Zilliz 51 Hybrid Search
  • 52. 52 | © Copyright Zilliz 52