SlideShare a Scribd company logo
1 | © Copyright 2024 Zilliz
1
Codeless Generative AI Pipelines:
Prompt Automation
Tim Spann @ Zilliz
2 | © Copyright 2024 Zilliz
2
2 | © Copyright 10/22/23 Zilliz
2 | © Copyright 2024 Zilliz
Tim Spann
Principal Developer Advocate, Milvus
tim@datainmotion.dev
https://www.linkedin.com/in/timothyspann/
https://x.com/PaaSDev
3 | © Copyright 2024 Zilliz
3
These Slides
X
4 | © Copyright 2024 Zilliz
4
The Challenge of Unstructured Data
● Problem: Unstructured data comes in lots of forms, no easy
way to interact with it all
● Solution: Vector embeddings
● How: Neural networks e.g. embedding models
Vector
Databases
5 | © Copyright 2024 Zilliz
5
Unstructured Data is Everywhere
Unstructured data is any data that does not conform
to a predefined data model.
Currently, 90% of unstructured data is never
analyzed.
Images Videos and
more!
Text
Vn, 1
…
…
…
1
2
3
4
5
Transform into
Vectors
Unstructured Data
Images
User Generated
Content
Video
Documents
Audio
Vector Embeddings
Perform
Approximate
Nearest Neighbor
Similarity Search
Perform Query
Get Results
Store in Vector Database
How Similarity Search Works
7 | © Copyright Zilliz
7
About Milvus
Milvus is an open-source vector database for
GenAI projects. pip install on your laptop, plug into
popular AI dev tools, and push to production with
a single line of code.
29K
GitHub Stars
25M
Downloads
250
Contributors
2,600
Forks
Easy Setup
Pip-install to start coding in a notebook within seconds
Integration
Plug into OpenAI, Langchain, LlmaIndex, and many more
Reusable Code
Write once, and deploy with one line of code into the production
environment
Feature-rich
Dense & sparse embeddings, filtering, reranking and beyond
8 | © Copyright Zilliz
8
pip install pymilvus
Milvus Lite
9
Retrieval-Augmented Generation (RAG)
2024
A technique that combines the
strength of retrieval-based and
generative models:
● Improve accuracy and relevance
● Eliminate hallucination
● Provide domain-specific
knowledge
1
0
RAG : an economic perspective
2024
A business model that bridges public
data and private data
● Data sovereignty
● You can't and shouldn't give your
private data to others
11 | © Copyright Zilliz
11
12 | © Copyright Zilliz
12
DataFlow Pipelines Can Help
External Context Ingest
Ingesting, routing, clean, enrich, transforming,
parsing, chunking and vectorizing structured,
unstructured, semistructured, binary data and
documents
Prompt engineering
Crafting and structuring queries to optimize
LLM responses
Context Retrieval
Enhancing LLM with external context such as
Retrieval Augmented Generation (RAG)
Roundtrip Interface
Act as a Discord, REST, Kafka, SQL, Slack bot to
roundtrip discussions
13 | © Copyright Zilliz
13
https://medium.com/cloudera-inc/getting-ready-for-apache-nifi-2-0-5a5e6a67f450
NiFi 2.0.0 Features
● Python Integration
● Parameters
● JDK 21+
● JSON Flow Serialization
● Rules Engine for Development
Assistance
● Run Process Group as Stateless
● flow.json.gz
https://cwiki.apache.org/confluence/display/NIFI/NiFi+2.0+Release+Goals
14 | © Copyright Zilliz
14
Extract Company Names
● Python 3.10+
● Hugging Face, NLP, SpaCY, PyTorch
https://github.com/tspannhw/FLaNK-python-ExtractCompanyName-processor
15 | © Copyright Zilliz
15
CaptionImage
● Python 3.10+
● Hugging Face
● Salesforce/blip-image-captioning-large
● Generate Captions for Images
● Adds captions to FlowFile Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
16 | © Copyright Zilliz
16
RESNetImageClassification
● Python 3.10+
● Hugging Face
● Transformers
● Pytorch
● Datasets
● microsoft/resnet-50
● Adds classification label to FlowFile
Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
17 | © Copyright Zilliz
17
NSFWImageDetection
● Python 3.10+
● Hugging Face
● Transformers
● Falconsai/nsfw_image_detection
● Adds normal and nsfw to FlowFile
Attributes
● Gives score on safety of image
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
18 | © Copyright Zilliz
18
FacialEmotionsImageDetection
● Python 3.10+
● Hugging Face
● Transformers
● facial_emotions_image_detection
● Image Classification
● Adds labels/scores to FlowFile Attributes
● Does not require download or copies of
your images
https://github.com/tspannhw/FLaNK-python-processors
19 | © Copyright Zilliz
19
Letʼs do a metamorphosis on your data. Donʼt fear changing data.
You don’t need to be a brilliant writer to stream
data.
Franz Kafka was a German-speaking
Bohemian novelist and short-story writer,
widely regarded as one of the major figures of
20th-century literature. His work fuses
elements of realism and the fantastic.
Wikipedia
YES, FRANZ, IT’S KAFKA
20 | © Copyright Zilliz
20
20
Streams
Replication
Manager (SRM)
• Event Replication engine for Kafka
• Supports active-active, multi-cluster,
cross DC replication scenarios
• Leverage Kafka Connect for
scalability and HA
• Replicate data and configurations
(ACL, partitioning, new topics, etc)
• Offset translation for simplified
failover
• Integrate replication monitoring with
SMM
21 | © Copyright Zilliz
21
Open Source Edition
•Apache NiFi in
Docker
•Try new features
quickly
•Develop applications
locally
● Docker NiFi
○ docker run --name nifi -p 8443:8443 -d -e
SINGLE_USER_CREDENTIALS_USERNAME=admin -e
SINGLE_USER_CREDENTIALS_PASSWORD=ctsBtRBKHRAx69EqUghv
vgEvjnaLjFEB apache/nifi:latest
● Licensed under the ASF License
● Unsupported
● NiFi 1.28 and NiFi 2.0.0
https://hub.docker.com/r/apache/nifi
22 | © Copyright Zilliz
22
https://medium.com/cloudera-inc/streaming-street-cams-to-yolo-v8-with-python-and-nifi-to-minio-s3-3277e73723ce
Street Cameras
23 | © Copyright Zilliz
23 | © Copyright Zilliz
23
RESOURCES
24 | © Copyright Zilliz
24
Vector Database Resources
Give Milvus a Star! Chat with me on Discord!
https://github.com/milvus-io/milvus
https://medium.com/@tspann/unstructured-street-data-in-new-york-8d3cde0a1e5b
https://medium.com/@tspann/not-every-field-is-just-text-numbers-or-vectors-976231e90e4d
https://medium.com/@tspann/unstructured-data-processing-with-a-raspberry-pi-ai-kit-c959dd7fff47
Raspberry Pi AI Kit Hailo
Edge AI
28 | © Copyright 2024 Zilliz
28
28
This week in Milvus, Towhee, Attu, GPT
Cache, Gen AI, LLM, Apache NiFi, Apache
Flink, Apache Kafka, ML, AI, Apache Spark,
Apache Iceberg, Python, Java, Vector DB
and Open Source friends.
https://bit.ly/32dAJft
https://github.com/milvus-io/milvus
AIM Weekly by Tim Spann

More Related Content

Similar to 14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipelines_ Prompt Automation (20)

PDF
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
PDF
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
PDF
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
PDF
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
Timothy Spann
 
PDF
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
PDF
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
Timothy Spann
 
PDF
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
PDF
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
PPTX
10 Big Data Technologies you Didn't Know About
Jesus Rodriguez
 
PDF
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
PDF
Advanced Retrieval Augmented Generation Techniques
Zilliz
 
PDF
Building an Agentic RAG locally with Ollama and Milvus
Zilliz
 
PDF
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
PDF
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
PDF
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
PDF
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
PDF
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
PDF
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
PDF
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Timothy Spann
 
PPTX
Devclub.lv - Introduction to stream processing
Nicolas Fränkel
 
GraphRAG Agents with Neo4j, Milvus and GPT4
Zilliz
 
Building RAG with self-deployed Milvus vector database and Snowpark Container...
Zilliz
 
Building Production Ready Search Pipelines with Spark and Milvus
Zilliz
 
April 2024 - NLIT Cloudera Real-Time LLM Streaming 2024
Timothy Spann
 
06-12-2024-BudapestDataForum-BuildingReal-timePipelineswithFLaNK AIM
Timothy Spann
 
2024 Feb AI Meetup NYC GenAI_LLMs_ML_Data Codeless Generative AI Pipelines
Timothy Spann
 
09-12-2024 - Milvus, Vector database used for Sensor Data RAG
Timothy Spann
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
10 Big Data Technologies you Didn't Know About
Jesus Rodriguez
 
Tirana Tech Meetup - Agentic RAG with Milvus, Llama3 and Ollama
Zilliz
 
Advanced Retrieval Augmented Generation Techniques
Zilliz
 
Building an Agentic RAG locally with Ollama and Milvus
Zilliz
 
Multi-agent Systems with Mistral AI, Milvus and Llama-agents
Zilliz
 
2024-10-28 All Things Open - Advanced Retrieval Augmented Generation (RAG) Te...
Timothy Spann
 
Supercharge Spark: Unleashing Big Data Potential with Milvus for RAG systems
Zilliz
 
Agentic AI in Action: Real-Time Vision, Memory & Autonomy with Browser Use & ...
Zilliz
 
Using LLM Agents with Llama 3.2, LangGraph and Milvus
Zilliz
 
10-25-2024_BITS_NYC_Unstructured Data and LLM_ What, Why and How
Timothy Spann
 
14May2025_TSPANN_FromAirQualityUnstructuredData.pdf
Timothy Spann
 
Devclub.lv - Introduction to stream processing
Nicolas Fränkel
 

More from Timothy Spann (20)

PDF
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
Timothy Spann
 
PDF
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
Timothy Spann
 
PDF
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
PDF
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
Timothy Spann
 
PDF
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
Timothy Spann
 
PDF
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
Timothy Spann
 
PDF
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
Timothy Spann
 
PDF
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
Timothy Spann
 
PDF
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
PDF
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
Timothy Spann
 
PDF
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Timothy Spann
 
PDF
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
Timothy Spann
 
PDF
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
PDF
09-18-2024 NYC Meetup Vector Databases 102
Timothy Spann
 
PDF
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Timothy Spann
 
PDF
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
PDF
08-15-2024 - AI Camp Meetup - Human Pose Estimation in Real-Time Utilizing Ed...
Timothy Spann
 
PDF
Unstructured Data Meetup - NYC - Qarbine - Milvus 13-Aug-2024
Timothy Spann
 
PDF
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
PDF
Milvus Vector Database: Integrating Semantic Search Capabilities with .NET an...
Timothy Spann
 
Streaming AI Pipelines with Apache NiFi and Snowflake NYC 2025
Timothy Spann
 
Conf42_IoT_Dec2024_Building IoT Applications With Open Source
Timothy Spann
 
2024 Dec 05 - PyData Global - Tutorial Its In The Air Tonight
Timothy Spann
 
2024Nov20-BigDataEU-RealTimeAIWithOpenSource
Timothy Spann
 
2024 Nov 05 - Linux Foundation TAC TALK With Milvus
Timothy Spann
 
tspann08-Nov-2024_PyDataNYC_Unstructured Data Processing with a Raspberry Pi ...
Timothy Spann
 
2024-OCT-23 NYC Meetup - Unstructured Data Meetup - Unstructured Halloween
Timothy Spann
 
DBTA Round Table with Zilliz and Airbyte - Unstructured Data Engineering
Timothy Spann
 
11-OCT-2024_AI_101_CryptoOracle_UnstructuredData
Timothy Spann
 
2024-10-04 - Grace Hopper Celebration Open Source Day - Stefan
Timothy Spann
 
01-Oct-2024_PES-VectorDatabasesAndAI.pdf
Timothy Spann
 
09-25-2024 NJX Venture Summit Introduction to Unstructured Data
Timothy Spann
 
09-19-2024 AI Camp Hybrid Seach - Milvus for Vector Database
Timothy Spann
 
09-18-2024 NYC Meetup Vector Databases 102
Timothy Spann
 
09-26-2024 Conf 42 Kube Native: Unleashing the Potential of Cloud Native Open...
Timothy Spann
 
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Timothy Spann
 
08-15-2024 - AI Camp Meetup - Human Pose Estimation in Real-Time Utilizing Ed...
Timothy Spann
 
Unstructured Data Meetup - NYC - Qarbine - Milvus 13-Aug-2024
Timothy Spann
 
08-13-2024 NYC Meetup Unstructured Data Processing From Cloud to Edge (Milvus)
Timothy Spann
 
Milvus Vector Database: Integrating Semantic Search Capabilities with .NET an...
Timothy Spann
 
Ad

Recently uploaded (20)

PDF
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
PPTX
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
PPTX
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
PPTX
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
PPTX
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
PDF
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
PDF
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
PPTX
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
PDF
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
PDF
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
PDF
Choosing the Right Database for Indexing.pdf
Tamanna
 
PDF
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
PDF
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
PPTX
Climate Action.pptx action plan for climate
justfortalabat
 
PDF
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
PDF
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
PPT
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
PDF
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
PPTX
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
PDF
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
Copia de Strategic Roadmap Infographics by Slidesgo.pptx (1).pdf
ssuserd4c6911
 
Module-5-Measures-of-Central-Tendency-Grouped-Data-1.pptx
lacsonjhoma0407
 
AI Presentation Tool Pitch Deck Presentation.pptx
ShyamPanthavoor1
 
ER_Model_Relationship_in_DBMS_Presentation.pptx
dharaadhvaryu1992
 
Aict presentation on dpplppp sjdhfh.pptx
vabaso5932
 
apidays Helsinki & North 2025 - REST in Peace? Hunting the Dominant Design fo...
apidays
 
Merits and Demerits of DBMS over File System & 3-Tier Architecture in DBMS
MD RIZWAN MOLLA
 
The _Operations_on_Functions_Addition subtruction Multiplication and Division...
mdregaspi24
 
Building Production-Ready AI Agents with LangGraph.pdf
Tamanna
 
Early_Diabetes_Detection_using_Machine_L.pdf
maria879693
 
Choosing the Right Database for Indexing.pdf
Tamanna
 
apidays Helsinki & North 2025 - How (not) to run a Graphql Stewardship Group,...
apidays
 
Product Management in HealthTech (Case Studies from SnappDoctor)
Hamed Shams
 
Climate Action.pptx action plan for climate
justfortalabat
 
apidays Helsinki & North 2025 - API-Powered Journeys: Mobility in an API-Driv...
apidays
 
Context Engineering for AI Agents, approaches, memories.pdf
Tamanna
 
Growth of Public Expendituuure_55423.ppt
NavyaDeora
 
OPPOTUS - Malaysias on Malaysia 1Q2025.pdf
Oppotus
 
apidays Munich 2025 - Building Telco-Aware Apps with Open Gateway APIs, Subhr...
apidays
 
Web Scraping with Google Gemini 2.0 .pdf
Tamanna
 
Ad

14 November 2024 - Conf 42 - Prompt Engineering - Codeless Generative AI Pipelines_ Prompt Automation

  • 1. 1 | © Copyright 2024 Zilliz 1 Codeless Generative AI Pipelines: Prompt Automation Tim Spann @ Zilliz
  • 2. 2 | © Copyright 2024 Zilliz 2 2 | © Copyright 10/22/23 Zilliz 2 | © Copyright 2024 Zilliz Tim Spann Principal Developer Advocate, Milvus [email protected] https://www.linkedin.com/in/timothyspann/ https://x.com/PaaSDev
  • 3. 3 | © Copyright 2024 Zilliz 3 These Slides X
  • 4. 4 | © Copyright 2024 Zilliz 4 The Challenge of Unstructured Data ● Problem: Unstructured data comes in lots of forms, no easy way to interact with it all ● Solution: Vector embeddings ● How: Neural networks e.g. embedding models Vector Databases
  • 5. 5 | © Copyright 2024 Zilliz 5 Unstructured Data is Everywhere Unstructured data is any data that does not conform to a predefined data model. Currently, 90% of unstructured data is never analyzed. Images Videos and more! Text
  • 6. Vn, 1 … … … 1 2 3 4 5 Transform into Vectors Unstructured Data Images User Generated Content Video Documents Audio Vector Embeddings Perform Approximate Nearest Neighbor Similarity Search Perform Query Get Results Store in Vector Database How Similarity Search Works
  • 7. 7 | © Copyright Zilliz 7 About Milvus Milvus is an open-source vector database for GenAI projects. pip install on your laptop, plug into popular AI dev tools, and push to production with a single line of code. 29K GitHub Stars 25M Downloads 250 Contributors 2,600 Forks Easy Setup Pip-install to start coding in a notebook within seconds Integration Plug into OpenAI, Langchain, LlmaIndex, and many more Reusable Code Write once, and deploy with one line of code into the production environment Feature-rich Dense & sparse embeddings, filtering, reranking and beyond
  • 8. 8 | © Copyright Zilliz 8 pip install pymilvus Milvus Lite
  • 9. 9 Retrieval-Augmented Generation (RAG) 2024 A technique that combines the strength of retrieval-based and generative models: ● Improve accuracy and relevance ● Eliminate hallucination ● Provide domain-specific knowledge
  • 10. 1 0 RAG : an economic perspective 2024 A business model that bridges public data and private data ● Data sovereignty ● You can't and shouldn't give your private data to others
  • 11. 11 | © Copyright Zilliz 11
  • 12. 12 | © Copyright Zilliz 12 DataFlow Pipelines Can Help External Context Ingest Ingesting, routing, clean, enrich, transforming, parsing, chunking and vectorizing structured, unstructured, semistructured, binary data and documents Prompt engineering Crafting and structuring queries to optimize LLM responses Context Retrieval Enhancing LLM with external context such as Retrieval Augmented Generation (RAG) Roundtrip Interface Act as a Discord, REST, Kafka, SQL, Slack bot to roundtrip discussions
  • 13. 13 | © Copyright Zilliz 13 https://medium.com/cloudera-inc/getting-ready-for-apache-nifi-2-0-5a5e6a67f450 NiFi 2.0.0 Features ● Python Integration ● Parameters ● JDK 21+ ● JSON Flow Serialization ● Rules Engine for Development Assistance ● Run Process Group as Stateless ● flow.json.gz https://cwiki.apache.org/confluence/display/NIFI/NiFi+2.0+Release+Goals
  • 14. 14 | © Copyright Zilliz 14 Extract Company Names ● Python 3.10+ ● Hugging Face, NLP, SpaCY, PyTorch https://github.com/tspannhw/FLaNK-python-ExtractCompanyName-processor
  • 15. 15 | © Copyright Zilliz 15 CaptionImage ● Python 3.10+ ● Hugging Face ● Salesforce/blip-image-captioning-large ● Generate Captions for Images ● Adds captions to FlowFile Attributes ● Does not require download or copies of your images https://github.com/tspannhw/FLaNK-python-processors
  • 16. 16 | © Copyright Zilliz 16 RESNetImageClassification ● Python 3.10+ ● Hugging Face ● Transformers ● Pytorch ● Datasets ● microsoft/resnet-50 ● Adds classification label to FlowFile Attributes ● Does not require download or copies of your images https://github.com/tspannhw/FLaNK-python-processors
  • 17. 17 | © Copyright Zilliz 17 NSFWImageDetection ● Python 3.10+ ● Hugging Face ● Transformers ● Falconsai/nsfw_image_detection ● Adds normal and nsfw to FlowFile Attributes ● Gives score on safety of image ● Does not require download or copies of your images https://github.com/tspannhw/FLaNK-python-processors
  • 18. 18 | © Copyright Zilliz 18 FacialEmotionsImageDetection ● Python 3.10+ ● Hugging Face ● Transformers ● facial_emotions_image_detection ● Image Classification ● Adds labels/scores to FlowFile Attributes ● Does not require download or copies of your images https://github.com/tspannhw/FLaNK-python-processors
  • 19. 19 | © Copyright Zilliz 19 Letʼs do a metamorphosis on your data. Donʼt fear changing data. You don’t need to be a brilliant writer to stream data. Franz Kafka was a German-speaking Bohemian novelist and short-story writer, widely regarded as one of the major figures of 20th-century literature. His work fuses elements of realism and the fantastic. Wikipedia YES, FRANZ, IT’S KAFKA
  • 20. 20 | © Copyright Zilliz 20 20 Streams Replication Manager (SRM) • Event Replication engine for Kafka • Supports active-active, multi-cluster, cross DC replication scenarios • Leverage Kafka Connect for scalability and HA • Replicate data and configurations (ACL, partitioning, new topics, etc) • Offset translation for simplified failover • Integrate replication monitoring with SMM
  • 21. 21 | © Copyright Zilliz 21 Open Source Edition •Apache NiFi in Docker •Try new features quickly •Develop applications locally ● Docker NiFi ○ docker run --name nifi -p 8443:8443 -d -e SINGLE_USER_CREDENTIALS_USERNAME=admin -e SINGLE_USER_CREDENTIALS_PASSWORD=ctsBtRBKHRAx69EqUghv vgEvjnaLjFEB apache/nifi:latest ● Licensed under the ASF License ● Unsupported ● NiFi 1.28 and NiFi 2.0.0 https://hub.docker.com/r/apache/nifi
  • 22. 22 | © Copyright Zilliz 22 https://medium.com/cloudera-inc/streaming-street-cams-to-yolo-v8-with-python-and-nifi-to-minio-s3-3277e73723ce Street Cameras
  • 23. 23 | © Copyright Zilliz 23 | © Copyright Zilliz 23 RESOURCES
  • 24. 24 | © Copyright Zilliz 24 Vector Database Resources Give Milvus a Star! Chat with me on Discord! https://github.com/milvus-io/milvus
  • 28. 28 | © Copyright 2024 Zilliz 28 28 This week in Milvus, Towhee, Attu, GPT Cache, Gen AI, LLM, Apache NiFi, Apache Flink, Apache Kafka, ML, AI, Apache Spark, Apache Iceberg, Python, Java, Vector DB and Open Source friends. https://bit.ly/32dAJft https://github.com/milvus-io/milvus AIM Weekly by Tim Spann