SlideShare a Scribd company logo
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source
AGI -
The making of the world’s most powerful AI will be
Open Source and Community led
Like Music, Math is generational wealth of
the human species!
Like Math, AGI will belong to the Community
in Open Source!
Open Source is freedom, not free!
Open Source is the defense of the
community with code
Open Source is the defense of the
community with AI
Data Scientist = 100% data
Business Unit = 85% data, 15% storytelling
Executive = 15% data, 85% storytelling
Board = 99% storytelling
GrandPa = 100% storytelling
Our (LLMs) journey to here
Time is the only non-renewable resource
H2O.ai Confidential
Democratize AI with H2O.ai
8
7
222OF
THE
H2O
OF THE TOP 10
BANKS
OF THE TOP 10
4 OF THE TOP 10
MANUFACTURING
COMPANIES
INSURANCE
COMPANIES
FORTUNE
500
CUSTOMER OBSESSION
MAKER CULTURE
Commonwealth Bank, Goldman Sachs,
Wells Fargo, NVIDIA, Capital One, Nexus
Ventures, New York Life
2M
Community
30
Kaggle Grandmasters
2012
FOUNDED IN
SILICON VALLEY
$256M
FUNDRAISED,
SUSTAINABLE
H2O.ai Confidential
H2O.ai Confidential
H2O products are backed by 10% of the World’s Data Science
Grandmasters and a Team of Experts who are relentless in
solving critical problems.
KAGGLE GRANDMASTERS World’s Best Data Scientists
2 Quadruple GMs
1 Triple GM
7 Double GMs
#1 KGM
#3 KGM
5Top 10 Globally
10 Kaggle Masters
H2O.ai KGM HIGHLIGHTS
ā—2023 H2O KGMs win 3
healthcare-related
competitions
ā—July 2023 H2O KGMs make ā€œTop
GenAI Scientistsā€ list
ā—Oct 2023 H2O paper accepted
at EMNLP 2023
ā—Nov 2023 H2O KGMs win 1st
place in Kaggle Science Exam
competition
ā—Jan 2024 H2O KGM places 2nd
in Detect AI Text competition
ā—Feb 2024 introduced
foundational model H2O-
Danube 1.8b
BC is Before Covid
Every Nation needs to be an AI Nation
Prompt is your intellectual property
Everyone needs their own GPT
Every organization needs to own its GPT
Open source crushed the GPT trademark
Open source crushed GPT trademark, not fully!
https://tsdr.uspto.gov/documentviewer?caseId=sn97733259&docId=FREF20240206125856&linkId=1#docI
ndex=1&page=1
Democratize AI
Democratize AGI
Gaia Benchmark for General Intelligence agents
OS-Copilot / FRIDAY
Autogen
Coding General Assistants
SWE-Bench
OpenDevin
Eureka
https://gaia-benchmark-leaderboard.hf.space/?__theme=light
dream
AI for Good
AI for Good
AI for Climate Change
AI to Protect WildLife and Pets
AI To Manage Pandemic Supply Chain
AI to Model WildFire Behaviors
AI to Predict Hurricanes
AI for H2O (Water)
AI to Democratize Health
AI to Educate and Upskill our Public Services
Responsible AI and Fairness
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source
AI to predict Hurricanes
AI to fight Climate Change
AI to democratize health
our generation’s revolution.
Co-create
H2O.ai Confidential
Generative AI
Predictive AI
Generative AI
AWARD-WINNING AUTOML reaches
a new audience with chat & function
calling assistants for ML
Enhanced AI for Documents
Automation for data labeling
Automation of code for AI Apps
CUSTOMGPTS: H2O foundational
models, fine-tuned models, personality, on
top of predictive models
Convergence of World’s best Predictive and Generative AI
GenAI Apps
Enterprise h2oGPTe: Python
APIs and UI for chat, RAG
customGPTs
Function-calling & Agents
Prompt Tuning
Foundational Models
document and data labeling
on-premise, air-gapped,
cloud VPC | SaaS
+
H2O.ai Confidential
GenAI AppStudio
Structured
Datasets
Unstructured
Datasets
ETL / Prep for LLMs
Documents → QA Pairs
Fine Tuning LLMs
(& Prompts)
End Users
Vector DB
(Embedding
s)
myGPT
R. A. G.
Talk to your Data
Document QA
Document Chat
Image/Video Chat
LLM Query
GenAI Apps
+ +
+ +
+
+
LLM
Data Studio
AI Engines MLOps
EvalStudio
AI Apps
+ LLMs
Integration
LLMOps
API
Prompt
Studio
Continuous
Feedback
Parsing . Chunking
Indexing . Embeddings
LLM Agents
Enterprise h2oGPTe (RAG)
Make Your
Own Eval
Predictive AI Layer Predictive + Gen AI
h2o-functions
World’s Best AI to do Gen AI
Tabular
Data
Raise a forest, not just a tree.
Make ecosystems
Tech Ecosystem of H2O GenAI
OPEN SOURCE MODELS
VECTOR DATABASES
CLOSE SOURCE MODELS
LLM INFERENCING HARDWARE MODEL
HOSTING
CLOUD PROVIDERS
Algorithms
Design
Data
Domain
Business
Applications
Data is a
Team Sport
Teamwork makes the Dream Work
Thank you, Community.
Gratitude.
Makers, past, present and future!
H2O Movement
Community and Customers!
To all greatness, our greetings
endaro mahanubhavulu
LLM-Based Evaluation
Prompt/Context and
Document Embedding
Dimensionality Reduction
and Clustering
Cluster/Topic
Summarization
Auto Prompt Generation
Prompt Perturbation
• Invariance
• Adversarial
Robustness
• Performance Drop
Stratified Sampling
Weakness Detection
• Weak clusters/topics
Resilience
• Performance under
distribution shift
Interpretability Outcome Analysis: Cluster
Eval
Input-Output
Visualization
Diagnostics
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source
Open Source Foundation Models
H2O Danube 2 LLM 1.8B, 2T Tokens, 4K seq length
fp8 precision during training (quantization)
Grouped Query Attention
(Dropped Windowing; Mistral Tokenizer)
Compute 7,600 GPU hrs, (8xH100, 45 days)
Inference is efficient. Embeddable.
Cost efficient! (lot less than you think!)
Ready to pre-train on your data!
Democratizing Foundation LLMs
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source
Using LLM Studio
DPO,
with good chat data
Fine-Tuned for Chat
Guardrail LLMs and Gateway LLMs
LLMs to safeguard an LLM
Costs and Latencies of AI
Training Costs
Inference Cost & latency
vLLM
Data
Labeling,
Synthetic
Feedback
Private
On-prem,
behind your firewall
Sovereign Data
Sovereign AI
PII Detection
Detect patterns of personal identification
https://www.kaggle.com/competitions/pii-detection-removal-from-educational-
data/discussion/481135
LLM Generated Content Detection
Easier to detect human generated
Prompt Recovery and Intent discovery
Your own post-trained, fine-tuned content
Post-train and fine-tune LLMs on small hardware
Early SLM (Danube 2) applications
Open Source h2oGPT
Fine-tuning and RAG of Best-in-class Open Source LLMs
Open Source H2O LLM Studio
Best-in-class Studio to fine-tune LLMs
Enterprise h2oGPTe
Best-in class RAG and API to power GenAI Apps
H2O EvalStudio
Evaluation-led building of LLM-powered Applications
Open Source H2O EvalGPT
Evaluate LLMs for public and private benchmarks
Converging Predictive AI and Generative AI
H2O Agents
Converging Predictive AI and Generative AI
H2O Document AI
Prompt powered AI for Documents
H2O Label Genie
ETL for GenAI
H2O Document Diff
Document Difference Engine powered by GenAI
Data sanskrit. plural of datum.
to give.
to impact with energy.
mineral rich in energy.
AI will usher in abundance
AI will usher in abundance
abundance of time
abundance of space
outer space, inner space
abundance of matter & energy
With great powers, comes great…
Responsible AI.
BC is Before Covid
Long time ago, there once were -
Virus, Wars and Superstitions
Long time ago, there once were -
Virus, Wars and Fake News
AI can make a difference.
We can make a difference.
busy being born and busy dying.
- Bob Dylan
this will be fun!
Gratitude
purpose needs voice
Make data and AI your first-class assets
Thank You: GenAI Open Source Ecosystem
h2oGPT (Open Source GPTs need RAG)
PyMuPDF
BGE, Instructor embeddings
ragas, evalGPT.ai, hf/leaderboard, llm-eval
hnswlib, Milvus, Chroma
LangChain, LlamaIndex, Open Interpreter
vLLMs, tgi-0.9
HuggingFace, H2O LLMStudio, GGUF, AWQ, Gradio
Alpaca LoRA, llama.cpp, AutoGPTQ, hf/TheBloke
OpenAssistant
LLama2, Falcon, Mistral, GPT3, Eleuther, BigScience
Pytorch, DeepSpeed
ā€œAttention is All You Needā€ team and Transformers
CUDA, RocM
Python, Go, Rust, K8s, Clouds
# others I might have missed
# @ylecun
# @ Dr. Ebtesam Almazrouei
# @tloen/alpaca-lora @ecjwg
ETL for LLMs
H2O Data Studio
Label Genie powered by LLMs
Hydrogen Torch
Labeling as a Service
Evaluation led LLM Development
evalGPT.ai
ragas, retrieval
Evaluation as a Service
Eval Studio
When Generation is abundant
Curation becomes valuable
AI to curate AI
AI to do AI
customer is the moat.
Co-creation
Gov GenAI App Store
RAG + Gen AI Use cases
Customer Experience, Contact Centers
Document Processing
Generating Marketing
Generating Code and Apps
Procurement, RFPs, Contracts, Inventory
Portfolio Recommendation (Earnings Calls)
Custom GPTs
Meeting Summaries, Tumor Board Summary
Multi-modal, Speech
Document Extraction, ETL for LLMs
Labeling, Translation
Ask Data. BI.
H2O.ai Confidential
AI for ALL
AI for Sales People
AI for Investors
AI for Manufacturers
AI for Retailers
AI for Hospitals
AI for Transportation
AI for Urban Planning
AI for Financial Services
AI for Real Estates
AI for Energy
AI for Tourism
AI for Hospitality
AI for Oil and Gas
AI for Nature
AI for Future
AI for Good
Government
Scam Shield
LLM based Scam
Prevention Service
LLM
Governments
Virtual Advisor
LLM based
conversation support
services
LLM
Governments
Blueprint Designer
LLM based design and
construction
LLM
Tourism
Tour Personalization
LLM based travel and
tour
recommendations
LLM
Tourism
Language Assist
LLM based
multilingual assistance
LLM
Governments
Waste Management
GenAI based waste
optimization
LLM
Smart City
Dynamic Road Tax
Services to provide
route
recommendations
LLM
Manufacturing
Green Builder
LLMs for Carbon
reduction plans
LLM
Hospitals
Healthcare AI
Best public health
advice using GenAI
LLM
Hospitals
Disease Advice
LLMs for patient care
and disease advice
LLM
Real Estate
Tenancy Screener
LLM for person risk
profiling
LLM
Security
Edge Surveillance
Local LLMs in drones
for Intelligent
Surveillance
LLM
Gen AI App Store - Applications Powered by LLMs
H2O.ai Confidential
AI App Store - Everything AI in one place
AI for ALL
AI for Sales People
AI for Investors
AI for Manufacturers
AI for Retailers
AI for Hospitals
AI for Transportation
AI for Urban Planning
AI for Financial Services
AI for Real Estates
AI for Energy
AI for Tourism
AI for Hospitality
AI for Oil and Gas
AI for Nature
AI for Future
AI for Good
+
Scaling the AI & GenAI for 100s of Use Cases
Data
LLMs /
Models
AI + LLM
Apps
GenAI
App Store
Model
Deployments
ATL AI
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
AI Access
Regions
100s of Use Cases
AIaaS Ecosystem with H2O.AI at AT&T
AI Governance
Center
Any AI/ML library
Build your own models
Import
Models
Create App/
Deploy
Model Microservice
ā— Model Validation
ā—‹ Adversarial analysis
ā—‹ Backtesting
ā—‹ What-if analysis
ā— Suggest models
ā—‹ Model catalog
ā— Suggest features
ā—‹ Feature store
ā— Ensemble strategies
ā— Model benchmarking
Create AI
App Store
Model Store
H2O Feature
Store
App Store
Model Analytics Platform
AI Bias eval, MLI
AI Ops
Application
Deploy AI
1
2 3
Self Service Data
Ingestion
H2O - 3
Data Store
on Prem
H2O Olympics
2.5
H2O Olympics
Responsible AI
AI/ML User
Model Store
H2O
WAVE
H2O WAVE
H2O Feature
Store
HDFS
Cloudera
H2O MLOps
Model Repository
Model Deployment
Model Monitoring
High Availability Model Hosting
GenAI (LLMs)
Integration
4
Customer and Community
Love is the greatest force in nature.
From Data Science to Strategy Science
Silence
The quieter you become the more we can hear
Ram Dass
listen
silent
GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source
universal
individual
you can do anything you set
your mind to.
eminem.
Courage
whose world is this?
https://www.youtube.com/watch?v=ahY62u62Uw8
Transform
manager to a leader
caterpillar to a butterfly!
Rise of Data Scientist
From Data to Strategy Science
CDAO now works closely with CEO
Intelligence is the Edge
The only wisdom we can hope to acquire
Is the wisdom of humility: humility is endless
TS Eliot said that.
You can be in my dream
If I can be in yours.
Bob Dylan said that.
You can be in my selfie
If I can be in yours.

More Related Content

PDF
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
Ā 
PDF
H2O Cloud AI Developer Services - Slides (2024)
Sri Ambati
Ā 
PDF
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
Matthew Sinclair
Ā 
PDF
H2o.ai presentation at 2nd Virtual Pydata Piraeus meetup
PyData Piraeus
Ā 
PPTX
ISV Showcase: End-to-end Machine Learning using H2O on Azure
Microsoft Tech Community
Ā 
PPTX
Types of AI and Their Usefulness.pptx for healthcare workers
drthurapku
Ā 
PPTX
AI and AutoML: Debunking Myths
Sri Ambati
Ā 
PPTX
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Sri Ambati
Ā 
H2O.ai CEO/Founder: Sri Ambati Keynote at Wells Fargo Day
Sri Ambati
Ā 
H2O Cloud AI Developer Services - Slides (2024)
Sri Ambati
Ā 
20240507 QFM013 Machine Intelligence Reading List April 2024.pdf
Matthew Sinclair
Ā 
H2o.ai presentation at 2nd Virtual Pydata Piraeus meetup
PyData Piraeus
Ā 
ISV Showcase: End-to-end Machine Learning using H2O on Azure
Microsoft Tech Community
Ā 
Types of AI and Their Usefulness.pptx for healthcare workers
drthurapku
Ā 
AI and AutoML: Debunking Myths
Sri Ambati
Ā 
Building LLM Solutions using Open Source and Closed Source Solutions in Coher...
Sri Ambati
Ā 

Similar to GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source (20)

PDF
Introducción al Aprendizaje Automatico con H2O-3 (1)
Sri Ambati
Ā 
PDF
Intro to Machine Learning with H2O and AWS
Sri Ambati
Ā 
PDF
My Journey towards Artificial Intelligence
Vijayananda Mohire
Ā 
PDF
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Ian Gomez
Ā 
PDF
Overview of Artificial Intelligence - Technology
NickDAgostino3
Ā 
PDF
How do you get started in AI?
Gordon Haff
Ā 
PPTX
Project "Deep Water"
Jo-fai Chow
Ā 
PPTX
H2O 0xdata MLconf
Sri Ambati
Ā 
PDF
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
Ā 
PDF
Exploring AI as tools in your career.pdf
videongamesrfun
Ā 
PDF
20240411 QFM009 Machine Intelligence Reading List March 2024
Matthew Sinclair
Ā 
PDF
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Daniel Zivkovic
Ā 
PDF
Python and H2O with Cliff Click at PyData Dallas 2015
Sri Ambati
Ā 
PDF
HKOSCon18 - Chetan Khatri - Open Source AI / ML Technologies and Application ...
Chetan Khatri
Ā 
PDF
David Michels: DevOps My AI at AWS Community Day Midwest 2024
AWS Chicago
Ā 
PDF
Your AI Transformation
Sri Ambati
Ā 
PDF
H2O at BelgradeR Meetup
Jo-fai Chow
Ā 
PDF
Belgrade R - Intro to H2O and Deep Water
Sri Ambati
Ā 
PDF
20240801 QFM025 Machine Intelligence Reading List July 2024
Matthew Sinclair
Ā 
PDF
AI Evolution Beyond Humans _The Age of Machine Superiority.pdf
Impaakt Magazine
Ā 
Introducción al Aprendizaje Automatico con H2O-3 (1)
Sri Ambati
Ā 
Intro to Machine Learning with H2O and AWS
Sri Ambati
Ā 
My Journey towards Artificial Intelligence
Vijayananda Mohire
Ā 
Start Getting Your Feet Wet in Open Source Machine and Deep Learning
Ian Gomez
Ā 
Overview of Artificial Intelligence - Technology
NickDAgostino3
Ā 
How do you get started in AI?
Gordon Haff
Ā 
Project "Deep Water"
Jo-fai Chow
Ā 
H2O 0xdata MLconf
Sri Ambati
Ā 
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
Ā 
Exploring AI as tools in your career.pdf
videongamesrfun
Ā 
20240411 QFM009 Machine Intelligence Reading List March 2024
Matthew Sinclair
Ā 
All in AI: LLM Landscape & RAG in 2024 with Mark Ryan (Google) & Jerry Liu (L...
Daniel Zivkovic
Ā 
Python and H2O with Cliff Click at PyData Dallas 2015
Sri Ambati
Ā 
HKOSCon18 - Chetan Khatri - Open Source AI / ML Technologies and Application ...
Chetan Khatri
Ā 
David Michels: DevOps My AI at AWS Community Day Midwest 2024
AWS Chicago
Ā 
Your AI Transformation
Sri Ambati
Ā 
H2O at BelgradeR Meetup
Jo-fai Chow
Ā 
Belgrade R - Intro to H2O and Deep Water
Sri Ambati
Ā 
20240801 QFM025 Machine Intelligence Reading List July 2024
Matthew Sinclair
Ā 
AI Evolution Beyond Humans _The Age of Machine Superiority.pdf
Impaakt Magazine
Ā 
Ad

More from Sri Ambati (20)

PDF
H2O Label Genie Starter Track - Support Presentation
Sri Ambati
Ā 
PDF
H2O.ai Agents : From Theory to Practice - Support Presentation
Sri Ambati
Ā 
PDF
H2O Generative AI Starter Track - Support Presentation Slides.pdf
Sri Ambati
Ā 
PDF
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
Sri Ambati
Ā 
PDF
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Sri Ambati
Ā 
PDF
Intro to Enterprise h2oGPTe Presentation Slides
Sri Ambati
Ā 
PDF
Enterprise h2o GPTe Learning Path Slide Deck
Sri Ambati
Ā 
PDF
H2O Wave Course Starter - Presentation Slides
Sri Ambati
Ā 
PDF
Large Language Models (LLMs) - Level 3 Slides
Sri Ambati
Ā 
PDF
Data Science and Machine Learning Platforms (2024) Slides
Sri Ambati
Ā 
PDF
Data Prep for H2O Driverless AI - Slides
Sri Ambati
Ā 
PDF
LLM Learning Path Level 2 - Presentation Slides
Sri Ambati
Ā 
PDF
LLM Learning Path Level 1 - Presentation Slides
Sri Ambati
Ā 
PDF
Hydrogen Torch - Starter Course - Presentation Slides
Sri Ambati
Ā 
PDF
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Sri Ambati
Ā 
PDF
H2O Driverless AI Starter Course - Slides and Assignments
Sri Ambati
Ā 
PPTX
Generative AI Masterclass - Model Risk Management.pptx
Sri Ambati
Ā 
PDF
AI and the Future of Software Development: A Sneak Peek
Sri Ambati
Ā 
PPTX
LLMOps: Match report from the top of the 5th
Sri Ambati
Ā 
PPTX
Building, Evaluating, and Optimizing your RAG App for Production
Sri Ambati
Ā 
H2O Label Genie Starter Track - Support Presentation
Sri Ambati
Ā 
H2O.ai Agents : From Theory to Practice - Support Presentation
Sri Ambati
Ā 
H2O Generative AI Starter Track - Support Presentation Slides.pdf
Sri Ambati
Ā 
H2O Gen AI Ecosystem Overview - Level 1 - Slide Deck
Sri Ambati
Ā 
An In-depth Exploration of Enterprise h2oGPTe Slide Deck
Sri Ambati
Ā 
Intro to Enterprise h2oGPTe Presentation Slides
Sri Ambati
Ā 
Enterprise h2o GPTe Learning Path Slide Deck
Sri Ambati
Ā 
H2O Wave Course Starter - Presentation Slides
Sri Ambati
Ā 
Large Language Models (LLMs) - Level 3 Slides
Sri Ambati
Ā 
Data Science and Machine Learning Platforms (2024) Slides
Sri Ambati
Ā 
Data Prep for H2O Driverless AI - Slides
Sri Ambati
Ā 
LLM Learning Path Level 2 - Presentation Slides
Sri Ambati
Ā 
LLM Learning Path Level 1 - Presentation Slides
Sri Ambati
Ā 
Hydrogen Torch - Starter Course - Presentation Slides
Sri Ambati
Ā 
Presentation Resources - H2O Gen AI Ecosystem Overview - Level 2
Sri Ambati
Ā 
H2O Driverless AI Starter Course - Slides and Assignments
Sri Ambati
Ā 
Generative AI Masterclass - Model Risk Management.pptx
Sri Ambati
Ā 
AI and the Future of Software Development: A Sneak Peek
Sri Ambati
Ā 
LLMOps: Match report from the top of the 5th
Sri Ambati
Ā 
Building, Evaluating, and Optimizing your RAG App for Production
Sri Ambati
Ā 
Ad

Recently uploaded (20)

PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
Ā 
PDF
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
Ā 
PDF
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
Ā 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
Ā 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
Ā 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
Ā 
PDF
Software Development Methodologies in 2025
KodekX
Ā 
PDF
Architecture of the Future (09152021)
EdwardMeyman
Ā 
PDF
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
Ā 
PDF
Software Development Company | KodekX
KodekX
Ā 
PDF
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
Ā 
PDF
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
Ā 
PDF
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
Ā 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
Ā 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
Ā 
PDF
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
Ā 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
Ā 
PPTX
What-is-the-World-Wide-Web -- Introduction
tonifi9488
Ā 
PPT
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
Ā 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
Ā 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
Ā 
Event Presentation Google Cloud Next Extended 2025
minhtrietgect
Ā 
Data_Analytics_vs_Data_Science_vs_BI_by_CA_Suvidha_Chaplot.pdf
CA Suvidha Chaplot
Ā 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
Ā 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
Ā 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
Ā 
Software Development Methodologies in 2025
KodekX
Ā 
Architecture of the Future (09152021)
EdwardMeyman
Ā 
The Evolution of KM Roles (Presented at Knowledge Summit Dublin 2025)
Enterprise Knowledge
Ā 
Software Development Company | KodekX
KodekX
Ā 
Beyond Automation: The Role of IoT Sensor Integration in Next-Gen Industries
Rejig Digital
Ā 
Tea4chat - another LLM Project by Kerem Atam
a0m0rajab1
Ā 
SparkLabs Primer on Artificial Intelligence 2025
SparkLabs Group
Ā 
REPORT: Heating appliances market in Poland 2024
SPIUG
Ā 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
Ā 
Orbitly Pitch Deck|A Mission-Driven Platform for Side Project Collaboration (...
zz41354899
Ā 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
Ā 
What-is-the-World-Wide-Web -- Introduction
tonifi9488
Ā 
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
Ā 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
Ā 

GenAISummit 2024 May 28 Sri Ambati Keynote: AGI Belongs to The Community in Open Source

  • 2. AGI - The making of the world’s most powerful AI will be Open Source and Community led
  • 3. Like Music, Math is generational wealth of the human species!
  • 4. Like Math, AGI will belong to the Community in Open Source!
  • 5. Open Source is freedom, not free!
  • 6. Open Source is the defense of the community with code
  • 7. Open Source is the defense of the community with AI
  • 8. Data Scientist = 100% data Business Unit = 85% data, 15% storytelling Executive = 15% data, 85% storytelling Board = 99% storytelling GrandPa = 100% storytelling Our (LLMs) journey to here
  • 9. Time is the only non-renewable resource
  • 10. H2O.ai Confidential Democratize AI with H2O.ai 8 7 222OF THE H2O OF THE TOP 10 BANKS OF THE TOP 10 4 OF THE TOP 10 MANUFACTURING COMPANIES INSURANCE COMPANIES FORTUNE 500 CUSTOMER OBSESSION MAKER CULTURE Commonwealth Bank, Goldman Sachs, Wells Fargo, NVIDIA, Capital One, Nexus Ventures, New York Life 2M Community 30 Kaggle Grandmasters 2012 FOUNDED IN SILICON VALLEY $256M FUNDRAISED, SUSTAINABLE
  • 11. H2O.ai Confidential H2O.ai Confidential H2O products are backed by 10% of the World’s Data Science Grandmasters and a Team of Experts who are relentless in solving critical problems. KAGGLE GRANDMASTERS World’s Best Data Scientists 2 Quadruple GMs 1 Triple GM 7 Double GMs #1 KGM #3 KGM 5Top 10 Globally 10 Kaggle Masters H2O.ai KGM HIGHLIGHTS ā—2023 H2O KGMs win 3 healthcare-related competitions ā—July 2023 H2O KGMs make ā€œTop GenAI Scientistsā€ list ā—Oct 2023 H2O paper accepted at EMNLP 2023 ā—Nov 2023 H2O KGMs win 1st place in Kaggle Science Exam competition ā—Jan 2024 H2O KGM places 2nd in Detect AI Text competition ā—Feb 2024 introduced foundational model H2O- Danube 1.8b
  • 12. BC is Before Covid
  • 13. Every Nation needs to be an AI Nation
  • 14. Prompt is your intellectual property
  • 16. Every organization needs to own its GPT
  • 17. Open source crushed the GPT trademark
  • 18. Open source crushed GPT trademark, not fully! https://tsdr.uspto.gov/documentviewer?caseId=sn97733259&docId=FREF20240206125856&linkId=1#docI ndex=1&page=1
  • 21. Gaia Benchmark for General Intelligence agents OS-Copilot / FRIDAY Autogen Coding General Assistants SWE-Bench OpenDevin Eureka https://gaia-benchmark-leaderboard.hf.space/?__theme=light
  • 22. dream
  • 24. AI for Good AI for Climate Change AI to Protect WildLife and Pets AI To Manage Pandemic Supply Chain AI to Model WildFire Behaviors AI to Predict Hurricanes AI for H2O (Water) AI to Democratize Health AI to Educate and Upskill our Public Services Responsible AI and Fairness
  • 26. AI to predict Hurricanes
  • 27. AI to fight Climate Change
  • 31. H2O.ai Confidential Generative AI Predictive AI Generative AI AWARD-WINNING AUTOML reaches a new audience with chat & function calling assistants for ML Enhanced AI for Documents Automation for data labeling Automation of code for AI Apps CUSTOMGPTS: H2O foundational models, fine-tuned models, personality, on top of predictive models Convergence of World’s best Predictive and Generative AI GenAI Apps Enterprise h2oGPTe: Python APIs and UI for chat, RAG customGPTs Function-calling & Agents Prompt Tuning Foundational Models document and data labeling on-premise, air-gapped, cloud VPC | SaaS +
  • 32. H2O.ai Confidential GenAI AppStudio Structured Datasets Unstructured Datasets ETL / Prep for LLMs Documents → QA Pairs Fine Tuning LLMs (& Prompts) End Users Vector DB (Embedding s) myGPT R. A. G. Talk to your Data Document QA Document Chat Image/Video Chat LLM Query GenAI Apps + + + + + + LLM Data Studio AI Engines MLOps EvalStudio AI Apps + LLMs Integration LLMOps API Prompt Studio Continuous Feedback Parsing . Chunking Indexing . Embeddings LLM Agents Enterprise h2oGPTe (RAG) Make Your Own Eval Predictive AI Layer Predictive + Gen AI h2o-functions World’s Best AI to do Gen AI Tabular Data
  • 33. Raise a forest, not just a tree. Make ecosystems
  • 34. Tech Ecosystem of H2O GenAI OPEN SOURCE MODELS VECTOR DATABASES CLOSE SOURCE MODELS LLM INFERENCING HARDWARE MODEL HOSTING CLOUD PROVIDERS
  • 36. Thank you, Community. Gratitude. Makers, past, present and future! H2O Movement Community and Customers! To all greatness, our greetings endaro mahanubhavulu
  • 37. LLM-Based Evaluation Prompt/Context and Document Embedding Dimensionality Reduction and Clustering Cluster/Topic Summarization Auto Prompt Generation Prompt Perturbation • Invariance • Adversarial Robustness • Performance Drop Stratified Sampling Weakness Detection • Weak clusters/topics Resilience • Performance under distribution shift Interpretability Outcome Analysis: Cluster Eval Input-Output Visualization Diagnostics
  • 39. Open Source Foundation Models H2O Danube 2 LLM 1.8B, 2T Tokens, 4K seq length fp8 precision during training (quantization) Grouped Query Attention (Dropped Windowing; Mistral Tokenizer) Compute 7,600 GPU hrs, (8xH100, 45 days) Inference is efficient. Embeddable. Cost efficient! (lot less than you think!) Ready to pre-train on your data! Democratizing Foundation LLMs
  • 41. Using LLM Studio DPO, with good chat data Fine-Tuned for Chat
  • 42. Guardrail LLMs and Gateway LLMs LLMs to safeguard an LLM
  • 43. Costs and Latencies of AI Training Costs Inference Cost & latency vLLM Data Labeling, Synthetic Feedback Private On-prem, behind your firewall Sovereign Data Sovereign AI
  • 44. PII Detection Detect patterns of personal identification https://www.kaggle.com/competitions/pii-detection-removal-from-educational- data/discussion/481135 LLM Generated Content Detection Easier to detect human generated Prompt Recovery and Intent discovery Your own post-trained, fine-tuned content Post-train and fine-tune LLMs on small hardware Early SLM (Danube 2) applications
  • 45. Open Source h2oGPT Fine-tuning and RAG of Best-in-class Open Source LLMs Open Source H2O LLM Studio Best-in-class Studio to fine-tune LLMs
  • 46. Enterprise h2oGPTe Best-in class RAG and API to power GenAI Apps
  • 47. H2O EvalStudio Evaluation-led building of LLM-powered Applications Open Source H2O EvalGPT Evaluate LLMs for public and private benchmarks
  • 48. Converging Predictive AI and Generative AI
  • 49. H2O Agents Converging Predictive AI and Generative AI
  • 50. H2O Document AI Prompt powered AI for Documents
  • 51. H2O Label Genie ETL for GenAI
  • 52. H2O Document Diff Document Difference Engine powered by GenAI
  • 53. Data sanskrit. plural of datum. to give. to impact with energy. mineral rich in energy.
  • 54. AI will usher in abundance
  • 55. AI will usher in abundance abundance of time abundance of space outer space, inner space abundance of matter & energy
  • 56. With great powers, comes great… Responsible AI.
  • 57. BC is Before Covid
  • 58. Long time ago, there once were - Virus, Wars and Superstitions
  • 59. Long time ago, there once were - Virus, Wars and Fake News
  • 60. AI can make a difference.
  • 61. We can make a difference.
  • 62. busy being born and busy dying. - Bob Dylan
  • 63. this will be fun!
  • 66. Make data and AI your first-class assets
  • 67. Thank You: GenAI Open Source Ecosystem h2oGPT (Open Source GPTs need RAG) PyMuPDF BGE, Instructor embeddings ragas, evalGPT.ai, hf/leaderboard, llm-eval hnswlib, Milvus, Chroma LangChain, LlamaIndex, Open Interpreter vLLMs, tgi-0.9 HuggingFace, H2O LLMStudio, GGUF, AWQ, Gradio Alpaca LoRA, llama.cpp, AutoGPTQ, hf/TheBloke OpenAssistant LLama2, Falcon, Mistral, GPT3, Eleuther, BigScience Pytorch, DeepSpeed ā€œAttention is All You Needā€ team and Transformers CUDA, RocM Python, Go, Rust, K8s, Clouds # others I might have missed # @ylecun # @ Dr. Ebtesam Almazrouei # @tloen/alpaca-lora @ecjwg
  • 68. ETL for LLMs H2O Data Studio Label Genie powered by LLMs Hydrogen Torch Labeling as a Service
  • 69. Evaluation led LLM Development evalGPT.ai ragas, retrieval Evaluation as a Service Eval Studio
  • 70. When Generation is abundant Curation becomes valuable
  • 72. AI to do AI
  • 75. Gov GenAI App Store
  • 76. RAG + Gen AI Use cases Customer Experience, Contact Centers Document Processing Generating Marketing Generating Code and Apps Procurement, RFPs, Contracts, Inventory Portfolio Recommendation (Earnings Calls) Custom GPTs Meeting Summaries, Tumor Board Summary Multi-modal, Speech Document Extraction, ETL for LLMs Labeling, Translation Ask Data. BI.
  • 77. H2O.ai Confidential AI for ALL AI for Sales People AI for Investors AI for Manufacturers AI for Retailers AI for Hospitals AI for Transportation AI for Urban Planning AI for Financial Services AI for Real Estates AI for Energy AI for Tourism AI for Hospitality AI for Oil and Gas AI for Nature AI for Future AI for Good Government Scam Shield LLM based Scam Prevention Service LLM Governments Virtual Advisor LLM based conversation support services LLM Governments Blueprint Designer LLM based design and construction LLM Tourism Tour Personalization LLM based travel and tour recommendations LLM Tourism Language Assist LLM based multilingual assistance LLM Governments Waste Management GenAI based waste optimization LLM Smart City Dynamic Road Tax Services to provide route recommendations LLM Manufacturing Green Builder LLMs for Carbon reduction plans LLM Hospitals Healthcare AI Best public health advice using GenAI LLM Hospitals Disease Advice LLMs for patient care and disease advice LLM Real Estate Tenancy Screener LLM for person risk profiling LLM Security Edge Surveillance Local LLMs in drones for Intelligent Surveillance LLM Gen AI App Store - Applications Powered by LLMs
  • 78. H2O.ai Confidential AI App Store - Everything AI in one place AI for ALL AI for Sales People AI for Investors AI for Manufacturers AI for Retailers AI for Hospitals AI for Transportation AI for Urban Planning AI for Financial Services AI for Real Estates AI for Energy AI for Tourism AI for Hospitality AI for Oil and Gas AI for Nature AI for Future AI for Good
  • 79. + Scaling the AI & GenAI for 100s of Use Cases Data LLMs / Models AI + LLM Apps GenAI App Store Model Deployments ATL AI + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + AI Access Regions 100s of Use Cases
  • 80. AIaaS Ecosystem with H2O.AI at AT&T AI Governance Center Any AI/ML library Build your own models Import Models Create App/ Deploy Model Microservice ā— Model Validation ā—‹ Adversarial analysis ā—‹ Backtesting ā—‹ What-if analysis ā— Suggest models ā—‹ Model catalog ā— Suggest features ā—‹ Feature store ā— Ensemble strategies ā— Model benchmarking Create AI App Store Model Store H2O Feature Store App Store Model Analytics Platform AI Bias eval, MLI AI Ops Application Deploy AI 1 2 3 Self Service Data Ingestion H2O - 3 Data Store on Prem H2O Olympics 2.5 H2O Olympics Responsible AI AI/ML User Model Store H2O WAVE H2O WAVE H2O Feature Store HDFS Cloudera H2O MLOps Model Repository Model Deployment Model Monitoring High Availability Model Hosting GenAI (LLMs) Integration 4
  • 81. Customer and Community Love is the greatest force in nature.
  • 82. From Data Science to Strategy Science
  • 83. Silence The quieter you become the more we can hear Ram Dass
  • 87. you can do anything you set your mind to. eminem.
  • 89. whose world is this?
  • 91. Rise of Data Scientist From Data to Strategy Science CDAO now works closely with CEO
  • 93. The only wisdom we can hope to acquire Is the wisdom of humility: humility is endless TS Eliot said that.
  • 94. You can be in my dream If I can be in yours. Bob Dylan said that.
  • 95. You can be in my selfie If I can be in yours.