SlideShare a Scribd company logo
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Massimo Re Ferrè
Senior Principal Technologist, AWS
Generative AI for the rest of us
© 2023, Amazon Web Services, Inc. or its affiliates. 2
Mainframes
Zooming out a bit
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 3
Mainframes
Zooming out a bit
Personal Computers
Technology wave #2
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 4
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Technology wave #2
Technology wave #1
Data center
© 2023, Amazon Web Services, Inc. or its affiliates. 5
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates. 6
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Containers
Functions
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates. 7
Mainframes
Zooming out a bit
Phyisical Servers
Virtual Machines
Personal Computers
Containers
Functions
Generative AI
Technology wave #3
Technology wave #2
Technology wave #1
T
e
c
h
n
o
l
o
g
y
d
e
l
i
v
e
r
y
m
o
d
e
l
Data center
Cloud
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
What is Generative AI?
8
© 2023, Amazon Web Services, Inc. or its affiliates.
What is Generative AI (in simple terms)
9
- Traditional AI/ML: “Is this a picture of Rome or Florence?”
- [ Discriminative ]
- Gen AI: “Compare Rome Vs. Florence for someone interested in history”
- [ Generative ]
© 2023, Amazon Web Services, Inc. or its affiliates.
Gen AI “prompt”
10
A T I T S V E R Y C O R E ( T H E L L M - L A R G E L A N G U A G E M O D E L ) , G E N A I I S A F A K E . B U T A U S E F U L O N E
submit
© 2023, Amazon Web Services, Inc. or its affiliates.
This is how I like to think about an LLM
11
* or any profession that has nothing to do with a job in IT for that matter
q An LLM is akin to a … windsurfer professional*
qVery proficient in English
qAnd that had memorized all Wikipedia and all IT forums out there (and a lot more)
q They know Stack Overflow inside out! But don’t have a window to check the
weather (or a watch to check the time, etc)
q On their own, they have no relation to reality (beyond what they read)
q But they are great at generating free form content based on what they know
“have seen"
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
Source of unstructured knowledge
How can I use this knowledge and reason about it to create a new asset?
An asset being a piece of code, a whole program, a blog, an architecture, a troubleshooting
workflow, a db query and more outside of the IT realm (a poem, a picture, a receipt …)
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
Read and
memorize it all
(LOL – yeah sure)
(1)
Source of unstructured knowledge
You
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
You
Read and
memorize it all
(LOL – yeah sure)
Search engines (possibly
not relevant and still hard
- you are the integrator
and generator of a new
asset – text or code)
(1)
(2)
Source of unstructured knowledge
asset
© 2023, Amazon Web Services, Inc. or its affiliates.
Why is Gen AI useful? [ the builder view ]
15
You
Read and
memorize it all
(LOL – yeah sure)
LLM
Train on it
(doable)
(1)
(2)
(3a)
Natural language
conversation
(3b)
Source of unstructured knowledge
asset
asset
Search engines (possibly
not relevant and still hard
- you are the integrator
and generator of a new
asset – text or code)
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
My first Gen AI application
16
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case
17
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
!!!!
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the ClickOps version
18
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
Audio
file
Text
file
LLM
Audio to text translation Text summarization
Text
file
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the ClickOps version
19
B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S
Prompt
Output
(generated asset)
© 2023, Amazon Web Services, Inc. or its affiliates.
Real life use case – the application version
20
M Y F I R S T ( N O N T U T O R I A L - B A S E D H E L L O - W O R L D ) G E N E R A T I V E A I A P P L I C A T I O N
© 2023, Amazon Web Services, Inc. or its affiliates. 21
Real life use case – the application version
L A M B D A C A L L S A N E X T E R N A L L L M S E R V I C E
https://it20.info/2023/08/building-a-generative-ai-application-using-aws-step-functions/
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Making LLMs useful
22
© 2023, Amazon Web Services, Inc. or its affiliates. 23
Why are people talking about things like Agents, Tools, RAG..
q The LLM is just one (fundamental) component of Generative AI
q The LLM could hallucinate, don’t have knowledge of recent / private / live
information, can’t do advanced math, may have limited reasoning capabilities, etc.
q You need something to complement its capabilities and guide/help it
q Especially for “real” business use cases that go beyond “toying around”
© 2023, Amazon Web Services, Inc. or its affiliates. 24
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Developing code
Debugging code
Living life
Deploying code
Domains
Writing a novel
Organizing travels
© 2023, Amazon Web Services, Inc. or its affiliates. 25
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Developing code
Debugging code
Deploying code
Domains
Autocomplete a
function method
Build a new ERP
from scratch
Resolve an error
message
Rearchitect the
app to avoid this
error at scale
Suggest what I
could do today
Organize my
whole life for the
next 10 years
Level of abstraction
Simple task Complex task
Writing a novel
Living life
Organizing travels
Tell me how long
it takes driving
from Florence to
Rome
Plan in details all
my 1-year long
sabbatical
© 2023, Amazon Web Services, Inc. or its affiliates. 26
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Domains
Level of abstraction
Simple task Complex task
Progressive complexity
Completion Chat Reasoning Acting
à
à
à
coverage
Domain
A
function
of
the
corpus
data
© 2023, Amazon Web Services, Inc. or its affiliates. 27
Why are people talking about things like Agents, Tools, RAG..
T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S )
Domains
Level of abstraction
Simple task Complex task
Large Language model
Smaller
purpose
built/tuned
model
Models may need to be
helped / guided to achieve
goals where task complexity
is too high or simply for
missing domain knowledge
© 2023, Amazon Web Services, Inc. or its affiliates. 28
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F C O T ( C H A I N O F T H O U G H T S )
https://arxiv.org/abs/2201.11903
But sometimes in-prompt Chain of Thoughts (CoT) isn’t enough for the LLM to reason properly
W
elcom
e to
the
m
agic world
of
“prom
pt engineering”
© 2023, Amazon Web Services, Inc. or its affiliates. 29
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F T H E F A C T C H E C K I N G W I T H P R O M P T C H A I N I N G P R O C E S S
https://it20.info/2023/6/the-dark-zone-between-the-magic-genai-experience-and-the-large-language-model/
Q: What is the biggest clock in the world?
© 2023, Amazon Web Services, Inc. or its affiliates. 30
Why are people talking about things like Agents, Tools, RAG..
LLM
A N E X A M P L E O F T O O L S
You
“what’s the weather like
today in Rome?”
Math function
code
Web search
code
“Calculate <very complex
formula>”
(1a)
(2a)
(1b)
(2b)
© 2023, Amazon Web Services, Inc. or its affiliates. 31
Why are people talking about things like Agents, Tools, RAG..
A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G )
https://arxiv.org/abs/2210.03629
© 2023, Amazon Web Services, Inc. or its affiliates. 32
Why are people talking about things like Agents, Tools, RAG..
LLM
A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G )
You
Iterating
reasoning
code
“Write the solution for
<very complex task>” (1)
(2)
https://arxiv.org/abs/2210.03629
Tool
© 2023, Amazon Web Services, Inc. or its affiliates. 33
Why are people talking about things like Agents, Tools, RAG..
LLM
Vector
DB
A N E X A M P L E O F R A G ( R E T R I E V A L - A U G M E N T E D G E N E R A T I O N )
You
Private corpus of data
embedding
“Write a draft email on <specific
company secret topic>”
(1)
(2)
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Prompt context Vs. RAG Vs. fine-tuning
34
© 2023, Amazon Web Services, Inc. or its affiliates. 35
Prompt context Vs. RAG Vs. fine-tuning: I am lost
q Fair. There are three ways to increase an LLM answer precision and correctness
1. Provide context in the prompt
2. Augment the LLM with an external source of vectorized data at inference time (RAG)
3. Fine tune the LLM with additional data
q There isn’t a global right or wrong approach. As often happens, it depends
q Also they are not mutually exclusive
q They could (and often should) be used together to achieve optimal results
© 2023, Amazon Web Services, Inc. or its affiliates. 36
q Rate of the change of the data source
q Limits, cost, latency, speed of prompt context tokens
q Cost of fine tuning
q including the work required to “prepare the data”
q Cost of creating and maintaining the vector store
Prompt context Vs. RAG Vs. fine-tuning: when to use what?
© 2023, Amazon Web Services, Inc. or its affiliates. 37
q Complexity of the architecture
q fine tuning may make the architecture easier (with an upfront fine-tuning investment)
q Shape and location of the data source
q Precision of the outcome
q no absolute rules exist, testing may be required
q Personal experience of the team building the solution
q “I have always used RAG and that’s what I am comfortable with”
Prompt context Vs. RAG Vs. fine-tuning: when to use what?
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Who’s Gen AI for?
38
© 2023, Amazon Web Services, Inc. or its affiliates. 39
Who’s Gen AI for?
q For the developer that is writing code
q e.g. code assistants e.g. AWS CodeWhisperer
q For the developer that wants to use English as a programming language
q e.g. the example of the WhatsApp vocal messages
q For the ops person that does not want to write a SQL query to extract data
q e.g. https://www.honeycomb.io/blog/introducing-query-assistant
© 2023, Amazon Web Services, Inc. or its affiliates. 40
Who’s Gen AI for?
q For the business analyst that wants to create a report off of a spreadsheet
q For the journalist that wants to draft an article on a specific topic
q Etc. etc.
q Come see me later to chat about the story of my plumber impressed by “chat …
chat …. chat something” (true story)
© 2023, Amazon Web Services, Inc. or its affiliates. 41
Conclusions
q Get ready for this new wave. It’s coming and (I think) it’s staying.
q LLMs have moved the needle of the art of possible
q But LLMs alone are not enough. You need to … make LLMs useful.
q Gen AI is for everyone, not just for “builders”. It’s for “consumers” of tech too.
q Go explore! Go build!
© 2023, Amazon Web Services, Inc. or its affiliates.
© 2023, Amazon Web Services, Inc. or its affiliates.
Massimo Re Ferrè
Senior Principal Technologist, AWS
Twitter: @mreferre
E-mail: mreferre@amazon.com
Thanks!

More Related Content

What's hot (20)

PDF
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
PDF
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
PDF
Leveraging Generative AI & Best practices
DianaGray10
 
PDF
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
PDF
Large Language Models Bootcamp
Data Science Dojo
 
PPTX
Using Generative AI
Mark DeLoura
 
PPTX
Generative AI Risks & Concerns
Ajitesh Kumar
 
PDF
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
PDF
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf
AWS Chicago
 
PDF
An Introduction to Generative AI - May 18, 2023
CoriFaklaris1
 
PDF
Large Language Models - Chat AI.pdf
David Rostcheck
 
PDF
Introduction to LLMs
Loic Merckel
 
PDF
Building NLP applications with Transformers
Julien SIMON
 
PDF
Intro to LLMs
Loic Merckel
 
PDF
Let's talk about GPT: A crash course in Generative AI for researchers
Steven Van Vaerenbergh
 
PDF
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Huahai Yang
 
PPTX
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
PPTX
Generative AI and law.pptx
Chris Marsden
 
PDF
generative-ai-fundamentals and Large language models
AdventureWorld5
 
PPTX
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
DataScienceConferenc1
 
AI and ML Series - Introduction to Generative AI and LLMs - Session 1
DianaGray10
 
Exploring Opportunities in the Generative AI Value Chain.pdf
Dung Hoang
 
Leveraging Generative AI & Best practices
DianaGray10
 
Unlocking the Power of Generative AI An Executive's Guide.pdf
PremNaraindas1
 
Large Language Models Bootcamp
Data Science Dojo
 
Using Generative AI
Mark DeLoura
 
Generative AI Risks & Concerns
Ajitesh Kumar
 
Generative-AI-in-enterprise-20230615.pdf
Liming Zhu
 
Suresh Poopandi_Generative AI On AWS-MidWestCommunityDay-Final.pdf
AWS Chicago
 
An Introduction to Generative AI - May 18, 2023
CoriFaklaris1
 
Large Language Models - Chat AI.pdf
David Rostcheck
 
Introduction to LLMs
Loic Merckel
 
Building NLP applications with Transformers
Julien SIMON
 
Intro to LLMs
Loic Merckel
 
Let's talk about GPT: A crash course in Generative AI for researchers
Steven Van Vaerenbergh
 
Generative AI: Past, Present, and Future – A Practitioner's Perspective
Huahai Yang
 
The Future of AI is Generative not Discriminative 5/26/2021
Steve Omohundro
 
Generative AI and law.pptx
Chris Marsden
 
generative-ai-fundamentals and Large language models
AdventureWorld5
 
[DSC DACH 23] ChatGPT and Beyond: How generative AI is Changing the way peopl...
DataScienceConferenc1
 

Similar to Generative AI for the rest of us (20)

PDF
The Rise of the Planet of the Agents: LLM-based AI Agents and Cloud Security ...
Cloud Village
 
PDF
'The Art & Science of LLM Reliability - Building Trustworthy AI Systems' by M...
Daniel Zivkovic
 
PPTX
Introduction to LLMs and their relevance for Official Statistics
Dario Buono
 
PDF
Automated Workflows and AI Agents with Amazon Bedrock
Tilores
 
PPTX
AI Agents and their implications for Enterprise AI Use-cases
Debmalya Biswas
 
PPTX
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays
 
PDF
EIS-Webinar-Agent-Approaches-2024-08-21.pdf
Earley Information Science
 
PDF
Java Developers - What Lies Ahead in the AI era
Emily Jiang
 
PDF
OutSystems User Group November 2024
mail496323
 
PDF
AI FCE online presentation_japie swanepoel
japie swanepoel
 
PDF
LLM-based Multi-Agent Systems to Replace Traditional Software
Ivo Andreev
 
PDF
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
NETUserGroupBern
 
PDF
Introduction to AI/ML with AWS
Suman Debnath
 
PPTX
Los ordenadores ya no son las simples calculadoras numéricas que eran antes
AlfonsoMelndez1
 
PPTX
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
Muralidharan Deenathayalan
 
PDF
Devoxx Morocco 2024 - The Future Beyond LLMs: Exploring Agentic AI
Raphaël Semeteys
 
PDF
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Nicolás Lopéz
 
PDF
AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...
AWS Summits
 
PDF
Enhancing SEO Content Writing with AI: Opportunities & Challenges
Search Engine Journal
 
PPTX
AI Agents, such as Autogen at Tide Sprint
Nathan Bijnens
 
The Rise of the Planet of the Agents: LLM-based AI Agents and Cloud Security ...
Cloud Village
 
'The Art & Science of LLM Reliability - Building Trustworthy AI Systems' by M...
Daniel Zivkovic
 
Introduction to LLMs and their relevance for Official Statistics
Dario Buono
 
Automated Workflows and AI Agents with Amazon Bedrock
Tilores
 
AI Agents and their implications for Enterprise AI Use-cases
Debmalya Biswas
 
apidays New York 2025 - AI for All by Ananya Upadhyay (United Rentals, Inc.)
apidays
 
EIS-Webinar-Agent-Approaches-2024-08-21.pdf
Earley Information Science
 
Java Developers - What Lies Ahead in the AI era
Emily Jiang
 
OutSystems User Group November 2024
mail496323
 
AI FCE online presentation_japie swanepoel
japie swanepoel
 
LLM-based Multi-Agent Systems to Replace Traditional Software
Ivo Andreev
 
Large Language Models, Data & APIs - Integrating Generative AI Power into you...
NETUserGroupBern
 
Introduction to AI/ML with AWS
Suman Debnath
 
Los ordenadores ya no son las simples calculadoras numéricas que eran antes
AlfonsoMelndez1
 
GPT, LLM, RAG, and RAG in Action: Understanding the Future of AI-Powered Info...
Muralidharan Deenathayalan
 
Devoxx Morocco 2024 - The Future Beyond LLMs: Exploring Agentic AI
Raphaël Semeteys
 
Vertex AI Agent Builder - GDG Alicante - Julio 2024
Nicolás Lopéz
 
AWS Summit Singapore 2019 | The Smart Way to Build an AI & ML Strategy for Yo...
AWS Summits
 
Enhancing SEO Content Writing with AI: Opportunities & Challenges
Search Engine Journal
 
AI Agents, such as Autogen at Tide Sprint
Nathan Bijnens
 
Ad

More from Massimo Ferre' (14)

PDF
IDI_2023_MRF-Final.pdf
Massimo Ferre'
 
PDF
Web Assembly (on the server)
Massimo Ferre'
 
PDF
IDI 2022: Making sense of the '17 ways to run containers on AWS'
Massimo Ferre'
 
PDF
Codemotion 2020 - Containers Meet Serverless
Massimo Ferre'
 
PDF
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate
Massimo Ferre'
 
PDF
IDI 2020 - Containers Meet Serverless
Massimo Ferre'
 
PDF
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
Massimo Ferre'
 
PDF
Breaking the monolith (an example)
Massimo Ferre'
 
PDF
From 0 to Blue-Green deployments on AWS Fargate
Massimo Ferre'
 
PDF
AWS Summit Stockholm - Fargate: deploy containers, not infrastructure
Massimo Ferre'
 
PPTX
AWS Summit London 2019 - Containers on AWS
Massimo Ferre'
 
PPTX
Meetup CNCF Torino - Amazon EKS March 29th 2019
Massimo Ferre'
 
PPTX
End-to-end CI/CD deployments of containerized applications using AWS services
Massimo Ferre'
 
PPTX
Containers at AWS: State of the Union
Massimo Ferre'
 
IDI_2023_MRF-Final.pdf
Massimo Ferre'
 
Web Assembly (on the server)
Massimo Ferre'
 
IDI 2022: Making sense of the '17 ways to run containers on AWS'
Massimo Ferre'
 
Codemotion 2020 - Containers Meet Serverless
Massimo Ferre'
 
ContainerDay 2020 - Using Docker as a frontend for Amazon ECS and AWS Fargate
Massimo Ferre'
 
IDI 2020 - Containers Meet Serverless
Massimo Ferre'
 
Containers Meetup (AWS+CNCF) Milano Jan 15th 2020
Massimo Ferre'
 
Breaking the monolith (an example)
Massimo Ferre'
 
From 0 to Blue-Green deployments on AWS Fargate
Massimo Ferre'
 
AWS Summit Stockholm - Fargate: deploy containers, not infrastructure
Massimo Ferre'
 
AWS Summit London 2019 - Containers on AWS
Massimo Ferre'
 
Meetup CNCF Torino - Amazon EKS March 29th 2019
Massimo Ferre'
 
End-to-end CI/CD deployments of containerized applications using AWS services
Massimo Ferre'
 
Containers at AWS: State of the Union
Massimo Ferre'
 
Ad

Recently uploaded (20)

PDF
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
PPTX
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
PDF
International Journal of Information Technology Convergence and services (IJI...
ijitcsjournal4
 
PPTX
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
PPT
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
PDF
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
PPTX
Day2 B2 Best.pptx
helenjenefa1
 
DOC
MRRS Strength and Durability of Concrete
CivilMythili
 
PDF
Introduction to Productivity and Quality
মোঃ ফুরকান উদ্দিন জুয়েল
 
PPTX
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
PPTX
Green Building & Energy Conservation ppt
Sagar Sarangi
 
PPTX
fatigue in aircraft structures-221113192308-0ad6dc8c.pptx
aviatecofficial
 
PPTX
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
PPTX
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
PDF
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
PPTX
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
PDF
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
PDF
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
PDF
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
PDF
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 
GTU Civil Engineering All Semester Syllabus.pdf
Vimal Bhojani
 
artificial intelligence applications in Geomatics
NawrasShatnawi1
 
International Journal of Information Technology Convergence and services (IJI...
ijitcsjournal4
 
265587293-NFPA 101 Life safety code-PPT-1.pptx
chandermwason
 
Carmon_Remote Sensing GIS by Mahesh kumar
DhananjayM6
 
Biomechanics of Gait: Engineering Solutions for Rehabilitation (www.kiu.ac.ug)
publication11
 
Day2 B2 Best.pptx
helenjenefa1
 
MRRS Strength and Durability of Concrete
CivilMythili
 
Introduction to Productivity and Quality
মোঃ ফুরকান উদ্দিন জুয়েল
 
原版一样(Acadia毕业证书)加拿大阿卡迪亚大学毕业证办理方法
Taqyea
 
Green Building & Energy Conservation ppt
Sagar Sarangi
 
fatigue in aircraft structures-221113192308-0ad6dc8c.pptx
aviatecofficial
 
MobileComputingMANET2023 MobileComputingMANET2023.pptx
masterfake98765
 
DATA BASE MANAGEMENT AND RELATIONAL DATA
gomathisankariv2
 
Water Industry Process Automation & Control Monthly July 2025
Water Industry Process Automation & Control
 
Mechanical Design of shell and tube heat exchangers as per ASME Sec VIII Divi...
shahveer210504
 
Basic_Concepts_in_Clinical_Biochemistry_2018كيمياء_عملي.pdf
AdelLoin
 
Viol_Alessandro_Presentazione_prelaurea.pdf
dsecqyvhbowrzxshhf
 
AI TECHNIQUES FOR IDENTIFYING ALTERATIONS IN THE HUMAN GUT MICROBIOME IN MULT...
vidyalalltv1
 
Set Relation Function Practice session 24.05.2025.pdf
DrStephenStrange4
 

Generative AI for the rest of us

  • 1. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Massimo Re Ferrè Senior Principal Technologist, AWS Generative AI for the rest of us
  • 2. © 2023, Amazon Web Services, Inc. or its affiliates. 2 Mainframes Zooming out a bit Technology wave #1 Data center
  • 3. © 2023, Amazon Web Services, Inc. or its affiliates. 3 Mainframes Zooming out a bit Personal Computers Technology wave #2 Technology wave #1 Data center
  • 4. © 2023, Amazon Web Services, Inc. or its affiliates. 4 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Technology wave #2 Technology wave #1 Data center
  • 5. © 2023, Amazon Web Services, Inc. or its affiliates. 5 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 6. © 2023, Amazon Web Services, Inc. or its affiliates. 6 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Containers Functions Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 7. © 2023, Amazon Web Services, Inc. or its affiliates. 7 Mainframes Zooming out a bit Phyisical Servers Virtual Machines Personal Computers Containers Functions Generative AI Technology wave #3 Technology wave #2 Technology wave #1 T e c h n o l o g y d e l i v e r y m o d e l Data center Cloud
  • 8. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. What is Generative AI? 8
  • 9. © 2023, Amazon Web Services, Inc. or its affiliates. What is Generative AI (in simple terms) 9 - Traditional AI/ML: “Is this a picture of Rome or Florence?” - [ Discriminative ] - Gen AI: “Compare Rome Vs. Florence for someone interested in history” - [ Generative ]
  • 10. © 2023, Amazon Web Services, Inc. or its affiliates. Gen AI “prompt” 10 A T I T S V E R Y C O R E ( T H E L L M - L A R G E L A N G U A G E M O D E L ) , G E N A I I S A F A K E . B U T A U S E F U L O N E submit
  • 11. © 2023, Amazon Web Services, Inc. or its affiliates. This is how I like to think about an LLM 11 * or any profession that has nothing to do with a job in IT for that matter q An LLM is akin to a … windsurfer professional* qVery proficient in English qAnd that had memorized all Wikipedia and all IT forums out there (and a lot more) q They know Stack Overflow inside out! But don’t have a window to check the weather (or a watch to check the time, etc) q On their own, they have no relation to reality (beyond what they read) q But they are great at generating free form content based on what they know “have seen"
  • 12. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] Source of unstructured knowledge How can I use this knowledge and reason about it to create a new asset? An asset being a piece of code, a whole program, a blog, an architecture, a troubleshooting workflow, a db query and more outside of the IT realm (a poem, a picture, a receipt …)
  • 13. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] Read and memorize it all (LOL – yeah sure) (1) Source of unstructured knowledge You
  • 14. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] You Read and memorize it all (LOL – yeah sure) Search engines (possibly not relevant and still hard - you are the integrator and generator of a new asset – text or code) (1) (2) Source of unstructured knowledge asset
  • 15. © 2023, Amazon Web Services, Inc. or its affiliates. Why is Gen AI useful? [ the builder view ] 15 You Read and memorize it all (LOL – yeah sure) LLM Train on it (doable) (1) (2) (3a) Natural language conversation (3b) Source of unstructured knowledge asset asset Search engines (possibly not relevant and still hard - you are the integrator and generator of a new asset – text or code)
  • 16. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. My first Gen AI application 16
  • 17. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case 17 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S !!!!
  • 18. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the ClickOps version 18 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S Audio file Text file LLM Audio to text translation Text summarization Text file
  • 19. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the ClickOps version 19 B A C K G R O U N D : I H A T E W H A T S A P P V O C A L M E S S A G E S Prompt Output (generated asset)
  • 20. © 2023, Amazon Web Services, Inc. or its affiliates. Real life use case – the application version 20 M Y F I R S T ( N O N T U T O R I A L - B A S E D H E L L O - W O R L D ) G E N E R A T I V E A I A P P L I C A T I O N
  • 21. © 2023, Amazon Web Services, Inc. or its affiliates. 21 Real life use case – the application version L A M B D A C A L L S A N E X T E R N A L L L M S E R V I C E https://it20.info/2023/08/building-a-generative-ai-application-using-aws-step-functions/
  • 22. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Making LLMs useful 22
  • 23. © 2023, Amazon Web Services, Inc. or its affiliates. 23 Why are people talking about things like Agents, Tools, RAG.. q The LLM is just one (fundamental) component of Generative AI q The LLM could hallucinate, don’t have knowledge of recent / private / live information, can’t do advanced math, may have limited reasoning capabilities, etc. q You need something to complement its capabilities and guide/help it q Especially for “real” business use cases that go beyond “toying around”
  • 24. © 2023, Amazon Web Services, Inc. or its affiliates. 24 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Developing code Debugging code Living life Deploying code Domains Writing a novel Organizing travels
  • 25. © 2023, Amazon Web Services, Inc. or its affiliates. 25 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Developing code Debugging code Deploying code Domains Autocomplete a function method Build a new ERP from scratch Resolve an error message Rearchitect the app to avoid this error at scale Suggest what I could do today Organize my whole life for the next 10 years Level of abstraction Simple task Complex task Writing a novel Living life Organizing travels Tell me how long it takes driving from Florence to Rome Plan in details all my 1-year long sabbatical
  • 26. © 2023, Amazon Web Services, Inc. or its affiliates. 26 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Domains Level of abstraction Simple task Complex task Progressive complexity Completion Chat Reasoning Acting à à à coverage Domain A function of the corpus data
  • 27. © 2023, Amazon Web Services, Inc. or its affiliates. 27 Why are people talking about things like Agents, Tools, RAG.. T H E R E ’ R E T W O D I M E N S I O N S T H E L L M O P E R A T E S I N ( L E V E L O F A B S T R A C T I O N A N D D O M A I N S ) Domains Level of abstraction Simple task Complex task Large Language model Smaller purpose built/tuned model Models may need to be helped / guided to achieve goals where task complexity is too high or simply for missing domain knowledge
  • 28. © 2023, Amazon Web Services, Inc. or its affiliates. 28 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F C O T ( C H A I N O F T H O U G H T S ) https://arxiv.org/abs/2201.11903 But sometimes in-prompt Chain of Thoughts (CoT) isn’t enough for the LLM to reason properly W elcom e to the m agic world of “prom pt engineering”
  • 29. © 2023, Amazon Web Services, Inc. or its affiliates. 29 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F T H E F A C T C H E C K I N G W I T H P R O M P T C H A I N I N G P R O C E S S https://it20.info/2023/6/the-dark-zone-between-the-magic-genai-experience-and-the-large-language-model/ Q: What is the biggest clock in the world?
  • 30. © 2023, Amazon Web Services, Inc. or its affiliates. 30 Why are people talking about things like Agents, Tools, RAG.. LLM A N E X A M P L E O F T O O L S You “what’s the weather like today in Rome?” Math function code Web search code “Calculate <very complex formula>” (1a) (2a) (1b) (2b)
  • 31. © 2023, Amazon Web Services, Inc. or its affiliates. 31 Why are people talking about things like Agents, Tools, RAG.. A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G ) https://arxiv.org/abs/2210.03629
  • 32. © 2023, Amazon Web Services, Inc. or its affiliates. 32 Why are people talking about things like Agents, Tools, RAG.. LLM A N E X A M P L E O F R E A C T ( R E A S O N I N G A N D A C T I N G ) You Iterating reasoning code “Write the solution for <very complex task>” (1) (2) https://arxiv.org/abs/2210.03629 Tool
  • 33. © 2023, Amazon Web Services, Inc. or its affiliates. 33 Why are people talking about things like Agents, Tools, RAG.. LLM Vector DB A N E X A M P L E O F R A G ( R E T R I E V A L - A U G M E N T E D G E N E R A T I O N ) You Private corpus of data embedding “Write a draft email on <specific company secret topic>” (1) (2)
  • 34. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Prompt context Vs. RAG Vs. fine-tuning 34
  • 35. © 2023, Amazon Web Services, Inc. or its affiliates. 35 Prompt context Vs. RAG Vs. fine-tuning: I am lost q Fair. There are three ways to increase an LLM answer precision and correctness 1. Provide context in the prompt 2. Augment the LLM with an external source of vectorized data at inference time (RAG) 3. Fine tune the LLM with additional data q There isn’t a global right or wrong approach. As often happens, it depends q Also they are not mutually exclusive q They could (and often should) be used together to achieve optimal results
  • 36. © 2023, Amazon Web Services, Inc. or its affiliates. 36 q Rate of the change of the data source q Limits, cost, latency, speed of prompt context tokens q Cost of fine tuning q including the work required to “prepare the data” q Cost of creating and maintaining the vector store Prompt context Vs. RAG Vs. fine-tuning: when to use what?
  • 37. © 2023, Amazon Web Services, Inc. or its affiliates. 37 q Complexity of the architecture q fine tuning may make the architecture easier (with an upfront fine-tuning investment) q Shape and location of the data source q Precision of the outcome q no absolute rules exist, testing may be required q Personal experience of the team building the solution q “I have always used RAG and that’s what I am comfortable with” Prompt context Vs. RAG Vs. fine-tuning: when to use what?
  • 38. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Who’s Gen AI for? 38
  • 39. © 2023, Amazon Web Services, Inc. or its affiliates. 39 Who’s Gen AI for? q For the developer that is writing code q e.g. code assistants e.g. AWS CodeWhisperer q For the developer that wants to use English as a programming language q e.g. the example of the WhatsApp vocal messages q For the ops person that does not want to write a SQL query to extract data q e.g. https://www.honeycomb.io/blog/introducing-query-assistant
  • 40. © 2023, Amazon Web Services, Inc. or its affiliates. 40 Who’s Gen AI for? q For the business analyst that wants to create a report off of a spreadsheet q For the journalist that wants to draft an article on a specific topic q Etc. etc. q Come see me later to chat about the story of my plumber impressed by “chat … chat …. chat something” (true story)
  • 41. © 2023, Amazon Web Services, Inc. or its affiliates. 41 Conclusions q Get ready for this new wave. It’s coming and (I think) it’s staying. q LLMs have moved the needle of the art of possible q But LLMs alone are not enough. You need to … make LLMs useful. q Gen AI is for everyone, not just for “builders”. It’s for “consumers” of tech too. q Go explore! Go build!
  • 42. © 2023, Amazon Web Services, Inc. or its affiliates. © 2023, Amazon Web Services, Inc. or its affiliates. Massimo Re Ferrè Senior Principal Technologist, AWS Twitter: @mreferre E-mail: [email protected] Thanks!