SlideShare a Scribd company logo
AWS RDS
MIGRATION
Presented by Hardik Shah
Website www.blazeclan.com
Follow us @clouditbetter
Contact +91 9890 802 529
KEY TAKEAWAYS
Migrating Databases
Migrating minimal databases with minimal downtime to AWS RDS, Amazon Redshift and
Amazon Aurora
On Premise to Cloud
Migration of databases to same and different engines and from on premise to cloud
Schema Conversion
Schema conversion from Oracle and SQL Server to MySQL and Aurora
Traditional Approach= Time, Cost
Commercial tool for migration/replication
Application Downtime
Legacy Schema Objects
Introducing AWS RDS Migration Tool
Easy to setup and start migration in less than
15 mins
No downtime of applications during migration
Replicate from EC2 -> RDS or vice versa
Move data to same or different database
engines
Cost effective and no upfront cost
Keep your Apps running during the
Migration
Amazon RDS Migration
Tool consists of a Web-based
console and a replication server
to replicate data across
heterogeneous data sources.
Amazon RDS Migration Tool can
execute replication between
enterprise databases including
Oracle, Microsoft SQL Server,
and IBM DB2.
Replication is log based, which
means that only the changes
are read. This reduces the
impact on the source
databases.
Amazon RDS Migration
Tool can carry out two types of
replication: Full Load and
Change Processing (CDC).
Load data
efficiently and
quickly to
operational data
stores/
warehouses
Create
copies of
production
databases
Distribute
data
across
databases
Amazon
RDS
Migration
Tool has
high
throughput,
speed, and
scale.
Full Load: The full
load process
creates files or
tables at the target
database,
automatically
defines the
metadata that is
required at the
target, and
populates the
tables with data
from the source.
Change
Processing (CDC):
Change processing
captures changes
in the source data
or metadata as
they occur and
applies them to the
target database as
soon as possible in
near-real-time.
Features
Load reduction: It is recommended that you have a copy of all or of a subset of a collection on a different
server to reduce the load on the main server.
Improved service: Users of the copy of the information may get better access to the copy of the data
than to the original.
Security considerations: Some users might be allowed access to a subset of the data and only this
subset is made available as a replicated copy to those users.
Geographic distribution: The enterprise (for example, a chain of retail stores or warehouses)
may be widely distributed and each node uses primarily its own subset of the data (in addition
to all of the data being available at a central location for less common use).
Disaster Recovery: A copy of the main data is required for rapid failover (the capability to
switch over to a redundant or standby computer server, in case of failure of the main
system).
Support the need for implementing "cloud" computing.
Replication
During replication, a collection
of data is copied from system
A to system B. A is known as
the source (for this collection),
B is known as the target. A
system can be either a source
or a target or even both (within
certain restrictions). When a
number of sources and targets
and data collections are
defined, the replication
topology can be quite
complex.
Integrity: Make sure that the data in
the target actually reflects the
completed result of a change in the
source and not some intermediate
invalid result.
Latency: How out-of-date is the
copy?
Consistency: Make sure that if
the change affects several
different tables or rows, the
copy reflects a consistent state
all were changed or none).
The first two issues are the
responsibility of the replicator.
While some latency is
unavoidable in any system, a
good replicator will aim not to
exceed several seconds of
latency as a general rule.
Replication Tasks
The definition of a task consists of:
Specifying the source and target databases
Specifying the source and target tables to be kept in sync
Specifying the relevant source table columns
Specifying filtering conditions (if any) for each source table, as Boolean predicates on the values one or
more source columns (the predicates are in SQLite syntax)
Listing the target table columns and (optionally) specifying their data types and values (as expressions or
functions over the values of one or more source or target columns, using SQL syntax). If not specified, the
same column names and values as the source tables are used, with default mapping of the source DBMS
data types onto the target DBMS data types. Amazon RDS Migration Tool automatically takes care of the
required filtering, transformations and computations during the Load or CDC execution.
Replication Tasks
The simplest specification of a task may not mention of the target data, with only the source tables (or
ALL, or a mask) specified. In this case, the target tables are identical to the source tables, using the
default mappings between the source and target DBMS data types. In this way, the entire definition
process could be accomplished by a single click, referred to as "Click to Replicate".
Once a task is defined, it can be activated immediately. The target tables with the necessary metadata
definitions are automatically created and loaded, and the CDC is activated. The replication activity can
then be monitored, stopped, or restarted using the Amazon RDS Migration Console.
Full Load & CDC
The full load process creates files or tables at the
target database, automatically defines the metadata
that is required at the target, and populates the tables
with data from the source. Unlike the CDC process
the data is loaded one entire table or file at a time for
efficiency purposes.
The Load process can be interrupted and when
restarted it continues from wherever it was stopped.
New tables can be added to an existing target
without reloading the existing tables. Similarly,
columns in previously-populated target tables can be
added or dropped without requiring reloading.
CDC operates by reading the recovery log file of the source
database management system and grouping together the
entries for each transaction. Various techniques are employed
to ensure that this is done in an efficient manner without
seriously impacting the latency of the target data.
The Change Data Capture (CDC) process captures
changes in the source data or metadata as they occur
and applies them to the target database as soon as
possible in near-real-time. The changes are captured
and applied as units of single committed transactions,
and several different target tables can be updated as the
result of a single source commit.
Defining Global Transformation
Use Global Transformations to make similar changes to multiple tables, owners, and columns in the same
task.
You may need to use this option when you want to change the names of all tables. You can change the
names using wild cards and patterns. For example, you may want to change the names of the tables
from account_% to ac_%. This is helpful when replicating data from an Microsoft SQL Server database to
an Oracle database where the Microsoft SQL Server database has a limit of 128 characters for a table
name and the Oracle database has a limit of 31 characters.
You may also need to change a specific data type in the source to a different data type in the target for
many or all of the tables in the task. Global transformation will accomplish this without having to define a
transformation for each table individually.
Global Transformation types
Rename
Schema
Rename
Table
Rename
Column
Add
Column
Drop
Column
Convert
Data Type
Select this if you
want to change
the schema name
for multiple tables.
Select this if you
want to change
the name of
multiple tables.
Select this if you
want to change
the name of
multiple columns.
Select this if you
want to add a
column with a
similar name to
multiple tables.
Select this if you
want to drop a
column with a
similar name from
multiple tables.
Select this if you
want to change a
specific data type to
a different one
across multiple
tables.
THANK YOU
Follow Us:

More Related Content

Viewers also liked (16)

PDF
Big Data Building Blocks with AWS Cloud
Blazeclan Technologies Private Limited
 
PDF
Analyze Amazon CloudFront, S3 & ELB Logs with Cloudlytics - Part 1
Blazeclan Technologies Private Limited
 
PDF
Micro services on AWS
Blazeclan Technologies Private Limited
 
PPTX
Solving Big Data problems on AWS by Rajnish Malik
Blazeclan Technologies Private Limited
 
PDF
Cloudlytics Reporting: Analyze Amazon CloudFront, S3 & ELB Logs - Part 2
Blazeclan Technologies Private Limited
 
PDF
Productive Expansion on Amazon Web Services with BlazeClan
Blazeclan Technologies Private Limited
 
PPTX
Enterprise Cloud for your Business Applications
Blazeclan Technologies Private Limited
 
PDF
[TechTalks] Learning Configuration Management with SaltStack (Advanced Concepts)
Blazeclan Technologies Private Limited
 
PPTX
Overview of AWS Services for Media Content
Blazeclan Technologies Private Limited
 
PPTX
Life of data from generation to visualization using big data
Blazeclan Technologies Private Limited
 
PPTX
Overview of AWS Services for your Enterprise
Blazeclan Technologies Private Limited
 
PDF
[TechTalks] Effects of UI/ UX Designs on Customer Satisfaction & Loyalty
Blazeclan Technologies Private Limited
 
PDF
Amazon CloudFront Complete with Blazeclan's Media Solution Stack
Blazeclan Technologies Private Limited
 
PDF
Solving Big Data Industry Use Cases with AWS Cloud Computing
Blazeclan Technologies Private Limited
 
Big Data Building Blocks with AWS Cloud
Blazeclan Technologies Private Limited
 
Analyze Amazon CloudFront, S3 & ELB Logs with Cloudlytics - Part 1
Blazeclan Technologies Private Limited
 
Solving Big Data problems on AWS by Rajnish Malik
Blazeclan Technologies Private Limited
 
Cloudlytics Reporting: Analyze Amazon CloudFront, S3 & ELB Logs - Part 2
Blazeclan Technologies Private Limited
 
Productive Expansion on Amazon Web Services with BlazeClan
Blazeclan Technologies Private Limited
 
Enterprise Cloud for your Business Applications
Blazeclan Technologies Private Limited
 
[TechTalks] Learning Configuration Management with SaltStack (Advanced Concepts)
Blazeclan Technologies Private Limited
 
Overview of AWS Services for Media Content
Blazeclan Technologies Private Limited
 
Life of data from generation to visualization using big data
Blazeclan Technologies Private Limited
 
Overview of AWS Services for your Enterprise
Blazeclan Technologies Private Limited
 
[TechTalks] Effects of UI/ UX Designs on Customer Satisfaction & Loyalty
Blazeclan Technologies Private Limited
 
Amazon CloudFront Complete with Blazeclan's Media Solution Stack
Blazeclan Technologies Private Limited
 
Solving Big Data Industry Use Cases with AWS Cloud Computing
Blazeclan Technologies Private Limited
 

Similar to AWS RDS Migration Tool (20)

DOCX
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PPTX
PPT SQL CLASS.pptx
AngeOuattara
 
PPTX
Introduction of ssis
deepakk073
 
PPTX
Azure Data Factory Data Flows Training (Sept 2020 Update)
Mark Kromer
 
PPT
Overview of query evaluation
avniS
 
PDF
What is Scalability and How can affect on overall system performance of database
Alireza Kamrani
 
PPT
Ms sql server architecture
Ajeet Singh
 
PPTX
introductionofssis-130418034853-phpapp01.pptx
YashaswiniSrinivasan1
 
PPTX
Azure Data Fundamentals DP 900 Full Course
Piyush sachdeva
 
PPTX
Apache Cassandra 2.0
Joe Stein
 
PDF
AWS ETL
clintonlmcvicker
 
DOCX
Microsoft Fabric data warehouse by dataplatr
ajaykumar405166
 
PDF
strategies-for-migrating-oracle-database-to-aws
Abdul Sathar Sait
 
PPTX
Data warehouse physical design
Er. Nawaraj Bhandari
 
POTX
Test Data Transfer Tool
Hai Nguyen
 
PPT
Chapter 4 event it theory programming.pptx
kmkkali41
 
PPT
Chapter02
sasa_eldoby
 
PPTX
2nd chapter dbms.pptx
kavitha623544
 
PPT
[PHPUGPH] PHP Roadshow - MySQL
Cherrie Ann Domingo
 
PPTX
Athena & AWS Glue for AWS Data analytics.pptx
krnaween
 
Cassandra data modelling best practices
Sandeep Sharma IIMK Smart City,IoT,Bigdata,Cloud,BI,DW
 
PPT SQL CLASS.pptx
AngeOuattara
 
Introduction of ssis
deepakk073
 
Azure Data Factory Data Flows Training (Sept 2020 Update)
Mark Kromer
 
Overview of query evaluation
avniS
 
What is Scalability and How can affect on overall system performance of database
Alireza Kamrani
 
Ms sql server architecture
Ajeet Singh
 
introductionofssis-130418034853-phpapp01.pptx
YashaswiniSrinivasan1
 
Azure Data Fundamentals DP 900 Full Course
Piyush sachdeva
 
Apache Cassandra 2.0
Joe Stein
 
Microsoft Fabric data warehouse by dataplatr
ajaykumar405166
 
strategies-for-migrating-oracle-database-to-aws
Abdul Sathar Sait
 
Data warehouse physical design
Er. Nawaraj Bhandari
 
Test Data Transfer Tool
Hai Nguyen
 
Chapter 4 event it theory programming.pptx
kmkkali41
 
Chapter02
sasa_eldoby
 
2nd chapter dbms.pptx
kavitha623544
 
[PHPUGPH] PHP Roadshow - MySQL
Cherrie Ann Domingo
 
Athena & AWS Glue for AWS Data analytics.pptx
krnaween
 
Ad

More from Blazeclan Technologies Private Limited (12)

PDF
2020 Recap | Clan's Transformational Journey In The New Normal
Blazeclan Technologies Private Limited
 
PDF
Reminiscing 2019 And Heading Toward A Brighter Future!
Blazeclan Technologies Private Limited
 
PDF
AWS Managed Services - BlazeClan Technologies
Blazeclan Technologies Private Limited
 
PDF
Cloudlytics: In Depth S3 & CloudFront Log Analysis - Featuring Reports
Blazeclan Technologies Private Limited
 
PDF
Amazon Reshift as your Data Warehouse Solution
Blazeclan Technologies Private Limited
 
PDF
Testing Framework on AWS Cloud - Solution Set
Blazeclan Technologies Private Limited
 
PDF
Cloud for Media - A Complete Solution Stack for Faster Cloud Adoption
Blazeclan Technologies Private Limited
 
PDF
5 Points to Consider - Enterprise Road Map to AWS Cloud
Blazeclan Technologies Private Limited
 
PDF
How cloud is fueling growth for online gaming
Blazeclan Technologies Private Limited
 
PDF
A guide on Aws Security Token Service
Blazeclan Technologies Private Limited
 
PDF
Working and Features of HTML5 and PhoneGap - An Overview
Blazeclan Technologies Private Limited
 
PDF
Cloud Migration Strategy - IT Transformation with Cloud
Blazeclan Technologies Private Limited
 
2020 Recap | Clan's Transformational Journey In The New Normal
Blazeclan Technologies Private Limited
 
Reminiscing 2019 And Heading Toward A Brighter Future!
Blazeclan Technologies Private Limited
 
AWS Managed Services - BlazeClan Technologies
Blazeclan Technologies Private Limited
 
Cloudlytics: In Depth S3 & CloudFront Log Analysis - Featuring Reports
Blazeclan Technologies Private Limited
 
Amazon Reshift as your Data Warehouse Solution
Blazeclan Technologies Private Limited
 
Testing Framework on AWS Cloud - Solution Set
Blazeclan Technologies Private Limited
 
Cloud for Media - A Complete Solution Stack for Faster Cloud Adoption
Blazeclan Technologies Private Limited
 
5 Points to Consider - Enterprise Road Map to AWS Cloud
Blazeclan Technologies Private Limited
 
How cloud is fueling growth for online gaming
Blazeclan Technologies Private Limited
 
A guide on Aws Security Token Service
Blazeclan Technologies Private Limited
 
Working and Features of HTML5 and PhoneGap - An Overview
Blazeclan Technologies Private Limited
 
Cloud Migration Strategy - IT Transformation with Cloud
Blazeclan Technologies Private Limited
 
Ad

Recently uploaded (20)

PDF
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PDF
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
PPTX
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
PPTX
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
PDF
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
PDF
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
PPTX
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
PDF
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
PDF
Market Insight : ETH Dominance Returns
CIFDAQ
 
PPTX
The Future of AI & Machine Learning.pptx
pritsen4700
 
PDF
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
PPTX
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
PDF
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
PDF
introduction to computer hardware and sofeware
chauhanshraddha2007
 
PDF
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PPTX
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
PDF
Per Axbom: The spectacular lies of maps
Nexer Digital
 
TrustArc Webinar - Navigating Data Privacy in LATAM: Laws, Trends, and Compli...
TrustArc
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
Make GenAI investments go further with the Dell AI Factory
Principled Technologies
 
AI in Daily Life: How Artificial Intelligence Helps Us Every Day
vanshrpatil7
 
Applied-Statistics-Mastering-Data-Driven-Decisions.pptx
parmaryashparmaryash
 
OFFOFFBOX™ – A New Era for African Film | Startup Presentation
ambaicciwalkerbrian
 
GDG Cloud Munich - Intro - Luiz Carneiro - #BuildWithAI - July - Abdel.pdf
Luiz Carneiro
 
Agentic AI in Healthcare Driving the Next Wave of Digital Transformation
danielle hunter
 
Responsible AI and AI Ethics - By Sylvester Ebhonu
Sylvester Ebhonu
 
Market Insight : ETH Dominance Returns
CIFDAQ
 
The Future of AI & Machine Learning.pptx
pritsen4700
 
How ETL Control Logic Keeps Your Pipelines Safe and Reliable.pdf
Stryv Solutions Pvt. Ltd.
 
Dev Dives: Automate, test, and deploy in one place—with Unified Developer Exp...
AndreeaTom
 
Research-Fundamentals-and-Topic-Development.pdf
ayesha butalia
 
introduction to computer hardware and sofeware
chauhanshraddha2007
 
AI Unleashed - Shaping the Future -Starting Today - AIOUG Yatra 2025 - For Co...
Sandesh Rao
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Farrell_Programming Logic and Design slides_10e_ch02_PowerPoint.pptx
bashnahara11
 
Per Axbom: The spectacular lies of maps
Nexer Digital
 

AWS RDS Migration Tool

  • 1. AWS RDS MIGRATION Presented by Hardik Shah Website www.blazeclan.com Follow us @clouditbetter Contact +91 9890 802 529
  • 2. KEY TAKEAWAYS Migrating Databases Migrating minimal databases with minimal downtime to AWS RDS, Amazon Redshift and Amazon Aurora On Premise to Cloud Migration of databases to same and different engines and from on premise to cloud Schema Conversion Schema conversion from Oracle and SQL Server to MySQL and Aurora
  • 3. Traditional Approach= Time, Cost Commercial tool for migration/replication Application Downtime Legacy Schema Objects
  • 4. Introducing AWS RDS Migration Tool Easy to setup and start migration in less than 15 mins No downtime of applications during migration Replicate from EC2 -> RDS or vice versa Move data to same or different database engines Cost effective and no upfront cost
  • 5. Keep your Apps running during the Migration
  • 6. Amazon RDS Migration Tool consists of a Web-based console and a replication server to replicate data across heterogeneous data sources. Amazon RDS Migration Tool can execute replication between enterprise databases including Oracle, Microsoft SQL Server, and IBM DB2. Replication is log based, which means that only the changes are read. This reduces the impact on the source databases. Amazon RDS Migration Tool can carry out two types of replication: Full Load and Change Processing (CDC).
  • 7. Load data efficiently and quickly to operational data stores/ warehouses Create copies of production databases Distribute data across databases Amazon RDS Migration Tool has high throughput, speed, and scale. Full Load: The full load process creates files or tables at the target database, automatically defines the metadata that is required at the target, and populates the tables with data from the source. Change Processing (CDC): Change processing captures changes in the source data or metadata as they occur and applies them to the target database as soon as possible in near-real-time. Features
  • 8. Load reduction: It is recommended that you have a copy of all or of a subset of a collection on a different server to reduce the load on the main server. Improved service: Users of the copy of the information may get better access to the copy of the data than to the original. Security considerations: Some users might be allowed access to a subset of the data and only this subset is made available as a replicated copy to those users. Geographic distribution: The enterprise (for example, a chain of retail stores or warehouses) may be widely distributed and each node uses primarily its own subset of the data (in addition to all of the data being available at a central location for less common use). Disaster Recovery: A copy of the main data is required for rapid failover (the capability to switch over to a redundant or standby computer server, in case of failure of the main system). Support the need for implementing "cloud" computing. Replication
  • 9. During replication, a collection of data is copied from system A to system B. A is known as the source (for this collection), B is known as the target. A system can be either a source or a target or even both (within certain restrictions). When a number of sources and targets and data collections are defined, the replication topology can be quite complex. Integrity: Make sure that the data in the target actually reflects the completed result of a change in the source and not some intermediate invalid result. Latency: How out-of-date is the copy? Consistency: Make sure that if the change affects several different tables or rows, the copy reflects a consistent state all were changed or none). The first two issues are the responsibility of the replicator. While some latency is unavoidable in any system, a good replicator will aim not to exceed several seconds of latency as a general rule.
  • 10. Replication Tasks The definition of a task consists of: Specifying the source and target databases Specifying the source and target tables to be kept in sync Specifying the relevant source table columns Specifying filtering conditions (if any) for each source table, as Boolean predicates on the values one or more source columns (the predicates are in SQLite syntax) Listing the target table columns and (optionally) specifying their data types and values (as expressions or functions over the values of one or more source or target columns, using SQL syntax). If not specified, the same column names and values as the source tables are used, with default mapping of the source DBMS data types onto the target DBMS data types. Amazon RDS Migration Tool automatically takes care of the required filtering, transformations and computations during the Load or CDC execution.
  • 11. Replication Tasks The simplest specification of a task may not mention of the target data, with only the source tables (or ALL, or a mask) specified. In this case, the target tables are identical to the source tables, using the default mappings between the source and target DBMS data types. In this way, the entire definition process could be accomplished by a single click, referred to as "Click to Replicate". Once a task is defined, it can be activated immediately. The target tables with the necessary metadata definitions are automatically created and loaded, and the CDC is activated. The replication activity can then be monitored, stopped, or restarted using the Amazon RDS Migration Console.
  • 12. Full Load & CDC The full load process creates files or tables at the target database, automatically defines the metadata that is required at the target, and populates the tables with data from the source. Unlike the CDC process the data is loaded one entire table or file at a time for efficiency purposes. The Load process can be interrupted and when restarted it continues from wherever it was stopped. New tables can be added to an existing target without reloading the existing tables. Similarly, columns in previously-populated target tables can be added or dropped without requiring reloading. CDC operates by reading the recovery log file of the source database management system and grouping together the entries for each transaction. Various techniques are employed to ensure that this is done in an efficient manner without seriously impacting the latency of the target data. The Change Data Capture (CDC) process captures changes in the source data or metadata as they occur and applies them to the target database as soon as possible in near-real-time. The changes are captured and applied as units of single committed transactions, and several different target tables can be updated as the result of a single source commit.
  • 13. Defining Global Transformation Use Global Transformations to make similar changes to multiple tables, owners, and columns in the same task. You may need to use this option when you want to change the names of all tables. You can change the names using wild cards and patterns. For example, you may want to change the names of the tables from account_% to ac_%. This is helpful when replicating data from an Microsoft SQL Server database to an Oracle database where the Microsoft SQL Server database has a limit of 128 characters for a table name and the Oracle database has a limit of 31 characters. You may also need to change a specific data type in the source to a different data type in the target for many or all of the tables in the task. Global transformation will accomplish this without having to define a transformation for each table individually.
  • 14. Global Transformation types Rename Schema Rename Table Rename Column Add Column Drop Column Convert Data Type Select this if you want to change the schema name for multiple tables. Select this if you want to change the name of multiple tables. Select this if you want to change the name of multiple columns. Select this if you want to add a column with a similar name to multiple tables. Select this if you want to drop a column with a similar name from multiple tables. Select this if you want to change a specific data type to a different one across multiple tables.