SlideShare a Scribd company logo
5
Most read
6
Most read
8
Most read
ETL Metadata Injection
David Fombella Pombal
http://anonymousbi.wordpress.com
@Pentaho_Fan
ETL Metadata Injection 2
David Fombella Pombal
BI Consultant, Technical Developmental Editor and
Pentaho Community Guy
About me
ETL Metadata Injection 3
Introduction
•The ETL Metadata Injection step is capable of injecting
step metadata into a template transformation. So
instead of statically entering ETL metadata in a step
dialog, you pass it at run-time. It is possible to solve
repetitive ETL workloads like loading of text files, data
migration and so on.
ETL Metadata Injection 4
Problem description
•We need to load cost files containing a dynamic header
•Budget files contain 12 months information
•Forecast files contain between 1 and 12 months
•3 fixed columns containing Entity, Account and Cost Centre
ETL Metadata Injection 5
File Types
•Budget file containing 2015 (12 months)
•Forecast file October 2014 (3 months)
ETL Metadata Injection 6
Easy to use +
Open Source +
BI Solution =
Pentaho and
Mondrian!
Template transformation (Text file input)
ETL Metadata Injection 7
Easy to use +
Open Source +
BI Solution =
Pentaho and
Mondrian!
Template transformation (Row normaliser)
ETL Metadata Injection 8
Load file Metadata Injection 1
ETL Metadata Injection 9
Load file Metadata Injection 2
ETL Metadata Injection 10
Load file Metadata Injection 3
ETL Metadata Injection 11
Load file after Injection
ETL Metadata Injection 12
Non-Native Metadata Injection
Metadata Injection is one of the single most powerful
features of PDI. However it only some steps support it. But
as PDI has 100s of steps, more being added weekly, there’s
always going to be a scenario where the step you want to
Inject doesn't support it.
So what do you do? Use the API. The API is how PDI itself
sets the metadata for a step.
So the steps are:
1. Open the transformation file
2. Find the step(s) we want to change
3. Configure it accordingly
4. Save the file
ETL Metadata Injection 13
Non-Native Metadata Injection (template)
ETL Metadata Injection 14
Non-Native Metadata Injection (Inject 1)
ETL Metadata Injection 15
Non-Native Metadata Injection (Inject 2)
ETL Metadata Injection 16
Non-Native Metadata Injection (Inject 3)
ETL Metadata Injection 17
Non-Native Metadata Injection
Thanks to…

More Related Content

Similar to ETL Metadata Injection with Pentaho Data Integration (11)

PPT
Datastage Introduction To Data Warehousing
Vibrant Technologies & Computers
 
PPTX
ETL-Datawarehousing.ppt.pptx
karanamlakshminarasa
 
PPT
ETL Testing - Introduction to ETL testing
Vibrant Technologies & Computers
 
PPT
ETL Testing - Introduction to ETL Testing
Vibrant Event
 
PPT
ETL Testing - Introduction to ETL testing
Vibrant Event
 
PPT
Building the DW - ETL
ganblues
 
PPTX
What is ETL?
Ismail El Gayar
 
PPT
Introduction to ETL Data Warehousing.ppt
email2nalinikant
 
PPT
D01 etl
Prince Jain
 
PPT
extract, transform, load_Data Analyt.ppt
Neerupa Chauhan
 
PPT
Etl data processing system which is very useful for the engineering students
utsav25khel
 
Datastage Introduction To Data Warehousing
Vibrant Technologies & Computers
 
ETL-Datawarehousing.ppt.pptx
karanamlakshminarasa
 
ETL Testing - Introduction to ETL testing
Vibrant Technologies & Computers
 
ETL Testing - Introduction to ETL Testing
Vibrant Event
 
ETL Testing - Introduction to ETL testing
Vibrant Event
 
Building the DW - ETL
ganblues
 
What is ETL?
Ismail El Gayar
 
Introduction to ETL Data Warehousing.ppt
email2nalinikant
 
D01 etl
Prince Jain
 
extract, transform, load_Data Analyt.ppt
Neerupa Chauhan
 
Etl data processing system which is very useful for the engineering students
utsav25khel
 

Recently uploaded (20)

PDF
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
PDF
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
PDF
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
PDF
Python basic programing language for automation
DanialHabibi2
 
PDF
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
PDF
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
PDF
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
PDF
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
PDF
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
PPTX
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
PDF
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
PDF
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
PDF
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
PDF
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
PPTX
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
PDF
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
PDF
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
PPTX
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
PDF
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Transcript: New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Bitcoin for Millennials podcast with Bram, Power Laws of Bitcoin
Stephen Perrenod
 
Using FME to Develop Self-Service CAD Applications for a Major UK Police Force
Safe Software
 
CIFDAQ Weekly Market Wrap for 11th July 2025
CIFDAQ
 
Python basic programing language for automation
DanialHabibi2
 
HCIP-Data Center Facility Deployment V2.0 Training Material (Without Remarks ...
mcastillo49
 
Presentation - Vibe Coding The Future of Tech
yanuarsinggih1
 
New from BookNet Canada for 2025: BNC BiblioShare - Tech Forum 2025
BookNet Canada
 
Building Real-Time Digital Twins with IBM Maximo & ArcGIS Indoors
Safe Software
 
Log-Based Anomaly Detection: Enhancing System Reliability with Machine Learning
Mohammed BEKKOUCHE
 
COMPARISON OF RASTER ANALYSIS TOOLS OF QGIS AND ARCGIS
Sharanya Sarkar
 
Achieving Consistent and Reliable AI Code Generation - Medusa AI
medusaaico
 
[Newgen] NewgenONE Marvin Brochure 1.pdf
darshakparmar
 
Empower Inclusion Through Accessible Java Applications
Ana-Maria Mihalceanu
 
SWEBOK Guide and Software Services Engineering Education
Hironori Washizaki
 
"Autonomy of LLM Agents: Current State and Future Prospects", Oles` Petriv
Fwdays
 
Agentic AI lifecycle for Enterprise Hyper-Automation
Debmalya Biswas
 
Timothy Rottach - Ramp up on AI Use Cases, from Vector Search to AI Agents wi...
AWS Chicago
 
UiPath Academic Alliance Educator Panels: Session 2 - Business Analyst Content
DianaGray10
 
Chris Elwell Woburn, MA - Passionate About IT Innovation
Chris Elwell Woburn, MA
 
Ad

ETL Metadata Injection with Pentaho Data Integration

  • 1. ETL Metadata Injection David Fombella Pombal http://anonymousbi.wordpress.com @Pentaho_Fan
  • 2. ETL Metadata Injection 2 David Fombella Pombal BI Consultant, Technical Developmental Editor and Pentaho Community Guy About me
  • 3. ETL Metadata Injection 3 Introduction •The ETL Metadata Injection step is capable of injecting step metadata into a template transformation. So instead of statically entering ETL metadata in a step dialog, you pass it at run-time. It is possible to solve repetitive ETL workloads like loading of text files, data migration and so on.
  • 4. ETL Metadata Injection 4 Problem description •We need to load cost files containing a dynamic header •Budget files contain 12 months information •Forecast files contain between 1 and 12 months •3 fixed columns containing Entity, Account and Cost Centre
  • 5. ETL Metadata Injection 5 File Types •Budget file containing 2015 (12 months) •Forecast file October 2014 (3 months)
  • 6. ETL Metadata Injection 6 Easy to use + Open Source + BI Solution = Pentaho and Mondrian! Template transformation (Text file input)
  • 7. ETL Metadata Injection 7 Easy to use + Open Source + BI Solution = Pentaho and Mondrian! Template transformation (Row normaliser)
  • 8. ETL Metadata Injection 8 Load file Metadata Injection 1
  • 9. ETL Metadata Injection 9 Load file Metadata Injection 2
  • 10. ETL Metadata Injection 10 Load file Metadata Injection 3
  • 11. ETL Metadata Injection 11 Load file after Injection
  • 12. ETL Metadata Injection 12 Non-Native Metadata Injection Metadata Injection is one of the single most powerful features of PDI. However it only some steps support it. But as PDI has 100s of steps, more being added weekly, there’s always going to be a scenario where the step you want to Inject doesn't support it. So what do you do? Use the API. The API is how PDI itself sets the metadata for a step. So the steps are: 1. Open the transformation file 2. Find the step(s) we want to change 3. Configure it accordingly 4. Save the file
  • 13. ETL Metadata Injection 13 Non-Native Metadata Injection (template)
  • 14. ETL Metadata Injection 14 Non-Native Metadata Injection (Inject 1)
  • 15. ETL Metadata Injection 15 Non-Native Metadata Injection (Inject 2)
  • 16. ETL Metadata Injection 16 Non-Native Metadata Injection (Inject 3)
  • 17. ETL Metadata Injection 17 Non-Native Metadata Injection Thanks to…