Pentaho

 View Only

 PDI ETL Job

  • Pentaho
  • Kettle
  • Pentaho
  • Pentaho Data Integration PDI
Rajesh Manivannan's profile image
Rajesh Manivannan posted 09-24-2019 13:21

Hello,

 

I am new to PDI, to explore PDI I installed the trial version. Trying to understand how to complete the database connection. Our requirements are copying production data to QA by randomizing personally identifiable information.

Could you please help me to understand about:

  1. Database connection
  2. How to create a ETL job to copy production data from MySQL database ?
  3. Randomizing procedure
  4. Loading to target MySQL database

Thanks

Rajesh

 

=============================================

09252019:

 

By watching a video in you tube, I'm able to connect to the database.

 

Created a simple transformation by selecting a table in source database. Still trying to understand how to use "Replace in String" to generate random data. Or is there any other option to define random data generator in PDI from actual value ?

 

Thanks !!

Rajesh

/pentaho Datamigration

 

 

 

 


#Kettle
#Pentaho
#PentahoDataIntegrationPDI
Attachment  View in library
PDI ETL Job 47 KB
Paulo Pires's profile image
Paulo Pires

Hi Rajesh,

 

you can look at https://help.pentaho.com/Documentation/8.3/Setup/Define_data_connections

 

Best regards

Ana Gonzalez's profile image
Ana Gonzalez

In your PDI directory installation, in ~/data-integration/samples/trans, you have an example on how to use the Replace in string step, the transformation name is Replace in string - Simple example.ktr you can take a look at it.

Regards