Hitachi Vantara Certified Specialist - Pentaho Data Integration Implementation Questions and Answers
A transformation is running in a production environment and you want to monitor it in real time.
Which tool should you use?
You have completed a successful installation of a Pentaho server on Linux.
You now need to write a script to run the Pentaho server as a service.
Which two files should you call from the script? (Choose two.)
Choose 2 answers
Which statement is true for a transformation?
You have a string field in your dataset where you need to extract characters 1-5 only.
Which two steps will accomplish this task? (Choose two.)
Choose 2 answers
You need to load data from many CSV files into a database and you want to minimize the number of PDI jobs and transformations that need to be maintained.
In which two scenarios is Metadata injection the recommend option? (Choose two.)
Choose 2 answers
You need to process data on the nodes within a Hadoop cluster. To accomplish this task, you write a mapper and reducer transformation and use the Pentaho MapReduce entry to execute the MapReduce job on the cluster.
In this scenario, which two steps are required within the transformations? (Choose two.)
Choose 2 answers
You have a PDI input step that generates data within a transformation.
Which two statements are true about downstream steps in this scenario? (Choose two.)
Choose 2 answers
You have multiple transformations that read and process data from multiple text files. You identity a series of steps that are common across transformations and you want to re-use them to avoid duplication of code.
How doyou accomplish this?
A Big Data customer wants to run POI transformations on Spark on their production Hadoop cluster using Pentaho's Adaptive Execution Layer (AEL)
What are two steps for installing AEL? (Choose two.)
Choose 2 answers