Interview Questions.

Top 100+ Pentaho Bi Interview Questions And Answers

fluid

Top 100+ Pentaho Bi Interview Questions And Answers

Question 1. Explain Pentaho?

Answer :

It addresses the blockades that block the employer’s capability to get fee from all our statistics. Pentaho is located to make certain that each member of our crew from builders to business customers can easily convert statistics into price.

Question 2. Mention Major Features Of Pentaho?

Answer :

Direct Analytics on MongoDB: It authorizes business analysts and IT to get right of entry to, examine, and visualize MongoDB statistics.

Science Pack: Pentaho’s Data Science Pack operationalizes analytical modeling and system getting to know while permitting data scientists and developers to unburden the exertions of records coaching to Pentaho Data Integration.

Full YARN Support for Hadoop: Pentaho’s YARN blending permits organizations to exploit the full computing electricity of Hadoop whilst leveraging existing skillsets and era investments.

Informatica Interview Questions
Question three. Define Pentaho Bi Project?

Answer :

The Pentaho BI Project is an cutting-edge effort by means of the Open Source communal to provide businesses with excellent-in-class solutions for their initiative Business Intelligence (BI) wishes.

Question 4. What Major Applications Comprises Of Pentaho Bi Project?

Answer :

The Pentaho BI Project encompasses the subsequent main software areas:

Business Intelligence Platform
Data Mining
Reporting
Dashboards
Business Intelligence Platform
Informatica Tutorial
Question five. Which Platform Benefits From The Pentaho Bi Project?

Answer :

Java developers who typically use assignment components to unexpectedly assemble custom BI answers
ISVs who can enhance the value and capacity of their solutions via embedding BI functionality
End-Users who can fast installation packaged BI solutions which are either modest or more to conventional commercial offerings at a dramatically lower cost.
ETL Testing Interview Questions
Question 6. Is Pentaho A Trademark?

Answer :

Yes, Pentaho is a hallmark.

Question 7. What Do You Understand By Pentaho Metadata?

Answer :

Pentaho Metadata is a chunk of the Pentaho BI Platform designed to make it simpler for customers to get entry to information in business terms.

ETL Testing Tutorial Data Warehouse ETL Toolkit Interview Questions
Question eight. How Does Pentaho Metadata Work?

Answer :

With the help of Pentaho’s open supply metadata talents, administrators can define a layer of abstraction that provides database information to commercial enterprise users in acquainted enterprise terms.

Question 9. What Is The Pentaho Reporting Evaluation?

Answer :

Pentaho Reporting Evaluation is a specific package deal of a subset of the Pentaho Reporting abilties, designed for typical first-segment assessment sports together with having access to pattern facts, growing and enhancing reviews, and viewing and interacting with reviews.

PL/SQL and Informatica Interview Questions
Question 10. Explain Mdx?

Answer :

Multidimensional Expressions (MDX) is a query language for OLAP databases, similar to SQL is a question language for relational databases. It is also a calculation language, with syntax much like spreadsheet formulas.

Data Warehouse ETL Toolkit Tutorial
Question 11. Define Tuple?

Answer :

Finite ordered list of elements is called as tuple.

Talend Interview Questions
Question 12. What Kind Of Data, Cube Contain?

Answer :

The Cube will comprise the following information:

three Fact fields: Sales, Costs and Discounts
Time Dimension: with the subsequent hierarchy: Year, Quarter and Month
2 Customer Dimensions: one with area (Region, Country) and the opposite with Customer Group and Customer Name
Product Dimension: containing a Product Name
Informatica Interview Questions
Question thirteen. Differentiate Between Transformations And Jobs?

Answer :

Transformations is transferring and reworking rows from supply to goal.
Jobs are extra approximately high level go with the flow manage.
Pentaho Tutorial
Question 14. How To Do A Database Join With Pdi?

Answer :

If we need to join 2 tables from the equal database, we are able to use a “Table Input” step and do the be a part of in SQL itself.
If we want to enroll in 2 tables that aren't in the identical database. We can use the the “Database Join”.
Question 15. How To Sequentialize Transformations?

Answer :

it isn't viable as in PDI adjustments all of the steps run in parallel. So we will’t sequentialize them.

Spotfire (TIBCO) Interview Questions
Question sixteen. How We Can Use Database Connections From Repository?

Answer :

We can Create a new conversion or close and re-open the ones we've got loaded in Spoon.

Question 17. How Do You Insert Booleans Into A Mysql Database, Pdi Encodes A Boolean As ‘y’ Or ‘n’ And This Can’t Be Insert Into A Bit(1) Column In Mysql.?

Answer :

BIT isn't a trendy SQL records kind. It’s not even widespread on MySQL because the which means (center definition) modified from MySQL version four to 5.

Also a BIT uses 2 bytes on MySQL. That’s why in PDI we made the secure desire and went for a char(1) to shop a boolean. There is a easy workaround available: trade the facts kind with a Select Values step to “Integer” inside the metadata tab. This converts it to 1 for “genuine” and 0 for “false”, just like MySQL expects.

Ibm Tivoli Netcool Omnibus Interview Questions
Question 18. By Default All Steps In A Transformation Run In Parallel, How Can We Make It So That 1 Row Gets Processed Completely Until The End Before The Next Row Is Processed?.

Answer :

This is not possible as in PDI ameliorations all of the steps run in parallel. So we are able to’t sequentialize them. This would require architectural changes to PDI and sequential processing additionally result in very sluggish processing.

ETL Testing Interview Questions
Question 19. Why Can’t We Duplicate Fieldnames In A Single Row?

Answer :

we are able to’t. If we've duplicate fieldnames. Before PDI v2.Five.0 we were capable of pressure reproduction fields, but also most effective the first value of the replica fields could ever be used.

Question 20. What Are The Benefits Of Pentaho?

Answer :

1. Open Source
2. Have network that aid the customers
3. Running well below multi platform (Windows, Linux, Macintosh, Solaris, Unix, and so on)
4. Have whole package deal from reporting, ETL for warehousing data management,
5. OLAP server statistics mining additionally dashboard.

Pentaho Interview Questions
Question 21. Differentiate Between Arguments And Variables?

Answer :

Arguments are command line arguments that we'd generally specify at some stage in batch processing .

Variables are surroundings or PDI variables that we'd normally set in a previous transformation in a job.

Question 22. What Are The Applications Of Pentaho?

Answer :

1. Suite Pentaho

BI Platform (JBoss Portal)
Pentaho Dashboard
JFreeReport
Mondrian
Kettle
Weka
2. All build under Java platform

Question 23. What Do You Understand By The Term Pentaho Dashboard?

Answer :

Pentaho Dashboards supply business customers the vital records they want to recognize and improve organizational performance.

Informatica MDM Interview Questions
Question 24. What Is The Use Of Pentaho Reporting?

Answer :

Pentaho Reporting permits companies to without problems get admission to, format and deliver information to personnel, customers and partners.

Data Warehouse ETL Toolkit Interview Questions
Question 25. Define Pentaho Schema Workbench?

Answer :

Pentaho Schema Workbench gives a graphical edge for designing OLAP cubes for Pentaho Analysis.

Question 26. Define Pentho Data Mining?

Answer :

Pentaho Data Mining used the Waikato Environment for Information Analysis to look records for patterns. It have capabilities for statistics processing, regression evaluation, type methods, and so on.

Clover Etl Interview Questions
Question 27. Brief About Pentaho Report Designer?

Answer :

It is a visual, banded document creator. It has diverse capabilities lilke using subreports, charts and graphs and so forth.

PL/SQL and Informatica Interview Questions
Question 28. What Do You Un Derstand By The Term Etl?

Answer :

It is an entri level device for records manipulation.

Question 29. What Do You Understand By Hierarchical Navigation?

Answer :

A hierarchical navigation menu lets in the consumer to come immediately to a section of the web site several ranges beneath the top.

Question 30. What Are The Steps To Decrypt A Folder Or File?

Answer :

Right-click on at the folder or file we want to decrypt, after which click on on Properties alternative.
Click the General tab, after which click on Advanced.
Clear the Encrypt contents to cozy records check field, click on OK, and then click on OK once more.
Question 31. Explain Encrypting File System?

Answer :

It is the era which allows documents to be transparently encrypted to relaxed personal statistics from attackers with bodily get right of entry to to the laptop.

Question 32. What Do You Mean By Repository?

Answer :

Repository is a storage area wherein we are able to shop the data correctly with none harmness.

Question 33. Explain Why We Need Etl Tool?

Answer :

ETL Tool is used to get statistics from many supply device like RDBMS, SAP, and so on. And convert them based at the user requirement. It is required while records go with the flow across many systems.

Talend Interview Questions
Question 34. What Is Etl Process? Write The Steps Also?

Answer :

ETL is extraction , reworking , loading procedure the steps are :

1 – outline the source
2 – define the target
three – create the mapping
4 – create the session
five – create the work flow

Question 35. What Is Metadata?

Answer :

The metadata saved in the repository via associating facts with individual items inside the repository.

Question 36. What Are Snapshots?

Answer :

Snapshots are study-best copies of a master desk positioned on a far off node which may be periodically refreshed to mirror changes made to the grasp desk.

Spotfire (TIBCO) Interview Questions
Question 37. What Is Data Staging?

Answer :

Data staging is surely a group of strategies used to put together supply system statistics for loading a information warehouse.

Question 38. Data Staging Is Actually A Group Of Procedures Used To Prepare Source System Data For Loading A Data Warehouse.?

Answer :

Full Load way completely erasing the insides of 1 or more tables and filling with clean facts.
Incremental Load method making use of ongoing adjustments to 1 or extra tables based totally on a predefined agenda.
Question 39. Define Mapping?

Answer :

Dataflow from source to goal is known as as mapping.

Question 40. Explain Session?

Answer :

It is a set of training which inform whilst and how to flow records from respective supply to goal.

Ibm Tivoli Netcool Omnibus Interview Questions
Question 41. What Is Workflow?

Answer :

It is a fixed of guidance which inform the infomatica server how to execute the assignment.

Question forty two. Define Mapplet?

Answer :

It creates and configure the set of transformation.

Pentaho Interview Questions
Question 43. What Do You Understand By Three Tier Data Warehouse?

Answer :

A information warehouse is said to be a three-tier system where a center system offers usable statistics in a comfortable way to end customers. Both facet of this middle machine are the end customers and the back-stop statistics stores.

Question forty four. What Is Ods?

Answer :

ODS is Operational Data Store which comes in among of records warehouse and staging location.

Question forty five. Differentiate Between Etl Tool And Olap Tool?

Answer :

ETL Tool is used for extracting data from the legecy system and cargo it into exact database with some processing of cleansing data.

OLAP Tool is used for reporting system . Here facts is available in multidimensional model consequently we will write simple question to extract information from database.

Question 46. Wha Is Xml?

Answer :

XML is an extensiable markup language which defines a fixed of rule for encoding files in each formats that is human readable and gadget readable.

Question 47. What Are Various Tools In Etl?

Answer :

Abinitio,DataStage, Informatica, Cognos Decision Stream, etc

Question 48. Define Mdx?

Answer :

MDX is multi- dimensional expression that is a prime query language applied via the Mondrains.

Question 49. Define Multi-dimensional Cube?

Answer :

It is a dice to view data in which we are able to slice and dice the records. It have time measurement, locations and figures.

Question 50. How Do You Duplicate A Field In A Row In A Transformation?

Answer :

Several solutions exist:

Use a “Select Values” step renaming a field at the same time as selecting also the authentic one. The result may be that the authentic field may be duplicated to any other call.

It will appearance as follows:

This will duplicate fieldA to fieldB and fieldC.

Use a calculator step and use e.G. The NLV(A,B) operation as follows:

This will have the identical impact as the first solution: 3 fields inside the output which are copies of every other: fieldA, fieldB, and fieldC.

Use a JavaScript step to duplicate the sphere:

This will have the same impact because the previous solutions: 3 fields in the output that are copies of each other: fieldA, fieldB, and fieldC.

Question 51. Why Can’t I Duplicate Fieldnames In A Single Row?

Answer :

You can’t. PDI will bitch in most of the cases when you have replica fieldnames. Before PDI v2.Five.Zero you were able to pressure reproduction fields, but additionally best the primary cost of the reproduction fields ought to ever be used.

Question 52. I’ve Got A Transformation That Doesn’t Run Fast Enough, But It Is Hard To Tell In What Order To Optimize The Steps. What Should I Do?

Answer :

Transformations circulation facts thru their steps:

That way that the slowest step is going to decide the rate of a change.
So you optimize the slowest steps first. How can you inform which step is the slowest: look at the size of the input buffer within the log view.
In the latest three.1.Zero-M1 nightly build you will also find a graphical evaluation of this: HTTP://WWW.IBRIDGE.BE/?P=ninety two
(the “graph” button at the lowest of the log view will display the information).
A gradual step could have constantly huge enter buffer sizes. A rapid step will consistently have low input buffer sizes.
Question fifty three. We Will Be Using Pdi Integrated In A Web Application Deployed On An Application Server. We’ve Created A Jndi Datasource In Our Application Server. Of Course Spoon Doesn’t Run In The Context Of The Application Server, So How Can We Use The Jndi Data Source In Pdi?

Answer :

If you appearance inside the PDI major directory you will see a sub-listing “simple-jndi”, which includes a report referred to as “jdbc.Houses”. You need to trade this file so that the JNDI information fits the one you operate to your application server.

After that you set in the connection tab of Spoon the “Method of get admission to” to JNDI, the “Connection type” to the sort of database you’re using. And “Connection call” to the name of the JDNI datasource (as used in “jdbc.Residences”).




CFG