Questions tagged [pentaho]

The Pentaho BI Suite is an open source business intelligence suite with integrated reporting, dashboard, data mining, workflow, and ETL capabilities. It is primarily written in Java.

The latest GA release of Pentaho Suite has four primary downloadable products.

PDI - Pentaho Data Integration is a graphical extract, transform, and load (ETL) solution that uses an innovative metadata-driven approach. PDI includes the DI Server, a design tool (aka Spoon), three utilities, and several plugins.

PBA - Pentaho Business Analytics formally known as Pentaho Business Intelligence (BI) is mainly used for dashboard development.

PRD - Pentaho Report Designer is used for developing business (B2B and B2C) reports.

PSW - Pentaho Schema workbench is used for developing OLAP cubes and schemas.

3854 questions
78
votes
9 answers

Pentaho Data Integration SQL connection

I am using Pentaho Data Integration and I am trying to connect to my database via MySQL but when I do I get this error..... Error connecting to database [devdb2] : org.pentaho.di.core.exception.KettleDatabaseException: Error occured while trying to…
user979331
  • 11,039
  • 73
  • 223
  • 418
50
votes
10 answers

How to get last 7 days data from current datetime to last 7 days in sql server

Hi I am loading table A data from sql server to mysql using pentaho when loading data i need to get only last 7 days data from sql server A table to mysql In sql server createddate column data type is like datetime AND In mysql created_on column…
SRI
  • 531
  • 1
  • 4
  • 11
38
votes
9 answers

Pentaho vs Microsoft BI Stack

My company is heavily invested in the MS BI Stack (SQL Server Reporting Services, -Analysis Services and -Integration Services), but I want to have a look at what the seemingly most talked about open-source alternative Pentaho is like. I've…
Tomas
  • 3,384
  • 2
  • 26
  • 28
27
votes
5 answers

Books to start learning big data

I would like to start learning about the big data technologies. I want to work in this area in the future. Does anyone know good books to start learning about it? Hadoop, HBase. Beginner - intermediate - advanced - Thanks in advance
Gunter Amorim
  • 77
  • 1
  • 5
  • 14
23
votes
5 answers

"RecursionError: maximum recursion depth exceeded" from ssl.py: `super(SSLContext, SSLContext).options.__set__(self, value)`

I am using Python 3.6.5 on the following remote server setup: Server: Windows 10 Python: 3.6.5 Requests: 2.18.4 Pentaho: 8.0 When I run request.get against URLs in the server's command prompt, it gets the JSON as expected: >>> import…
user3871
  • 12,432
  • 33
  • 128
  • 268
22
votes
1 answer

Using Pentaho Kettle, how do I load multiple tables from a single table while keeping referential integrity?

Need to load data from a single file with a 100,000+ records into multiple tables on MySQL maintaining the relationships defined in the file/tables; meaning the relationships already match. The solution should work on the latest version of MySQL,…
blunders
  • 3,619
  • 10
  • 43
  • 65
20
votes
2 answers

Using Pentaho Kettle, how do I automatically retry rest requests which fail due to connection hiccups?

How can we make Pentaho retry rest requests on connection errors? We have a Pentaho BI system which, among numerous data sources, is querying a particular REST api for over 20k query variations each run. Predicatably, on most runs a few of these…
ms-tg
  • 2,688
  • 23
  • 18
17
votes
5 answers

If Pentaho is Open Source, can I just use it?

I see that Pentaho wants to charge me for their software. How can I get to the underlying Open Source software for dashboards to see what it can do without having to deal with Pentaho marketing folks?
Suds
17
votes
9 answers

Unable to connect to oracle via Pentaho. 'oracle.jdbc.driver.OracleDriver' could not be found

Hi I've just installed Pentaho Data Integration v. 5.01 and I'm trying to connect to a database. However, I'm unable to test the connection as I get the error (pasted below). I have JDK 1.7 installed on the system as well, however I can't seem to…
Joshua1729
  • 775
  • 4
  • 10
  • 19
15
votes
4 answers

Kettle / Pentaho Data Integration - unable to create a Database Connection (XulException: java.lang.reflect.InvocationTargetException)

Having finally got kettle to start and not hang, I still cannot use it to much avail, as when I try to create a new Database Connection (after creating a new Transformation) I get this error: org.pentaho.ui.xul.XulException:…
Blew my stack
  • 261
  • 1
  • 4
  • 9
15
votes
2 answers

Pass DB Connection parameters to a Kettle a.k.a PDI table Input step dynamically from Excel

I have a requirement such that whenever i run my Kettle job, the database connection parameters must be taken dynamically from an excel source on each run. Say i have an excel with column names : HostName, Username, Database, Password. i want to…
Ritesh
  • 237
  • 1
  • 4
  • 13
14
votes
3 answers

Pentaho kettle: how to set up tests for transformations/jobs?

I've been using Pentaho Kettle for quite a while and previously the transformations and jobs i've made (using spoon) have been quite simple load from db, rename etc, input to stuff to another db. But now i've been doing transformations that do a bit…
hannesh
  • 502
  • 10
  • 15
14
votes
2 answers

Fast alternative to split in R

I'm partitioning a data frame with split() in order to use parLapply() to call a function on each partition in parallel. The data frame has 1.3 million rows and 20 cols. I'm splitting/partitioning by two columns, both character type. Looks like…
argoneus
  • 701
  • 1
  • 7
  • 14
13
votes
1 answer

Error while exporting clob field from oracle to text field on Azure DB

I have a CLOB field on an Oracle Table that stores a PDF file. When I try export this field to SQL Server Db on Azure I got this error: 2017/01/19 11:14:32 - ImpostoRenda 2.0 - ERROR (version 6.1.0.1-196, build 1 from 2016-04-07 12.08.49 by…
Renatto Machado
  • 1,534
  • 2
  • 16
  • 33
13
votes
3 answers

COPY column order

I'm trying to use COPY with HEADER option but my header line in file is in different order than the column order specified in database. Is the column name order necessary in my file ?? My code is as below: COPY table_name ( …
sneha
  • 169
  • 1
  • 4
  • 11
1
2 3
99 100