I’ve been hearing a blot of buzz from clients about Pentaho. One large bank mentioned that they loaded their Hadoop environment in seconds with Pentaho versus several minutes with other leading ETL tools. So I decided to do some research on Pentaho to see what all the fuss was about.
It turns out that Pentaho offers a tightly integrated suite for business intelligence and data integration.
Pentaho Business Analytics offers reporting, dashboarding, and analytical capabilities based on in-memory technology. The tool offers strong visualization capabilities including support for scatter plots, heat grids, and geo-mapping.
Pentaho Business Analytics leverages Pentaho Data Integration to provide analytics for data residing in relational databases, NoSQL stores, Hadoop, and business intelligence appliances. Pentaho Data Integration is an ETL tool that supports connectivity to OLTP, analytical and NoSQL databases such as Oracle, Greenplum, Teradata, Netezza, Apache Cassandra, and MongoDB as well as to unstructured and semi-structured sources such as Hadoop, Excel, XML, and RSS feeds.
The Pentaho platform comes in two basic flavors: a community edition that is open source, and an enterprise edition that includes product support and advanced features.
Pentaho has been getting strong traction with organizations due to its strong big data support, open source heritage, and cost effectiveness.