{"id":741,"date":"2011-09-18T21:24:13","date_gmt":"2011-09-19T03:24:13","guid":{"rendered":"http:\/\/www.mooreds.com\/wordpress\/?p=741"},"modified":"2011-09-11T21:36:06","modified_gmt":"2011-09-12T03:36:06","slug":"pentaho-data-integration-is-damn-cool","status":"publish","type":"post","link":"https:\/\/www.mooreds.com\/wordpress\/archives\/741","title":{"rendered":"Pentaho Data Integration is damn cool"},"content":{"rendered":"<p>I have worked on two small projects with Pentaho Data Integration.\u00a0 If you&#8217;re looking for a business intelligence tool that lets you manipulate large amounts of data in a performant way, you definitely want to take a look at this.\u00a0 The version I&#8217;m working with is a couple of revisions back, but the <a href=\"http:\/\/wiki.pentaho.com\/display\/EAI\/Latest+Pentaho+Data+Integration+%28aka+Kettle%29+Documentation\">online support<\/a> is pretty good.\u00a0 It&#8217;s way more developer-efficient than writing java, though debugging is more difficult.<\/p>\n<p>Why is it so cool?\u00a0 It lets you focus on your problem&#8211;validating and transforming your data&#8211;rather than the mechanics of it (where do the CSV files live?\u00a0 what fields did I just add?\u00a0 how do I parse this fixed width file?).\u00a0 You can also <a href=\"http:\/\/wiki.pentaho.com\/display\/EAI\/User+Defined+Java+Class\">call out to Java<\/a> if you need to.<\/p>\n<p>There is a bit of a learning curve, especially around the difference between transformations and jobs.\u00a0 I bought my first tech book of 2011, <a href=\"http:\/\/www.wiley.com\/WileyCDA\/WileyTitle\/productCd-0470635177.html\">Pentaho Kettle Solutions<\/a>.\u00a0 These projects weren&#8217;t even using Pentaho for its sweet spot, ETLing to a data warehouse, but I have found this to be an invaluable tool for moving data from text files to databases while cleaning up and processing it.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I have worked on two small projects with Pentaho Data Integration.\u00a0 If you&#8217;re looking for a business intelligence tool that lets you manipulate large amounts of data in a performant [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,5],"tags":[],"class_list":["post-741","post","type-post","status-publish","format-standard","hentry","category-ides","category-java"],"_links":{"self":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts\/741","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/comments?post=741"}],"version-history":[{"count":1,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts\/741\/revisions"}],"predecessor-version":[{"id":742,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/posts\/741\/revisions\/742"}],"wp:attachment":[{"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/media?parent=741"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/categories?post=741"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.mooreds.com\/wordpress\/wp-json\/wp\/v2\/tags?post=741"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}