Thursday, July 13, 2017

Speed up inserts for Pentaho on MySQL

I work with Pentaho data integration a fair amount at work.  When I try to run jobs locally the insert speed is ridiculously slow.  I found this article that gives a good way to speed up the inserts.

TLDR: add useServerPrepStmts=false, rewriteBatchedStatements=true and useCompression=true to database connection options and things speed up quite a bit on the insert side.  I still need to find a way to may insert/update faster.

Thursday, November 15, 2012

Crazy joins and insane SQL

I am trying to write reports at work off of data that is not really conducive to reporting.  The biggest problem is a field that has values separated by commas.  It can hold an agent's name or the queue or queues that have access to work it.  I found this article on stack overflow, http://stackoverflow.com/questions/10019557/join-sql-server-tables-on-a-like-statement, and it is helping me join to a reference table, but I still can't obviously get rows with more than one value in the table to count only once, but I have also been running into a problem of trying to join the agent ID in the column to our agent hierarchy data.  I starting using substring and charindex before I found this method for the queues.  I just realized I need to use this same technique for this join.  I need to test this in the morning to see if I can fix the problem with this technique instead of a nasty case statement with substrings and charindexes.

Wednesday, October 17, 2012

First post

I am just starting out in this realm.  Most of my posts will probably be SQL Server or virtualization related.  I am not much of a writer, so a lot of this may be just me venting frustration or putting things down here so I can find them later.