<?xml version="1.0" encoding="UTF-8"?>
<!--Generated by Squarespace Site Server v5.11.81 (http://www.squarespace.com/) on Fri, 10 Feb 2012 15:29:07 GMT--><?xml-stylesheet type="text/css" href="/universal/styles/feed.css"?><rss version="2.0"><channel><title>OCDQ Blog Feed - Comments</title><link>http://www.ocdqblog.com/home/</link><description>Obsessive-Compulsive Data Quality Blog</description><copyright>Copyright Jim Harris 2009-2011</copyright><language>en-US</language><generator>Squarespace Site Server v5.11.81 (http://www.squarespace.com/)</generator><item><title>Jimothy Richards comments on HoardaBytes and the Big Data Lebowski</title><author>Jimothy Richards</author><pubDate>Fri, 03 Feb 2012 04:56:46 +0000</pubDate><link>http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16769072</guid><description><![CDATA[<p>The ability for companies to handle big data represents exciting innovation where large relational databases with high price tags are sometimes replaced with flat files, technologies like Hadoop and intelligent parsers to create analytics from massive amounts of data.</p>]]></description></item><item><title>Jim Harris comments on A Swift Kick in the AAS</title><author>Jim Harris</author><pubDate>Thu, 02 Feb 2012 21:51:44 +0000</pubDate><link>http://www.ocdqblog.com/home/a-swift-kick-in-the-aas.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16766162</guid><description><![CDATA[<p>From the <a href="http://lnkd.in/3yw7i8" title="LinkedIn Group for Enterprise CIO Forum" rel="nofollow">LinkedIn Group for Enterprise CIO Forum</a>, <b>Pearl Zhu</b> commented:</p><p>“Hi, Jim, nice to see you expand the data wind into cloud wings, the blog is resourceful by puling and well-organizing a few good cloud blogs to deliver more holistic view of cloudification:</p><p>-- Legacy Infrastructure Modernization and Convergence: Cloud provides the unprecedented opportunities for business to re-visit their EA, ITA, DA, and manage their information system with elasticity and agility</p><p>-- Application Portfolio Rationalization: SAAS solutions may also help business to revitalize their application portfolio via consolidation, integration, rationalization and optimization</p><p>-- From CapEx to OpEx: Beyond fast delivery and easy provisioning, Cloud has the economical flexibility to allow businesses to experiment with new innovations, delight customers and engage employees”</p><p><b>And I responded:</b></p><p>Thanks for the additional insights about the benefits of cloudification, Pearl :-)</p><p><br/>And <b>Paul Calento</b> commented:</p><p>“Specific to your mention of <em>converged infrastructure</em>, one of the challenges isn’t the opportunity itself, but the fact that for many companies the term doesn’t reflect <em>harmony</em> but (at least in some part) orphaned systems and applications that exist and can&#39;t be merely ripped and replaced. </p><p>In that scenario, instead of calling it <em>converged infrastructure</em> you could call it <em>duct tape and baling twine infrastructure.</em> </p><p>One key, perhaps to resolving this and the <em>as a service</em> trend is starting with hard choices during the asset evaluation and management state.”</p>]]></description></item><item><title>Sarah Michaels comments on The Johari Window of Data Quality</title><author>Sarah Michaels</author><pubDate>Thu, 26 Jan 2012 12:59:41 +0000</pubDate><link>http://www.ocdqblog.com/home/the-johari-window-of-data-quality.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16701579</guid><description><![CDATA[<p>I’ve never heard of the Johari Window, but this is very interesting indeed. Thanks for sharing this.</p>]]></description></item><item><title>Jim Harris comments on DQ-View: MetaData makes BettahMusic</title><author>Jim Harris</author><pubDate>Wed, 25 Jan 2012 14:48:14 +0000</pubDate><link>http://www.ocdqblog.com/home/dq-view-metadata-makes-bettahmusic.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16688244</guid><description><![CDATA[<p>Thanks for your comments, Jen and Crysta.</p><p><b><i>@Jen</i></b> — Yes, I too have used a library card catalog analogy in the past, as well as a music analogy based on albums and album covers, back when albums were compact disks or vinyl records that you could use for a show-and-tell presentation in real life. I came up with the iTunes analogy after being chided by a much younger member of my extended family who was confused why there were no songs in my iTunes library — and even more confused by my attempted explanation of music that didn’t come from the Internet :-)    </p><p><b><i>@Crysta</i></b> — Pandora (which I am listening to as I type this) is another fascinating example since metadata is needed in order for Pandora to create radio stations for you by artist, track, or composer, as well as the great metadata that they have for lyrics (my favorite feature), artist biographies, and similar artists. </p><p>Pandora’s <a href="http://en.wikipedia.org/wiki/Music_Genome_Project" title="Wikipedia article about the Music Genome Project" rel="nofollow">Music Genome Project</a> is a great example of the power of combining metadata and data, which uses almost 400 song attributes and a complex data mining algorithm to organize them, which also discovers other songs and artists similar to the ones you indicate that you like (data quality geeks like me would call this a data matching user feedback process).</p>]]></description></item><item><title>Crysta Anderson comments on DQ-View: MetaData makes BettahMusic</title><author>Crysta Anderson</author><pubDate>Tue, 24 Jan 2012 17:28:19 +0000</pubDate><link>http://www.ocdqblog.com/home/dq-view-metadata-makes-bettahmusic.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16664725</guid><description><![CDATA[<p>Really nice example of real-world metadata application. By this logic, Pandora only exists because of the available metadata.</p>]]></description></item><item><title>Jen Besser comments on DQ-View: MetaData makes BettahMusic</title><author>Jen Besser</author><pubDate>Tue, 24 Jan 2012 16:37:37 +0000</pubDate><link>http://www.ocdqblog.com/home/dq-view-metadata-makes-bettahmusic.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16664385</guid><description><![CDATA[<p>As always, Jim, thanks for making data management topics relevant and fun.  I&#39;ve been using the library card catalog analogy to explain metadata, but that is a dated analogy.  Thanks for supplying a more up to date, everyday analogy.</p>]]></description></item><item><title>Jim Harris comments on Big Data el Memorioso</title><author>Jim Harris</author><pubDate>Sat, 21 Jan 2012 20:09:59 +0000</pubDate><link>http://www.ocdqblog.com/home/big-data-el-memorioso.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16636728</guid><description><![CDATA[<p>From the <a href="http://www.linkedin.com/groups?gid=45685" title="TDWI Business Intelligence and Data Warehousing LinkedIn Group" rel="nofollow">TDWI Business Intelligence and Data Warehousing LinkedIn Group</a>, <b>Lalitha Nataraj</b> commented:</p><p>“Nice one. Can you find me some examples of business problems which have been tackled by big data?”</p><p><b>And I responded:</b></p><p>Thanks for your comment and question, Lalitha. </p><p>The easiest examples of business problems tackled by big data come from some of the biggest Internet companies. </p><p>Google, whose description of their proprietary MapReduce algorithm served as the inspiration for the open source development of Hadoop, is a company built on big data with their indexing and ranking of web pages driving their search engine dominance, and their per-click advertising business model that still accounts for approximately 80% of their financial success. Amazon and Apple are excellent examples of platforms built on the big data of online sales transactions and recommendation engines, and the Facebook empire is built on the big data of social networking. </p><p>Of course, these companies are also examples of the potential dark side of big data, as shown by the data privacy implications and related legal and government regulatory challenges plaguing Google and Facebook. </p><p>However, companies of all sizes, and in all industries, are starting to investigate how big data can help with their business problems, and so in the coming months and years, I believe that we will be hearing more of those more practical stories and real-world case studies. </p><p>Best Regards, </p><p>Jim</p><p><br/>And <b>Jaime Rubio</b> commented: More examples out of the cloud, but on the ground, where you’d have to tap Big Data: </p><p>- Financial transactions (fraud detection, accounting) <br/>- Insurance liability calculation <br/>- Cell phone calls analysis for rate plan design <br/>- Daily purchases on a big retail chain <br/>- Votation polls</p><p><b>And I responded:</b> Thanks Jaime for providing some excellent additional examples of the usefulness of Big Data.</p>]]></description></item><item><title>Mark Troester comments on Big Data el Memorioso</title><author>Mark Troester</author><pubDate>Fri, 20 Jan 2012 21:46:24 +0000</pubDate><link>http://www.ocdqblog.com/home/big-data-el-memorioso.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16633283</guid><description><![CDATA[<p>I think his helps illustrate that one size does not fit all. You can&#39;t take a singular approach to how you design for big data. It&#39;s all about identifying relevance and understanding that relevance can change over time. </p><p>There are certain situations where it makes sense to leverage all of the data, and now with high performance computing capabilities that include in-memory, in-DB and grid, it&#39;s possible to build and deploy rich models using all data in a short amount of time. Not only can you leverage rich models, but you can deploy a large number of models that leverage many variables so that you get optimal results. On the other hand, there are situations where you need to filter out the extraneous information and the more intelligent you can be about identifying the relevant information the better. </p><p>The traditional approach is to grab the data, cleanse it and land it somewhere before processing or analyzing the data. We suggest that you leverage analytics up front to determine what data is relevant as it streams in, with relevance based on your organizational knowledge or context. That helps you determine what data should be acted upon immediately, where it should be stored, etc. And of course there are considerations about using visual analytic techniques to help you determine relevance and guide your analysis, but that&#39;s an entire subject just on its own!</p><p>Mark Troester<br/>SAS<br/>Twitter: <a href="http://twitter.com/mtroester" title="twitter.com/mtroester" rel="nofollow">@mtroester</a></p>]]></description></item><item><title>Jim Harris comments on Data Governance Frameworks are like Jigsaw Puzzles</title><author>Jim Harris</author><pubDate>Fri, 20 Jan 2012 00:00:46 +0000</pubDate><link>http://www.ocdqblog.com/home/data-governance-frameworks-are-like-jigsaw-puzzles.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16628204</guid><description><![CDATA[<p>From the <a href="http://www.linkedin.com/groups?gid=748817" title="LinkedIn Group for Data Governance &amp; Data Quality" rel="nofollow">LinkedIn Group for Data Governance &amp; Data Quality</a>, <b>John Adler</b> commented:</p><p>“Great work Jim, thanks! This is one area that organizational/technical/business tactics are key. </p><p>The biggest challenge is getting from maturity level 0 to level 1 - puzzle analogy: corners and edges first. Developing repeatable and proven approaches that are tailored to each company&#39;s unique situation is the thing that makes establishing Data Governance so challenging (and fun). This is a cross discipline activity that requires Business, Organizational, <a href="http://en.wikipedia.org/wiki/Intrapreneurship" title="Wikipedia article about Intrapreneurship" rel="nofollow">Intrapreneurial</a>, Technical, and Interpersonal smarts. A tall order indeed.”</p><p><b>And I responded:</b></p><p>Thanks for your great comment, John. </p><p>Corners and edges first, indeed. And, as you noted, each company&#39;s unique situation will determine where those corners and edges will be as the data governance journey begins. </p><p>Best Regards, </p><p>Jim</p>]]></description></item><item><title>Jim Harris comments on Big Data el Memorioso</title><author>Jim Harris</author><pubDate>Thu, 19 Jan 2012 23:51:06 +0000</pubDate><link>http://www.ocdqblog.com/home/big-data-el-memorioso.html#comments</link><guid isPermaLink="false">327252:3438475:comment/16628169</guid><description><![CDATA[<p>From the <a href="http://lnkd.in/uhRDwR" title="LinkedIn Group for Enterprise CIO Forum" rel="nofollow">LinkedIn Group for Enterprise CIO Forum</a>, <b>Pearl Zhu</b> commented:</p><p>“Hi, Jim, enjoy your big data wisdom starting with an interesting little story. I would say, big data could mean more of a culture evolution than a technology evolution, the point is: instead of counting on the gut feeling to make decisions, big data now encourages us to pursue data-driven patterns and facts, then, as you pointed out, big data is the way to the end, not the end, the purpose of analytics is not to get stuck to the details, but accumulate the wisdom, in order to spark more innovative products or services, optimize customer experience or improve the working environment, etc.”</p><p><b>And I responded:</b></p><p>Thanks for your excellent (as always) comment, Pearl. </p><p>I definitely agree with you that big data is more of a corporate cultural evolution than a technology evolution. But since technology is almost always easier to work with than corporate culture, I fear that some organizations will simply try throwing technology (e.g., Hadoop and NoSQL) at big data only to be disappointed with the results, or lack thereof. </p><p>Best Regards, </p><p>Jim</p>]]></description></item></channel></rss>
