<?xml version="1.0" encoding="UTF-8"?>
<!--Generated by Squarespace V5 Site Server v5.13.166 (http://www.squarespace.com) on Wed, 19 Jun 2013 12:13:37 GMT--><rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:wfw="http://wellformedweb.org/CommentAPI/" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:dc="http://purl.org/dc/elements/1.1/" version="2.0"><channel><title>OCDQ Blog Feed</title><link>http://www.ocdqblog.com/home/</link><description>Obsessive-Compulsive Data Quality Blog</description><lastBuildDate>Tue, 18 Jun 2013 14:57:50 +0000</lastBuildDate><copyright>Copyright Jim Harris 2009-2011</copyright><language>en-US</language><generator>Squarespace V5 Site Server v5.13.166 (http://www.squarespace.com)</generator><item><title>DQ-Tip: “An information centric organization...”</title><category>Books</category><category>DQ-Tip</category><category>Data Governance</category><category>Data Quality</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 18 Jun 2013 15:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/dq-tip-an-information-centric-organization.html</link><guid isPermaLink="false">327252:3438475:33917334</guid><description><![CDATA[<p><em>Data Quality (DQ) Tips is an <a title="ocdqblog.com/about-ocdq" href="http://www.ocdqblog.com/about-ocdq/">OCDQ</a> regular segment.  Each DQ-Tip is a clear and concise data quality pearl of wisdom.</em></p>
<blockquote>
<p><strong>“An information centric organization is an organization driven from high-quality, complete, and timely information that is relevant to its goals.”</strong></p>
</blockquote>
<p>This <a title="ocdqblog.com/home/tag/dq-tip" href="http://www.ocdqblog.com/home/tag/dq-tip">DQ-Tip</a> is from the new book <a title="amazon.com/Patterns-Information-Management-Mandy-Chessell/dp/0133155501" href="http://www.amazon.com/Patterns-Information-Management-Mandy-Chessell/dp/0133155501" target="_blank"><em>Patterns of Information Management</em></a> by Mandy Chessell and Harald Smith.</p>
<p>“An organization exists for a purpose,” Chessell and Smith explained.  “It has targets to achieve and long-term aspirations.  An organization needs to make good use of its information to achieve its goals.”  In order to do this, they recommend that you define an information strategy that lays out why, what, and how your organization will manage its information:</p>
<ul>
<li><strong>Why</strong> — The business imperatives that drive the need to be information centric, which helps focus information management efforts on the activities that deliver value to the organization.</li>
</ul>
<ul>
<li><strong>What</strong> — The type of information that you must manage to deliver on those business imperatives, which includes the subject areas to cover, which attributes within each subject area that need to be managed, the valid values for those attributes, and the information management policies (such as retention and protection) that the organization wants to implement.</li>
</ul>
<ul>
<li><strong>How</strong> — The information management principles that provide the general rules for how information is to be managed by the information systems and the people using them along with how information flows between them.</li>
</ul>
<p>Developing an information strategy, according to Chessell and Smith, “creates a set of objectives for the organization, which guides the investment in information management technology and related solutions that support the business.  Starting with the business imperatives ensures the information management strategy is aligned with the needs of the organization, making it easier to demonstrate its relevance and value.”</p>
<p>Chessell and Smith also noted that “technology alone is not sufficient to ensure the quality, consistency, and flexibility of an organization’s information.  Classify the people connected to the organization according to their information needs and skills, provide common channels of communication and knowledge sharing about information, and user interfaces and reports through which they can access the information as appropriate.”</p>
<p>Chessell and Smith explained that the attitudes and skills of the organization’s people will be what enables the right behaviors in everyday operations, which is a major determination of the success of an information management program.</p>
<p> </p>
<h2>Related Posts</h2>
<p><a title="ocdqblog.com/home/dq-tip-the-quality-of-information-is-directly-related-to.html" href="http://www.ocdqblog.com/home/dq-tip-the-quality-of-information-is-directly-related-to.html">DQ-Tip: “The quality of information is directly related to...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-undisputable-fact-about-the-value-and-use-of-data.html" href="http://www.ocdqblog.com/home/dq-tip-undisputable-fact-about-the-value-and-use-of-data.html">DQ-Tip: “Undisputable fact about the value and use of data...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-data-quality-tools-do-not-solve-data-quality-problems.html" href="http://www.ocdqblog.com/home/dq-tip-data-quality-tools-do-not-solve-data-quality-problems.html">DQ-Tip: “Data quality tools do not solve data quality problems...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-there-is-no-such-thing-as-data-accuracy.html" href="http://www.ocdqblog.com/home/dq-tip-there-is-no-such-thing-as-data-accuracy.html">DQ-Tip: “There is no such thing as data accuracy...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-data-quality-is-primarily-about-context-not-accuracy.html" href="http://www.ocdqblog.com/home/dq-tip-data-quality-is-primarily-about-context-not-accuracy.html">DQ-Tip: “Data quality is primarily about context not accuracy...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-there-is-no-point-in-monitoring-data-quality.html" href="http://www.ocdqblog.com/home/dq-tip-there-is-no-point-in-monitoring-data-quality.html">DQ-Tip: “There is no point in monitoring data quality...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-dont-pass-bad-data-on-to-the-next-person.html" href="http://www.ocdqblog.com/home/dq-tip-dont-pass-bad-data-on-to-the-next-person.html">DQ-Tip: “Don't pass bad data on to the next person...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-go-talk-with-the-people-using-the-data.html" href="http://www.ocdqblog.com/home/dq-tip-go-talk-with-the-people-using-the-data.html">DQ-Tip: “...Go talk with the people using the data”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-data-quality-is-about-more-than-just-improving-your-d.html" href="http://www.ocdqblog.com/home/dq-tip-data-quality-is-about-more-than-just-improving-your-d.html">DQ-Tip: “Data quality is about more than just improving your data...”</a></p>
<p><a title="ocdqblog.com/home/dq-tip-start-where-you-are.html" href="http://www.ocdqblog.com/home/dq-tip-start-where-you-are.html">DQ-Tip: “Start where you are...”</a></p>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/dq-tip-an-information-centric-organization.html" data-text="DQ-Tip: “An information centric organization...” #DataQuality #DataGovernance" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/dq-tip-an-information-centric-organization.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33917334.xml</wfw:commentRss></item><item><title>Sometimes Worse Data Quality is Better</title><category>Data Quality</category><category>Debates</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 11 Jun 2013 14:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/sometimes-worse-data-quality-is-better.html</link><guid isPermaLink="false">327252:3438475:33886825</guid><description><![CDATA[<p>Continuing a theme from three previous posts, which discussed <a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">when it’s okay to call data quality as good as it needs to get</a>, the occasional times <a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">when perfect data quality is necessary</a>, and the <a title="ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html" href="http://www.ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html" target="_blank">costs and profits of poor data quality</a>, in this blog post I want to provide three examples of when the world of consumer electronics proved that sometimes worse data quality is better.</p>
<p> </p>
<h2>When the Betamax Bet on Video Busted</h2>
<p>While it seems like a long time ago in a galaxy far, far away, during the 1970s and 1980s a <a title="Wikipedia article about videotape format war" href="http://en.wikipedia.org/wiki/Videotape_format_war" target="_blank">videotape format war</a> waged between Betamax and VHS.  Betamax was widely believed to provide superior video data quality.</p>
<p>But a blank Betamax tape allowed users to record up to two hours of high-quality video, whereas a VHS tape allowed users to record up to four hours of slightly lower quality video.  Consumers consistently chose quantity over quality — and especially since lower quality also meant a lower price.  Betamax tapes and machines remained more expensive based on betting that consumers would be willing to pay a premium for higher-quality video.</p>
<p>The VHS victory demonstrated how people often choose <a title="ocdqblog.com/home/data-quality-and-big-data.html" href="http://www.ocdqblog.com/home/data-quality-and-big-data.html">quantity over quality</a>, so it doesn’t always pay to have better data quality.</p>
<p> </p>
<h2>When Lossless Lost to Lossy Audio</h2>
<p>Much to the dismay of those working in the data quality profession, most people do not care about the quality of their data unless it becomes bad enough for them to pay attention to — and complain about.</p>
<p>An excellent example is bitrate, which refers to the number of bits — or the amount of data — that are processed over a certain amount of time.  In his article <a title="lifehacker.com/5810575/does-bitrate-really-make-a-difference-in-my-music" href="http://lifehacker.com/5810575/does-bitrate-really-make-a-difference-in-my-music" target="_blank"><em>Does Bitrate Really Make a Difference In My Music?</em></a>, Whitson Gordon examined the common debate about <strong>lossless</strong> versus <strong>lossy</strong> audio formats.</p>
<p>Using the example of ripping a track from a CD to a hard drive, a lossless format means the track is <strong>not compressed</strong> to the point where any of its data is lost, retaining, for all intents and purposes, the same audio data quality as the original CD track.</p>
<p>By contrast, a lossy format compresses the track so that it takes up less space by <strong>intentionally deleting</strong> some of its data, reducing audio data quality.  Audiophiles often claim anything other than vinyl records sound lousy because they are so lossy.</p>
<p>However, like truth, beauty, and art, <a title="ocdqblog.com/home/data-myopia-and-business-relativity.html" href="http://www.ocdqblog.com/home/data-myopia-and-business-relativity.html">data quality can be said to be</a> in the eyes — or the ears — of the beholder.  So, if your favorite music sounds fine to you in MP3 file format, then not only do you not need vinyl records, audio tapes, and CDs anymore, but if you consider MP3 files <strong>good enough</strong>, then you will not pay more attention to (or pay more money for) audio data quality.</p>
<p> </p>
<h2>When Digital Killed the Photograph Star</h2>
<p>The Eastman Kodak Company, commonly known as Kodak, which was founded by George Eastman in 1888 and dominated the photograph industry for most of the 20th century, filed for bankruptcy in January 2012.  The primary reason was that Kodak, which had previously pioneered innovations like celluloid film and color photography, failed to embrace the industry’s transition to digital photography, despite the fact that Kodak invented some of the core technology used in current digital cameras.</p>
<p>Why?  Because Kodak believed that the data quality of digital photographs would be generally unacceptable to consumers as a replacement for film photographs.  In much the same way that Betamax assumed consumers wanted higher-quality video, Kodak assumed consumers would always want to use higher-quality photographs to capture their “Kodak moments.”</p>
<p>In fairness to Kodak, <a title="ocdqblog.com/home/the-age-of-the-mobile-device.html" href="http://www.ocdqblog.com/home/the-age-of-the-mobile-device.html">mobile devices are causing a massive — <em>and rapid</em> — disruption</a> to many well-established business models, creating a brave new digital world, and obviously not just for photography.  However, when digital killed the photograph star, it proved, once again, that sometimes worse data quality is better.</p>
<p>  </p>
<h2>Related Posts</h2>
<table width="873" border="0" cellspacing="3" cellpadding="3">
<tbody>
<tr>
<td valign="top" width="435">
<p><a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">Data Quality and the OK Plateau</a></p>
<p><a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">When Poor Data Quality Kills</a></p>
<p><a title="ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html" href="http://www.ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html">The Costs and Profits of Poor Data Quality</a></p>
<p><a title="ocdqblog.com/home/promoting-poor-data-quality.html" href="http://www.ocdqblog.com/home/promoting-poor-data-quality.html">Promoting Poor Data Quality</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-the-cupertino-effect.html" href="http://www.ocdqblog.com/home/data-quality-and-the-cupertino-effect.html">Data Quality and the Cupertino Effect</a></p>
<p><a title="ocdqblog.com/home/the-data-quality-wager.html" href="http://www.ocdqblog.com/home/the-data-quality-wager.html">The Data Quality Wager</a></p>
<p><a title="ocdqblog.com/home/how-data-cleansing-saves-lives.html" href="http://www.ocdqblog.com/home/how-data-cleansing-saves-lives.html">How Data Cleansing Saves Lives</a></p>
<p><a title="ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html" href="http://www.ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html">The Dichotomy Paradox, Data Quality and Zero Defects</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-miracle-exceptions.html" href="http://www.ocdqblog.com/home/data-quality-and-miracle-exceptions.html">Data Quality and Miracle Exceptions</a></p>
<p><a title="ocdqblog.com/home/data-quality-quo-vadimus.html" href="http://www.ocdqblog.com/home/data-quality-quo-vadimus.html">Data Quality: Quo Vadimus?</a></p>
</td>
<td valign="top" width="435">
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=11810" href="http://www.dataroundtable.com/?p=11810" target="_blank">The Seventh Law of Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1711" href="http://www.dataroundtable.com/?p=1711" target="_blank">A Tale of Two Q’s</a></p>
<p><a class="offsite-link-inline" title="Paleolithic Rhythm and Data Quality" href="http://www.dataroundtable.com/?p=7948" target="_blank">Paleolithic Rhythm and Data Quality</a></p>
<p><a class="offsite-link-inline" title="Groundhog Data Quality Day by Jim Harris on the Data Roundtable" href="http://www.dataroundtable.com/?p=6031" target="_blank">Groundhog Data Quality Day</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1560" href="http://www.dataroundtable.com/?p=1560" target="_blank">Data Quality and The Middle Way</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12780" href="http://www.dataroundtable.com/?p=12780" target="_blank">Stop Poor Data Quality STOP</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12366" href="http://www.dataroundtable.com/?p=12366" target="_blank">When Poor Data Quality Calls</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8537" href="http://www.dataroundtable.com/?p=8537" target="_blank">Freudian Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=3348" href="http://www.dataroundtable.com/?p=3348" target="_blank">Predictably Poor Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8998" href="http://www.dataroundtable.com/?p=8998" target="_blank">Satisficing Data Quality</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/sometimes-worse-data-quality-is-better.html" data-text="Sometimes Worse Data Quality is Better #DataQuality" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/sometimes-worse-data-quality-is-better.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33886825.xml</wfw:commentRss></item><item><title>i blog of Data glad and big</title><category>Big Data</category><category>Blogs</category><category>Books</category><category>Data Quality</category><category>Data Science</category><category>Debates</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Thu, 06 Jun 2013 15:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/i-blog-of-data-glad-and-big.html</link><guid isPermaLink="false">327252:3438475:33856925</guid><description><![CDATA[<p>I recently blogged about the need to balance <a title="ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html" href="http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html">the hype of big data</a> with <a title="ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" href="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html">some anti-hype</a>.  My hope was, like a collision of matter and anti-matter, the hype and anti-hype would cancel each other out, transitioning our energy into a more productive discussion about big data.  But, of course, few things in human discourse ever reach such an equilibrium, or can maintain it for very long.</p>
<p>For example, Quentin Hardy recently blogged about <a title="blogs.nytimes.com/2013/06/01/why-big-data-is-not-truth" href="http://bits.blogs.nytimes.com/2013/06/01/why-big-data-is-not-truth/?smid=pl-share" target="_blank">six big data myths</a> based on a conference presentation by Kate Crawford, who herself also recently blogged about <a title="blogs.hbr.org/cs/2013/04/the_hidden_biases_in_big_data.html" href="http://blogs.hbr.org/cs/2013/04/the_hidden_biases_in_big_data.html" target="_blank">the hidden biases in big data</a>.  “I call B.S. on all of it,” Derrick Harris blogged in his <a title="gigaom.com/2013/05/28/if-youre-disappointed-with-big-data-youre-not-paying-attention" href="http://gigaom.com/2013/05/28/if-youre-disappointed-with-big-data-youre-not-paying-attention/" target="_blank">response to the backlash against big data</a>.  “It might be provocative to call into question one of the hottest tech movements in generations, but it’s not really fair.  That’s because how companies and people benefit from big data, <a title="ocdqblog.com/home/demystifying-data-science.html" href="http://www.ocdqblog.com/home/demystifying-data-science.html">data science</a> or whatever else they choose to call the movement toward <a title="ocdqblog.com/home/our-increasingly-data-constructed-world.html" href="http://www.ocdqblog.com/home/our-increasingly-data-constructed-world.html">a data-centric world</a> is directly related to what they expect going in.  Arguing that big data isn’t all it’s cracked up to be is a strawman, pure and simple — because <a title="ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html" href="http://www.ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html">no one should think it’s magic</a> to begin with.”</p>
<p>In their new book <a title="amazon.com/Big-Data-Revolution-Transform-Think/dp/0544002695" href="http://www.amazon.com/Big-Data-Revolution-Transform-Think/dp/0544002695" target="_blank"><em>Big Data: A Revolution That Will Transform How We Live, Work, and Think</em></a>, Viktor Mayer-Schonberger and Kenneth Cukier explained that “like so many new technologies, big data will surely become a victim of Silicon Valley’s notorious hype cycle: after being feted on the cover of magazines and at industry conferences, the trend will be dismissed and many of the data-smitten startups will flounder.  But both the infatuation and the damnation profoundly misunderstand the importance of what is taking place.  Just as <a title="ocdqblog.com/home/keep-looking-up-insights-in-data.html" href="http://www.ocdqblog.com/home/keep-looking-up-insights-in-data.html">the telescope enabled us to comprehend the universe</a> and the microscope allowed us to understand germs, the new techniques for <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" target="_blank">collecting and analyzing huge bodies of data</a> will help us make sense of our world in ways we are just starting to appreciate.  The <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx" target="_blank">real revolution is not in the machines</a> that calculate data, but in data itself and how we use it.”</p>
<p>Although there have been numerous critical technology factors making the era of big data possible, such as increases in the amount of computing power, decreases in the cost of data storage, increased network bandwidth, parallel processing frameworks (e.g., Hadoop), scalable and distributed models (e.g., cloud computing), and other techniques (e.g., in-memory computing), Mayer-Schonberger and Cukier argued that “something more important changed too, something subtle.  There was a shift in mindset about how data could be used.  Data was no longer regarded as static and stale, whose usefulness was finished once the <a title="ocdqblog.com/home/data-myopia-and-business-relativity.html" href="http://www.ocdqblog.com/home/data-myopia-and-business-relativity.html">purpose for which it was collected</a> was achieved.  Rather, <a title="ocdqblog.com/home/the-data-cold-war.html" href="http://www.ocdqblog.com/home/the-data-cold-war.html">data became a raw material of business</a>, a vital economic input, used to create a new form of economic value.”</p>
<p>“In fact, with the right mindset, data can be cleverly used to become a fountain of innovation and new services.  The data can reveal secrets to those with the humility, the willingness, and the tools to listen.”</p>
<p>Pondering this big data war of words reminded me of the <a title="wikipedia.org/wiki/E._E._Cummings" href="http://en.wikipedia.org/wiki/E._E._Cummings" target="_blank">E. E. Cummings</a> poem <a title="poets.org/viewmedia.php/prmMID/15408" href="http://www.poets.org/viewmedia.php/prmMID/15408" target="_blank"><em>i sing of Olaf glad and big</em></a>, which sings of Olaf, a conscientious objector forced into military service, who passively endures brutal torture inflicted upon him by training officers, while calmly responding (<strong>pardon the profanity</strong>): “I will not kiss your fucking flag” and “there is some shit I will not eat.”</p>
<p>Without question, <a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">big data has both positive and negative aspects</a>, but the seeming unwillingness of either side in the big data war of words to “kiss each other’s flag,” so to speak, is not as concerning to me as is the conscientious objection to big data and data science expanding into realms where people and businesses were not used to enduring its influence.  For example, some will feel that <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1196/Data-Separates-Science-from-Superstition.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1196/Data-Separates-Science-from-Superstition.aspx" target="_blank">data-driven audits of their decision-making</a> is like brutal torture inflicted upon their less-than <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1111/Data-Driven-Intuition.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1111/Data-Driven-Intuition.aspx" target="_blank">data-driven intuition</a>.</p>
<p>E.E. Cummings sang the praises of Olaf “because unless statistics lie, he was more brave than me.”  i blog of Data glad and big, but I fear that, regardless of how big it is, “there is some data I will not believe” will be a common refrain by people who will lack the humility and willingness to listen to data, and who will not be brave enough to admit that <a title="openmethodology.org/blogs/information-development/2013/05/29/headaches-data-analysis-and-negativity-bias" href="http://mike2.openmethodology.org/blogs/information-development/2013/05/29/headaches-data-analysis-and-negativity-bias/" target="_blank">statistics don’t always lie</a>.</p>
<p> </p>
<h2>Related Posts</h2>
<table width="873" border="0" cellspacing="3" cellpadding="3">
<tbody>
<tr>
<td valign="top" width="435">
<p><a title="ocdqblog.com/home/the-need-for-data-philosophers.html" href="http://www.ocdqblog.com/home/the-need-for-data-philosophers.html">The Need for Data Philosophers</a></p>
<p><a title="ocdqblog.com/home/on-philosophy-science-and-data.html" href="http://www.ocdqblog.com/home/on-philosophy-science-and-data.html">On Philosophy, Science, and Data</a></p>
<p><a title="ocdqblog.com/home/demystifying-data-science.html" href="http://www.ocdqblog.com/home/demystifying-data-science.html">OCDQ Radio - Demystifying Data Science</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-big-data.html" href="http://www.ocdqblog.com/home/data-quality-and-big-data.html">OCDQ Radio - Data Quality and Big Data </a></p>
<p><a title="ocdqblog.com/home/big-data-and-the-infinite-inbox.html" href="http://www.ocdqblog.com/home/big-data-and-the-infinite-inbox.html">Big Data and the Infinite Inbox</a></p>
<p><a title="ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" href="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html">The Laugh-In Effect of Big Data</a></p>
<p><a title="ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html" href="http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html">HoardaBytes and the Big Data Lebowski</a></p>
<p><a title="ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html" href="http://www.ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html">Magic Elephants, Data Psychics, and Invisible Gorillas</a></p>
<p><a title="ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html" href="http://www.ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html">Will Big Data be Blinded by Data Science?</a></p>
<p><a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">The Graystone Effects of Big Data</a></p>
<p><a title="ocdqblog.com/home/information-overload-revisited.html" href="http://www.ocdqblog.com/home/information-overload-revisited.html">Information Overload Revisited</a></p>
<p><a title="ocdqblog.com/home/exercise-better-data-management.html" href="http://www.ocdqblog.com/home/exercise-better-data-management.html">Exercise Better Data Management</a></p>
<p><a title="ocdqblog.com/home/a-tale-of-two-datas.html" href="http://www.ocdqblog.com/home/a-tale-of-two-datas.html">A Tale of Two Datas</a></p>
<p><a title="ocdqblog.com/home/our-increasingly-data-constructed-world.html" href="http://www.ocdqblog.com/home/our-increasingly-data-constructed-world.html">Our Increasingly Data-Constructed World</a></p>
<p><a title="ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html" href="http://www.ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html">The Wisdom of Crowds, Friends, and Experts</a></p>
</td>
<td valign="top" width="435">
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1196/Data-Separates-Science-from-Superstition.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1196/Data-Separates-Science-from-Superstition.aspx" target="_blank">Data Separates Science from Superstition</a></p>
<p><a class="offsite-link-inline" title="openmethodology.org/blogs/information-development/2013/05/29/headaches-data-analysis-and-negativity-bias" href="http://mike2.openmethodology.org/blogs/information-development/2013/05/29/headaches-data-analysis-and-negativity-bias/" target="_blank">Headaches, Data Analysis, and Negativity Bias</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1181/Why-Data-Science-Storytelling-Needs-a-Good-Editor.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1181/Why-Data-Science-Storytelling-Needs-a-Good-Editor.aspx" target="_blank">Why Data Science Storytelling Needs a Good Editor</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" target="_blank">Predictive Analytics, the Data Effect, and Jed Clampett</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx" target="_blank">Rage against the Machines Learning</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" target="_blank">The Flying Monkeys of Big Data</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12862" href="http://www.dataroundtable.com/?p=12862" target="_blank">Cargo Cult Data Science</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx" target="_blank">Speed Up Your Data to Slow Down Your Decisions</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" target="_blank">It’s Not about being Data-Driven</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" target="_blank">Big Data, Sporks, and Decision Frames</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10689" href="http://www.dataroundtable.com/?p=10689" target="_blank">Big Data: Structure and Quality</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" target="_blank">Darth Vader, Big Data, and Predictive Analytics</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1117/Big-Data-Predictive-Analytics-and-the-Ideal-Chronicler.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1117/Big-Data-Predictive-Analytics-and-the-Ideal-Chronicler.aspx" target="_blank">Big Data, Predictive Analytics, and the Ideal Chronicler</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10993" href="http://www.dataroundtable.com/?p=10993" target="_blank">What Magic Tricks teach us about Data Science</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10951" href="http://www.dataroundtable.com/?p=10951" target="_blank">What Mozart for Babies teaches us about Data Science</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/i-blog-of-data-glad-and-big.html" data-text="i blog of Data glad and big #DataQuality #BigData #DataScience" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/i-blog-of-data-glad-and-big.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33856925.xml</wfw:commentRss></item><item><title>Business Analytics for Midsize Businesses</title><category>Big Data</category><category>Books</category><category>Business Intelligence</category><category>IBM for Midsize Business</category><category>Sponsored Blog Posts</category><dc:creator>Jim Harris</dc:creator><pubDate>Thu, 30 May 2013 22:30:00 +0000</pubDate><link>http://www.ocdqblog.com/home/business-analytics-for-midsize-businesses.html</link><guid isPermaLink="false">327252:3438475:33827845</guid><description><![CDATA[<p>As this <a title="opentracker.net/article/25-definitions-big-data" href="http://www.opentracker.net/article/25-definitions-big-data" target="_blank">growing list of definitions for big data</a> attests, big data evangelist and IBM thought leader James Kobielus rightfully warns that <a title="ibmbigdatahub.com/blog/big-data-danger-definitional-overkill" href="http://www.ibmbigdatahub.com/blog/big-data-danger-definitional-overkill" target="_blank">big data is in danger of definitional overkill</a>.  But most midsize business owners are less concerned about defining big data as they are about, as Laurie McCabe recently blogged, determining <a title="lauriemccabe.com/2013/04/30/is-big-data-relevant-for-smbs" href="http://goo.gl/ipWDb" target="_blank">whether big data is relevant</a> for their business.</p>
<p>“The fact of the matter is, <em>big</em> is a relative term,” McCabe explained, “relative to the amount of information that your organization needs to sift through to find the insights you need to operate the business more proactively and profitably.”</p>
<p>McCabe also noted that this is not just a problem for big businesses, since getting better insights from the data you already have is a challenge for businesses of all sizes.  Midsize businesses “may not be dealing with terabytes of data,” McCabe explained, “but many are finding that tools that used to suffice—such as Excel spreadsheets—fall short even when it comes to analyzing internal transactional databases.”  McCabe also provided recommendations for how midsize businesses can <a title="lauriemccabe.com/2013/05/10/putting-big-data-to-work-for-smbs" href="http://goo.gl/8yFbh" target="_blank">put big data to work</a>.</p>
<p>The recent IBM study <a title="public.dhe.ibm.com/software/info/television/advertising/sp/IBMBusinessAnalyticsCaseStudy-1-22-13.pdf" href="http://goo.gl/X54Gg" target="_blank"><em>The Case for Business Analytics in Midsize Firms</em></a> lists big data as one of the trends making a compelling case for the growing importance of business analytics for midsize businesses.  The study also noted that important functional data continues to live in departmental spreadsheets, and state-of-the-art business analytics solutions are needed to make it easy to pull all that data, along with data from other sources, together in a meaningful way.  Despite the common misconception that such solutions are too expensive for midsize businesses, solutions are now available that can deliver analytics capabilities to help overcome big data challenges without requiring a big upfront investment in hardware or software.</p>
<p>Phil Simon, author of <a title="amazon.com/Too-Big-Ignore-Business-Wiley/dp/1118638174" href="http://www.amazon.com/Too-Big-Ignore-Business-Wiley/dp/1118638174" target="_blank"><em>Too Big to Ignore: The Business Case for Big Data</em></a>, recently blogged about <a title="philsimon.com/blog/big-data/reporting-vs-analytics" href="http://www.philsimon.com/blog/big-data/reporting-vs-analytics/" target="_blank">reporting versus analytics</a>, explaining the essence of analytics is it goes beyond the <em>what</em> and <em>where</em> provided by reporting, and tries to explain the <em>why</em>.</p>
<p>Big data isn’t the only reason why analytics is becoming more of a necessity.  But with the barriers to what it costs and where it can be deployed becoming easier to overcome, business analytics is becoming more commonplace in midsize businesses.</p>
<p> </p>
<p><img style="float: left;" src="http://www.ocdqblog.com/storage/website-images/IBM%20Logo.jpg" alt="" width="188" height="76" border="0" /></p>
<p style="text-align: right;"><em>This post was written as part of the <a title="IBM Midsize Business Solutions" href="http://goo.gl/t3fgW" target="_blank">IBM for Midsize Business</a> program, which provides midsize businesses with the tools, expertise and solutions they need to become engines of a smarter planet. I’ve been compensated to contribute to this program, but the opinions expressed in this post are my own and don’t necessarily represent IBM’s positions, strategies, or opinions.</em></p>
<p> </p>
<p style="text-align: center;">To participate in the <strong>2013 IBM Big Data Study</strong> launching in June, register via the following link: <a title="https://www14.software.ibm.com/webapp/iwm/web/signup.do?source=csuite-NA&amp;S_PKG=ov12573" href="http://goo.gl/dkf0H" target="_blank">http://goo.gl/dkf0H</a></p>
<p> </p>
<h2>Related Posts</h2>
<table width="873" border="0" cellspacing="3" cellpadding="3">
<tbody>
<tr>
<td valign="top" width="435">
<p><a title="ocdqblog.com/home/the-big-datastillery.html" href="http://www.ocdqblog.com/home/the-big-datastillery.html">The Big Datastillery</a></p>
<p><a title="ocdqblog.com/home/smart-big-data-adoption-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/smart-big-data-adoption-for-midsize-businesses.html">Smart Big Data Adoption for Midsize Businesses</a></p>
<p><a title="ocdqblog.com/home/big-data-is-not-just-for-big-businesses.html" href="http://www.ocdqblog.com/home/big-data-is-not-just-for-big-businesses.html">Big Data is not just for Big Businesses</a></p>
<p><a title="ocdqblog.com/home/big-data-lessons-from-orbitz.html" href="http://www.ocdqblog.com/home/big-data-lessons-from-orbitz.html">Big Data Lessons from Orbitz</a></p>
<p><a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">The Graystone Effects of Big Data</a></p>
<p><a title="ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html" href="http://www.ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html">Will Big Data be Blinded by Data Science?</a></p>
<p><a title="ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" href="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html">The Laugh-In Effect of Big Data</a></p>
<p><a title="ocdqblog.com/home/social-business-is-more-than-social-marketing.html" href="http://www.ocdqblog.com/home/social-business-is-more-than-social-marketing.html">Social Business is more than Social Marketing</a></p>
<p><a title="ocdqblog.com/home/social-media-marketing-from-monologues-to-dialogues.html" href="http://www.ocdqblog.com/home/social-media-marketing-from-monologues-to-dialogues.html">Social Media Marketing: From Monologues to Dialogues</a></p>
<p><a title="ocdqblog.com/home/social-media-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/social-media-for-midsize-businesses.html">Social Media for Midsize Businesses</a></p>
</td>
<td valign="top" width="435">
<p><a title="ocdqblog.com/home/business-intelligence-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/business-intelligence-for-midsize-businesses.html">Business Intelligence for Midsize Businesses</a></p>
<p><a title="ocdqblog.com/home/barriers-to-cloud-adoption.html" href="http://www.ocdqblog.com/home/barriers-to-cloud-adoption.html">Barriers to Cloud Adoption</a></p>
<p><a title="ocdqblog.com/home/leveraging-the-cloud-for-application-development.html" href="http://www.ocdqblog.com/home/leveraging-the-cloud-for-application-development.html">Leveraging the Cloud for Application Development</a></p>
<p><a title="ocdqblog.com/home/cloud-computing-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/cloud-computing-for-midsize-businesses.html">Cloud Computing for Midsize Businesses</a></p>
<p><a title="ocdqblog.com/home/cloud-computing-is-the-new-nimbyism.html" href="http://www.ocdqblog.com/home/cloud-computing-is-the-new-nimbyism.html">Cloud Computing is the New Nimbyism</a></p>
<p><a title="ocdqblog.com/home/devising-a-mobile-device-strategy.html" href="http://www.ocdqblog.com/home/devising-a-mobile-device-strategy.html">Devising a Mobile Device Strategy</a></p>
<p><a title="ocdqblog.com/home/the-age-of-the-mobile-device.html" href="http://www.ocdqblog.com/home/the-age-of-the-mobile-device.html">The Age of the Mobile Device</a></p>
<p><a title="ocdqblog.com/home/word-of-mouth-has-become-word-of-data.html" href="http://www.ocdqblog.com/home/word-of-mouth-has-become-word-of-data.html">Word of Mouth has become Word of Data</a></p>
<p><a title="ocdqblog.com/home/information-asymmetry-versus-empowered-customers.html" href="http://www.ocdqblog.com/home/information-asymmetry-versus-empowered-customers.html">Information Asymmetry versus Empowered Customers</a></p>
<p><a title="ocdqblog.com/home/talking-business-about-the-weather.html" href="http://www.ocdqblog.com/home/talking-business-about-the-weather.html">Talking Business about the Weather</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/business-analytics-for-midsize-businesses.html" data-text="Business Analytics for Midsize Businesses #BigData #Analytics #IBMsmb" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/business-analytics-for-midsize-businesses.html" data-counter="top"></script>
</p>

<!-- Start of StatCounter Code for Default Guide -->
<script type="text/javascript">
var sc_project=7819984;
var sc_invisible=1;
var sc_security="b93f7b04";
</script>
<script type="text/javascript"
src="http://www.statcounter.com/counter/counter.js"></script>
<noscript><div class="statcounter"><a title="tumblr visitor"
href="http://statcounter.com/tumblr/" target="_blank"><img
class="statcounter"
src="http://c.statcounter.com/7819984/0/b93f7b04/1/"
alt="tumblr visitor"></a></div></noscript>
<!-- End of StatCounter Code for Default Guide -->]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33827845.xml</wfw:commentRss></item><item><title>The Need for Data Philosophers</title><category>Big Data</category><category>Books</category><category>Data Quality</category><category>Data Science</category><category>Debates</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Thu, 16 May 2013 22:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/the-need-for-data-philosophers.html</link><guid isPermaLink="false">327252:3438475:33723146</guid><description><![CDATA[<p>In my post <a title="ocdqblog.com/home/on-philosophy-science-and-data.html" href="http://www.ocdqblog.com/home/on-philosophy-science-and-data.html"><em>On Philosophy, Science, and Data</em></a>, I explained that although some argue philosophy only reigns in the absence of data while science reigns in the analysis of data, a conceptual bridge still remains between analysis and insight, the crossing of which is itself a philosophical exercise.  Therefore, I argued that an endless oscillation persists between science and philosophy, which is why, despite the fact that all we hear about is the need for <strong>data scientists</strong>, there’s also a need for data philosophers.</p>
<p>Of course, the debate between science and philosophy is a very old one, as is the argument we need both.  In my previous post, I slightly paraphrased <a title="wikipedia.org/wiki/Immanuel_Kant" href="http://en.wikipedia.org/wiki/Immanuel_Kant" target="_blank">Immanuel Kant</a> (“perception without conception is blind and conception without perception is empty”) by saying that science without philosophy is blind and philosophy without science is empty.</p>
<p>In his book <a title="amazon.com/Cosmic-Apprentice-Dispatches-Edges-Science/dp/081668135X" href="http://www.amazon.com/Cosmic-Apprentice-Dispatches-Edges-Science/dp/081668135X" target="_blank"><em>Cosmic Apprentice: Dispatches from the Edges of Science</em></a>, Dorion Sagan explained that science and philosophy hang “in a kind of odd balance, watching each other, holding hands.  Science’s eye for detail, buttressed by philosophy’s broad view, makes for a kind of <a title="wikipedia.org/wiki/Alembic" href="http://en.wikipedia.org/wiki/Alembic" target="_blank">alembic</a>, an antidote to both.  Although philosophy isn’t fiction, it can be more personal, creative and open, a kind of counterbalance for science even as it argues that science, with its emphasis on a kind of impersonal materialism, provides a crucial reality check for philosophy and a tendency to over-theorize that’s inimical to the scientific spirit.  Ideally, in the search for truth, science and philosophy, the impersonal and autobiographical, can keep each other honest in a kind of open circuit.”</p>
<p>“Science’s spirit is philosophical,” Sagan concluded.  “It is the spirit of questioning, of curiosity, of critical inquiry combined with fact-checking.  It is the spirit of being able to admit you’re wrong, of appealing to data, not authority.”</p>
<p>“Science,” as his father <a title="wikipedia.org/wiki/Carl_Sagan" href="http://en.wikipedia.org/wiki/Carl_Sagan" target="_blank">Carl Sagan</a> said, “is a way of thinking much more than it is a body of knowledge.”  By extension, we could say that data science is about a way of thinking much more than it is about big data or <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" target="_blank">about being data-driven</a>.</p>
<p>I have previously blogged that science has always been about <a title="openmethodology.org/blogs/information-development/2013/04/30/bigger-questions-not-bigger-data" href="http://mike2.openmethodology.org/blogs/information-development/2013/04/30/bigger-questions-not-bigger-data/" target="_blank">bigger questions, not bigger data</a>.  As <a title="wikipedia.org/wiki/Claude_Lévi-Strauss" href="http://en.wikipedia.org/wiki/Claude_Lévi-Strauss" target="_blank">Claude Lévi-Strauss</a> said, “the scientist is not a person who gives the right answers, but one who asks the right questions.”  As far as data science goes, what are the right questions?  Data scientist <a title="melindathielbar.com/2013/03/17/three-questions-that-can-make-data-science-built-to-last" href="http://melindathielbar.com/2013/03/17/three-questions-that-can-make-data-science-built-to-last/" target="_blank">Melinda Thielbar proposes three key questions</a> (Actionable? Verifiable? Repeatable?).</p>
<p>Here again we see the interdependence of science and philosophy.  “Philosophy,” <a title="wikipedia.org/wiki/Marilyn_McCord_Adams" href="http://en.wikipedia.org/wiki/Marilyn_McCord_Adams" target="_blank">Marilyn McCord Adams</a> said, “is thinking really hard about the most important questions and trying to bring analytic clarity both to the questions and the answers.”</p>
<p>“Philosophy is critical thinking,” <a title="wikipedia.org/wiki/Don_Cupitt" href="http://en.wikipedia.org/wiki/Don_Cupitt" target="_blank">Don Cupitt</a> said. “Trying to become aware of how one’s own thinking works, of all the things one takes for granted, of the way in which one’s own thinking shapes the things one’s thinking about.”  Yes, even a data scientist’s own thinking could shape the things they are thinking scientifically about.  Big data evangelist <a title="ibmbigdatahub.com/blog/data-scientist-bias-backlash-and-brutal-self-criticism" href="http://www.ibmbigdatahub.com/blog/data-scientist-bias-backlash-and-brutal-self-criticism" target="_blank">James Kobielus recently blogged</a> about five biases that may crop up in a data scientist’s work (Cognitive, Selection, Sampling, Modeling, Funding).</p>
<p>“Data science has a bright future ahead,” explained <a title="mashable.com/2013/05/14/hilary-mason-data" href="http://mashable.com/2013/05/14/hilary-mason-data/" target="_blank">Hilary Mason in a recent interview</a>.  “There will only be more data, and more of a need for people who can find meaning and value in that data.  We’re also starting to see a greater need for <strong>data engineers</strong>, people to build infrastructure around data and algorithms, and <strong>data artists</strong>, people who can visualize the data.”</p>
<p>I agree with Mason, and I would add that we are also starting to see a greater need for <strong>data philosophers</strong>, people who can, borrowing the words that <a title="wikipedia.org/wiki/Anthony_Kenny" href="http://en.wikipedia.org/wiki/Anthony_Kenny" target="_blank">Anthony Kenny</a> used to define philosophy, “think as clearly as possible about the most fundamental concepts that reach through all the disciplines.”</p>
<p> </p>
<h2>Related Posts</h2>
<table width="873" border="0" cellspacing="3" cellpadding="3">
<tbody>
<tr>
<td valign="top" width="435">
<p><a title="ocdqblog.com/home/on-philosophy-science-and-data.html" href="http://www.ocdqblog.com/home/on-philosophy-science-and-data.html">On Philosophy, Science, and Data</a></p>
<p><a title="ocdqblog.com/home/demystifying-data-science.html" href="http://www.ocdqblog.com/home/demystifying-data-science.html">OCDQ Radio - Demystifying Data Science</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-big-data.html" href="http://www.ocdqblog.com/home/data-quality-and-big-data.html">OCDQ Radio - Data Quality and Big Data </a></p>
<p><a title="ocdqblog.com/home/big-data-and-the-infinite-inbox.html" href="http://www.ocdqblog.com/home/big-data-and-the-infinite-inbox.html">Big Data and the Infinite Inbox</a></p>
<p><a title="ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" href="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html">The Laugh-In Effect of Big Data</a></p>
<p><a title="ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html" href="http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html">HoardaBytes and the Big Data Lebowski</a></p>
<p><a title="ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html" href="http://www.ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html">Magic Elephants, Data Psychics, and Invisible Gorillas</a></p>
<p><a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">The Graystone Effects of Big Data</a></p>
<p><a title="ocdqblog.com/home/information-overload-revisited.html" href="http://www.ocdqblog.com/home/information-overload-revisited.html">Information Overload Revisited</a></p>
<p><a title="ocdqblog.com/home/exercise-better-data-management.html" href="http://www.ocdqblog.com/home/exercise-better-data-management.html">Exercise Better Data Management</a></p>
<p><a title="ocdqblog.com/home/a-tale-of-two-datas.html" href="http://www.ocdqblog.com/home/a-tale-of-two-datas.html">A Tale of Two Datas</a></p>
<p><a title="ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html" href="http://www.ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html">The Wisdom of Crowds, Friends, and Experts</a></p>
</td>
<td valign="top" width="435">
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1181/Why-Data-Science-Storytelling-Needs-a-Good-Editor.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1181/Why-Data-Science-Storytelling-Needs-a-Good-Editor.aspx" target="_blank">Why Data Science Storytelling Needs a Good Editor</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1187/Predictive-Analytics-the-Data-Effect-and-Jed-Clampett.aspx" target="_blank">Predictive Analytics, the Data Effect, and Jed Clampett</a></p>
<p><a class="offsite-link-inline" title="openmethodology.org/blogs/information-development/2013/04/30/bigger-questions-not-bigger-data" href="http://mike2.openmethodology.org/blogs/information-development/2013/04/30/bigger-questions-not-bigger-data/" target="_blank">Bigger Questions, not Bigger Data</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" target="_blank">The Flying Monkeys of Big Data</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12862" href="http://www.dataroundtable.com/?p=12862" target="_blank">Cargo Cult Data Science</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx" target="_blank">Speed Up Your Data to Slow Down Your Decisions</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" target="_blank">It’s Not about being Data-Driven</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" target="_blank">Big Data, Sporks, and Decision Frames</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10689" href="http://www.dataroundtable.com/?p=10689" target="_blank">Big Data: Structure and Quality</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" target="_blank">Darth Vader, Big Data, and Predictive Analytics</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10993" href="http://www.dataroundtable.com/?p=10993" target="_blank">What Magic Tricks teach us about Data Science</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10951" href="http://www.dataroundtable.com/?p=10951" target="_blank">What Mozart for Babies teaches us about Data Science</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/the-need-for-data-philosophers.html" data-text="The Need for Data Philosophers #DataQuality #BigData #DataScience #FutureOfDS" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/the-need-for-data-philosophers.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33723146.xml</wfw:commentRss></item><item><title>Keep Looking Up Insights in Data</title><category>Books</category><category>Data Quality</category><category>Debates</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 07 May 2013 14:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/keep-looking-up-insights-in-data.html</link><guid isPermaLink="false">327252:3438475:33610563</guid><description><![CDATA[<p>In a previous post, I used the history of the <a title="wikipedia.org/wiki/Hubble_Space_Telescope" href="http://en.wikipedia.org/wiki/Hubble_Space_Telescope" target="_blank">Hubble Space Telescope</a> to explain <a title="ocdqblog.com/home/how-data-cleansing-saves-lives.html" href="http://www.ocdqblog.com/home/how-data-cleansing-saves-lives.html">how data cleansing saves lives</a>, based on a true story I read in the book&nbsp;<a title="amazon.com/Space-Chronicles-Facing-Ultimate-Frontier/dp/0393082105" href="http://www.amazon.com/Space-Chronicles-Facing-Ultimate-Frontier/dp/0393082105" target="_blank"><em>Space Chronicles: Facing the Ultimate Frontier</em></a> by Neil deGrasse Tyson. &nbsp;In this post, Hubble and Tyson once again provide the inspiration for an insightful metaphor about data quality.</p>
<p>Hubble is one of dozens of space telescopes of assorted sizes and shapes orbiting the Earth. &nbsp;&ldquo;Each one,&rdquo; Tyson explained, &ldquo;provides a view of the cosmos that is unobstructed, unblemished, and undiminished by Earth&rsquo;s turbulent and murky atmosphere. &nbsp;They are designed to detect bands of light invisible to the human eye, some of which never penetrate Earth&rsquo;s atmosphere. &nbsp;Hubble is the first and only space telescope to observe the universe using primarily visible light. &nbsp;Its stunningly crisp, colorful, and detailed images of the cosmos make Hubble a kind of supreme version of the human eye in space.&rdquo;</p>
<p>This is how we&rsquo;d like the quality of data to be when we&rsquo;re looking for business insights. &nbsp;High-quality data provides stunningly crisp, colorful, and detailed images of the business cosmos, acting&nbsp;as a kind of supreme version of the human eye in data.</p>
<p>However, despite their less-than-perfect vision, the limitations of Earth-based telescopes still facilitated significant scientific breakthroughs long before Hubble became the first space telescope in 1990.</p>
<p>In 1609, when the Italian physicist and astronomer <a title="wikipedia.org/wiki/Galileo_Galilei" href="http://en.wikipedia.org/wiki/Galileo_Galilei" target="_blank">Galileo Galilei</a> turned a telescope of his own design to the sky, as Tyson explained, he &ldquo;heralded a new era of technology-aided discovery, whereby the capacities of the human senses could be extended, revealing the natural world in unprecedented, even heretical ways. &nbsp;The fact that Galileo revealed the Sun to have spots, the planet Jupiter to have satellites [its four moons: Callisto, Ganymede, Europa, Io], and Earth not to be the center of all celestial motion was enough to unsettle centuries of Aristotelian teachings by the Catholic Church and to put Galileo under house arrest.&rdquo;</p>
<p>And in 1964, another Earth-based telescope, this one operated by the American astronomers Arno Penzias and Robert Wilson at AT&amp;T Bell Labs, was responsible for what is widely considered the most important single discovery in astrophysics, what&rsquo;s now known as <a title="wikipedia.org/wiki/Cosmic_microwave_background_radiation" href="http://en.wikipedia.org/wiki/Cosmic_microwave_background_radiation" target="_blank">cosmic microwave background radiation</a>, and for which Penzias and Wilson won the 1978 Nobel Prize in Physics.</p>
<p>Recently, I&rsquo;ve blogged about how there are times&nbsp;<a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">when perfect data quality is necessary</a>, when we need the equivalent of a space telescope, and times <a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">when okay data quality is good enough</a>, when the equivalent of an Earth-based telescope will do.</p>
<p>What I would like you to take away from this post is that perfect data quality is not a prerequisite for the discovery of new business insights. &nbsp;Even when data doesn&rsquo;t provide a perfect view of the business cosmos, even when it&rsquo;s partially obstructed, blemished, or diminished by the turbulent and murky atmosphere of poor quality, data can still provide business insights.</p>
<p>This doesn&rsquo;t mean that you should settle for poor data quality, just that <a title="ocdqblog.com/home/to-our-data-perfectionists.html" href="http://www.ocdqblog.com/home/to-our-data-perfectionists.html">you shouldn’t demand perfection</a> before using data.</p>
<p>Tyson ends each episode of his <a title="startalkradio.net" href="http://www.startalkradio.net/" target="_blank">StarTalk Radio</a> program by saying &ldquo;keep looking up,&rdquo; so I will end this blog post by saying, even when its quality isn&rsquo;t perfect, keep looking up insights in data.</p>
<p>&nbsp;</p>
<h2>Related Posts</h2>
<table border="0" cellspacing="3" cellpadding="3" width="873">
<tbody>
<tr>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">Data Quality and the OK Plateau</a></p>
<p><a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">When Poor Data Quality Kills</a></p>
<p><a title="ocdqblog.com/home/how-data-cleansing-saves-lives.html" href="http://www.ocdqblog.com/home/how-data-cleansing-saves-lives.html">How Data Cleansing Saves Lives</a></p>
<p><a title="ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html" href="http://www.ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html">The Dichotomy Paradox, Data Quality and Zero Defects</a></p>
<p><a title="ocdqblog.com/home/the-asymptote-of-data-quality.html" href="http://www.ocdqblog.com/home/the-asymptote-of-data-quality.html">The Asymptote of Data Quality</a></p>
<p><a title="ocdqblog.com/home/to-our-data-perfectionists.html" href="http://www.ocdqblog.com/home/to-our-data-perfectionists.html">To Our Data Perfectionists</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-miracle-exceptions.html" href="http://www.ocdqblog.com/home/data-quality-and-miracle-exceptions.html">Data Quality and Miracle Exceptions</a></p>
<p><a title="ocdqblog.com/home/data-quality-quo-vadimus.html" href="http://www.ocdqblog.com/home/data-quality-quo-vadimus.html">Data Quality: Quo Vadimus?</a></p>
</td>
<td width="435" valign="top">
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=11810" href="http://www.dataroundtable.com/?p=11810" target="_blank">The Seventh Law of Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1711" href="http://www.dataroundtable.com/?p=1711" target="_blank">A Tale of Two Q&rsquo;s</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1560" href="http://www.dataroundtable.com/?p=1560" target="_blank">Data Quality and The Middle Way</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12780" href="http://www.dataroundtable.com/?p=12780" target="_blank">Stop Poor Data Quality STOP</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8537" href="http://www.dataroundtable.com/?p=8537" target="_blank">Freudian Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=3348" href="http://www.dataroundtable.com/?p=3348" target="_blank">Predictably Poor Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12719" href="http://www.dataroundtable.com/?p=12719" target="_blank">This isn&rsquo;t Jeopardy</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8998" href="http://www.dataroundtable.com/?p=8998" target="_blank">Satisficing Data Quality</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/keep-looking-up-insights-in-data.html" data-text="Keep Looking Up Insights in Data #DataQuality" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/keep-looking-up-insights-in-data.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33610563.xml</wfw:commentRss></item><item><title>Business Intelligence for Midsize Businesses</title><category>Books</category><category>Business Intelligence</category><category>Cloud</category><category>IBM for Midsize Business</category><category>IT</category><category>Lyndsay Wise</category><category>Mobile</category><category>Open Source</category><category>Sponsored Blog Posts</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 30 Apr 2013 22:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/business-intelligence-for-midsize-businesses.html</link><guid isPermaLink="false">327252:3438475:33520806</guid><description><![CDATA[<p><em>Business intelligence</em> is one of those phrases that everyone agrees is something all organizations, regardless of their size, should be doing. &nbsp;After all, no organization would admit to doing&nbsp;<em>business stupidity</em>. &nbsp;Nor, I presume, would any vendor admit to selling it.</p>
<p>But not everyone seems to agree on what the phrase&nbsp;means. &nbsp;Personally, I have always defined business intelligence as the data analytics performed in support of making informed business decisions (i.e., for me, business intelligence = decision support).</p>
<p>Oftentimes, this analytics is performed on data integrated, cleansed, and consolidated into a repository (e.g., a data warehouse). &nbsp;Other times, it&rsquo;s performed on a single data set (e.g., a customer information file). &nbsp;Either way, business decision makers interact with the analytical results via static reports, data visualizations, dynamic dashboards, and ad hoc querying and reporting tools.</p>
<p>But robust business intelligence and analytics solutions used to be perceived as something only implemented by big businesses, as evinced in the big price tags usually associated with them. &nbsp;However, free and open source software,&nbsp;<a title="ocdqblog.com/home/cloud-computing-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/cloud-computing-for-midsize-businesses.html">cloud computing</a>,&nbsp;<a title="ocdqblog.com/home/devising-a-mobile-device-strategy.html" href="http://www.ocdqblog.com/home/devising-a-mobile-device-strategy.html">mobile</a>, <a title="ocdqblog.com/home/social-business-is-more-than-social-marketing.html" href="http://www.ocdqblog.com/home/social-business-is-more-than-social-marketing.html">social</a>, and a variety of&nbsp;<a title="ocdqblog.com/home/a-swift-kick-in-the-aas.html" href="http://www.ocdqblog.com/home/a-swift-kick-in-the-aas.html">as-a-service</a> technologies drove&nbsp;<a title="ocdqblog.com/home/the-diffusion-of-the-consumerization-of-it.html" href="http://www.ocdqblog.com/home/the-diffusion-of-the-consumerization-of-it.html">the consumerization of IT</a>, driving down the costs of solutions,&nbsp;enabling small and midsize businesses to afford them. &nbsp;Additionally, the&nbsp;open data movement lead to a wealth of free public data sets that can be incorporated into business intelligence and analytics solutions (examples can be found at&nbsp;<a href="http://www.kdnuggets.com/datasets/" target="_blank">kdnuggets.com/datasets</a>).</p>
<p><a title="wiseanalytics.com" href="http://www.wiseanalytics.com/" target="_blank">Lyndsay Wise</a>, author of the insightful book&nbsp;<em><a title="wiseanalytics.com/book/book.php" href="http://wiseanalytics.com/book/book.php" target="_blank">Using Open Source Platforms for Business Intelligence</a></em> (to listen to a podcast about the book, click here:&nbsp;<a title="ocdqblog.com/home/open-source-business-intelligence.html" href="http://www.ocdqblog.com/home/open-source-business-intelligence.html">OSBI on OCDQ Radio</a>),&nbsp;recently blogged about&nbsp;<a title="wiseanalytics.com/blog/2013/03/24/the-time-is-ripe-for-business-intelligence-bi-for-smbs" href="http://www.wiseanalytics.com/blog/2013/03/24/the-time-is-ripe-for-business-intelligence-bi-for-smbs/" target="_blank">business intelligence for small and midsize businesses</a>.</p>
<p>Wise advised that &ldquo;recent market changes have shifted the market in favor of small and midsize businesses. &nbsp;Before this, most were limited by requirements for large infrastructures, high-cost licensing, and limited solution availability. &nbsp;With this newly added flexibility and access to lower price points, business intelligence and analytics solutions are no longer out of reach.&rdquo;</p>
<p>&nbsp;</p>
<p><img style="float: left;" src="http://www.ocdqblog.com/storage/website-images/IBM%20Logo.jpg" border="0" alt="" width="188" height="76" /></p>
<p style="text-align: right;"><em>This post was written as part of the <a title="IBM Midsize Business Solutions" href="http://goo.gl/t3fgW" target="_blank">IBM for Midsize Business</a> program, which provides midsize businesses with the tools, expertise and solutions they need to become engines of a smarter planet. I&rsquo;ve been compensated to contribute to this program, but the opinions expressed in this post are my own and don&rsquo;t necessarily represent IBM&rsquo;s positions, strategies, or opinions.</em></p>
<p>&nbsp;</p>
<h2>Related Posts</h2>
<table border="0" cellspacing="3" cellpadding="3" width="873">
<tbody>
<tr>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/the-big-datastillery.html" href="http://www.ocdqblog.com/home/the-big-datastillery.html">The Big Datastillery</a></p>
<p><a title="ocdqblog.com/home/smart-big-data-adoption-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/smart-big-data-adoption-for-midsize-businesses.html">Smart Big Data Adoption for Midsize Businesses</a></p>
<p><a title="ocdqblog.com/home/big-data-is-not-just-for-big-businesses.html" href="http://www.ocdqblog.com/home/big-data-is-not-just-for-big-businesses.html">Big Data is not just for Big Businesses</a></p>
<p><a title="ocdqblog.com/home/big-data-lessons-from-orbitz.html" href="http://www.ocdqblog.com/home/big-data-lessons-from-orbitz.html">Big Data Lessons from Orbitz</a></p>
<p><a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">The Graystone Effects of Big Data</a></p>
<p><a title="ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html" href="http://www.ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html">Will Big Data be Blinded by Data Science?</a></p>
<p><a title="ocdqblog.com/home/social-business-is-more-than-social-marketing.html" href="http://www.ocdqblog.com/home/social-business-is-more-than-social-marketing.html">Social Business is more than Social Marketing</a></p>
<p><a title="ocdqblog.com/home/social-media-marketing-from-monologues-to-dialogues.html" href="http://www.ocdqblog.com/home/social-media-marketing-from-monologues-to-dialogues.html">Social Media Marketing: From Monologues to Dialogues</a></p>
<p><a title="ocdqblog.com/home/social-media-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/social-media-for-midsize-businesses.html">Social Media for Midsize Businesses</a></p>
</td>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/barriers-to-cloud-adoption.html" href="http://www.ocdqblog.com/home/barriers-to-cloud-adoption.html">Barriers to Cloud Adoption</a></p>
<p><a title="ocdqblog.com/home/leveraging-the-cloud-for-application-development.html" href="http://www.ocdqblog.com/home/leveraging-the-cloud-for-application-development.html">Leveraging the Cloud for Application Development</a></p>
<p><a title="ocdqblog.com/home/cloud-computing-for-midsize-businesses.html" href="http://www.ocdqblog.com/home/cloud-computing-for-midsize-businesses.html">Cloud Computing for Midsize Businesses</a></p>
<p><a title="ocdqblog.com/home/cloud-computing-is-the-new-nimbyism.html" href="http://www.ocdqblog.com/home/cloud-computing-is-the-new-nimbyism.html">Cloud Computing is the New Nimbyism</a></p>
<p><a title="ocdqblog.com/home/devising-a-mobile-device-strategy.html" href="http://www.ocdqblog.com/home/devising-a-mobile-device-strategy.html">Devising a Mobile Device Strategy</a></p>
<p><a title="ocdqblog.com/home/the-age-of-the-mobile-device.html" href="http://www.ocdqblog.com/home/the-age-of-the-mobile-device.html">The Age of the Mobile Device</a></p>
<p><a title="ocdqblog.com/home/word-of-mouth-has-become-word-of-data.html" href="http://www.ocdqblog.com/home/word-of-mouth-has-become-word-of-data.html">Word of Mouth has become Word of Data</a></p>
<p><a title="ocdqblog.com/home/information-asymmetry-versus-empowered-customers.html" href="http://www.ocdqblog.com/home/information-asymmetry-versus-empowered-customers.html">Information Asymmetry versus Empowered Customers</a></p>
<p><a title="ocdqblog.com/home/talking-business-about-the-weather.html" href="http://www.ocdqblog.com/home/talking-business-about-the-weather.html">Talking Business about the Weather</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/business-intelligence-for-midsize-businesses.html" data-text="Business Intelligence for Midsize Businesses #BusinessIntelligence #IBMsmb" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/business-intelligence-for-midsize-businesses.html" data-counter="top"></script>
</p>

<!-- Start of StatCounter Code for Default Guide -->
<script type="text/javascript">
var sc_project=7819984;
var sc_invisible=1;
var sc_security="b93f7b04";
</script>
<script type="text/javascript"
src="http://www.statcounter.com/counter/counter.js"></script>
<noscript><div class="statcounter"><a title="tumblr visitor"
href="http://statcounter.com/tumblr/" target="_blank"><img
class="statcounter"
src="http://c.statcounter.com/7819984/0/b93f7b04/1/"
alt="tumblr visitor"></a></div></noscript>
<!-- End of StatCounter Code for Default Guide -->]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33520806.xml</wfw:commentRss></item><item><title>The Laugh-In Effect of Big Data</title><category>Big Data</category><category>Books</category><category>Data Quality</category><category>Humor</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 23 Apr 2013 14:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html</link><guid isPermaLink="false">327252:3438475:33420175</guid><description><![CDATA[<p>Although I am an advocate for data science and big data done right, lately I have been <strong>sounding the Anti-Hype Horn</strong> with blog posts offering <a title="openmethodology.org/blogs/information-development/2013/03/28/a-contrarians-view-of-unstructured-data" href="http://mike2.openmethodology.org/blogs/information-development/2013/03/28/a-contrarians-view-of-unstructured-data/" target="_blank">a contrarian’s view of unstructured data</a>, forewarning you about <a title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" target="_blank">the flying monkeys of big data</a>, cautioning you against performing <a title="dataroundtable.com/?p=12862" href="http://www.dataroundtable.com/?p=12862" target="_blank">Cargo Cult Data Science</a>, and inviting you to ponder the perils of <a title="ocdqblog.com/home/big-data-and-the-infinite-inbox.html" href="http://www.ocdqblog.com/home/big-data-and-the-infinite-inbox.html">the Infinite Inbox</a>.</p>
<p><a title="ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html" href="http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html">The hype of big data</a> has resulted in a lot of people and vendors extolling its virtues with stories about how <a title="ocdqblog.com/home/the-data-cold-war.html" href="http://www.ocdqblog.com/home/the-data-cold-war.html">Internet companies</a>, <a title="technologyreview.com/featuredstory/508856/obamas-data-techniques-will-rule-future-elections" href="http://www.technologyreview.com/featuredstory/508856/obamas-data-techniques-will-rule-future-elections/" target="_blank">political campaigns</a>, and <a title="forbes.com/sites/oreillymedia/2012/02/07/apache-hadoop-what-you-need-to-know-about-this-important-big-data-tool" href="http://www.forbes.com/sites/oreillymedia/2012/02/07/apache-hadoop-what-you-need-to-know-about-this-important-big-data-tool/" target="_blank">new technologies</a> have profited, or otherwise benefited, from big data.  These stories are served up as alleged business cases for investing in big data and data science.  Although some of these stories are fluff pieces, many of them accurately, and in some cases comprehensively, describe a real-world application of big data and data science.  However, these messages most often lack a critically important component — <strong>applicability to your specific business</strong>.  In <a title="amazon.com/Made-Stick-Ideas-Survive-Others/dp/1400064287" href="http://www.amazon.com/Made-Stick-Ideas-Survive-Others/dp/1400064287" target="_blank"><em>Made to Stick: Why Some Ideas Survive and Others Die</em></a>, Chip Heath and Dan Heath explained that “an accurate but useless idea is still useless.  If a message can’t be used to make predictions or decisions, it is without value, no matter how accurate or comprehensive it is.”</p>
<p><a title="Wikipedia article about Rowan &amp; Martin’s Laugh-In" href="http://en.wikipedia.org/wiki/Rowan_%26_Martin%27s_Laugh-In" target="_blank"><em>Rowan &amp; Martin’s Laugh-In</em></a> was an American sketch comedy television series, which aired from 1968 to 1973.  One of the recurring characters portrayed by <a title="wikipedia.org/wiki/Arte_Johnson" href="http://en.wikipedia.org/wiki/Arte_Johnson" target="_blank">Arte Johnson</a> was Wolfgang the German soldier, who would often comment on the previous comedy sketch by saying (in a heavy and long-drawn-out German accent): “Very interesting . . . but stupid!”</p>
<p>From now on whenever someone shares another interesting story masquerading as a solid business case for big data that lacks any applicability beyond the specific scenario in the story, a common phenomenon I call <strong>The Laugh-In Effect of Big Data</strong>, my unapologetic response will resoundingly be: “Very interesting . . . but stupid!”</p>
<p> </p>
<h2>Related Posts</h2>
<table border="0" cellspacing="3" cellpadding="3" width="873">
<tbody>
<tr>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/on-philosophy-science-and-data.html" href="http://www.ocdqblog.com/home/on-philosophy-science-and-data.html">On Philosophy, Science, and Data</a></p>
<p><a title="ocdqblog.com/home/demystifying-data-science.html" href="http://www.ocdqblog.com/home/demystifying-data-science.html">OCDQ Radio - Demystifying Data Science</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-big-data.html" href="http://www.ocdqblog.com/home/data-quality-and-big-data.html">OCDQ Radio - Data Quality and Big Data </a></p>
<p><a title="ocdqblog.com/home/big-data-and-the-infinite-inbox.html" href="http://www.ocdqblog.com/home/big-data-and-the-infinite-inbox.html">Big Data and the Infinite Inbox</a></p>
<p><a title="ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html" href="http://www.ocdqblog.com/home/hoardabytes-and-the-big-data-lebowski.html">HoardaBytes and the Big Data Lebowski</a></p>
<p><a title="ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html" href="http://www.ocdqblog.com/home/will-big-data-be-blinded-by-data-science.html">Will Big Data be Blinded by Data Science?</a></p>
<p><a title="ocdqblog.com/home/data-silence.html" href="http://www.ocdqblog.com/home/data-silence.html">Data Silence</a></p>
<p><a title="ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html" href="http://www.ocdqblog.com/home/magic-elephants-data-psychics-and-invisible-gorillas.html">Magic Elephants, Data Psychics, and Invisible Gorillas</a></p>
<p><a title="ocdqblog.com/home/the-graystone-effects-of-big-data.html" href="http://www.ocdqblog.com/home/the-graystone-effects-of-big-data.html">The Graystone Effects of Big Data</a></p>
<p><a title="ocdqblog.com/home/big-data-el-memorioso.html" href="http://www.ocdqblog.com/home/big-data-el-memorioso.html">Big Data el Memorioso</a></p>
<p><a title="ocdqblog.com/home/information-overload-revisited.html" href="http://www.ocdqblog.com/home/information-overload-revisited.html">Information Overload Revisited</a></p>
<p><a title="ocdqblog.com/home/exercise-better-data-management.html" href="http://www.ocdqblog.com/home/exercise-better-data-management.html">Exercise Better Data Management</a></p>
<p><a title="ocdqblog.com/home/a-tale-of-two-datas.html" href="http://www.ocdqblog.com/home/a-tale-of-two-datas.html">A Tale of Two Datas</a></p>
<p><a title="http://www.ocdqblog.com/home/dot-collectors-and-dot-connectors.html" href="http://www.ocdqblog.com/home/dot-collectors-and-dot-connectors.html">Dot Collectors and Dot Connectors</a></p>
<p><a title="ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html" href="http://www.ocdqblog.com/home/the-wisdom-of-crowds-friends-and-experts.html">The Wisdom of Crowds, Friends, and Experts</a></p>
</td>
<td width="435" valign="top">
<p><a class="offsite-link-inline" title="openmethodology.org/blogs/information-development/2013/03/28/a-contrarians-view-of-unstructured-data/" href="http://mike2.openmethodology.org/blogs/information-development/2013/03/28/a-contrarians-view-of-unstructured-data/">A Contrarian’s View of Unstructured Data</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1175/The-Flying-Monkeys-of-Big-Data.aspx">The Flying Monkeys of Big Data</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12862" href="http://www.dataroundtable.com/?p=12862">Cargo Cult Data Science</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12194" href="http://www.dataroundtable.com/?p=12194" target="_blank">A Statistically Significant Resolution for 2013</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1149/Speed-Up-Your-Data-to-Slow-Down-Your-Decisions.aspx">Speed Up Your Data to Slow Down Your Decisions</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1139/Rage-against-the-Machines-Learning.aspx">Rage against the Machines Learning</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1123/It-s-Not-about-being-Data-Driven.aspx" target="_blank">It’s Not about being Data-Driven</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1129/Big-Data-Sporks-and-Decision-Frames.aspx" target="_blank">Big Data, Sporks, and Decision Frames</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10689" href="http://www.dataroundtable.com/?p=10689" target="_blank">Big Data: Structure and Quality</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1088/Darth-Vader-Big-Data-and-Predictive-Analytics.aspx" target="_blank">Darth Vader, Big Data, and Predictive Analytics</a></p>
<p><a class="offsite-link-inline" title="bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1117/Big-Data-Predictive-Analytics-and-the-Ideal-Chronicler.aspx" href="http://bigdata.pervasive.com/Blog/Big-Data-Blog/EntryId/1117/Big-Data-Predictive-Analytics-and-the-Ideal-Chronicler.aspx" target="_blank">Big Data, Predictive Analytics, and the Ideal Chronicler</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10190" href="http://www.dataroundtable.com/?p=10190" target="_blank">The Big Data Theory</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10144" href="http://www.dataroundtable.com/?p=10144" target="_blank">Swimming in Big Data</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10993" href="http://www.dataroundtable.com/?p=10993" target="_blank">What Magic Tricks teach us about Data Science</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=10951" href="http://www.dataroundtable.com/?p=10951" target="_blank">What Mozart for Babies teaches us about Data Science</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" data-text="The Laugh-In Effect of Big Data #DataQuality #BigData" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/the-laugh-in-effect-of-big-data.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33420175.xml</wfw:commentRss></item><item><title>The Costs and Profits of Poor Data Quality</title><category>Data Quality</category><category>Debates</category><category>Philosophy</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 16 Apr 2013 20:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html</link><guid isPermaLink="false">327252:3438475:33393932</guid><description><![CDATA[<p>Continuing the theme of my two previous posts, which discussed <a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">when it’s okay to call data quality as good as it needs to get</a> and <a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">when perfect data quality is necessary</a>, in this post I want to briefly discuss the costs — and <a title="ocdqblog.com/home/promoting-poor-data-quality.html" href="http://www.ocdqblog.com/home/promoting-poor-data-quality.html">profits</a> — of poor data quality.</p>
<p>Loraine Lawson interviewed Ted Friedman of Gartner Research about <a title="itbusinessedge.com/interviews/how-to-measure-the-cost-of-data-quality-problems.html" href="http://www.itbusinessedge.com/interviews/how-to-measure-the-cost-of-data-quality-problems.html" target="_blank"><em>How to Measure the Cost of Data Quality Problems</em></a>, such as the costs associated with reduced productivity, redundancies, business processes breaking down because of data quality issues, <a title="ocdqblog.com/home/solvency-ii-and-data-quality.html" href="http://www.ocdqblog.com/home/solvency-ii-and-data-quality.html">regulatory compliance risks</a>, and lost business opportunities.  <a title="dataqualitybook.com/?p=300" href="http://dataqualitybook.com/?p=300" target="_blank">David Loshin blogged</a> about the challenge of estimating the cost of poor data quality, noting that many estimates, upon close examination, seem to rely exclusively on <a title="dataroundtable.com/?p=8384" href="http://www.dataroundtable.com/?p=8384" target="_blank">anecdotal evidence</a>.</p>
<p>A recent <em>Mental Floss</em> article recounted <a title="mentalfloss.com/article/49935/10-very-costly-typos" href="http://mentalfloss.com/article/49935/10-very-costly-typos" target="_blank"><em>10 Very Costly Typos</em></a>, including the 1962 $80 million dollar missing hyphen in the programming code that led to the destruction of the <a title="wikipedia.org/wiki/Mariner_1" href="http://en.wikipedia.org/wiki/Mariner_1" target="_blank">Mariner 1 spacecraft</a>, the 2007 Roswell, New Mexico car dealership promotion where instead of 1 out of 50,000 scratch lottery tickets revealing a $1,000 cash grand prize, <em>all of the tickets</em> were printed as grand-prize winners, which would have been a $50 million payout, but $250,000 in Walmart gift certificates were given out instead, and, more recently, the March 2013 typographical error in the price of pay-per-ride cards on 160,000 maps and posters that cost New York City’s Transportation Authority approximately $500,000.</p>
<p>Although we often only think about the costs of poor data quality, the article also shared some 2010 research performed by Harvard University claiming that Google profits an estimated $497 million dollars a year from people mistyping the names of popular websites and landing on <a title="wikipedia.org/wiki/Typosquatting" href="http://en.wikipedia.org/wiki/Typosquatting" target="_blank">typosquatter sites</a>, which just happen to be conveniently littered with Google ads.</p>
<p>Poor data quality has also long played an important role in improving Google Search, where misspellings of search terms entered by users (and not just <a title="ocdqblog.com/home/data-quality-and-the-cupertino-effect.html" href="http://www.ocdqblog.com/home/data-quality-and-the-cupertino-effect.html">a spellchecker program</a>) is leveraged by the algorithm providing the <em>Did you mean</em>, <em>Including results for</em>, and <em>Search instead for</em> help text displayed at the top of the first page of Google Search results.</p>
<p>What examples (or calculation methods) can you provide about either the costs or profits associated with poor data quality?</p>
<p> </p>
<h2>Related Posts</h2>
<table border="0" cellspacing="3" cellpadding="3" width="873">
<tbody>
<tr>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/promoting-poor-data-quality.html" href="http://www.ocdqblog.com/home/promoting-poor-data-quality.html">Promoting Poor Data Quality</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-the-cupertino-effect.html" href="http://www.ocdqblog.com/home/data-quality-and-the-cupertino-effect.html">Data Quality and the Cupertino Effect</a></p>
<p><a title="ocdqblog.com/home/the-data-quality-wager.html" href="http://www.ocdqblog.com/home/the-data-quality-wager.html">The Data Quality Wager</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">Data Quality and the OK Plateau</a></p>
<p><a title="ocdqblog.com/home/when-poor-data-quality-kills.html" href="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html">When Poor Data Quality Kills</a></p>
<p><a title="ocdqblog.com/home/how-data-cleansing-saves-lives.html" href="http://www.ocdqblog.com/home/how-data-cleansing-saves-lives.html">How Data Cleansing Saves Lives</a></p>
<p><a title="ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html" href="http://www.ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html">The Dichotomy Paradox, Data Quality and Zero Defects</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-miracle-exceptions.html" href="http://www.ocdqblog.com/home/data-quality-and-miracle-exceptions.html">Data Quality and Miracle Exceptions</a></p>
<p><a title="ocdqblog.com/home/data-quality-quo-vadimus.html" href="http://www.ocdqblog.com/home/data-quality-quo-vadimus.html">Data Quality: Quo Vadimus?</a></p>
<p><a title="ocdqblog.com/home/data-and-its-relationships-with-quality.html" href="http://www.ocdqblog.com/home/data-and-its-relationships-with-quality.html">Data and its Relationships with Quality</a></p>
</td>
<td width="435" valign="top">
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=11810" href="http://www.dataroundtable.com/?p=11810" target="_blank">The Seventh Law of Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1711" href="http://www.dataroundtable.com/?p=1711" target="_blank">A Tale of Two Q’s</a></p>
<p><a class="offsite-link-inline" title="Paleolithic Rhythm and Data Quality" href="http://www.dataroundtable.com/?p=7948" target="_blank">Paleolithic Rhythm and Data Quality</a></p>
<p><a class="offsite-link-inline" title="Groundhog Data Quality Day by Jim Harris on the Data Roundtable" href="http://www.dataroundtable.com/?p=6031" target="_blank">Groundhog Data Quality Day</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1560" href="http://www.dataroundtable.com/?p=1560" target="_blank">Data Quality and The Middle Way</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12780" href="http://www.dataroundtable.com/?p=12780" target="_blank">Stop Poor Data Quality STOP</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12366" href="http://www.dataroundtable.com/?p=12366" target="_blank">When Poor Data Quality Calls</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8537" href="http://www.dataroundtable.com/?p=8537" target="_blank">Freudian Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=3348" href="http://www.dataroundtable.com/?p=3348" target="_blank">Predictably Poor Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8998" href="http://www.dataroundtable.com/?p=8998" target="_blank">Satisficing Data Quality</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html" data-text="The Costs and Profits of Poor Data Quality #DataQuality" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/the-costs-and-profits-of-poor-data-quality.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33393932.xml</wfw:commentRss></item><item><title>When Poor Data Quality Kills</title><category>Books</category><category>Data Matching</category><category>Data Quality</category><category>Debates</category><dc:creator>Jim Harris</dc:creator><pubDate>Tue, 09 Apr 2013 20:00:00 +0000</pubDate><link>http://www.ocdqblog.com/home/when-poor-data-quality-kills.html</link><guid isPermaLink="false">327252:3438475:33273670</guid><description><![CDATA[<p>In my <a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">previous post</a>, I made the argument that many times it’s okay to call data quality as good as it needs to get, as opposed to <a title="ocdqblog.com/home/to-our-data-perfectionists.html" href="http://www.ocdqblog.com/home/to-our-data-perfectionists.html">demanding data perfection</a>.  However, a balanced perspective demands acknowledging there are times when nothing less than perfect data quality is necessary.  In fact, there are times when poor data quality can have deadly consequences.</p>
<p>In his book <a title="amazon.com/Information-History-Theory-Flood/dp/0375423729" href="http://www.amazon.com/Information-History-Theory-Flood/dp/0375423729" target="_blank"><em>The Information: A History, a Theory, a Flood</em></a>, James Gleick explained “pharmaceutical names are a special case: a subindustry has emerged to coin them, research them, and vet them.  In the United States, the Food and Drug Administration reviews proposed drug names for possible collisions, and this process is complex and uncertain.  <strong>Mistakes cause death</strong>.”</p>
<p>“Methadone, for opiate dependence, has been administrated in place of Metadate, for attention-deficit disorder, and Taxcol, a cancer drug, for Taxotere, a different cancer drug, with fatal results.  Doctors fear both look-alike errors and sound-alike errors: Zantac/Xanax; Verelan/Virilon.  Linguists devise scientific measures of the <em>distance</em> between names.  But Lamictal and Lamisil and Ludiomil and Lomotil are all approved drug names.”</p>
<p>All <a title="ocdqblog.com/home/the-art-of-data-matching.html" href="http://www.ocdqblog.com/home/the-art-of-data-matching.html">data matching techniques</a>, such as edit distance functions, phonetic comparisons, and more complex algorithms, provide a way to represent (e.g., numeric probabilities, weighted percentages, odds ratios, etc.) the likelihood that two non-exact matching data items are the same.  No matter what data quality software vendors tell you, all data matching techniques are susceptible to <strong>false negatives</strong> (data that <em>did not match</em>, but <em>should have</em>) and <strong>false positives</strong> (data that <em>matched</em>, but <em>should not have</em>).</p>
<p>This pharmaceutical example is one case where a false positive could be deadly, a time when poor data quality kills.  Admittedly, this is an extreme example.  What other examples can you offer where perfect data quality is actually a necessity?</p>
<p> </p>
<h2>Related Posts</h2>
<table border="0" cellspacing="3" cellpadding="3" width="873">
<tbody>
<tr>
<td width="435" valign="top">
<p><a title="ocdqblog.com/home/data-quality-and-the-ok-plateau.html" href="http://www.ocdqblog.com/home/data-quality-and-the-ok-plateau.html">Data Quality and the OK Plateau</a></p>
<p><a title="ocdqblog.com/home/how-data-cleansing-saves-lives.html" href="http://www.ocdqblog.com/home/how-data-cleansing-saves-lives.html">How Data Cleansing Saves Lives</a></p>
<p><a title="ocdqblog.com/home/what-going-to-the-dentist-taught-me-about-data-quality.html" href="http://www.ocdqblog.com/home/what-going-to-the-dentist-taught-me-about-data-quality.html">What going to the dentist taught me about data quality</a></p>
<p><a title="ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html" href="http://www.ocdqblog.com/home/the-dichotomy-paradox-data-quality-and-zero-defects.html">The Dichotomy Paradox, Data Quality and Zero Defects</a></p>
<p><a title="ocdqblog.com/home/the-asymptote-of-data-quality.html" href="http://www.ocdqblog.com/home/the-asymptote-of-data-quality.html">The Asymptote of Data Quality</a></p>
<p><a title="ocdqblog.com/home/to-our-data-perfectionists.html" href="http://www.ocdqblog.com/home/to-our-data-perfectionists.html">To Our Data Perfectionists</a></p>
<p><a title="ocdqblog.com/home/data-quality-and-miracle-exceptions.html" href="http://www.ocdqblog.com/home/data-quality-and-miracle-exceptions.html">Data Quality and Miracle Exceptions</a></p>
<p><a title="ocdqblog.com/home/data-quality-quo-vadimus.html" href="http://www.ocdqblog.com/home/data-quality-quo-vadimus.html">Data Quality: Quo Vadimus?</a></p>
<p><a title="ocdqblog.com/home/data-and-its-relationships-with-quality.html" href="http://www.ocdqblog.com/home/data-and-its-relationships-with-quality.html">Data and its Relationships with Quality</a></p>
<p><a title="ocdqblog.com/home/dq-tip-there-is-no-such-thing-as-data-accuracy.html" href="http://www.ocdqblog.com/home/dq-tip-there-is-no-such-thing-as-data-accuracy.html">DQ-Tip: “There is no such thing as data accuracy...”</a></p>
</td>
<td width="435" valign="top">
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=11810" href="http://www.dataroundtable.com/?p=11810" target="_blank">The Seventh Law of Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1711" href="http://www.dataroundtable.com/?p=1711" target="_blank">A Tale of Two Q’s</a></p>
<p><a class="offsite-link-inline" title="Paleolithic Rhythm and Data Quality" href="http://www.dataroundtable.com/?p=7948" target="_blank">Paleolithic Rhythm and Data Quality</a></p>
<p><a class="offsite-link-inline" title="Groundhog Data Quality Day by Jim Harris on the Data Roundtable" href="http://www.dataroundtable.com/?p=6031" target="_blank">Groundhog Data Quality Day</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=1560" href="http://www.dataroundtable.com/?p=1560" target="_blank">Data Quality and The Middle Way</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12780" href="http://www.dataroundtable.com/?p=12780" target="_blank">Stop Poor Data Quality STOP</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=12366" href="http://www.dataroundtable.com/?p=12366" target="_blank">When Poor Data Quality Calls</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8537" href="http://www.dataroundtable.com/?p=8537" target="_blank">Freudian Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=3348" href="http://www.dataroundtable.com/?p=3348" target="_blank">Predictably Poor Data Quality</a></p>
<p><a class="offsite-link-inline" title="dataroundtable.com/?p=8998" href="http://www.dataroundtable.com/?p=8998" target="_blank">Satisficing Data Quality</a></p>
</td>
</tr>
</tbody>
</table>

<p>
<a href="http://twitter.com/share" class="twitter-share-button" data-url="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html" data-text="When Poor Data Quality Kills #DataQuality" data-count="vertical" data-via="ocdqblog">Tweet</a><script type="text/javascript" src="http://platform.twitter.com/widgets.js"></script>
<script type="text/javascript" src="http://platform.linkedin.com/in.js"></script><script type="in/share" data-url="http://www.ocdqblog.com/home/when-poor-data-quality-kills.html" data-counter="top"></script>
</p>]]></description><wfw:commentRss>http://www.ocdqblog.com/home/rss-comments-entry-33273670.xml</wfw:commentRss></item></channel></rss>