OCDQ Blog
  • Home
  • Blog
  • Podcast
  • Best of OCDQ
  • Published Articles
  • OCDQ Jim Harris Testimonials
  • Contact
  • RSS

OCDQ Blog

  • Home/
  • Blog/
  • Podcast/
  • Best of OCDQ/
  • Published Articles/
  • About/
    • OCDQ
    • Jim Harris
    • Testimonials
  • Contact/
  • RSS/
iStock_000032135210Large.png

OCDQ Blog

Obsessive-Compulsive Data Quality by Jim Harris

OCDQ Blog

Obsessive-Compulsive Data Quality by Jim Harris

OCDQ Blog

  • Home/
  • Blog/
  • Podcast/
  • Best of OCDQ/
  • Published Articles/
  • About/
    • OCDQ
    • Jim Harris
    • Testimonials
  • Contact/
  • RSS/
June 25, 2012

Metadata, Data Quality, and the Stroop Test

June 25, 2012/ Jim Harris

In psychology, the Stroop Effect is a demonstration of the reaction time of a task.  The most commonly used example is what is known as the Stroop Test, which compares the time needed to name colors when they are printed in an ink color that matches their name (e.g., green, yellow, red, blue, brown, purple) with the time needed to name the same colors when they are printed in an ink color that does not match their name (e.g., blue, red, purple, green, brown, yellow).  Naming the color of the word takes longer, and is more prone to errors, when the ink color does not match the name of the color.

The Stroop Test, where colors do not match their names, reminds me of the relationship between metadata and data quality if I view the ink color as the metadata and the name of the color as the data, given that understanding data takes longer, and is more prone to errors, when the metadata does not match the data, or when the metadata is ambiguous.

Unlike the Stroop Test, where poor metadata (ink color) obfuscates good data (name of the color), data quality issues can also be caused when good metadata is undermined by poor data (e.g., data entry errors like an email address being entered into a postal address field).  And, of course, even when the entered data matches the metadata (or automatic data-to-metadata matching is enabled by drop-down boxes), more insidious data quality issues can be caused by the complex challenge of data accuracy.

Additionally, the point of view paradox can turn data quality debates about fitness for the purpose of use even more colorful than the Stroop Test, such as when data that one user sees as red and green, another user sees as crimson and chartreuse.

But hopefully we can all agree that good data quality begins with good metadata, because better metadata makes data better.

 

Related Posts

You Say Potato and I Say Tater Tot

The Metadata Continuum

The Metadata Crisis

Let’s Meta a Data

What’s the Meta with your Data?

DQ-View: MetaData makes BettahMusic

Who Framed Data Entry?

Data Quality and the Cupertino Effect

DQ-Tip: “There is no such thing as data accuracy...”

DQ-Tip: “Data quality is primarily about context not accuracy...”

DQ-BE: Data Quality Airlines

Data Quality and the Q Test

Tweet

June 25, 2012/ Jim Harris/ 3 Comments
Data Quality, Debates
Accuracy, Metadata

Jim Harris

  • Big Data Lessons from Orbitz
  • The Return of the Dumb Terminal
  • Home/
  • Blog/
  • Podcast/
  • Best of OCDQ/
  • Published Articles/
  • About/
    • OCDQ
    • Jim Harris
    • Testimonials
  • Contact/
  • RSS/

OCDQ Blog

Obsessive-Compulsive Data Quality (OCDQ) is a blog offering a vendor-neutral perspective on data quality and its related disciplines.

Jim Harris

Jim Harris is the OCDQ Blogger-in-Chief.

Home Blog Podcast Videos Best of OCDQ Published Articles About Contact

© 2022, Jim Harris.

Powered by Squarespace