Cookies help us display personalized product recommendations and ensure you have great shopping experience.

By using this site, you agree to the Privacy Policy and Terms of Use.
Accept
SmartData CollectiveSmartData Collective
  • Analytics
    AnalyticsShow More
    data analytics in ecommerce
    Analytics Technology Drives Conversions for Your eCommerce Site
    5 Min Read
    CRM Analytics
    CRM Analytics Helps Content Creators Develop an Edge in a Saturated Market
    5 Min Read
    data analytics and commerce media
    Leveraging Commerce Media & Data Analytics in Ecommerce
    8 Min Read
    big data in healthcare
    Leveraging Big Data and Analytics to Enhance Patient-Centered Care
    5 Min Read
    instagram visibility
    Data Analytics Plays a Key Role in Improving Instagram Visibility
    7 Min Read
  • Big Data
  • BI
  • Exclusive
  • IT
  • Marketing
  • Software
Search
© 2008-23 SmartData Collective. All Rights Reserved.
Reading: Data Governance Begins at the Spreadsheet
Share
Notification Show More
Font ResizerAa
SmartData CollectiveSmartData Collective
Font ResizerAa
Search
  • About
  • Help
  • Privacy
Follow US
© 2008-23 SmartData Collective. All Rights Reserved.
SmartData Collective > Data Management > Best Practices > Data Governance Begins at the Spreadsheet
AnalyticsBest PracticesBig DataBusiness IntelligenceData ManagementPolicy and Governance

Data Governance Begins at the Spreadsheet

boblambert12
Last updated: May 3, 2013 5:36 pm
boblambert12
6 Min Read
data spreadsheet
SHARE

Data management professionals have long and sometimes rather Quixotically driven organizations to “get past the spreadsheet culture.” Maybe that’s misguided. The recent furor over a widely read social science paper may show how we can look to scientific peer review for a way to govern data, spreadsheets and all.

data spreadsheetRecently, it was found that a key study underpinning debt-reduction as a driver of economic growth based its conclusions on a flawed spreadsheet. As this ArsTechnica article describes, Carmen Reinhart and Kenneth Rogoff’s Growth in a Time of Debt seemingly proved a connection between “high levels of debt and negative average economic growth”. But, per a recent study by Thomas Herndon, Michael Ash, and Robert Pollin, it turns out that the study’s conclusions drew from a Microsoft Excel formula mistake, questionable data exclusions, and non-standard weightings of base data. The ArsTechnica piece finds those conclusions fade to a more ambiguous outcome with errors and apparent biases corrected.

Data management professionals have long and sometimes rather Quixotically driven organizations to “get past the spreadsheet culture.” Maybe that’s misguided. The recent furor over a widely read social science paper may show how we can look to scientific peer review for a way to govern data, spreadsheets and all.

data spreadsheetRecently, it was found that a key study underpinning debt-reduction as a driver of economic growth based its conclusions on a flawed spreadsheet. As this ArsTechnica article describes, Carmen Reinhart and Kenneth Rogoff’s Growth in a Time of Debt seemingly proved a connection between “high levels of debt and negative average economic growth”. But, per a recent study by Thomas Herndon, Michael Ash, and Robert Pollin, it turns out that the study’s conclusions drew from a Microsoft Excel formula mistake, questionable data exclusions, and non-standard weightings of base data. The ArsTechnica piece finds those conclusions fade to a more ambiguous outcome with errors and apparent biases corrected.

More Read

Should the Entire Internet Be Encrypted?

5 Reasons Why Small and Medium-Sized Businesses Should Take Data Protection More Seriously
Optimizing Health Care With Big Data
Reinventing the BI Solution You Already Have – A Series of Unfortunate Data Warehousing/Business Intelligence Events #1
Big Data Makes Custom Decals More Useful Marketing Tools

I’m not trying to make a political point here, but clearly this mistake must be distressing to politicians who cited the study as a basis for economic policy proposals. It is equally a cautionary tale to those in business whose complex spreadsheets drive their analyses, plans, and decisions.

As Jim King astutely points out, the spreadsheet is today’s de facto business analysis tool of choice due to its “low technical requirement, intuitive and flexible calculation capability, and business-expert-oriented easy solution to 80% of BI problems”. In my experience, business people view advanced BI and data visualization projects as data delivery platforms for Excel. “Can I download that report into Excel” might be the most-asked question in BI presentations to end-users. How can organizations address the risk that they might base big decisions on invalid spreadsheets?

An excerpt from the title page of Growth in a Time of Debt, published as an National Bureau of Economic Research “working paper” points the way forward:

“NBER working papers are circulated for discussion and comment purposes. They have not been peer reviewed or been subject to the review by the NBER Board of Directors that accompanies official NBER publications.”

Scientific papers published in reputable journals endure rigorous peer review, in which editors distribute the submitted draft to peer scientists for evaluation and comment. Reviews are detailed and comments can be harsh, sometimes calling for the authors to put in months of rework before resubmission. Growth in a Time of Debt hadn’t been through that process, and shouldn’t have been relied upon yet as a guide for policy makers.

The moral for business is this: make peer review a part of key spreadsheet analyses. Every important spreadsheet should undergo review by three or four peer analysts, and be corrected according to the results of their review before use as a basis for decision-making. Here is a list adapted from one description of peer review (see page 8, here) that spreadsheet reviewers might use:

  • Are the question or questions answered by the spreadsheet clear?
  • Was the approach appropriate?
  • Does the spreadsheet integrate data from appropriate sources?
  • Are the spreadsheet design, methods and analyses appropriate to the question being studied?
  • Does the spreadsheet add to existing knowledge, or does it repeat other previous documents that might have answered the same questions?
  • Are the methods described clearly enough for reviewers to understand and replicate?
  • Are calculations, statistical analyses, and levels of significance appropriate and correct?
  • Could presentation of the results be improved and do they answer the question?
  • If non-public information was involved, was ethics approval gained and was the analysis ethical?
  • Are the conclusions appropriate?

The interesting thing about these review questions is that they beg larger ones, which is where data governance comes in. Instead of bemoaning persistent business dependence on “spreadmarts”, data governance advocates should forget about the tool and focus on data practices by helping the business define standards for data usage. What are appropriate data sources for a spreadsheet? What is ethical use of non-public information? What are appropriate calculations, statistical analyses, and level of significance? And so on.

Maybe data governance will take hold in more organizations when it starts promoting age-old practices applied in science to the spreadsheets upon which business depends.

(image: spreadsheet / shutterstock)

TAGGED:spreadsheet
Share This Article
Facebook Twitter Pinterest LinkedIn
Share

Follow us on Facebook

Latest News

trusted data management
The Future of Trusted Data Management: Striking a Balance between AI and Human Collaboration
Artificial Intelligence Big Data Data Management
data analytics in ecommerce
Analytics Technology Drives Conversions for Your eCommerce Site
Analytics Exclusive
data grids in big data apps
Best Practices for Integrating Data Grids into Data-Intensive Apps
Big Data Exclusive
AI helps create discord server bots
AI-Driven Discord Bots Can Track Server Stats
Artificial Intelligence Exclusive

Stay Connected

1.2kFollowersLike
33.7kFollowersFollow
222FollowersPin

You Might also Like

The Road to Self-Service BI

2 Min Read
BI tools
AnalyticsBest PracticesBig DataBusiness IntelligenceCulture/LeadershipData ManagementJobs

Could Business Computing Be Done by Users Without Technical Experience?

4 Min Read

Desktop Spreadsheet Alternatives to Know

7 Min Read

Hacking the Budget

6 Min Read

SmartData Collective is one of the largest & trusted community covering technical content about Big Data, BI, Cloud, Analytics, Artificial Intelligence, IoT & more.

ai chatbot
The Art of Conversation: Enhancing Chatbots with Advanced AI Prompts
Chatbots
giveaway chatbots
How To Get An Award Winning Giveaway Bot
Big Data Chatbots Exclusive

Quick Link

  • About
  • Contact
  • Privacy
Follow US
© 2008-24 SmartData Collective. All Rights Reserved.
Go to mobile version
Welcome Back!

Sign in to your account

Username or Email Address
Password

Lost your password?