Paul L. Caron
Dean


Sunday, September 22, 2019

How Excel's Automatic Data Formatting Can Cause Errors In Published Research

Washington Post, An Alarming Number of Scientific Papers Contain Excel Errors:

ExcelA surprisingly high number of scientific papers in the field of genetics contain errors introduced by Microsoft Excel, according to an analysis recently published in the journal Genome Biology.

A team of Australian researchers analyzed nearly 3,600 genetics papers published in a number of leading scientific journals — like Nature, Science and PLoS One. As is common practice in the field, these papers all came with supplementary files containing lists of genes used in the research.

The Australian researchers found that roughly 1 in 5 of these papers included errors in their gene lists that were due to Excel automatically converting gene names to things like calendar dates or random numbers. ...

Gene[T]he researchers note that there's no way to permanently disable automatic date formatting within Excel. Researchers still have to remember to manually format columns to "Text" before you type anything in new Excel sheets — every. single. time. ...

Genetics isn't the only field where a life's work can potentially be undermined by a spreadsheet error. Harvard economists Carmen Reinhart and Kenneth Rogoff famously made an Excel goof — omitting a few rows of data from a calculation — that caused them to drastically overstate the negative GDP impact of high debt burdens. Researchers in other fields occasionally have to issue retractions after finding Excel errors as well. ...

[O]ne perfectly free spreadsheet program did not have any issues storing the gene names as typed — Google Sheets.

For the time being, the only fix for the issue is for researchers and journal editors to remain vigilant when working with their data files. Even better, they could abandon Excel completely in favor of programs and languages that were built for statistical research, like R and Python.

https://taxprof.typepad.com/taxprof_blog/2019/09/how-excels-automatic-data-formatting-can-cause-errors-in-published-research.html

Legal Ed News, Legal Ed Scholarship, Legal Education | Permalink

Comments

Classic problems: researchers not fully understanding the tools they are using and how that impacts their work, and not carefully proofing what they publish.

Posted by: ruralcounsel | Sep 23, 2019 3:37:35 AM

If you have a brain, you know to take studies with a grain of salt. They're outcome-oriented and the data can be manipulated to say what you want.

Reminds me of that study where they mixed up the numbers and it was the Democrats who were unhinged (instead of Republicans). Oopsies.

Posted by: Anon | Sep 23, 2019 3:04:52 PM