Producing Wrong Data Without Doing Anything Obviously Wrong!

Related tools & artifacts:

Accuracy

Conference Paper: ASPLOS'09, March, 2009

Todd Mytkowicz, Amer Diwan, Matthias.Hauswirth, Peter Sweeney

http://dx.doi.org/10.1145/1508244.1508275

This paper presents a surprising result: changing a seemingly innocuous aspect of an experimental setup can result in a systems researcher drawing wrong conclusions from an experiment. What appears to be an innocuous aspect in the experimental setup may in fact introduce a significant bias in an evaluation. For example, consider an experiment to determine if idea I is beneficial for system S. If the systems researcher measures S and S+I in an experimental setup that is biased towards S+I, she may conclude that I is beneficial even when it is not. This phenomenon is called measurement bias in the natural and social sciences.

Our results demonstrate that measurement bias is significant and commonplace. By significant we mean that measurement bias can lead to an incorrect conclusion. By commonplace we mean that measurement bias occurs in all architectures that we tried (Pentium 4, Core 2, and m5 O3CPU), all compilers that we tried (gcc and Intel's C compiler), and all of the SPEC CPU2006 C programs. Thus, we cannot ignore measurement bias. Nevertheless, in a literature survey of 133 recent papers from ASPLOS, PACT, PLDI, and CGO, we determined that none of the papers with experimental results adequately consider measurement bias.

Inspired by similar problems and their solutions in other sciences, we describe and demonstrate two methods, one for detecting (causal analysis) and one for avoiding (setup randomization) measurement bias.

@inproceedings{Mytkowicz09, author = {Mytkowicz, Todd and Diwan, Amer and Hauswirth, Matthias and Sweeney, Peter F.}, title = {Producing wrong data without doing anything obviously wrong!}, booktitle = {Proceeding of the 14th international conference on Architectural support for programming languages and operating systems}, series = {ASPLOS '09}, year = {2009}, isbn = {978-1-60558-406-5}, location = {Washington, DC, USA}, pages = {265--276}, numpages = {12}, url = {http://doi.acm.org/10.1145/1508244.1508275}, doi = {http://doi.acm.org/10.1145/1508244.1508275}, acmid = {1508275}, publisher = {ACM}, address = {New York, NY, USA}, keywords = {bias, measurement, performance}, }

frames are not supported

Software and Programmer Efficiency Research Group

Navigation

User login

Publication Highlights

Blast

Jikes RDB