Why You Should Care About Quantile Regression

Why You Should Care About Quantile Regression
Related tools & artifacts:
Conference Paper: ASPLOS'13, March, 2013

Research has shown that correctly conducting and analysing computer performance experiments is difficult. This paper investigates what is necessary to conduct successful computer performance evaluation by attempting to repeat a prior experiment: the comparison between two Linux schedulers.

In our efforts, we found that exploring an experimental space through a series of incremental experiments can be inconclusive, and there may be no indication of how much experimentation will be enough. Analysis of variance (ANOVA), a traditional analysis method, is able to partly solve the problems with the previous approach, but we demonstrate that ANOVA can be insufficient for proper analysis due to the requirements it imposes on the data.

Finally, we demonstrate the successful application of quantile regression, a recent development in statistics, to computer performance experiments. Quantile regression can provide more insight into the experiment than ANOVA, with the additional benefit of being applicable to data from any distribution. This property makes it especially useful in our field, since non-normally distributed data is common in computer experiments.

@inproceedings{Oliveira13, author = {Born de Oliveira, Augusto and Fischmeister, Sebastian and Diwan, Amer and Hauswirth, Matthias and Sweeney, Peter F.}, title = {Why You Should Care About Quantile Regression}, booktitle = {Proceeding of the 18th international conference on Architectural support for programming languages and operating systems}, series = {ASPLOS '13}, year = {2013}, location = {Houston, TX, USA}, numpages = {11}, publisher = {ACM}, address = {New York, NY, USA}, keywords = {scheduling, latency, performance evaluation, ANOVA, Quantile Regression}, }