<  Back to the Polytechnique Montréal portal

Diagnosing performance variations by comparing multi-level execution traces

François Doray and Michel Dagenais

Article (2016)

Open Access document in PolyPublie
[img]
Preview
Open Access to the full text of this document
Accepted Version
Terms of Use: All rights reserved
Download (258kB)
Show abstract
Hide abstract

Abstract

Tracing allows the analysis of task interactions with each other and with the operating system. Locating performance problems in a trace is not trivial because of their large size. Furthermore, deep knowledge of all components of the observed system is required to decide whether observed behavior is normal. We introduce TraceCompare, a framework that automatically identifies differences between groups of executions of the same task at the user space and kernel levels. Many performance problems manifest themselves as variations that are easily identified by our framework. Our comparison algorithm takes into account all threads that affect the completion time of analyzed executions. Differences are correlated with application code to facilitate the correction of identified problems. Performance characteristics of task executions are represented by a new data structure called enhanced calling context tree (ECCT). We demonstrate the efficiency of our approach by presenting four case studies in which TraceCompare was used to uncover serious performance problems in enterprise and open source applications, without any prior knowledge of their codebase. We also show that the overhead of our tracing solution is between 0.2 and 9 percent depending on the type of application.

Uncontrolled Keywords

Performance analysis, tracing, software visualization,concurrency, operating systems

Subjects: 2700 Information technology > 2706 Software engineering
2700 Information technology > 2720 Computer systems software
Department: Department of Computer Engineering and Software Engineering
Funders: CRSNG/NSERC
Grant number: CRDPJ468687-14
PolyPublie URL: https://publications.polymtl.ca/2961/
Journal Title: IEEE Transactions on Parallel and Distributed Systems (vol. 28, no. 2)
Publisher: IEEE
DOI: 10.1109/tpds.2016.2567390
Official URL: https://doi.org/10.1109/tpds.2016.2567390
Date Deposited: 01 Feb 2018 11:00
Last Modified: 08 Apr 2024 13:24
Cite in APA 7: Doray, F., & Dagenais, M. (2016). Diagnosing performance variations by comparing multi-level execution traces. IEEE Transactions on Parallel and Distributed Systems, 28(2), 462-474. https://doi.org/10.1109/tpds.2016.2567390

Statistics

Total downloads

Downloads per month in the last year

Origin of downloads

Dimensions

Repository Staff Only

View Item View Item