Enhanced state history tree (eSHT): a stateful data structure for analysis of highly parallel system traces

Loïc Prieur-Drevon, Raphaël Beamonte, Naser Ezzati-Jivan et Michel Dagenais

Communication écrite (2016)

Document en libre accès dans PolyPublie

Affichage préliminaire

Libre accès au plein texte de ce document
Version finale avant publication
Conditions d'utilisation: Tous droits réservés
Télécharger (413kB)

Afficher le résumé

Cacher le résumé

Abstract

Behaviors of distributed systems with many cores and/or many threads are difficult to understand. This is why dynamic analysis tools such as tracers are useful to collect run-time data and help programmers debug and optimize complex programs. However, manual trace analysis on very large traces with billions of events can be a difficult problem which automated trace visualizers and analyzers aim to solve. Trace analysis and visualization software needs fast access to data which it cannot achieve by searching through the entire trace for every query. A number of solutions have adopted stateful analysis which rearranges events into a more query friendly structures after a single pass through the trace. In this paper, we look into current implementations and model the behavior of previous work, the State History Tree (SHT), on traces with many thread creation and deletion. This allows us to identify which properties of the SHT are responsible for inefficient disk usage and high memory consumption. We then propose a more efficient data structure, the enhanced State History Tree (eSHT), to store and query computed states, in order to limit disk usage and reduce the query time for any state. Next, we compare the use of SHT and eSHT on traces with many attributes. We finally verify the scalability of our new data structure according to trace size. As shown by our results, the proposed solution makes near optimal use of disk space, reduces the algorithm's memory usage logarithmically for previously problematic cases, and speeds up queries on traces with many attributes by an order of magnitude. The proposed solution builds upon our previous work, enabling it to easily scale up to traces containing a million threads.

Mots clés

Département:	Département de génie informatique et génie logiciel
Organismes subventionnaires:	CRSNG/NSERC
Numéro de subvention:	CRDPJ468687-14
URL de PolyPublie:	https://publications.polymtl.ca/2994/
Nom de la conférence:	IEEE International Congress on Big Data (BigData Congress 2016)
Lieu de la conférence:	San Francisco, CA, USA
Date(s) de la conférence:	2016-06-27 - 2016-07-02
Maison d'édition:	IEEE
DOI:	10.1109/bigdatacongress.2016.19
URL officielle:	https://doi.org/10.1109/bigdatacongress.2016.19
Date du dépôt:	13 févr. 2018 12:51
Dernière modification:	02 févr. 2026 17:58

Citer en APA 7:	Prieur-Drevon, L., Beamonte, R., Ezzati-Jivan, N., & Dagenais, M. (juin 2016). Enhanced state history tree (eSHT): a stateful data structure for analysis of highly parallel system traces [Communication écrite]. IEEE International Congress on Big Data (BigData Congress 2016), San Francisco, CA, USA (8 pages). https://doi.org/10.1109/bigdatacongress.2016.19

Statistiques

Total des téléchargements à partir de PolyPublie

Téléchargements par année

Provenance des téléchargements

Dimensions

Actions réservées au personnel

Afficher document