I am interested in developing tools and methods that aim at debugging and optimizing HPC applications.
I have developed PARCOACH, a static/dynamic tool to detect collective errors in parallel applications. The static part identifies the reduced set of collective communications that may eventually lead to potential deadlock situations, and issues warnings. Using this analysis, a selective instrumentation of the code is then achieved, displaying an error, synchronously interrupting all processes, if the schedule leads to a deadlock situation.
PARCOACH is implemented as a LLVM pass and is still under development.
- Journées nationales du GDR GPL, Grenoble June 12-15, 2018 Vérification des applications MPI par une anayse statique/dynamique - Verification of MPI applications using a static/dynamic analysis (Talk in french)
- Journée LaHMA, Paris December 13, 2018 Analyse statique/dynamique pour la vérification des applications parallèles - Static/Dynamic Analysis for the verification of parallel applications (Talk in french)