Bibliography
Bell P, Suarez K, Fossum B, Chapp D, Bhowmick S, Taufer M. A Research-Based Course Module to Study Non-determinism in High Performance Applications.IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW). (2022). doi: 10.1109/IPDPSW55747.2022.00067.
Bell P, Suarez K, Chapp D, Tan N, Bhowmick S, Taufer M. ANACIN-X: A Software Framework for Studying Non-determinism in MPI Applications.Software impacts.9 Oct. 2021, doi: 10.1016/j.simpa.2021.100151. URL: Link
D. Chapp, N. Tan, S. Bhowmick and M. Taufer. Identifying Degree and Sources of Non-Determinism in MPI Applications Via Graph Kernels, IEEE Transactions on Parallel and Distributed Systems, vol. 32, no. 12, pp. 2936-2952, 1 Dec. 2021, doi:10.1109/TPDS.2021.3081530 Link
D. Chapp, K. Sato, D. Ahn, and M. Taufer. Record-and-Replay Techniques for HPC Systems: A survey. Journal of Supercomputing Frontiers and Innovations, 5(1):11-30,. (2018). Link
D. Chapp, D. Rorabaugh, K. Sato, D. Ahn, and M. Taufer. A Three-phase Workflow for General and Expressive Representations of Nondeterminism in HPC Applications. International Journal of High-Performance Computing Applications(IJHPCA), 1175-1184 (2019) Link
D. Chapp, T. Johnston, and M. Taufer. On the Need for Reproducible Numerical Accuracy through Intelligent Runtime Selection of Reduction Algorithms at the Extreme Scale. In Proceedings of IEEE Cluster Conference, pp. 166 – 175. Chicago, Illinois, USA. September 8 – 11, 2015. Link