会议文集


文集名High Performance Computing
会议名37th International Conference on High Performance Computing (ISC High Performance 2022)
中译名《第三十七届国际高性能计算会议》
会议日期May 29 - June 2, 2022
会议地点Hamburg, Germany
出版年2022
馆藏号343340


题名作者出版年
Accelerating MPI All-to-All Communication with Online Compression on Modern GPU ClustersQinghua Zhou; Pouya Kousha; Quentin Anthony; Kawthar Shafie Khorassani; Aamir Shafi; Hari Subramoni; Dhabaleswar K. Panda2022
NVIDIA's Quantum InfiniBand Network Congestion Control Technology and Its Impact on Application PerformanceYuval Shpigelman; Gilad Shainer; Richard Graham; Yong Qin; Gerardo Cisneros-Stoianowski; Craig Stunkel2022
LLM: Realizing Low-Latency Memory by Exploiting Embedded Silicon Photonics for Irregular WorkloadsMarjan Fariborz; Mahyar Samani; Pouya Fotouhi; Roberto Proietti; Il-Min Yi; Venkatesh Akella; Jason Lowe-Power; Samuel Palermo; S. J. Ben Yoo2022
SU3_Bench on a Programmable Integrated Unified Memory Architecture (PIUMA) and How that Differs from Standard NUMA CPUsJesmin Jahan Tithi; Fabio Checconi; Douglas Doerfler; Fabrizio Petrini2022
"Hey CAI" - Conversational AI Enabled User Interface for HPC ToolsPouya Kousha; Arpan Jain; Ayyappa Kolli; Prasanna Sainath; Hari Subramoni; Aamir Shafi; Dhableswar K. Panda2022
Hy-Fi: Hybrid Five-Dimensional Parallel DNN Training on High-Performance GPU ClustersArpan Jain; Aamir Shafi; Quentin Anthony; Pouya Kousha; Hari Subramoni; Dhableswar K. Panda2022
Efficient Application of Hanging-Node Constraints for Matrix-Free High-Order FEM Computations on CPU and GPUPeter Munch; Karl Ljungkvist; Martin Kronbichler2022
Dynamic Task Fusion for a Block-Structured Finite Volume Solver over a Dynamically Adaptive Mesh with Local Time SteppingBaojiu Li; Holger Schulz; Tobias Weinzierl; Han Zhang2022
Accelerating Simulated Quantum Annealing with GPU and Tensor CoresYi-Hua Chung; Cheng-Jhih Shih; Shih-Hao Hung2022
m-CUBES: An Efficient and Portable Implementation of Multi-dimensional Integration for GPUsIoannis Sakiotis; Kamesh Arumugam; Marc Paterno; Desh Ranjan; Balsa Terzic; Mohammad Zubair2022
Comparative Evaluation of Call Graph Generation by Profiling ToolsOnur Cankur; Abhinav Bhatele2022
MAPredict: Static Analysis Driven Memory Access Prediction Framework for Modern CPUsMohammad Alaul Haque Monil; Seyong Lee; Jeffrey S. Vetter; Allen D. Malony2022
Rapid Execution Time Estimation for Heterogeneous Memory Systems Through Differential TracingNicolas Denoyelle; Swann Perarnau; Kamil Iskra; Balazs Gerofi2022
Understanding Distributed Deep Learning Performance by Correlating HPC and Machine Learning MeasurementsAna Luisa Veroneze Solorzano; Lucas Mello Schnorr2022
A Motivating Case Study on Code Variant Selection by Reinforcement LearningOliver Hacker; Matthias Korch; Johannes Seiferth2022
Remote OpenMP OffloadingAtmn Patel; Johannes Doerfert2022
Hybrid Parallel ILU Preconditioner in Linear Solver Library GaspiLSRaju Ram; Daniel Grunewald; Nicolas R. Gauger2022
A Subset of the CERN Virtual Machine File System: Fast Delivering of Complex Software Stacks for Supercomputing ResourcesAlexandre F. Boyer; Christophe Haen; Federico Stagni; David R. C. Hill2022