Paper 2024/1030

GRASP: Accelerating Hash-based PQC Performance on GPU Parallel Architecture

Yijing Ning, University of Science and Technology of China
Jiankuo Dong, Nanjing University of Posts and Telecommunications
Jingqiang Lin, University of Science and Technology of China
Fangyu Zheng, University of Chinese Academy of Sciences
Yu Fu, University of Science and Technology of China
Zhenjiang Dong, Nanjing University of Posts and Telecommunications
Fu Xiao, Nanjing University of Posts and Telecommunications
Abstract

$SPHINCS^+$, one of the Post-Quantum Cryptography Digital Signature Algorithms (PQC-DSA) selected by NIST in the third round, features very short public and private key lengths but faces significant performance challenges compared to other post-quantum cryptographic schemes, limiting its suitability for real-world applications. To address these challenges, we propose the GPU-based paRallel Accelerated $SPHINCS^+$ (GRASP), which leverages GPU technology to enhance the efficiency of $SPHINCS^+$ signing and verification processes. We propose an adaptable parallelization strategy for $SPHINCS^+$, analyzing its signing and verification processes to identify critical sections for efficient parallel execution. Utilizing CUDA, we perform bottom-up optimizations, focusing on memory access patterns and hypertree computation, to enhance GPU resource utilization. These efforts, combined with kernel fusion technology, result in significant improvements in throughput and overall performance. Extensive experimentation demonstrates that our optimized CUDA implementation of $SPHINCS^+$ achieves superior performance. Specifically, our GRASP scheme delivers throughput improvements ranging from 1.37× to 3.45× compared to state-of-the-art GPU-based solutions and surpasses the NIST reference implementation by over three orders of magnitude, highlighting a significant performance advantage.

Metadata
Available format(s)
PDF
Category
Implementation
Publication info
Preprint.
Keywords
PQChash-based digital signatureSPHINCS+GPUCUDA
Contact author(s)
truegeorge @ mail ustc edu cn
djiankuo @ njupt edu cn
linjq @ ustc edu cn
zhengfangyu @ ucas ac cn
fuyu22 @ mail ustc edu cn
dongzhenjiang @ njupt edu cn
xiaof @ njupt edu cn
History
2025-01-16: revised
2024-06-26: received
See all versions
Short URL
https://ia.cr/2024/1030
License
Creative Commons Attribution
CC BY

BibTeX

@misc{cryptoeprint:2024/1030,
      author = {Yijing Ning and Jiankuo Dong and Jingqiang Lin and Fangyu Zheng and Yu Fu and Zhenjiang Dong and Fu Xiao},
      title = {{GRASP}: Accelerating Hash-based {PQC} Performance on {GPU} Parallel Architecture},
      howpublished = {Cryptology {ePrint} Archive, Paper 2024/1030},
      year = {2024},
      url = {https://eprint.iacr.org/2024/1030}
}
Note: In order to protect the privacy of readers, eprint.iacr.org does not use cookies or embedded third party content.