Publications

(* denotes equal contributions, # denotes the corresponding author.)


SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference
Jintao Zhang, Chendong Xiang, Haofeng Huang, Haocheng Xi, Jia Wei, Jun Zhu, Jianfei Chen
Arxiv 2025
| paper | code |


SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
Jintao Zhang, Haofeng Huang, Pengle Zhang, Jia Wei, Jun Zhu, Jianfei Chen
Arxiv 2024
| paper | code |


SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration
Jintao Zhang, Jia Wei, Pengle Zhang, Jun Zhu, Jianfei Chen
ICLR 2025 (TH-CPL-A, Research track, Full paper)
| paper | code |


SAGE: A Framework of Precise Retrieval for RAG
Jintao Zhang, Guoliang Li, Jinyang Su
ICDE 2025 (CCF-A, Research track, Full paper)
| paper | code |


PACE: Poisoning Attacks on Learned Cardinality Estimation
Jintao Zhang, Guoliang Li, Chao Zhang, Chengliang Chai
SIGMOD 2024 (CCF-A, Research track, Full paper)
| paper | code |


AutoCE: An Accurate and Efficient Model Advisor for Learned Cardinality Estimation
Jintao Zhang, Chao Zhang, Guoliang Li, Chengliang Chai
ICDE 2023 (CCF-A, Research track, Full paper)
| paper | code |


Learned Cardinality Estimation: A Design Space Exploration and A Comparative Evaluation
Ji Sun*, Jintao Zhang*, Zhaoyan Sun, Nan Tang, Guoliang Li
VLDB 2022 (CCF-A, Research track, Full paper)
| paper | code |


Accurate INT8 Training Through Dynamic Block-Level Fallback
Pengle Zhang, Jia Wei, Jintao Zhang, Jun Zhu, Jianfei Chen
Arxiv 2025
| paper | code |


Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
Haocheng Xi, Shuo Yang, Yilong Zhao, Chenfeng Xu, Muyang Li, Xiuyu Li, Yujun Lin, Han Cai, Jintao Zhang, Dacheng Li, Jianfei Chen, Ion Stoica, Kurt Keutzer, Song Han
Arxiv 2025
| paper | code |


Identifying Sensitive Weights via Post-quantization Integral
Yuezhou Hu, Weiyu Huang, Zichen Liang, Chang Chen, Jintao Zhang, Jun Zhu, Jianfei Chen
Arxiv 2025
| paper | code |


HTAP Databases: A Survey
Chao Zhang, Guoliang Li, Jintao Zhang, Xinning Zhang, Jianhua Feng
TKDE (CCF-A, Journal)
| paper | code |


Survey of Key Techniques of HTAP Databases.
Chao Zhang, Guoliang Li, Jianhua Feng, Jintao Zhang
Journal of Software
| paper | code |