分类与摘要
围绕编译、调优和程序搜索提升科学计算或 AI kernel 的执行效率。
证据摘录:the schedule search process. Instead of applying the com- plex learned cost model to all explored candidates, Pruner ∗Both authors contributed equally to this research. †Also with Laoshan Laboratory, Qingdao, China ‡Corresponding author. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for prof || the schedule search process. Instead of applying the com- plex learned cost model to all explored candidates, Pruner ∗Both authors contributed equally to this research. †Also with Laoshan Laboratory, Qingdao, China ‡Corresponding author. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distribut || rs for their constructive comments. This work was supported by the Strategic Priority Research Program of Chinese Academy of Sciences (Grant No.XDB0500102), Laoshan Laboratory (No.LSKJ202300305). References [1] Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. {TensorFlow}: a system for {Large-Scale} ma || rs for their constructive comments. This work was supported by the Strategic Priority Research Program of Chinese Academy of Sciences (Grant No.XDB0500102), Laoshan Laboratory (No.LSKJ202300305). References [1] Martín Abadi, Paul Barham, Jianmin Chen, Zhifeng Chen, Andy Davis, Jeffrey Dean, Matthieu Devin, Sanjay Ghemawat, Geoffrey Irving, Michael Isard, et al. 2016. {TensorFlow}: a system for {Large-
引用
Qiao, Liang; Shi, Jun; Hao, Xiaoyu; Fang, Xi; Zhang, Sen; Zhao, Minfan; Zhu, Ziqi; Chen, Junshi; An, Hong; Tang, Xulong; Li, Bing; Yuan, Honghui; Wang, Xinyang, Pruner: A Draft-then-Verify Exploration Mechanism to Accelerate Tensor Program Tuning. In Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2 (ASPLOS ’25), March 30-April 3, 2025, Rotterdam, Netherlands. ACM, New York, NY, USA, 949-965 (CCF A)
@article{acsa2025_17,
title = {Pruner: A Draft-then-Verify Exploration Mechanism to Accelerate Tensor Program Tuning},
year = {2025},
doi = {10.1145/3676641.3716269}
} | title | Pruner: A Draft-then-Verify Exploration Mechanism to Accelerate Tensor Program Tuning |
|---|---|
| title_zh | 待补充 |
| abstract | 待补充 |
| abstract_zh | 待补充 |
| keywords | Compiler, 学位认定 A, CCF A |
| year | 2025 |
| published_date | 待补充 |
| online_date | 待补充 |
| paper_type | Conference |
| publication_status | Published |
| volume | 2 |
| issue | 待补充 |
| pages | 待补充 |
| article_number | 待补充 |
| publisher | ACM |
| doi | 10.1145/3676641.3716269 |
| research_area | 编译器与程序优化 |
| tags | Compiler, 学位认定 A, CCF A |
| category | 编译器与程序优化 |
| summary | 围绕编译、调优和程序搜索提升科学计算或 AI kernel 的执行效率。 |
| authors | Qiao, Liang, Shi, Jun, Hao, Xiaoyu, Fang, Xi |
| corresponding_authors | 待补充 |
| affiliations | 待补充 |
| funding | 崂山实验室项目 |