You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! We've received your issue and please be patient to get responded. We will arrange technicians to answer your questions as soon as possible. Please make sure that you have posted enough message to demo your request. You may also check out the API,FAQ,Github Issue and AI community to get the answer.Have a nice day!
(此 ISSUE 为 PaddlePaddle Hackathon 活动的任务 ISSUE,更多详见PaddlePaddle Hackathon)
【任务说明】
任务标题:在 Paddle 中使用 CUDA Graph 加速训练
技术标签:深度学习框架,CUDA Graph
任务难度:困难
详细描述: CUDA Graph 可以通过预定义静态子图的方式加速 Kernel 调度,进而加速模型训练。独立的性能测试显示其对 Kernel 调度有显著加速。这个任务的目标是把 CUDA Graph 引入 Paddle,加速 Kernel 调度,进而加速模型训练。可以分为两个子目标:
(1) 定义并实现 CUDA Graph Python 层 api,用户可以使用 api 完成构建子图,执行子图,包含但不限于
(2) 将 CUDA Graph 已一定的方式可以在框架中自动的使用,框架自动的完成构建子图,执行子图。
【提交内容】
任务提案
任务 PR 到 Paddle
相关技术文档
任务单测文件
【技术要求】
熟练掌握 C++
了解 CUDA Graph 相关的技术背景
The text was updated successfully, but these errors were encountered: