8000 [Graph Partition] use pinned memory and foreach when moving cpu scalar tensor to gpu · Issue #155360 · pytorch/pytorch · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
[Graph Partition] use pinned memory and foreach when moving cpu scalar tensor to gpu #155360
Open
@BoyuanFeng

Description

@BoyuanFeng

Graph partition automatically moves cpu scalar tensors to gpu when possible (#154464). It's better to use pin memory and copy with non_blocking. This depends on #155121. More context in this issue.

cc @voznesenskym @penguinwu @EikanWang @jgong5 @Guobing-Chen @XiaobingSuper @zhuhaozhe @blzheng @wenzhe-nrv @jiayisunx @ipiszy @chenyang78 @kadeng @muchulee8 @amjames @chauhang @aakhundov

Metadata

Metadata

Assignees

Labels

module: inductoroncall: pt2triagedThis issue has been looked at a team member, and triaged and prioritized into an appropriate module

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions

    0