Tuple as device_type input to support Heterogenous Sharding of tables across different device_typestable #2600

faran928 · 2024-12-02T21:21:05Z

Summary: As we plan to support heterogenous sharding across different device types (cuda / cpu etc), we will pass device type per shard in the format of tuple for device_type_from_sharding_info where each index will represent the device_type for that particular shard

Differential Revision: D65933148

facebook-github-bot · 2024-12-02T21:21:32Z

This pull request was exported from Phabricator. Differential Revision: D65933148

… across different device_typestable (pytorch#2600) Summary: As we plan to support heterogenous sharding across different device types (cuda / cpu etc), we will pass device type per shard in the format of tuple for device_type_from_sharding_info where each index will represent the device_type for that particular shard Differential Revision: D65933148

facebook-github-bot · 2024-12-04T01:38:35Z

This pull request was exported from Phabricator. Differential Revision: D65933148

… across different device_typestable (pytorch#2600) Summary: As we plan to support heterogenous sharding across different device types (cuda / cpu etc), we will pass device type per shard in the format of tuple for device_type_from_sharding_info where each index will represent the device_type for that particular shard Differential Revision: D65933148

Summary: Unify InferRwSequenceEmbedding Modules for GPU / CPU. There does not seem to be much difference in the implementation for InferRwSequenceEmbedding and InferCPURwSequenceEmbedding. For heterogeneous sharding, we need to merge them together into one module. Also introduced the concept of device_type_from_sharding_info to propagate the correct device for output dist. Reviewed By: jiayisuse Differential Revision: D65859663

… across different device_typestable (pytorch#2600) Summary: As we plan to support heterogenous sharding across different device types (cuda / cpu etc), we will pass device type per shard in the format of tuple for device_type_from_sharding_info where each index will represent the device_type for that particular shard Differential Revision: D65933148

facebook-github-bot · 2024-12-04T02:16:01Z

This pull request was exported from Phabricator. Differential Revision: D65933148

… across different device_typestable (pytorch#2600) Summary: As we plan to support heterogenous sharding across different device types (cuda / cpu etc), we will pass device type per shard in the format of tuple for device_type_from_sharding_info where each index will represent the device_type for that particular shard Differential Revision: D65933148

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 2, 2024

facebook-github-bot added the fb-exported label Dec 2, 2024

faran928 force-pushed the export-D65933148 branch from 2e3aa39 to 35d3c3a Compare December 4, 2024 01:38

Faran Ahmad added 2 commits December 3, 2024 18:14

faran928 force-pushed the export-D65933148 branch from 35d3c3a to 0bf4f59 Compare December 4, 2024 02:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tuple as device_type input to support Heterogenous Sharding of tables across different device_typestable #2600

Tuple as device_type input to support Heterogenous Sharding of tables across different device_typestable #2600

faran928 commented Dec 2, 2024

facebook-github-bot commented Dec 2, 2024

facebook-github-bot commented Dec 4, 2024

facebook-github-bot commented Dec 4, 2024

Tuple as device_type input to support Heterogenous Sharding of tables across different device_typestable #2600

Are you sure you want to change the base?

Tuple as device_type input to support Heterogenous Sharding of tables across different device_typestable #2600

Conversation

faran928 commented Dec 2, 2024

facebook-github-bot commented Dec 2, 2024

facebook-github-bot commented Dec 4, 2024

facebook-github-bot commented Dec 4, 2024