NPU Acceleration #505

sikhness · 2024-05-31T01:29:31Z

Currently, only a subset of devices can be passed through, with GPUs being one of them (albeit limited with only DirectX based frameworks).
With the rise and push of NPU/IPUs built into processors, it would be beneficial to provide NPU acceleration in Windows Containers to be able to containerize our AI/ML workloads.

fady-azmy-msft · 2024-06-04T14:40:43Z

Hey @sikhness, similar question to your other issue. Can you help me understand what sort of workloads are you trying to run with NPU acceleration? Understanding this use case will help us better prioritize this request as we explore AI/ML workloads.

sikhness · 2024-06-05T00:45:19Z

Hey @fady-azmy-msft!
Similar to my other question, I did list out a few AI related workloads that would benefit from GPU Acceleration from vendor specific graphics APIs.

Some of those same AI workloads can also benefit from offloading that work to the NPU now and here is an example of Ryzen AI which provides instructions on how to install, prep and run your AI models on the NPU on Windows. It would be very beneficial to be able to containerize these applications for isolation & portability benefits and still leverage the hardware.

fady-azmy-msft · 2024-06-19T17:23:31Z

Got it. Tagging @NAWhitehead to look into this. He's driving the Windows containers GPU scenarios, and this is related.

doctorpangloss · 2024-07-11T15:51:34Z

I think you should get the class GUID for "Neural processors", try passing it as a --device class/the_guid, copy the drivers from the FileRepository into the container, and then see if the NPU works. Odds are low but crazier things have been true.

sikhness added enhancement New feature or request triage New and needs attention labels May 31, 2024

ntrappe-msft added AI/ML Artificial intelligence & machine learning development perf Speed, efficiency, optimization concerns labels Jun 3, 2024

fady-azmy-msft assigned NAWhitehead Jun 19, 2024

fady-azmy-msft removed the triage New and needs attention label Jun 19, 2024

ntrappe-msft unassigned NAWhitehead Sep 9, 2024

ntrappe-msft added the 🔖 ADO Has corresponding ADO item label Sep 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NPU Acceleration #505

NPU Acceleration #505

sikhness commented May 31, 2024

fady-azmy-msft commented Jun 4, 2024

sikhness commented Jun 5, 2024

fady-azmy-msft commented Jun 19, 2024

doctorpangloss commented Jul 11, 2024

NPU Acceleration #505

NPU Acceleration #505

Comments

sikhness commented May 31, 2024

fady-azmy-msft commented Jun 4, 2024

sikhness commented Jun 5, 2024

fady-azmy-msft commented Jun 19, 2024

doctorpangloss commented Jul 11, 2024