Initial support for Metal Performance Shaders (MPS) #98

dameikle · 2024-09-07T13:28:13Z

Makes adjustments to provide initial support for Apple Silicon Metal Performance Shaders (MPS) to provide a base for further optimisation.

This has been tested on basic training and inference on both M1 and M3 Macs. It has also been used for inference of larger production grade models that have been trained on CUDA.

eole/modules/multi_headed_attn.py

eole/train_single.py

eole/utils/misc.py

francoishernandez · 2024-09-16T09:15:57Z

Thanks a lot @dameikle ! Quite curious to see what we can achieve on Apple silicon chips.
I'll try and test this some time this week.

francoishernandez · 2024-09-18T17:35:31Z

@dameikle, I managed to run this fine on an M3 Macbook Air 👌

A few questions:

What kind of environment do you use on your mac(s)? I had a few issues setting things up (especially with some requirements not liking python3.12, had to setup python3.10.) If you have any tips to make things easier for other users, it might be good to create a new entry somewhere in the docs (e.g. in the FAQ sections).
Did you have to make some adaptations to the recipe(s) you tested? Seems like some of the bash commands are not all supported on default macOS (or my shell is wonky, possible).
You mention "inference of larger production grade models". Could you provide some details on this? E.g. a minimal benchmark of model size / performance (speed, memory, etc.) would very useful information, especially if we plan on "further optimisation"!

dameikle · 2024-09-23T13:58:03Z

Thanks for taking a look @francoishernandez.

What kind of environment do you use on your mac(s)? I had a few issues setting things up (especially with some requirements not liking python3.12, had to setup python3.10.) If you have any tips to make things easier for other users, it might be good to create a new entry somewhere in the docs (e.g. in the FAQ sections).

I normally use Virtual Envs created from Homebrew installed Python to avoid clashes with the macOS installed version. I can put together some documentation.

Did you have to make some adaptations to the recipe(s) you tested? Seems like some of the bash commands are not all supported on default macOS (or my shell is wonky, possible).

I was mainly doing things by hand as well as using others models I'd trained elsewhere, so wasn't trying them out to be honest. Can have a wee look at them and see if we can make them work across both.

You mention "inference of larger production grade models". Could you provide some details on this? E.g. a minimal benchmark of model size / performance (speed, memory, etc.) would very useful information, especially if we plan on "further optimisation"!

Sounds like a plan. Will pull information together.

…PS to avoid users having to use PYTORCH_ENABLE_MPS_FALLBACK

francoishernandez · 2024-09-25T07:56:45Z

Thanks for the updates @dameikle!
Can we merge or do you have pending topics to be pushed?
(Docs can be updated separately when you'll find time to put it together.)

dameikle · 2024-09-25T08:51:13Z

@francoishernandez - I think it's good to merge now. I was starting to work on the docs and taking a look at the recipies last night but like you say, can do that in seperate PR.

dameikle added 2 commits September 7, 2024 13:16

Initial support for Metal Performance Shaders (MPS)

e733903

Update to address formatting errors

c60461d

vince62s reviewed Sep 7, 2024

View reviewed changes

eole/modules/multi_headed_attn.py Outdated Show resolved Hide resolved

vince62s reviewed Sep 7, 2024

View reviewed changes

eole/modules/multi_headed_attn.py Outdated Show resolved Hide resolved

vince62s reviewed Sep 7, 2024

View reviewed changes

eole/train_single.py Outdated Show resolved Hide resolved

vince62s reviewed Sep 7, 2024

View reviewed changes

eole/utils/misc.py Show resolved Hide resolved

Addressing PR feedback from @vince62s

22ddb28

francoishernandez added the enhancement New feature or request label Sep 16, 2024

dameikle added 3 commits September 23, 2024 18:19

Merge branch 'main' into mps

aae74f0

Implementing PR review feedback

8b3c3b8

Added alternative logic for calculating is_finished_list when using M…

eb6f052

…PS to avoid users having to use PYTORCH_ENABLE_MPS_FALLBACK

francoishernandez merged commit a8cfb0d into eole-nlp:main Oct 2, 2024
2 checks passed

francoishernandez pushed a commit that referenced this pull request Dec 4, 2024

Initial support for Metal Performance Shaders (MPS) (#98)

f4b0bf0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial support for Metal Performance Shaders (MPS) #98

Initial support for Metal Performance Shaders (MPS) #98

Initial support for Metal Performance Shaders (MPS) #98

Initial support for Metal Performance Shaders (MPS) #98

Conversation