BLAS options: OpenBLAS vs Accelerate

ngam · 2022-01-24T17:46:18Z

🚀 The feature, motivation and pitch

Are there any benchmarks or preference among developers here? Assuming that Intel MacOS users should use MKL; MKL isn't available for Apple Silicon --- is there any benefit from using OpenBLAS vis-a-vis Accelerate? Any documentation or benchmarks? Thanks!

Alternatives

No response

Additional context

No response

cc @VitalyFedyunin @ngimel

vadimkantorov · 2022-01-24T18:06:54Z

also, BLIS might be an option... flame/blis#492 although this BLIS seems was not much used with pytorch...

malfet · 2022-01-24T18:14:57Z

OpenBLAS and Accelerate should have the same API, but I'm not aware of any good benchmark of one vs another. But one should be able to recompile with different BLAS frameworks

ngam · 2022-01-24T18:29:20Z

OpenBLAS and Accelerate should have the same API, but I'm not aware of any good benchmark of one vs another. But one should be able to recompile with different BLAS frameworks

Yes, I can confirm this. For what is worth, the default mechanism (i.e. no user preference in BLAS flag) in cmake seems to go something like this: it looks for MKL first, then BLIS, and then Accelerate, and I think finally OpenBLAS. I do understand the importance of setting MKL first (it seems to outperform everything else in this context). However, I am slightly confused about OpenBLAS vis-a-vis Accelerate (and BLIS maybe) from a few comments I gathered around here. Mainly this one: #68812 (comment). Perhaps @IvanYashchuk can weigh in?

Details of cmake compilation process:

  -- Trying to find preferred BLAS backend of choice: MKL
-- MKL_THREADING = OMP
-- Looking for sys/types.h
-- Looking for sys/types.h - found
-- Looking for stdint.h
-- Looking for stdint.h - found
-- Looking for stddef.h
-- Looking for stddef.h - found
-- Check size of void*
-- Check size of void* - done
-- MKL_THREADING = OMP
CMake Warning at cmake/Dependencies.cmake:177 (message):
  MKL could not be found.  Defaulting to Eigen
Call Stack (most recent call first):
  CMakeLists.txt:653 (include)


CMake Warning at cmake/Dependencies.cmake:205 (message):
  Preferred BLAS (MKL) cannot be found, now searching for a general BLAS
  library
Call Stack (most recent call first):
  CMakeLists.txt:653 (include)


-- MKL_THREADING = OMP
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - iomp5 - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - iomp5 - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - guide - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - guide - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_intel_thread - mkl_core - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_intel_thread - mkl_core - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_sequential - mkl_core - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_sequential - mkl_core - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - iomp5 - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - iomp5 - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - guide - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - guide - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl_intel_lp64 - mkl_core - pthread - m]
--   Library mkl_intel_lp64: not found
-- Checking for [mkl_intel - mkl_core - pthread - m]
--   Library mkl_intel: not found
-- Checking for [mkl - guide - pthread - m]
--   Library mkl: not found
-- MKL library not found
-- Checking for [blis]
--   Library blis: BLAS_blis_LIBRARY-NOTFOUND
-- Checking for [Accelerate]
--   Library Accelerate: /opt/MacOSX11.0.sdk/System/Library/Frameworks/Accelerate.framework
-- Looking for sgemm_
-- Looking for sgemm_ - found
-- Found a library with BLAS API (accelerate). Full path: (/opt/MacOSX11.0.sdk/System/Library/Frameworks/Accelerate.framework)

To expand: Because PyTorch would necessarily bundle both the BLAS routine with LAPACK, and it's been reported that the LAPACK routines included in Accelerate can be buggy/unreliable, it could make a difference. We could potentially try to unbundle BLAS and LAPACK here --- i.e., we can select BLAS from Accelerate while LAPACK from another provider as available. This will obviously be more work for a very niche optimization; hence, I would like to see if we can establish any firm benchmark that Accelerate can indeed be worthy here and then we can work on unbundling BLAS and LAPACK maybe.

ngam · 2022-01-24T18:35:56Z

also, BLIS might be an option... flame/blis#492 although this BLIS seems was not much used with pytorch...

Yes, thanks @vadimkantorov. The default mechanism appears to favor BLIS more than Accelerate as shown above.

IvanYashchuk · 2022-01-24T18:46:06Z

Here's the FindBLAS.cmake and FindLAPACK.cmake files that PyTorch uses. There's no way to specify the BLAS variant and CMake tries to find it in a specific order (specified in FindBLAS.cmake), but it's possible to compile PyTorch with Accelerate if CMake doesn't find anything with higher priority.

FindLAPACK.cmake disallows mixing BLAS & LAPACK from different providers because it's fragile.

ngimel · 2022-01-24T18:47:37Z

cc @robieta

ngam · 2022-01-24T18:50:05Z

Here's the FindBLAS.cmake and FindLAPACK.cmake files that PyTorch uses. There's no way to specify the BLAS variant and CMake tries to find it in a specific order (specified in FindBLAS.cmake), but it's possible to compile PyTorch with Accelerate if CMake doesn't find anything with higher priority

No! There is (which is actually good, so thank you for the flexibility!) You can specify BLAS=OpenBLAS and you will force it to compile with OpenBLAS (if available)

See more here: conda-forge/pytorch-cpu-feedstock#84 (comment)

FindLAPACK.cmake disallows mixing BLAS & LAPACK from different providers because it's fragile

Good to know the reason, thanks!

ngam · 2022-01-24T19:03:29Z

Also, I am happy to run some benchmarks and tests if you can point me to some meaningful ones for this particular case. I already have both OpenBLAS-based and Accelerate-based PyTorch ready (and reproducible; I can also add BLIS and/or other BLAS libraries to test). I also am happy to help if there is an interest in clarifying this further :)

Note: I believe this whole thing is meaningless outside of Apple Silicon Macs at the moment. MKL BLAS/LAPACK should still be used whenever available, imo, but it is not available on Apple Silicon Macs as far as I could tell.

IvanYashchuk · 2022-01-24T19:08:28Z

You can specify BLAS=OpenBLAS and you will force it to compile with OpenBLAS (if available)

Great it was fixed, if I recall correctly previously it was only affecting Caffe2 and not ATen (for example #60328).

FindLAPACK.cmake disallows mixing BLAS & LAPACK from different providers because it's fragile

Reference LAPACK requires BLAS and if LAPACK is built from source against BLAS from Accelerate then I guess there shouldn't be any problems.

ngam · 2022-02-05T20:34:52Z

@robieta any thoughts?

ngam · 2022-02-09T20:29:09Z

From my limited testing, there is little value in choosing one or the other --- they do end up being rather similar. Closing this, thanks everyone for engaging with this and good luck :)

malfet added module: performance Issues related to performance, either of kernel code or framework glue module: third_party needs research We need to decide whether or not this merits inclusion, based on research world labels Jan 24, 2022

anjali411 added the triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module label Jan 25, 2022

ngam closed this as completed Feb 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BLAS options: OpenBLAS vs Accelerate #71712

BLAS options: OpenBLAS vs Accelerate #71712

Uh oh!

There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BLAS options: OpenBLAS vs Accelerate #71712

BLAS options: OpenBLAS vs Accelerate #71712

Comments

Uh oh!

🚀 The feature, motivation and pitch

Alternatives

Additional context

Uh oh!

Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

There was an error while loading. Please reload this page.