[SYCL][FE][Driver] Implement floating point accuracy control #8280

zahiraam · 2023-02-09T19:25:51Z

This patch implements the accuracy controls for floating-point math functions in DPC++. Using the -ffp-accuracy command line option, the user can request an accuracy level for all math functions or for specific ones. Calls to fpbuiltin intrinsics llvm.fpbuilin.* are then generated.

Syntax:

Linux: -ffp-accuracy=[default|value][:funclist]
Windows: /Qfp-accuracy:[default|value][:funclist]

funclist is an optional comma separated list of math library functions.

-ffp-accuracy=[default|value]
default: Use the implementation defined accuracy for all math library functions.
This is equivalent to not using this option.
value: Use the defined standard accuracy for what each accuracy value
means for all math library functions.

-ffp-accuracy=[default|value][:funclist]

default: Use the implementation defined accuracy for the math library functions in funclist.
This is equivalent to not using this option.
value: Use the defined standard accuracy for what each accuracy value
means for the math library functions in funclist.

value is one of the following values denoting the library function accuracy.

high This is equivalent to max-error = 1.0.
medium This is equivalent to max-error = 4.
low This is equivalent to accuracy-bits = 11 for single-precision functions.
accuracy-bits = 26 for double-precision functions.
sycl Determined by the OpenCL specification for math function accuracy: https://registry.khronos.org/OpenCL/specs/3.0-unified/html/OpenCL_C.html#relative-error-as-ulps
cuda Determined by standard https://docs.nvidia.com/cuda/cuda-c-programming-guide/#mathematical-functions-appendix

This is a draft patch mostly to check that the use of TargetLibraryInfo can be used from the FE. Andy let me know if that's what you were thingking about?

clang/include/clang/Driver/Options.td

andykaylor · 2023-02-09T19:42:02Z

clang/lib/CodeGen/CGBuiltin.cpp

+    // on the function (IntrinsicID) and the command line accuracy (FPAccuracyIntrinsicID)
+    // FPAccuracyVal = TLI.getFPAccuracy(F->getName(), FPAccuracyIntrinsicID)
+    // auto *AccuracyMDS = MDString::get(CGF.Builder.getContext(), FPAccuracyVal);
+    return CGF.Builder.CreateCall(F, {Src0, AccuracyMD});


I know the mock-up I shared used a metadata argument with a string like you have hear, but I ended up switching that to represent the accuracy as a numeric value using a callsite attribute. See PR #8134.

Yes, I knew that was temporary. I was shooting for this IR:

call float @llvm.experimental.fpaccuracy.cos.f32(float %0, "float ulp-value")

but we need to have a way to calculate the ulp error. From you message in Team and comments below, it looks like the FE needs to compute this ulp error without going through the TargetLibraryInfoImp? So, it would be a new function depending on the library function name and the fp-accuracy option given in the command line (or FPA_default if no option is used)?

Do you mean fpbuiltin-max-error? This attribute still needs the ulp error value.

So, this is the IR you are expecting?
call float @llvm.experimental.fpaccuracy.cos.f32(float %0) #10
attributes #0 = { "fpbuiltin-max-error"="2.5" }

If yes, the FE needs to generate a new attribute to mark the function and calculate the ulp error value (the way I describe above).

I think the mapping still needs to come from the backend because we'll want other languages (like Fortran) to be able to use the same functionality. I'll think about this and try to come up with a proposal. If you want to put a placeholder function in the front end for now, that's fine.

andykaylor · 2023-02-09T19:44:28Z

clang/lib/CodeGen/CGBuiltin.cpp

+  if (FPAccuracyIntrinsicID != Intrinsic::not_intrinsic) {
+    LangOptions::FPAccuracyKind FPAccuracy = CGF.getLangOpts().getFPAccuracy();
+    // Enter this part of the condition only when TLI.isFPACCuracyAvailable is
+    // true, implying FPAccuary != LangOptions::FPA_Default.


I don't think this assumption is correct. It's possible that we'll have a back-end that supports multiple accuracy implementations but we want to use the default accuracy anyway. In fact, that's probably going to be the most common case.

clang/lib/CodeGen/CGBuiltin.cpp

…max-error attribute.

Added a map to LangOption that will map the function in the function list of command line to its accuracy.

mdtoguchi · 2023-04-18T14:40:59Z

clang/include/clang/Basic/DiagnosticDriverKinds.td

+def warn_function_fp_accuray_already_set : Warning <"FP accuracy value of '%0' has already "
+  "been assigned to 
8000
function '%1'">;
+def warn_all_fp_accuray_already_set : Warning <"FP accuracy value of '%0' has already "
+  "been assigned to all functions in the program">;


Should there be a way to disable these diagnostics?

Why would we want to disable them?

It is an informative diagnostic, but typically warnings have a way to be disabled.

I can create the warning in a new group:
def FPAccuracyWrongValue : DiagGroup<"wrong-value-of-fp-accuracy">;
and then use the -Wno-wrong-value-of-fp-accuracy on the command line to disable the warning. Is that what you mean?

Yup, that's what I mean. Maybe: "fp-accuracy-value" would be sufficient here for the name.

clang/lib/Driver/ToolChains/Clang.cpp

zahiraam · 2023-04-18T16:05:11Z

@mdtoguchi Thanks for taking a look at it.

clang/include/clang/Basic/DiagnosticDriverKinds.td

llvm/lib/IR/CMakeLists.txt

asudarsa · 2023-06-12T17:29:40Z

clang/lib/CodeGen/CGBuiltin.cpp

+  llvm::AttributeList AttrList;
+  // sincos() doesn't return a value, but it still has a type associated with
+  // it that corresponds to the operand type.
+  CGF.CGM.getFPAccuracyFuncAttributes(


Why do we need a AttrList here? Is there a scenario where more than one such attribute can be attached to a builtin call?

Thanks

But there is a call to getFPAccuracyFuncAttributes which in tun will call getDefaultFunctionFPAccuracyAttributes that can additional attributes to the function. So. I think a list is needed here.

Actually I think you are right that the AttrList can be a single attribute instead of a list. Added a TODO comment as we discussed offline. Thanks.

clang/lib/CodeGen/CGBuiltin.cpp

asudarsa

Minor typo. LGTM. Thanks for implementing this.

zahiraam · 2023-06-14T11:53:22Z

@intel/llvm-gatekeepers can you consider merging this please? Thanks.

dm-vodopyanov · 2023-06-14T11:55:59Z

@intel/llvm-gatekeepers can you consider merging this please? Thanks.

@zahiraam please update the caption of PR in accordance with https://github.com/intel/llvm/blob/sycl/sycl/doc/developer/ContributeToDPCPP.md#commit-message

dm-vodopyanov · 2023-06-14T12:09:42Z

Will be merged as soon as post-commit turns green.

dm-vodopyanov · 2023-06-14T13:56:16Z

Failures in post-commit:

Failed Tests (2):
  Clang :: CodeGen/fp-accuracy.c
  Clang :: Driver/fp-accuracy.c

https://github.com/intel/llvm/actions/runs/5267586087/jobs/9523035374

@zahiraam could you please address them?

) This patch implements the accuracy controls for floating-point math functions in DPC++. Using the -ffp-accuracy command line option, the user can request an accuracy level for all math functions or for specific ones. Calls to fpbuiltin intrinsics llvm.fpbuilin.* are then generated. Syntax: Linux: -ffp-accuracy=[default|value][:funclist] Windows: /Qfp-accuracy:[default|value][:funclist] funclist is an optional comma separated list of math library functions. -ffp-accuracy=[default|value] default: Use the implementation defined accuracy for all math library functions. This is equivalent to not using this option. value: Use the defined standard accuracy for what each accuracy value means for all math library functions. -ffp-accuracy=[default|value][:funclist] default: Use the implementation defined accuracy for the math library functions in funclist. This is equivalent to not using this option. value: Use the defined standard accuracy for what each accuracy value means for the math library functions in funclist. value is one of the following values denoting the library function accuracy. high This is equivalent to max-error = 1.0. medium This is equivalent to max-error = 4. low This is equivalent to accuracy-bits = 11 for single-precision functions. accuracy-bits = 26 for double-precision functions. sycl Determined by the OpenCL specification for math function accuracy: https://registry.khronos.org/OpenCL/specs/3.0-unified/html/OpenCL_C.html#relative-error-as-ulps cuda Determined by standard https://docs.nvidia.com/cuda/cuda-c-programming-guide/#mathematical-functions-appendix

aelovikov-intel · 2023-06-15T15:38:54Z

@zahiraam could you please address them?

Any updates on this? If the fix isn't ready, can we just disable the tests to fix post-commit?

zahiraam · 2023-06-15T16:17:52Z

@zahiraam could you please address them?

Any updates on this? If the fix isn't ready, can we just disable the tests to fix post-commit?

@aelovikov-intel I was able to reproduce the issue and working on it. Yes for now, we can diable the LIT tests and I will work on it. Thanks.

zahiraam · 2023-06-15T16:21:10Z

Also it looks like it's failing only on a release self-build. With the debug self-build the 2 tests are passing.

Implement floating point accuracy.

79792b3
This is a draft patch mostly to check that the use of TargetLibraryInfo can be used from the FE. Andy let me know if that's what you were thingking about?

zahiraam assigned andykaylor Feb 9, 2023

andykaylor reviewed Feb 9, 2023

View reviewed changes

zahiraam temporarily deployed to aws February 9, 2023 20:12 — with GitHub Actions Inactive

Responded to review comments and added more to the implementation.

c3029ee

zahiraam temporarily deployed to aws February 14, 2023 00:04 — with GitHub Actions Inactive

zahiraam added 2 commits February 14, 2023 16:20

Added option falt-math-library and fixed a few things with fpbuiltin-…

fa51800

…max-error attribute.

Added a cc1 option triggered by the fp-accuracy option.

b584e3d

Added a map to LangOption that will map the function in the function list of command line to its accuracy.

zahiraam temporarily deployed to aws March 2, 2023 03:44 — with GitHub Actions Inactive

zahiraam added 7 commits April 4, 2023 17:01

Merge remote-tracking branch 'remote/sycl' into FPAccuracy

ec2dfbc

Remove TargetInfoLibrary dependency.

be478d8

Fix a few things.

4c23010

Merge remote-tracking branch 'remote/sycl' into FPAccuracy

db8c415

Fix a few things.

2bcd2a8

Adding code to support the option.

bdf5dbe

Fix formatting issues and remove unused code.

d796b5f

zahiraam assigned hchilama and mdtoguchi Apr 18, 2023

Fix format issues.

6c005d8

zahiraam temporarily deployed to aws April 18, 2023 14:12 — with GitHub Actions Inactive

mdtoguchi reviewed Apr 18, 2023

View reviewed changes

clang/lib/Driver/ToolChains/Clang.cpp Outdated Show resolved Hide resolved

Respond to review.

b93d9ec

mdtoguchi reviewed Apr 18, 2023

View reviewed changes

clang/include/clang/Basic/DiagnosticDriverKinds.td Outdated Show resolved Hide resolved

clang/include/clang/Basic/DiagnosticDriverKinds.td Outdated Show resolved Hide resolved

zahiraam temporarily deployed to aws April 18, 2023 17:25 — with GitHub Actions Inactive

zahiraam added 2 commits April 18, 2023 14:05

Fix typo.

17eb30f

Add a group to the diag and fixed typo.

4ffd86f

zahiraam temporarily deployed to aws April 18, 2023 20:58 — with GitHub Actions Inactive

Fixed typo.

bd458b9

AlexeySachkov reviewed Jun 12, 2023

View reviewed changes

llvm/lib/IR/CMakeLists.txt Outdated Show resolved Hide resolved

asudarsa self-requested a review June 12, 2023 15:49

Responded to review for CmakeLists.txt.

d3d470e

asudarsa reviewed Jun 12, 2023

View reviewed changes

zahiraam temporarily deployed to aws June 12, 2023 17:41 — with GitHub Actions Inactive

Responded to review about the AttrList.

e08362e

asudarsa reviewed Jun 12, 2023

View reviewed changes

clang/lib/CodeGen/CGBuiltin.cpp Outdated Show resolved Hide resolved

asudarsa approved these changes Jun 12, 2023

View reviewed changes

Fixed typo.

6d5890d

zahiraam temporarily deployed to aws June 12, 2023 19:34 — with GitHub Actions Inactive

Merge remote-tracking branch 'remote/sycl' into FPAccuracy

7d3c976

zahiraam temporarily deployed to aws June 12, 2023 20:22 — with GitHub Actions Inactive

zahiraam temporarily deployed to aws June 13, 2023 12:05 — with GitHub Actions Inactive

Merge remote-tracking branch 'remote/sycl' into FPAccuracy

9c86522

zahiraam temporarily deployed to aws June 13, 2023 21:52 — with GitHub Actions Inactive

zahiraam temporarily deployed to aws June 13, 2023 22:31 — with GitHub Actions Inactive

zahiraam changed the title ~~Floating point accuracy control.~~ [SYCL] Floating point accuracy control. Jun 14, 2023

zahiraam changed the title ~~[SYCL] Floating point accuracy control.~~ [SYCL] Add support for floating point accuracy control. Jun 14, 2023

dm-vodopyanov changed the title ~~[SYCL] Add support for floating point accuracy control.~~ [SYCL][FE][Driver] Implement floating point accuracy control Jun 14, 2023

dm-vodopyanov merged commit 405778a into intel:sycl Jun 14, 2023

againull mentioned this pull request Jul 10, 2023

[SYCL] Split device images based on accuracy level provided in option #10140

Merged

jsji mentioned this pull request Sep 14, 2023

LLVM and SPIRV-LLVM-Translator pulldown (WW37) #11185

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SYCL][FE][Driver] Implement floating point accuracy control #8280

[SYCL][FE][Driver] Implement floating point accuracy control #8280

[SYCL][FE][Driver] Implement floating point accuracy control #8280

[SYCL][FE][Driver] Implement floating point accuracy control #8280

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment