[FIX] limit double precision to 17 digits during write #4636

jpfeuffer · 2020-04-09T11:17:26Z

ghost · 2020-04-09T11:17:39Z

Congratulations 🎉. DeepCode analyzed your code in 1.169 seconds and we found no issues. Enjoy a moment of no bugs ☀️.

👉 View analysis in DeepCode’s Dashboard

timosachsenberg · 2020-04-09T11:22:11Z

src/openms/include/OpenMS/DATASTRUCTURES/StringUtils.h

@@ -60,7 +60,9 @@ namespace OpenMS
    class BK_PrecPolicy : public boost::spirit::karma::real_policies<T>
    {
    public:
-      static unsigned int precision(T) { return writtenDigits<T>(T()); }
+        static unsigned int precision(T n) {


bracket in new line

add comment why this is sensible to do

oliveralka · 2020-04-09T11:42:07Z

@jpfeuffer I probably test if that fixes the issue before merging or?

jpfeuffer · 2020-04-09T11:52:18Z

Sure. If you have workspace ready where the test data is, it should be quick.

I wonder how many tests fail or if FuzzyDiff always catches the mini differences.

timosachsenberg · 2020-04-09T13:10:36Z

in1: /home/travis/build/OpenMS/OpenMS/_build/src/tests/class_tests/openms/AccurateMassSearchEngine_test_319.tmp (line: 13, position/column: 14/36)
!
36310.1992

in2: /home/travis/build/OpenMS/OpenMS/src/tests/class_tests/openms/data/AccurateMassSearchEngine_output1.featureXML (line: 13, position/column: 14/36)
!
36310.199219

jpfeuffer · 2020-04-09T13:59:27Z

Yes, so the problem is, we used our writtenDigits() function before which returns std::numeric_limits::digits10, which is (in my opinion) not what you want if you want to read
back in serialized doubles and guarantee that they will be the same.
I am happy about input here. https://www.boost.org/doc/libs/1_72_0/libs/multiprecision/doc/html/boost_multiprecision/tut/limits/constants.html gives a good overview of those two values.

jpfeuffer · 2020-04-09T14:05:47Z

36310.1992

this is enough to represent the underlying single-precision float 36310.19921875 uniquely when reading it back in.

timosachsenberg · 2020-04-09T14:11:21Z

src/openms/include/OpenMS/DATASTRUCTURES/StringUtils.h

+          // -> if scientific, subtract one
+          return BK_PrecPolicy<T>::floatfield(n) ?
+            std::numeric_limits<T>::max_digits10 - (floor(log10(n)) + 1) : 
+            std::numeric_limits<T>::max_digits10 - 1;


why subtract one?
From the boost docu:
The extra two or three least-significant digits are 'noisy' and may be junk, but if you want to 'round-trip' - printing a value out as a decimal digit string and reading it back in - (most commonly during serialization and de-serialization) you must use os.precision(std::numeric_limits::max_digits10)

for scientifc notation only.

oliveralka · 2020-04-09T14:32:07Z

Let's see what the others think.

Tested locally in addition:

	 39 - DataValue_test (Failed)
	 62 - String_test (Failed)
	173 - MascotInfile_test (Failed)
	182 - MzMLFile_test (Failed)
	187 - MzTab_test (Failed)
	196 - ParamXMLFile_test (Failed)
	205 - SVOutStream_test (Failed)
	263 - DataFilters_test (Failed)
	388 - AccurateMassSearchEngine_test (Failed)
	442 - ItraqConstants_test (Failed)
	2221 - TOPP_OpenPepXL_1_out_2 (Failed)
	2226 - TOPP_OpenPepXLLF_1_out_2 (Failed)
	2286 - TOPP_FidoAdapter_1 (Failed)
	2287 - TOPP_FidoAdapter_1_out (Failed)
	2288 - TOPP_FidoAdapter_2 (Failed)
	2289 - TOPP_FidoAdapter_2_out (Failed)
	2290 - TOPP_FidoAdapter_3 (Failed)
	2291 - TOPP_FidoAdapter_3_out (Failed)
	2292 - TOPP_FidoAdapter_4 (Failed)
	2293 - TOPP_FidoAdapter_4_out (Failed)
	2294 - TOPP_FidoAdapter_5 (Failed)
	2295 - TOPP_FidoAdapter_5_out (Failed)
	2296 - TOPP_FidoAdapter_6 (Failed)
	2297 - TOPP_FidoAdapter_6_out (Failed)

jpfeuffer · 2020-04-09T15:13:09Z

So, me and @timosachsenberg figured that we:
a) need an absolute value of the number first
b) that this formula will lead to longer strings for small values that do not yet undergo scientific writing mode (e.g. 0.001). Here it would ADD 4 to account for the first significant digit starting at position 4. Leading to an output of size 21, making it unreadable by the Java library.

So the solution to b) would be to switch to scientific mode starting from < 0.1

timosachsenberg · 2020-04-09T15:26:04Z

if abs(N) < 0.1

… normalization instead

timosachsenberg · 2020-04-13T11:47:32Z

did I get it right that the scientific notation has a fixed number of digits? And that lower number of written digits now works with luciphor?

jpfeuffer · 2020-04-13T11:52:18Z

We set the precision to writtenDigits(), which is digits10 which is 15 for double.
Before, we allowed numbers from 0.0001 to 99999 to be written in fixed notation. Since boost karma
does not care about significant digits you still get an additional 15 digits after. 5+15 = 20 = too much.
If you switch to scientific starting at 10000 you get 1e05 plus the fractional part, so in total 16 digits.

[FIX] limit double precision to 17 digits during write

5b5df37

jpfeuffer requested a review from cbielow April 9, 2020 11:21

timosachsenberg reviewed Apr 9, 2020

View reviewed changes

little fix

b3c9772

jpfeuffer force-pushed the fix/doublePrecisionOutput branch from 2019e80 to b3c9772 Compare April 10, 2020 11:13

jpfeuffer added 2 commits April 11, 2020 09:03

revert only real solution due to boost bug and speed concerns, extend…

44c402d

… normalization instead

fix tests

cf019b9

timosachsenberg merged commit 366944c into develop Apr 13, 2020

timosachsenberg deleted the fix/doublePrecisionOutput branch April 13, 2020 12:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[FIX] limit double precision to 17 digits during write #4636

[FIX] limit double precision to 17 digits during write #4636

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[FIX] limit double precision to 17 digits during write #4636

[FIX] limit double precision to 17 digits during write #4636

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

👉 View analysis in DeepCode’s Dashboard

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!