refactor: change metric evaluator name adapt to eval_at #1570

JoanFM · 2020-12-30T19:08:06Z

No description provided.

github-actions · 2020-12-30T19:22:29Z

Latency summary

Current PR yields:

😶 index QPS at 1566, delta to last 3 avg.: +1%
😶 query QPS at 26, delta to last 3 avg.: -1%

Breakdown

Version	Index QPS	Query QPS
current	1566	26
`0.8.22`	1517	26
`0.8.21`	1555	26
`0.8.20`	1549	26

Backed by latency-tracking. Further commits will update this comment.

florian-hoenicke

line 87 in test_ranking_evaluate_driver has to be changed to
assert doc.evaluations[0].op_name == 'Precision@2'

Also for some reason the function test_ranking_evaluate_driver is there twice. We can remove one of them.

JoanFM · 2020-12-31T10:35:39Z

8000

line 87 in test_ranking_evaluate_driver has to be changed to
assert doc.evaluations[0].op_name == 'Precision@2'

Also for some reason the function test_ranking_evaluate_driver is there twice. We can remove one of them.

Is it exactly the same? if it is exactly the same yes we can remove it, otherwise we rename one.

I saw, I want to change to

assert doc.evaluations[0].op_name == executor.metric

Would u have time to do these couple of changes on this branch?

florian-hoenicke · 2020-12-31T10:36:18Z

sure - I do it

JoanFM · 2020-12-31T10:52:24Z

tests/unit/drivers/test_rankingevaluation_driver.py

                                 ground_truth_pairs):
    ruuningavg_rank_evaluate_driver.attach(executor=PrecisionEvaluator(eval_at=2), pea=None)
    ruuningavg_rank_evaluate_driver._apply_all(ground_truth_pairs)
    for pair in ground_truth_pairs:
        doc = pair.doc
        assert len(doc.evaluations) == 1
-        assert doc.evaluations[0].op_name == 'Precision@N'
+        assert doc.evaluations[0].op_name == 'Precision@2'


can u change this to have

driver.attach(executor=executor, pea=None) assert driver.op_name == executor.metric

Naming might be wrong, but like this the responsibility of testing the actual name is tested in the executor while the driver tests that the name is taken from executor

JoanFM

I like it

JoanFM · 2020-12-31T11:44:47Z

@florian-hoenicke if u can, can u try running jina hello-world and see if the evaluation results are properly shown in the html resulting?

codecov · 2020-12-31T12:07:44Z

Codecov Report

Merging #1570 (f7da519) into master (35139e9) will increase coverage by 0.89%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1570      +/-   ##
==========================================
+ Coverage   83.44%   84.34%   +0.89%     
==========================================
  Files         128      128              
  Lines        6646     6705      +59     
==========================================
+ Hits         5546     5655     +109     
+ Misses       1100     1050      -50

Impacted Files	Coverage Δ
jina/executors/evaluators/rank/precision.py	`100.00% <ø> (ø)`
jina/executors/evaluators/rank/recall.py	`100.00% <ø> (ø)`
jina/drivers/evaluate.py	`98.27% <100.00%> (+0.06%)`	⬆️
jina/logging/sse.py	`91.42% <0.00%> (-0.76%)`	⬇️
jina/logging/profile.py	`69.84% <0.00%> (-0.56%)`	⬇️
jina/executors/decorators.py	`91.11% <0.00%> (-0.17%)`	⬇️
jina/drivers/craft.py	`100.00% <0.00%> (ø)`
jina/types/ndarray/generic.py	`100.00% <0.00%> (ø)`
jina/drivers/encode.py	`94.91% <0.00%> (+0.08%)`	⬆️
jina/enums.py	`96.59% <0.00%> (+0.09%)`	⬆️
... and 15 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 35139e9...ca7aa40. Read the comment docs.

florian-hoenicke · 2020-12-31T14:20:00Z

As a result I get:
Precision@50: 65.47% Recall@50: 0.55%
The precision looks really low to me.
From what I see in the results, it should be much higher.

JoanFM · 2020-12-31T14:21:11Z

As a result I get:
Precision@50: 65.47% Recall@50: 0.55%
The precision looks really low to me.
From what I see in the results, it should be much higher.

thats okey I just want to check the name

hanxiao

I removed this intentionally. To me it is redundant as we already have self.name for all executor. Can we just use self.name instead of creating a new property.

I suggest remove redundant metric property and use:

op.name = f'{self.exec.name}@{self.eval_at}'

JoanFM · 2020-12-31T14:37:49Z

I removed this intentionally. To me it is redundant as we already have self.name for all executor. Can we just use self.name instead of creating a new property.

I suggest remove redundant metric property and use:
op.name = f'{self.exec.name}@{self.eval_at}'

it was to mantain alignment with hub but yes we can do this. Just wanted to align all

JoanFM · 2020-12-31T14:49:23Z

Will do as @hanxiao suggests

florian-hoenicke · 2020-12-31T15:09:58Z

I created a new pr for the two tests having the same method name.
https://github.com/jina-ai/jina/pull/1573/files

…ric-name-eval

nan-wang

LGTM👍

nan-wang · 2021-01-05T07:21:24Z

The unit test failed
https://github.com/jina-ai/jina/pull/1570/checks?check_run_id=1645486983#step:6:2024

…-eval

JoanFM · 2021-01-05T07:31:48Z

The unit test failed
https://github.com/jina-ai/jina/pull/1570/checks?check_run_id=1645486983#step:6:2024

Thanks for pointing it out

nan-wang

LGTM👍

refactor: change metric evaluator name adapt to eval_at

1f54693

JoanFM requested a review from a team as a code owner December 30, 2020 19:08

JoanFM requested review from maximilianwerk and rutujasurve94 December 30, 2020 19:08

jina-bot added size/S area/core This issue/PR affects the core codebase area/testing This issue/PR affects testing component/executor labels Dec 30, 2020

florian-hoenicke requested changes Dec 31, 2020

View reviewed changes

fix: test rankingevaluation_driver

a528188

JoanFM commented Dec 31, 2020

View reviewed changes

fix: test metric name eval

0a0680a

JoanFM commented Dec 31, 2020

View reviewed changes

hanxiao requested changes Dec 31, 2020

View reviewed changes

JoanFM closed this Dec 31, 2020

Merge branch 'master' of https://github.com/jina-ai/jina into fix-met…

bc48ea9

…ric-name-eval

JoanFM reopened this Jan 2, 2021

jina-bot added the component/driver label Jan 2, 2021

JoanFM mentioned this pull request Jan 2, 2021

refactor: remove metric from evaluators jina-ai/jina-hub#1682

Merged

fix: remove metric from evaluators

426cba6

JoanFM force-pushed the fix-metric-name-eval branch from 95c50e2 to 426cba6 Compare January 2, 2021 17:25

JoanFM requested review from florian-hoenicke and hanxiao January 2, 2021 18:34

nan-wang previously approved these changes Jan 4, 2021

View reviewed changes

fix: remove metric from mocks

32100f4

JoanFM dismissed nan-wang’s stale review via 32100f4 January 4, 2021 07:10

Merge branch 'master' of github.com:jina-ai/jina into fix-metric-name…

ca7aa40

…-eval

JoanFM force-pushed the fix-metric-name-eval branch from 6db2f84 to ca7aa40 Compare January 5, 2021 07:30

nan-wang approved these changes Jan 6, 2021

View reviewed changes

nan-wang merged commit 7200c65 into master Jan 6, 2021

nan-wang deleted the fix-metric-name-eval branch January 6, 2021 03:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

refactor: change metric evaluator name adapt to eval_at #1570

refactor: change metric evaluator name adapt to eval_at #1570

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

refactor: change metric evaluator name adapt to eval_at #1570

refactor: change metric evaluator name adapt to eval_at #1570

Uh oh!

Conversation

Uh oh!

Uh oh!

Latency summary

Breakdown

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!