docs: indexers benchmark #4003

cristianmtr · 2021-11-26T16:11:56Z

TODO

find right place in docs. Maybe on same page as Indexers. Then wait for docs: refactor indexers page #4002 to be merged
include table with results
include plots. As iframe? Can we include html files somehow?

github-actions · 2021-11-26T16:20:08Z

Latency summary

Current PR yields:

🐢🐢 index QPS at 1195, delta to last 2 avg.: -11%
🐢🐢 query QPS at 22, delta to last 2 avg.: -14%
🐢🐢 dam extend QPS at 34515, delta to last 2 avg.: -13%
🐢🐢 avg flow time within 1.1538 seconds, delta to last 2 avg.: +1%
😶 import jina within 0.4128 seconds, delta to last 2 avg.: +3%

Breakdown

Version	Index QPS	Query QPS	DAM Extend QPS	Avg Flow Time (s)	Import Time (s)
current	1195	22	34515	1.1538	0.4128
`2.5.0`	1544	30	47720	1.0776	0.3579
`2.4.10`	1158	20	31807	1.2048	0.441

Backed by latency-tracking. Further commits will update this comment.

hanxiao · 2021-11-30T08:17:12Z

let me restructure the TOC a bit

hanxiao · 2021-11-30T08:21:16Z

nested

davidbp

When you talk about 1k query vectors it is not clear if this is 'done as a batch' (one function call that processes 1k query vector) or if query vectors are processes one at a time. If I were a user probably I would like to know

What's the expected time between .search and getting a result for a single query?
Whats the expected throughput of queries I can expect using X resources in delta time.

The document seems to answer the first question since it states Then we search with the respective search set, using a batch size of `1`, to mimic single query operations. The second question is not as relevant but still interesting for users with really high traffic.

maximilianwerk

I like it so far. Obviously the results data is needed.

In the final version, I'd put the results at the top and put the methodology behind. Most views will be for the results and thus they should be at the top and easiest accessible.

docs/advanced/experimental/indexers-benchmark.md

cristianmtr · 2021-11-30T11:25:14Z

@maximilianwerk moved results to top.

@hanxiao @davidbp added results now too. Can you check again?

cristianmtr · 2021-11-30T12:17:37Z

Not sure why the deployment doesn't include the subpage. Locally it is there

davidbp

I see now the benchmarks.

cristianmtr · 2021-11-30T16:32:44Z

I see now the benchmarks.

In the netlify deployment? I still don't. Tried a hard refresh but nothing.

maximilianwerk · 2021-12-01T07:59:42Z

I see now the benchmarks.

In the netlify deployment? I still don't. Tried a hard refresh but nothing.

me neither. weird.

maximilianwerk

I find it confusing, that the x-axis has varying scales in all the plots. This makes it much harder to get fast conclusions. Having a changing y-axis is OK. Opinions?

cristianmtr · 2021-12-01T13:07:48Z

I find it confusing, that the x-axis has varying scales in all the plots. This makes it much harder to get fast conclusions. Having a changing y-axis is OK. Opinions?

I'd need to recreate them and the code from @Hippopotamus0308 is not yet in the benchmarks repo. We can either wait and recreate them with fixed axes, or merge it as it is now.

Hippopotamus0308 · 2021-12-01T13:32:21Z

I find it confusing, that the x-axis has varying scales in all the plots. This makes it much harder to get fast conclusions. Having a changing y-axis is OK. Opinions?

I'd need to recreate them and the code from @Hippopotamus0308 is not yet in the benchmarks repo. We can either wait and recreate them with fixed axes, or merge it as it is now.

@cristianmtr I've added to the pr in https://github.com/jina-ai/benchmark_indexers/pull/33, please check it.

cristianmtr · 2021-12-01T13:54:01Z

@maximilianwerk

I tried to render them with fixed x_range but they are not all readable

ex.

maximilianwerk · 2021-12-01T13:54:55Z

@maximilianwerk

I tried to render them with fixed x_range but they are not all readable

ex.

Ok then don't do that and keep it as they are :)

docs/advanced/experimental/indexers-benchmark.md

maximilianwerk

apart from this and the precision/recall, I am good with merging.

docs/advanced/experimental/indexers-benchmark.md

maximilianwerk

LGTM

github-actions · 2021-12-01T14:21:26Z

📝 Docs are deployed on https://docs-indexers-benchmark--jina-docs.netlify.app 🎉

docs: indexers benchmark

ae5f060

github-actions bot added size/S area/docs This issue/PR affects the docs labels Nov 26, 2021

cristianmtr requested a review from hanxiao November 26, 2021 16:12

cristianmtr assigned maximilianwerk and unassigned maximilianwerk Nov 26, 2021

cristianmtr requested review from maximilianwerk and tadejsv November 26, 2021 16:12

docs(indexer): move benchmark under indexer main chapter

c478f59

davidbp reviewed Nov 30, 2021

View reviewed changes

maximilianwerk reviewed Nov 30, 2021

View reviewed changes

docs/advanced/experimental/indexers-benchmark.md Outdated Show resolved Hide resolved

docs: add plots and tables

6497a63

github-actions bot added the size/XL label Nov 30, 2021

cristianmtr requested review from davidbp and maximilianwerk November 30, 2021 11:23

docs: move bench results to top

cf3f7c4

cristianmtr marked this pull request as ready for review November 30, 2021 11:29

Merge branch 'master' into docs-indexers-benchmark

3df8dd5

davidbp previously approved these changes Nov 30, 2021

View reviewed changes

docs: fix ordering of sentences

eeef268

cristianmtr dismissed davidbp’s stale review via eeef268 December 1, 2021 08:46

maximilianwerk reviewed Dec 1, 2021

View reviewed changes

cristianmtr requested a review from maximilianwerk December 1, 2021 13:07

maximilianwerk reviewed Dec 1, 2021

View reviewed changes

docs/advanced/experimental/indexers-benchmark.md Show resolved Hide resolved

maximilianwerk reviewed Dec 1, 2021

View reviewed changes

docs/advanced/experimental/indexers-benchmark.md Outdated Show resolved Hide resolved

docs: remove precision and clarify ann

ff0239e

cristianmtr requested review from maximilianwerk and davidbp December 1, 2021 14:13

maximilianwerk approved these changes Dec 1, 2021

View reviewed changes

cristianmtr merged commit faaacf1 into master Dec 1, 2021

cristianmtr deleted the docs-indexers-benchmark branch December 1, 2021 14:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docs: indexers benchmark #4003

docs: indexers benchmark #4003

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

docs: indexers benchmark #4003

docs: indexers benchmark #4003

Uh oh!

Conversation

Uh oh!

Uh oh!

Uh oh!

Latency summary

Breakdown

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!