8000 FEAT: Benchmark suit for serving by wxiwnd · Pull Request #1618 · xorbitsai/inference · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

FEAT: Benchmark suit for serving #1618

New issue 8000

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Closed
wants to merge 0 commits into from
Closed

FEAT: Benchmark suit for serving #1618

wants to merge 0 commits into from

Conversation

wxiwnd
Copy link
Contributor
@wxiwnd wxiwnd commented Jun 12, 2024

Add new benchmark suit for online serving throughput.
Feature:

  • Infinite benchmark with a benchmark config.
  • Queued benchmark for multiple benchmark config
  • Single benchmark for one benchmark config
  • CLI

configuration file example:

{
    "host": "127.0.0.1",
    "port": 9997,
    "dataset": "./benchmark/dataset/ShareGPT_V3_unfiltered_cleaned_split,json",
    "trust_remote_code": true,
    "seed": 1,
    "num_prompt": 400,
    "concurrency": 200
}

@XprobeBot XprobeBot added this to the v0.12.1 milestone Jun 12, 2024
@wxiwnd wxiwnd changed the title FEAT: Auto Benchmark for serving FEAT: Benchmark suit for serving Jun 13, 2024
@XprobeBot XprobeBot modified the milestones: v0.12.1, v0.12.2 Jun 14, 2024
@wxiwnd wxiwnd marked this pull request as draft June 19, 2024 08:00
@XprobeBot XprobeBot modified the milestones: v0.12.2, v0.12.4, v0.13.0, v0.13.1 Jun 28, 2024
@XprobeBot XprobeBot modified the milestones: v0.13.1, v0.13.2 Jul 12, 2024
@wxiwnd wxiwnd closed this Jul 19, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants
0