Add streaming support for zero shot inference #3878

arnavgarg1 · 2024-01-11T02:17:55Z

This PR introduces a new boolean flag called streaming to the LudwigModel.generate() API which allows users to see streaming output when performing zero-shot inference on single or multiple samples.

Demo

Screen.Recording.2024-01-11.at.7.43.05.AM.mov

Config to reproduce the demo video

import yaml
import logging
from ludwig.api import LudwigModel

config = yaml.safe_load(
    """
model_type: llm
base_model: meta-llama/Llama-2-7b-chat-hf

quantization:
  bits: 4

input_features:
  - name: instruction
    type: text

output_features:
  - name: output
    type: text

generation:
  max_new_tokens: 64
  temperature: 0.1

trainer:
    type: none

backend:
  type: local
"""
)

model = LudwigModel(config, logging_level=logging.INFO)

# Single sample - Normal generation
output = model.generate("What is the meaning of life?", generation_config={"max_new_tokens": 32})

# Single sample - Streaming generation
output = model.generate("What is the meaning of life?", generation_config={"max_new_tokens": 32}, streaming=True)

# Multi sample - Normal generation
output = model.generate(["What is the meaning of life?", "What is the weather like in Germany around December?"], generation_config={"max_new_tokens": 64})

# Multi sample - Streaming generation
output = model.generate(["What is the meaning of life?", "What is the weather like in Germany around December?"], generation_config={"max_new_tokens": 64}, streaming=True)

justinxzhao

Very cool!

github-actions · 2024-01-11T03:57:19Z

Unit Test Results

  6 files ±0   6 suites ±0 14m 10s ⏱️ -5s
12 tests ±0   9 ✔️ ±0   3 💤 ±0 0 ❌ ±0
60 runs ±0 42 ✔️ ±0 18 💤 ±0 0 ❌ ±0

Results for commit caa8039. ± Comparison against base commit 22024d7.

♻️ This comment has been updated with latest results.

ludwig/api.py

justinxzhao

LGTM

ludwig/api.py

alexsherstinsky

@arnavgarg1 This is great -- I just made some minor comments (please let me know whether or not they will help). Thank you!

arnavgarg1 · 2024-01-11T21:15:43Z

@alexsherstinsky I think I had a poor interface defined! Can you take a look now and see if it makes sense and your comments are addressed?

alexsherstinsky

LGTM -- so nice!

arnavgarg1 added 2 commits January 11, 2024 07:30

Add support for zero shot streaming generation

d82deb8

Add docstrings

67288cd

arnavgarg1 requested review from w4nderlust, tgaddair, justinxzhao, geoffreyangus, jeffkinnison, Infernaught and alexsherstinsky as code owners January 11, 2024 02:17

Fix type hints

da4cea0

justinxzhao reviewed Jan 11, 2024

View reviewed changes

Add comment back

caa8039

arnavgarg1 requested a review from justinxzhao January 11, 2024 18:01

arnavgarg1 changed the title ~~Add support for zero shot streaming generation~~ Add streaming support for zero shot inference Jan 11, 2024

alexsherstinsky reviewed Jan 11, 2024

View reviewed changes

ludwig/api.py Outdated Show resolved Hide resolved

justinxzhao approved these changes Jan 11, 2024

View reviewed changes

alexsherstinsky reviewed Jan 11, 2024

View reviewed changes

ludwig/api.py Outdated Show resolved Hide resolved

alexsherstinsky requested changes Jan 11, 2024

View reviewed changes

Clean up implementation for streamer object

ce15440

arnavgarg1 requested a review from alexsherstinsky January 11, 2024 21:16

alexsherstinsky approved these changes Jan 11, 2024

View reviewed changes

arnavgarg1 merged commit 926c37e into master Jan 11, 2024

arnavgarg1 deleted the streaming_generation branch January 11, 2024 21:51

vijayi1 pushed a commit to vijayi1/ludwig that referenced this pull request Jan 23, 2024

Add streaming support for zero shot inference (ludwig-ai#3878)

06d2a9e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add streaming support for zero shot inference #3878

Add streaming support for zero shot inference #3878

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Add streaming support for zero shot inference #3878

Add streaming support for zero shot inference #3878

Uh oh!

Conversation

Uh oh!

Demo

Config to reproduce the demo video

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Unit Test Results

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!