streaming conversation api #8790

filintod · 2025-06-12T16:09:52Z

Description

Add streaming support to conversation API. Depends on dapr/components-contrib#3847

Issue reference

Please reference the issue this PR will close: #8813

Checklist

Please make sure you've completed the relevant tasks for this PR, out of the following list:

Code compiles correctly
Created/updated tests
Unit tests passing
End-to-end tests passing
Extended the documentation / Created issue in the https://github.com/dapr/docs/ repo: dapr/docs#[issue number]
Specification has been updated / Created issue in the https://github.com/dapr/docs/ repo: dapr/docs#[issue number]
Provided sample for the feature / Created issue in the https://github.com/dapr/docs/ repo: dapr/docs#[issue number]

dapr/proto/runtime/v1/dapr.proto

pkg/api/universal/conversation_middleware.go

pkg/api/universal/conversation_stream.go

filintod · 2025-06-19T12:30:05Z

go.mod

@@ -511,7 +509,8 @@ replace (
 // Uncomment for local development for testing with changes in the components-contrib && kit repositories.
 // Don't commit with this uncommented!
 //
-// replace github.com/dapr/components-contrib => ../components-contrib
+replace github.com/dapr/components-contrib => ../components-contrib


Temporary while developing and while component-contrib dependency is needed. I'll need to update the mod sum after that is merged

Signed-off-by: Filinto Duran <1373693+filintod@users.noreply.github.com>

- Changed `contextID` to `context_id` in `dapr.proto` and updated JSON name mapping. - Updated all references in the codebase to use the new field name `ContextId`. - Added `ConverseStreamAlpha1` endpoint to gRPC mappings. - Adjusted test cases to reflect the new naming convention for context ID. - Changed package name from `http` to `grpc` in relevant integration tests. Signed-off-by: Filinto Duran <1373693+filintod@users.noreply.github.com>

Signed-off-by: Filinto Duran <1373693+filintod@users.noreply.github.com>

JoshVanL

Another pass of review from me-

Please can organize out the processing the conversation messages into a separate package.

JoshVanL · 2025-06-24T22:51:03Z

dapr/proto/runtime/v1/dapr.proto

@@ -220,6 +220,9 @@ service Dapr {

  // Converse with a LLM service
  rpc ConverseAlpha1(ConversationRequest) returns (ConversationResponse) {}
+
+  // Converse with a LLM service using streaming
+  rpc ConverseStreamAlpha1(ConversationRequest) returns (stream ConversationStreamResponse) {}


Why is this API not bi-directional- with a client being able to send more prompts during the same "conversation"?

JoshVanL · 2025-06-24T22:51:15Z

dapr/proto/runtime/v1/dapr.proto

+  optional int32 completion_tokens = 2 [json_name = "completionTokens"];
+  // Total number of tokens used
+  optional int32 total_tokens = 3 [json_name = "totalTokens"];
+}


nit: eof newline

JoshVanL · 2025-06-24T22:52:51Z

dapr/proto/runtime/v1/dapr.proto

+  optional int32 prompt_tokens = 1 [json_name = "promptTokens"];
+  // Number of tokens in the completion
+  optional int32 completion_tokens = 2 [json_name = "completionTokens"];
+  // Total number of tokens used
+  optional int32 total_tokens = 3 [json_name = "totalTokens"];


Any reason signed & 32 bit?

Suggested change

optional int32 prompt_tokens = 1 [json_name = "promptTokens"];

// Number of tokens in the completion

optional int32 completion_tokens = 2 [json_name = "completionTokens"];

// Total number of tokens used

optional int32 total_tokens = 3 [json_name = "totalTokens"];

optional uint64 prompt_tokens = 1 [json_name = "promptTokens"];

// Number of tokens in the completion

optional uint64 completion_tokens = 2 [json_name = "completionTokens"];

// Total number of tokens used

optional uint64 total_tokens = 3 [json_name = "totalTokens"];

JoshVanL · 2025-06-24T22:54:24Z

dapr/proto/runtime/v1/dapr.proto

+
+// ConversationStreamResponse is the streaming response for Conversation.
+message ConversationStreamResponse {
+  oneof response_type {


Will this message ever contain other fields outside of oneof?

Generally it's good practice to move the oneof definition to a separate message to preserve field number sequence.

JoshVanL · 2025-06-24T22:55:05Z

pkg/api/universal/conversation.go

+		if input.GetScrubPII() {
+			return true
+		}


Why is the scrubber enabled for all when only some inputs have it enabled?

JoshVanL · 2025-06-24T23:38:09Z

pkg/api/universal/conversation_stream.go

+	// Simulate streaming by sending the complete response as chunks
+	if resp != nil {
+		contextID = resp.ConversationContext
+		if len(resp.Outputs) > 0 {


Redundant.

Suggested change

if len(resp.Outputs) > 0 {

JoshVanL · 2025-06-24T23:45:28Z

pkg/api/universal/conversation_stream.go

+			for _, output := range resp.Outputs {
+				// Break the result into chunks to simulate streaming
+				content := output.Result
+				chunkSize := 50 // Send in 50-character chunks


Why 50?

gRPC by default has a maximum message size of 4MB (~4,000,000 bytes)- It's incredibly inefficient to be sending small payloads like this.

I don't think we need to be chunking at all. But if we must.. we should be fetching what the max message size configured is, and chunking on a slightly smaller number to account for headers and other fields in the message.

JoshVanL · 2025-06-24T23:48:15Z

tests/integration/suite/daprd/conversation/grpc/streaming.go

+
+	// Add AI provider components if their API keys are available
+	for _, provider := range liveConversationAIProviders {
+		if apiKey := os.Getenv(provider.envVar); apiKey != "" {


Move to an end to end test.

Integration tests are always self contained and offline. Use the echo conversation component type, or use a mock server.

JoshVanL · 2025-06-24T23:49:04Z

tests/integration/suite/daprd/conversation/grpc/basic.go

@@ -34,6 +34,14 @@ type basic struct {
 	daprd *daprd.Daprd
 }

+func getEchoEstimatedTokens(msg ...string) int {


Please do not put helper functions at the top of the file.

JoshVanL · 2025-06-24T23:51:56Z

tests/integration/suite/daprd/conversation/grpc/streaming.go

+	s.daprd.WaitUntilRunning(t, ctx)
+
+	client := s.daprd.GRPCClient(t, ctx)
+


Is the oneof order (completion last) being tested?

filintod changed the title ~~Filinto/streaming conversation api~~ streaming conversation api Jun 12, 2025

JoshVanL requested changes Jun 12, 2025

View reviewed changes

filintod commented Jun 17, 2025

View reviewed changes

pkg/api/universal/conversation_stream.go Outdated Show resolved Hide resolved

filintod marked this pull request as ready for review June 19, 2025 12:24

filintod requested review from a team as code owners June 19, 2025 12:24

filintod commented Jun 19, 2025

View reviewed changes

filintod force-pushed the filinto/streaming-conversation-api branch 2 times, most recently from a0181aa to a6aa6d8 Compare June 20, 2025 05:41

filintod closed this Jun 20, 2025

filintod force-pushed the filinto/streaming-conversation-api branch from a6aa6d8 to 2c93287 Compare June 20, 2025 05:43

add streaming/usage/metrics

24049bb

Signed-off-by: Filinto Duran <1373693+filintod@users.noreply.github.com>

filintod reopened this Jun 20, 2025

filintod requested a review from JoshVanL June 20, 2025 05:45

filintod added 2 commits June 23, 2025 10:27

Merge branch 'master' into filinto/streaming-conversation-api

52f84d3

filintod mentioned this pull request Jun 23, 2025

adds converstation API tool calling #8829

Open

7 tasks

revert context_id change in proto for now

c4fd3b6

Signed-off-by: Filinto Duran <1373693+filintod@users.noreply.github.com>

JoshVanL requested changes Jun 24, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

streaming conversation api #8790

streaming conversation api #8790

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

		s.daprd.WaitUntilRunning(t, ctx)

		client := s.daprd.GRPCClient(t, ctx)

streaming conversation api #8790

Are you sure you want to change the base?

streaming conversation api #8790

Conversation

Uh oh!

Description

Issue reference

Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!