Move offloadKVCacheToGpu to llm.load #311

mattjcly · 2025-05-06T15:33:48Z

Since KV cache is very specific to llm

ryan-the-crayon · 2025-05-06T15:36:00Z

packages/lms-kv-config/src/schema.ts

@@ -428,7 +428,7 @@ export const llmSharedLoadConfigSchematics = llmLoadSchematics.sliced(
 const llamaLoadConfigSchematics = globalConfigSchematics.sliced("llama.load.*", "load.*");

 export const llmLlamaLoadConfigSchematics = llmSharedLoadConfigSchematics
-  .union(llmLoadSchematics.sliced("llama.*", "load.*"))
+  .union(llmLoadSchematics.sliced("llama.*", "load.*", "offloadKVCacheToGpu"))


I think offloadKVCacheToGpu is already covered by load.*?

That would be looking for llm.load.load I believe, since llmLoadSchematics already gets scoped into llm.load

export const llmLoadSchematics = globalConfigSchematics .scoped("llm.load") .union(globalConfigSchematics.sliced("envVars"));

But I could be wrong

Basically I observe that I cannot do:

const testConfig = llmLlamaLoadConfigSchematics.parse(loadConfig); const test = testConfig.get("offloadKVCacheToGpu");

unless I add this change

mattjcly added 2 commits May 6, 2025 11:24

Move offloadKVCacheToGpu to llm.load

9e99325

Fix llama load schematic

ed28278

mattjcly requested a review from ryan-the-crayon May 6, 2025 15:33

github-actions bot added the CLA signed Indicates if all contributors have signed the CLA label May 6, 2025

ryan-the-crayon reviewed May 6, 2025

View reviewed changes

ryan-the-crayon approved these changes May 6, 2025

View reviewed changes

mattjcly merged commit 041668e into main May 6, 2025
1 check passed

mattjcly deleted the matt/offloadkvtollmload branch May 6, 2025 15:48

github-actions bot locked and limited conversation to collaborators May 6, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Move offloadKVCacheToGpu to llm.load #311

Move offloadKVCacheToGpu to llm.load #311

Move offloadKVCacheToGpu to llm.load #311

Move offloadKVCacheToGpu to llm.load #311

Conversation

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment