Proposal: new chat_template_arg `enable_history_reasoning` for reusing prompt cache among querys within Agents .

#39

by Abioy - opened 2 days ago

base: refs/heads/main

←

from: refs/pr/39

Discussion Files changed

-1

Abioy

2 days ago

•

edited 2 days ago

Currently reasoning contents before the last user query msg will be ignored.
This might cause prompt cache miss, especially within agents (eg. Coding Agents / Deep Agents) that just calling tools many time before the last user query msg.
So, here I propose a new chat template arg enable_history_reasoning for (optionally) keep the history reasoning contents in the final prompt, and reusing prompt cache (better) in such cases.

Proposal: new chat_template_arg `enable_history_reasoning` for reusing prompt cache among querys within Agents .3f2adb65

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment