Skip to content

feat: Add optional prompt processing progress streaming #14731

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

baonudesifeizhai
Copy link

  • Add include_prompt_progress parameter to slot_params (default: false)
  • Extend server_task_result_cmpl_partial with progress fields
  • Implement send_progress_response() function with 1% progress intervals
  • Add progress response in prompt processing loop
  • Update JSON response to include prompt_processing field when requested
  • Add comprehensive documentation to README.md
  • Ensure full backward compatibility with existing clients

Closes #14685

Make sure to read the contributing guidelines before submitting a PR

- Add include_prompt_progress parameter to slot_params (default: false)
- Extend server_task_result_cmpl_partial with progress fields
- Implement send_progress_response() function with 1% progress intervals
- Add progress response in prompt processing loop
- Update JSON response to include prompt_processing field when requested
- Add comprehensive documentation to README.md
- Ensure full backward compatibility with existing clients

Closes ggml-org#14685
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Feature Request: Server stream response for "prompt processing progress"
1 participant