Skip to content

Added support for plugins to receive request headers and to modify the headers that are finally forwarded #79

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 7 commits into from
Apr 29, 2025

Conversation

shmuelk
Copy link
Collaborator

@shmuelk shmuelk commented Apr 29, 2025

This PR adds support for the PreSchedule, Filter, Scorer, Picker, and PostSchedule plugins to receive all of the headers sent by the user in their request. This enables the writing of plugins that make use of additional headers.

In addition, this PR adds support for the above plugins to add and/or modify the headers in the user's request. This is needed in certain solutions of Disaggregated Prefill/Decode.

The unit tests added are very simplistic. A better unit test will be added later at the StreamingServer level

nirrozenbaum and others added 6 commits April 27, 2025 09:41
Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>
* removed unused noop plugin

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>

* more scheduler refactoring

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>

* more refactoring

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>

* added weights to scorers and calculating weighted score

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>

* addressed code review comments

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>

---------

Signed-off-by: Nir Rozenbaum <nirro@il.ibm.com>
…est headers and to be able to add request headers
@shmuelk shmuelk requested review from mayabar and elevran April 29, 2025 08:47
@mayabar mayabar self-requested a review April 29, 2025 11:15
@shmuelk
Copy link
Collaborator Author

shmuelk commented Apr 29, 2025

Partially fixes issue #73

@shmuelk shmuelk merged commit c6adfc3 into neuralmagic:dev Apr 29, 2025
1 check passed
Copy link
Collaborator

@mayabar mayabar left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants