[Feature Request] Simplified API for Inference and HTTP Server Integration

First I want to express my deep gratitude for this project, thank you guys so much!

I'm writing to inquire about potential improvements to the API for inference, as well as the possibility of integrating an HTTP server for serving text generation requests. Specifically, I'm interested in the following:

1. A simplified and more flexible method for inference that allows for easier integration with external applications. I'm looking to manage chat history in a separate application and would like to have a straightforward way to perform inference on user-provided text.

2. The ability to serve text generation requests over HTTP. I'm interested in implementing a client-server architecture and would like to know if there are plans to include an HTTP server in the repository.

I understand that the repository is rapidly evolving, and I'm excited to see the new features and improvements you have planned. I'm planning to hack an http server together by myself, but I want to find out what your roadmap is so there is not any painful merges for me in the future. I'm open to contributing to the project if there's an opportunity for collaboration.

Could you please share any insights into the roadmap for these features, or let me know if there are any ongoing discussions or pull requests related to them?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Feature Request] Simplified API for Inference and HTTP Server Integration #565

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Simplified API for Inference and HTTP Server Integration #565

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions