Add logprobs return in ChatCompletionResponse #1311

windspirit95 · 2024-03-29T06:17:51Z

Hi, this pull request is my small update/modification to add logprobs and top_logprobs arguments when creating chat completion request into llama cpp server (align with OpenAI API template).

Sample request:
POST https://host:port/v1/chat/completions
{
"model": "mixtral-8x7b",
"messages": [
{
"role": "system",
"content": "You are a helpful assistant."
},
{
"role": "user",
"content": "Hello! Could you help me plan for 3-day trip in Hanoi?"
}
],
"logprobs": true,
"top_logprobs": 1,
"max_tokens": 1024,
"temperature": 0.0001,
"stream": false
}

Could you help me to review it? Thanks ^^

abetlen · 2024-03-31T17:29:26Z

Hey @windspirit95 great contribution, I just changed the default to logprobs=False to match the OpenAI api and added an example to the OpenAPI schema.

windspirit95 and others added 6 commits March 29, 2024 15:04

Add logprobs return in ChatCompletionResponse

21124db

Merge branch 'main' into chat-completion-logprobs

385b956

Fix duplicate field

9c3d35f

Set default to false

f884796

Simplify check

f375181

Add server example

c536903

abetlen merged commit aa9f1ae into abetlen:main Mar 31, 2024

windspirit95 deleted the chat-completion-logprobs branch March 31, 2024 23:53

devcxl mentioned this pull request Apr 4, 2024

Bug: Due to errors caused by adding logprobs #1328

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add logprobs return in ChatCompletionResponse #1311

Add logprobs return in ChatCompletionResponse #1311

Uh oh!

windspirit95 commented Mar 29, 2024

Uh oh!

abetlen commented Mar 31, 2024

Uh oh!

Uh oh!

Add logprobs return in ChatCompletionResponse #1311

Add logprobs return in ChatCompletionResponse #1311

Uh oh!

Conversation

windspirit95 commented Mar 29, 2024

Uh oh!

abetlen commented Mar 31, 2024

Uh oh!

Uh oh!