Tags · duaneking/llama.cpp

master-924dd22

Quantized dot products for CUDA mul mat vec (ggml-org#2067)

Jul 5, 2023
924dd22
zip
tar.gz

master-051c70d

llama: Don't double count the sampling time (ggml-org#2107)

Jul 5, 2023
051c70d
zip
tar.gz

master-9e4475f

Fixed OpenCL offloading prints (ggml-org#2082)

Jul 5, 2023
9e4475f
zip
tar.gz

master-f257fd2

Add an API example using server.cpp similar to OAI. (ggml-org#2009)

* add api_like_OAI.py
* add evaluated token count to server
* add /v1/ endpoints binding

Jul 4, 2023
f257fd2
zip
tar.gz

master-ed9a54e

ggml : sync latest (new ops, macros, refactoring) (ggml-org#2106)

- add ggml_argmax()
- add ggml_tanh()
- add ggml_elu()
- refactor ggml_conv_1d() and variants
- refactor ggml_conv_2d() and variants
- add helper macros to reduce code duplication in ggml.c

Jul 4, 2023
ed9a54e
zip
tar.gz

master-acc111c

Allow old Make to build server. (ggml-org#2098)

Also make server build by default.

Tested with Make 3.82

Jul 4, 2023
acc111c
zip
tar.gz

master-23c7c6f

Update Makefile: clean simple (ggml-org#2097)

Jul 4, 2023
23c7c6f
zip
tar.gz

master-7f0e9a7

embd-input: Fix input embedding example unsigned int seed (ggml-org#2105

)

Jul 4, 2023
7f0e9a7
zip
tar.gz

master-7ee76e4

Simple webchat for server (ggml-org#1998)

* expose simple web interface on root domain

* embed index and add --path for choosing static dir

* allow server to multithread

because web browsers send a lot of garbage requests we want the server
to multithread when serving 404s for favicon's etc. To avoid blowing up
llama we just take a mutex when it's invoked.


* let's try this with the xxd tool instead and see if msvc is happier with that

* enable server in Makefiles

* add /completion.js file to make it easy to use the server from js

* slightly nicer css

* rework state management into session, expose historyTemplate to settings

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Jul 4, 2023
7ee76e4
zip
tar.gz

master-698efad

CI: make the brew update temporarily optional. (ggml-org#2092)

until they decide to fix the brew installation in the macos runners.
see the open issues. eg actions/runner-images#7710

Jul 3, 2023
698efad
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

master-924dd22

master-051c70d

master-9e4475f

master-f257fd2

master-ed9a54e

master-acc111c

master-23c7c6f

master-7f0e9a7

master-7ee76e4

master-698efad

Tags: duaneking/llama.cpp