Skip to content

Tags: code-monad/llama.cpp

Tags

master-f5a77a6

Toggle master-f5a77a6's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Remove temporary notice and update hot topics

master-d5850c5

Toggle master-d5850c5's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add missing header for memcpy (ggml-org#386)

fixed: memcpy is not defined

master-ae44e23

Toggle master-ae44e23's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
When seed <= 0 - use the clock to generate one

master-928480e

Toggle master-928480e's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
When seed <= 0 - use the clock to generate one

master-4122dff

Toggle master-4122dff's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
cmake: make llama an actual library (ggml-org#392)

master-56e659a

Toggle master-56e659a's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
fix perplexity after c-api refactor (ggml-org#390)

* preallocate a buffer of fitting size for tokenization (utils.cpp)

* don't create a new std::string (especially here, where it's usually large)

master-e6c9e09

Toggle master-e6c9e09's commit message
Fix bin dir for win ci

master-8cf9f34

Toggle master-8cf9f34's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Adding missing features of CMakeLists.txt & Refactoring (ggml-org#131)

* Functionality addition CMakeLists.txt

Refactoring:
1. Simplify more options that are negation of negation.
LLAMA_NO_ACCELERATE -> LLAMA_ACCELERATE
2. Changed to an optional expression instead of forcing to enable AVX2 in MSVC.
3. Make CMAKE_CXX_STANDARD, which is different from Makefile, the same.
4. Use add_compile_options instead of adding options to CMAKE_C_FLAGS.
5. Make utils use target_link_libraries instead of directly referencing code.

Added features:
1. Added some options.
LLAMA_STATIC_LINK,LLAMA_NATIVE,LLAMA_LTO,LLAMA_GPROF,LLAMA_OPENBLAS

* Fix Accelerate link in CMake

* Windows build Fix

* C++11 to C++17

* Reflects C/C++ standard individually

* Change the version to 3.12

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

master-2e664f1

Toggle master-2e664f1's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add initial AVX512 support for dot product on Linux (ggml-org#320)

 * Update Makefile to detect AVX512 support and add compiler flags if it's available
 * Based on existing AVX2 implementation, dot product on one 32-value block of 4-bit quantized ints at a time
 * Perform 8 bit -> 16 bit sign extension and multiply+add on 32 values at time instead of 16
 * Use built-in AVX512 horizontal reduce add to get sum at the end
 * Manual unrolling on inner dot product loop to reduce loop counter overhead

master-01a297b

Toggle master-01a297b's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
specify build type for ctest on windows (ggml-org#371)