Tags · code-monad/llama.cpp

master-f5a77a6

Remove temporary notice and update hot topics

Mar 22, 2023
56817b1
zip
tar.gz

master-d5850c5

Add missing header for memcpy (ggml-org#386)

fixed: memcpy is not defined

Mar 22, 2023
d5850c5
zip
tar.gz

master-ae44e23

When seed <= 0 - use the clock to generate one

Mar 22, 2023
ae44e23
zip
tar.gz

master-928480e

When seed <= 0 - use the clock to generate one

Mar 22, 2023
ae44e23
zip
tar.gz

master-4122dff

cmake: make llama an actual library (ggml-org#392)

Mar 22, 2023
4122dff
zip
tar.gz

master-56e659a

fix perplexity after c-api refactor (ggml-org#390)

* preallocate a buffer of fitting size for tokenization (utils.cpp)

* don't create a new std::string (especially here, where it's usually large)

Mar 22, 2023
56e659a
zip
tar.gz

master-e6c9e09

Fix bin dir for win ci

Mar 21, 2023
e6c9e09
zip
tar.gz

master-8cf9f34

Adding missing features of CMakeLists.txt & Refactoring (ggml-org#131)

* Functionality addition CMakeLists.txt

Refactoring:
1. Simplify more options that are negation of negation.
LLAMA_NO_ACCELERATE -> LLAMA_ACCELERATE
2. Changed to an optional expression instead of forcing to enable AVX2 in MSVC.
3. Make CMAKE_CXX_STANDARD, which is different from Makefile, the same.
4. Use add_compile_options instead of adding options to CMAKE_C_FLAGS.
5. Make utils use target_link_libraries instead of directly referencing code.

Added features:
1. Added some options.
LLAMA_STATIC_LINK,LLAMA_NATIVE,LLAMA_LTO,LLAMA_GPROF,LLAMA_OPENBLAS

* Fix Accelerate link in CMake

* Windows build Fix

* C++11 to C++17

* Reflects C/C++ standard individually

* Change the version to 3.12

---------

Co-authored-by: Georgi Gerganov <ggerganov@gmail.com>

Mar 21, 2023
8cf9f34
zip
tar.gz

master-2e664f1

Add initial AVX512 support for dot product on Linux (ggml-org#320)

 * Update Makefile to detect AVX512 support and add compiler flags if it's available
 * Based on existing AVX2 implementation, dot product on one 32-value block of 4-bit quantized ints at a time
 * Perform 8 bit -> 16 bit sign extension and multiply+add on 32 values at time instead of 16
 * Use built-in AVX512 horizontal reduce add to get sum at the end
 * Manual unrolling on inner dot product loop to reduce loop counter overhead

Mar 21, 2023
2e664f1
zip
tar.gz

master-01a297b

specify build type for ctest on windows (ggml-org#371)

Mar 21, 2023
01a297b
zip
tar.gz

PreviousNext

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

master-f5a77a6

master-d5850c5

master-ae44e23

master-928480e

master-4122dff

master-56e659a

master-e6c9e09

master-8cf9f34

master-2e664f1

master-01a297b

Tags: code-monad/llama.cpp