Skip to content
This repository was archived by the owner on Jul 4, 2025. It is now read-only.
This repository was archived by the owner on Jul 4, 2025. It is now read-only.

feat: Self-Extend support #349

@hahuyhoang411

Description

@hahuyhoang411

Problem
This is new implementation of Self-Extend to make LLM extend to 8k or 16k without further training

Success Criteria
We can also use in Nitro

--grp-attn-n 2
--grp-attn-w 128

Additional context
The llama.cpp supports it: ggml-org/llama.cpp#4815

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions