Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

tempstudio · 2025-07-16T05:00:45Z

Cosyvoice2-0.5B (https://github.com/FunAudioLLM/CosyVoice/blob/main/cosyvoice/vllm/cosyvoice2.py#L93) is a TTS model finetuned on top of the Qwen2-0.5B model, with an extra bias tensor on the decoder head.

This change allows this bias tensor to be loaded for better quality when running Cosyvoice2 in llama.cpp.

hipudding · 2025-08-02T06:57:46Z

@tempstudio Hello, may I ask how I should load and run inference with the cosyvoice2 model? Thanks.

hipudding · 2025-08-05T08:48:35Z

@tempstudio I was thinking about it — this approach only allows llama.cpp to be used as a library, correct? It can’t yet run inference for CosyVoice directly, as preprocessing and postprocessing would still need to be handled by external code.

tempstudio · 2025-08-06T03:13:11Z

@hipudding Yes you are right, this is a replacement for the LLM part of Cosyvoice. It doesn't cover the FLOW and HIFIGAN parts of Cosyvoice.

hipudding · 2025-08-06T06:58:08Z

@hipudding Yes you are right, this is a replacement for the LLM part of Cosyvoice. It doesn't cover the FLOW and HIFIGAN parts of Cosyvoice.

Thanks. Do you have plans to fully support CosyVoice?

qwaqrm added 2 commits July 15, 2025 23:51

support cosyvoice2 over qwen2

54dd8e2

Merge branch 'master' of https://github.com/tempstudio/llama.cpp

1095d56

ggerganov approved these changes Jul 16, 2025

View reviewed changes

ggerganov merged commit b0f0ecc into ggml-org:master Jul 16, 2025
45 of 48 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

Uh oh!

tempstudio commented Jul 16, 2025

Uh oh!

Uh oh!

hipudding commented Aug 2, 2025

Uh oh!

hipudding commented Aug 5, 2025

Uh oh!

tempstudio commented Aug 6, 2025

Uh oh!

hipudding commented Aug 6, 2025

Uh oh!

Uh oh!

Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

Support Cosyvoice2-0.5B By allowing Qwen2 architecture to have a optional bias tensor #14711

Uh oh!

Conversation

tempstudio commented Jul 16, 2025

Uh oh!

Uh oh!

hipudding commented Aug 2, 2025

Uh oh!

hipudding commented Aug 5, 2025

Uh oh!

tempstudio commented Aug 6, 2025

Uh oh!

hipudding commented Aug 6, 2025

Uh oh!

Uh oh!