llama-model.cpp the monolithic giant #15523

pwilkin · 2025-08-23T11:38:24Z

pwilkin
Aug 23, 2025

Maybe it's a stupid question or maybe it's already been answered before, but: is there any particular reason why llama-model.cpp is kept as such a monolithic beast instead of moving all the graph builders to their separate files in some models/ subdirectory?

Obviously such a huge file is rough for IDE parsers and stuff, but the place where it really gets bad is using LLM assistants. I sometimes try to use i.e. Gemini to compare new model implementations with their reference Python implementation, but even Gemini with its 1 mil context struggles with analyzing the entire llama-model.cpp.

Would it be possible to actually refactor the file and move all the builders out or is there a good reason for why it stays as it is?

ggerganov · 2025-08-26T08:05:16Z

ggerganov
Aug 26, 2025
Maintainer

Yes, this will probably the next major refactoring. I was planning to have already started working on this, but got sidetracked with a few Metal optimizations the last couple of days. It's next on the list now.

2 replies

pwilkin Aug 26, 2025
Author

Great to hear! Do you mind if I propose a PR to that end or do you want to do it entirely yourself?

ggerganov Aug 26, 2025
Maintainer

I'll try to start a draft PR in the next days - we already have had some discussions how to approach the refactoring so it would be easier if I initiate it. We can discuss when it is open.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

llama-model.cpp the monolithic giant #15523

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Select a reply

Uh oh!

llama-model.cpp the monolithic giant #15523

Uh oh!

pwilkin Aug 23, 2025

Replies: 1 comment · 2 replies

Uh oh!

ggerganov Aug 26, 2025 Maintainer

Uh oh!

pwilkin Aug 26, 2025 Author

Uh oh!

Uh oh!

ggerganov Aug 26, 2025 Maintainer

pwilkin
Aug 23, 2025

Replies: 1 comment 2 replies

ggerganov
Aug 26, 2025
Maintainer

pwilkin Aug 26, 2025
Author

ggerganov Aug 26, 2025
Maintainer