convert : refactor rope scaling handling #18013

CISC · 2025-12-14T02:56:53Z

Handle rope scaling in set_gguf_parameters to deduplicate code and support the new rope_parameters (where rope_theta also has moved) introduced in huggingface/transformers#39847

Obsoletes #18008

ggerganov

Confirming that gpt-oss converts correctly. Will take a look at the rest of the changes, but probably won't be able to give much additional feedback on the python code.

ngxson · 2025-12-14T10:58:31Z

convert_hf_to_gguf.py

        self.model_name = model_name
        self.dir_model_card = dir_model  # overridden in convert_lora_to_gguf.py

+        # Ensure "rope_theta" and "rope_type" is mirrored in rope_parameters


I think it probably better to extract this into a new method called load_rope_params

I considered it, but it got a little awkward as a static method and not much sense as a regular method.

convert_hf_to_gguf.py

refactor rope scaling handling

8a68748

github-actions bot added the python python script changes label Dec 14, 2025

ws--

468db7b

loci-dev mentioned this pull request Dec 14, 2025

UPSTREAM PR #18013: convert : refactor rope scaling handling auroralabs-loci/llama.cpp#560

Open

missed a couple

140d70b

CISC requested review from ggerganov and ngxson December 14, 2025 03:25

ggerganov mentioned this pull request Dec 14, 2025

convert : fix gpt-oss #18008

Closed

ggerganov approved these changes Dec 14, 2025

View reviewed changes

ngxson reviewed Dec 14, 2025

View reviewed changes

CISC mentioned this pull request Dec 14, 2025

Misc. bug: ERROR:hf-to-gguf:Model Ministral3ForCausalLM is not supported #17987

Closed

use find_hparam

b60e9a2

ngxson approved these changes Dec 14, 2025

View reviewed changes

CISC merged commit 5c8a717 into master Dec 14, 2025
8 of 9 checks passed

CISC deleted the cisc/convert-rope-parameters branch December 14, 2025 15:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

convert : refactor rope scaling handling #18013

convert : refactor rope scaling handling #18013

CISC commented Dec 14, 2025

Uh oh!

ggerganov left a comment

Uh oh!

ngxson Dec 14, 2025

Uh oh!

CISC Dec 14, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

convert : refactor rope scaling handling #18013

convert : refactor rope scaling handling #18013

Conversation

CISC commented Dec 14, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

ngxson Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

CISC Dec 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants