Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Async DirectIO model loading on Linux
#18012 opened Dec 13, 2025 by JTischbein Loading…
webui: fix chat screen shadow width
#18010 opened Dec 13, 2025 by polydecay Loading…
model-conversion : cast logits to float32 examples python python script changes
#18009 opened Dec 13, 2025 by ggerganov Loading…
convert : fix gpt-oss python python script changes
#18008 opened Dec 13, 2025 by ggerganov Loading…
models : fix YaRN regression + consolidate logic
#18006 opened Dec 13, 2025 by ggerganov Loading…
CLI: fixed adding cli and completion into docker containers, improved docs devops improvements to build systems and github actions documentation Improvements or additions to documentation
#18003 opened Dec 13, 2025 by andrew-aladev Loading…
Clarify that steps also apply to linux documentation Improvements or additions to documentation
#18002 opened Dec 13, 2025 by alosslessdev Loading…
server: add /v1/metrics endpoint examples server
#18001 opened Dec 13, 2025 by Kritavya Loading…
mtmd: add GLM4V multimodal model with conversion support examples model Model specific python python script changes
#17998 opened Dec 13, 2025 by eelbaz Loading…
Optimization: Qwen3 next autoregressive pass model Model specific
#17996 opened Dec 13, 2025 by pwilkin Loading…
CLI: fixed dead links to tools/main for cli and completion, fixed code owners documentation Improvements or additions to documentation examples
#17993 opened Dec 13, 2025 by andrew-aladev Loading…
HIP: Refactor mma for RDNA and CDNA ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#17990 opened Dec 13, 2025 by zhang-hui-yulo Draft
1 task
sync : ggml ggml changes relating to the ggml tensor library for machine learning script Script related
#17988 opened Dec 13, 2025 by ggerganov Loading…
kv-cache: Fix state restore fragmented cache testing Everything test related
#17982 opened Dec 13, 2025 by ssweens Loading…
mtmd: refactor audio preprocessing examples
#17978 opened Dec 12, 2025 by ngxson Loading…
ggml-hexagon: Implement true Q8_0 quantization on Hexagon NPU for more accurate mixed-precision matmul operations ggml changes relating to the ggml tensor library for machine learning
#17977 opened Dec 12, 2025 by ngdxzy Loading…
mtmd: (WIP) gemma3n vision support examples python python script changes
#17961 opened Dec 12, 2025 by ngxson Draft
vulkan: Add perf logger mode with concurrency ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#17944 opened Dec 11, 2025 by jeffbolznv Loading…
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.