[pull] master from ggml-org:master #206

pull · 2025-07-18T22:12:04Z

See Commits and Changes for more details.

Created by pull[bot] (v2.0.0-alpha.3)

Can you help keep this open source service alive? 💖 Please sponsor : )

ggml-ci

* Documentation: Rewrote and updated the "Without docker" portion of the Vulkan backend build documentation. * Documentation: Reorganize build.md's Vulkan section.

…) (#14707)

* imatrix : allow processing multiple chunks per batch * perplexity : simplify filling the batch * imatrix : fix segfault when using a single chunk per batch * imatrix : use GGUF to store imatrix data * imatrix : fix conversion problems * imatrix : use FMA and sort tensor names * py : add requirements for legacy imatrix convert script * perplexity : revert changes * py : include imatrix converter requirements in toplevel requirements * imatrix : avoid using designated initializers in C++ * imatrix : remove unused n_entries * imatrix : allow loading mis-ordered tensors Sums and counts tensors no longer need to be consecutive. * imatrix : more sanity checks when loading multiple imatrix files * imatrix : use ggml_format_name instead of std::string concatenation Co-authored-by: Xuan Son Nguyen <son@huggingface.co> * quantize : use unused imatrix chunk_size with LLAMA_TRACE * common : use GGUF for imatrix output by default * imatrix : two-way conversion between old format and GGUF * convert : remove imatrix to gguf python script * imatrix : use the function name in more error messages * imatrix : don't use FMA explicitly This should make comparisons between the formats easier because this matches the behavior of the previous version. * imatrix : avoid returning from void function save_imatrix * imatrix : support 3d tensors with MUL_MAT * quantize : fix dataset name loading from gguf imatrix * common : move string_remove_suffix from quantize and imatrix Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com> * imatrix : add warning when legacy format is written * imatrix : warn when writing partial data, to help guess dataset coverage Also make the legacy format store partial data by using neutral values for missing data. This matches what is done at read-time for the new format, and so should get the same quality in case the old format is still used. * imatrix : avoid loading model to convert or combine imatrix * imatrix : avoid using imatrix.dat in README --------- Co-authored-by: Xuan Son Nguyen <son@huggingface.co> Co-authored-by: Sigbjørn Skjæret <sigbjorn.skjaeret@scala.com>

metal : fuse add, mul + add tests (#14596)

bf9087f

ggml-ci

pull bot locked and limited conversation to collaborators Jul 18, 2025

pull bot added the ⤵️ pull label Jul 18, 2025

github-actions bot added ggml testing Apple Metal labels Jul 18, 2025

sync : ggml

b172309

github-actions bot added the script label Jul 19, 2025

Documentation: Update build.md's Vulkan section (#14736)

f0d4d17

* Documentation: Rewrote and updated the "Without docker" portion of the Vulkan backend build documentation. * Documentation: Reorganize build.md's Vulkan section.

github-actions bot added the documentation Improvements or additions to documentation label Jul 19, 2025

Vulkan: Fix fprintf format-security warning (#14770)

83f5872

github-actions bot added the Vulkan label Jul 19, 2025

Peter0x44 and others added 2 commits July 19, 2025 17:58

vulkan: Add logging for bf16 features to ggml_vk_print_gpu_info (#13274…

d4b91ea

…) (#14707)

github-actions bot added examples python labels Jul 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[pull] master from ggml-org:master #206

[pull] master from ggml-org:master #206

pull bot commented Jul 18, 2025 •

edited

Loading

Uh oh!

Uh oh!

[pull] master from ggml-org:master #206

Are you sure you want to change the base?

[pull] master from ggml-org:master #206

Conversation

pull bot commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

pull bot commented Jul 18, 2025 •

edited

Loading