ggml : add SSE 4.2 and x64 base variant for CPUs without AVX #12871

slaren · 2025-04-10T10:00:21Z

corsairius · 2025-04-21T16:07:44Z

How long does it usually take for a bug fix to become certified? Is it necessary to pass all of the above tests to get it into the review queue?

Thanks!

acbits · 2025-04-21T18:26:08Z

Just noticed this fix.

I am curious why the user can't fix this issue by setting CFLAGS to -msse4.2 -mno-avx?
Is there any reason why we add more code to cmake as it is a maintenance burden?

P. S. A even better option is to set -march=<cpu arch> and -mtune=<cpu> as it enables the exact set of flags required for that architecture.

…g#12871) * ggml : add SSE 4.2 variant for CPUs without AVX * ggml : add x64 base ABI variant

slaren · 2025-04-21T19:12:15Z

@acbits see #10606 and #10626 for more details.

…g#12871) * ggml : add SSE 4.2 variant for CPUs without AVX * ggml : add x64 base ABI variant

ggml : add SSE 4.2 variant for CPUs without AVX

3c36f96

github-actions bot added the ggml changes relating to the ggml tensor library for machine learning label Apr 10, 2025

ggml : add x64 base ABI variant

625a7a7

slaren changed the title ~~ggml : add SSE 4.2 variant for CPUs without AVX~~ ggml : add SSE 4.2 and x64 base variant for CPUs without AVX Apr 10, 2025

slaren requested a review from ggerganov April 21, 2025 16:09

ggerganov approved these changes Apr 21, 2025

View reviewed changes

slaren merged commit 1d735c0 into master Apr 21, 2025
50 of 51 checks passed

slaren deleted the sl/no-avx-variant branch April 21, 2025 16:13

colout pushed a commit to colout/llama.cpp that referenced this pull request Apr 21, 2025

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (ggml-or…

dc5d894

…g#12871) * ggml : add SSE 4.2 variant for CPUs without AVX * ggml : add x64 base ABI variant

pockers21 pushed a commit to pockers21/llama.cpp that referenced this pull request Apr 28, 2025

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX (ggml-or…

0190b62

…g#12871) * ggml : add SSE 4.2 variant for CPUs without AVX * ggml : add x64 base ABI variant

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX #12871

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX #12871

slaren commented Apr 10, 2025

corsairius commented Apr 21, 2025

acbits commented Apr 21, 2025 •

edited

Loading

slaren commented Apr 21, 2025

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX #12871

ggml : add SSE 4.2 and x64 base variant for CPUs without AVX #12871

Conversation

slaren commented Apr 10, 2025

corsairius commented Apr 21, 2025

acbits commented Apr 21, 2025 • edited Loading

slaren commented Apr 21, 2025

acbits commented Apr 21, 2025 •

edited

Loading