ggml-threading.cpp #7576

kunnis · 2024-05-27T23:50:32Z

Move all the threading related stuff to it's own file as discussed in the text of #6915

I picked cpp because I was thinking we'd use std::thread so we won't have to maintain two implementations of the thread operations. It'll also let us use std::mutex and std::atomic, and those primitives will likely be faster than our own implementations. However, this would just be the leading structural change. My goal with this PR is to not make any functional changes, and then we can do separate tests of the actual functional changes.

@slaren @ggerganov Do you think I'm headed in the right direction with the structural changes? I still need to do some more testing and self-reviews and I've only tested this really in windows so far, but I want to make sure I'm not headed in the wrong direction.

kunnis · 2024-05-28T21:22:50Z

I'll need to update for #7598

slaren · 2024-05-28T21:26:53Z

ggml-threading.h

+extern void atomic_store(atomic_int* ptr, LONG val);
+extern LONG atomic_load(atomic_int* ptr);
+extern LONG atomic_fetch_add(atomic_int* ptr, LONG inc);
+extern LONG atomic_fetch_sub(atomic_int* ptr, LONG dec);


I think it is very important that these functions are inlined, so they cannot be made extern. Additionally, we cannot export symbols with these names, because they are very likely to create conflicts. Same with the pthread and sched_yield stuff.

I see. Valid points.

That's the exact kind of feedback I was looking for. Thanks.

slaren · 2024-05-28T21:27:07Z

ggml-threading.h

+extern void set_numa_thread_affinity(int thread_n);
+extern void clear_numa_thread_affinity(void);


Same issue here, we cannot export functions with these names. At least they need a ggml_ prefix.

kunnis added 2 commits May 27, 2024 16:26

adding in x64 targets.

db0aba2

Moving all the threading operations to it's own file

b414599

github-actions bot added build Compilation issues ggml changes relating to the ggml tensor library for machine learning labels May 28, 2024

mofosyne added the Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level label May 28, 2024

slaren reviewed May 28, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml-threading.cpp #7576

ggml-threading.cpp #7576

kunnis commented May 27, 2024

kunnis commented May 28, 2024

slaren May 28, 2024

kunnis May 28, 2024

slaren May 28, 2024

		extern void set_numa_thread_affinity(int thread_n);
		extern void clear_numa_thread_affinity(void);

ggml-threading.cpp #7576

Are you sure you want to change the base?

ggml-threading.cpp #7576

Conversation

kunnis commented May 27, 2024

kunnis commented May 28, 2024

slaren May 28, 2024

Choose a reason for hiding this comment

kunnis May 28, 2024

Choose a reason for hiding this comment

slaren May 28, 2024

Choose a reason for hiding this comment