numactl --membind=1
so it is constrained to one of the memory sticks which speeds up token generation a little.
numactl --membind=1
so it is constrained to one of the memory sticks which speeds up token generation a little.