Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I also run a Qwen 3.6 moe A4B on old hardware. I set it up with

numactl --membind=1

so it is constrained to one of the memory sticks which speeds up token generation a little.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: