Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
trilogic
39 days ago
|
parent
|
context
|
favorite
| on:
Advanced Quantization Algorithm for LLMs
You can try it with this model here:
https://hugston.com/models/56tps-tested-autoround-qwen35-35b...
which is really well done and can run pretty fast with ctx up to 300k. Just 11.65 GB. Get the Mmproj also for vision/image processing.
khimaros
37 days ago
[–]
this link feels very spammy
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: