Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
danielhanchen
12 days ago
|
parent
|
context
|
favorite
| on:
Unsloth Dynamic 2.0 GGUFs
Oh I didn't expect this to be on HN haha - but yes for our new benchmarks for Qwen3.5, we devised a slightly different approach for quantization which we plan to roll out to all new models from now on!
help
nnx
12 days ago
|
next
[ā]
Can you describe what is this slightly different approach and why it should work on all models?
reply
hedora
12 days ago
|
prev
[ā]
Nice! Your stuff ran LLMs extremely well on < $500 boxes (24-32GB ram) with iGPUS before this update.
Iām eager to try it out, especially if 16GB is viable now.
reply
gundmc
11 days ago
|
parent
[ā]
The 5080 is 16GB VRAM, not system memory. I don't think you can get 24-32GB VRAM in a $500 box
reply
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: