Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

ONNX doesn't support the same level of quantization as GGML.

So basically GGML will run on hardware with less memory.



Or alternatively, bigger models with the same memory (just quantised harder).




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: