Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I reviewed the TensorRT-LLM commit history from the past few days and couldn't find any updates regarding Gemma 4 support. By contrast, here is the reference for MAX:https://github.com/modular/modular/commit/57728b23befed8f3b4...


If OP meant they have the fastest implementation of Gemma 4 on Blackwell at the moment, I guess that is technically true. I doubt that will hold up when TensorRT-LLM finishes their implementation though.


How is the sglang performance on Blackwell for this model?


Dunno but there's a PR for it. Probably also more performant than Modular.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: