Doesn't huggingface have dozens of freely available pretrained models like this ...

isoprophlex · on Jan 11, 2023

True, but the use cases arent the same. As he did before for other models, he has a knack for distilling the code down to beautiful, self-contained examples of high didactic value.

It's an order of magnitude easier to grok the basics from this repo than from going through (admittedly more ergonomic or performant or production-ready) huggingface repos.

minimaxir · on Jan 11, 2023

Additionally, in terms of the streamlining nanoGPT porports, HuggingFace's implementations play nice with optimization techniques such as ONNX/TensorRT, which will give you better performance than anything PyTorch-based even if minimal.

That doesn't mean an ONNX-ed nanoGPT won't be better, but the field of optimized text generation isn't as new as people claim.

visarga · on Jan 11, 2023

This is a didactic implementation. If you read the HuggingFace repo it is much more abstracted on account they implement many models in the same codebase. It's not fast or big, just easier to read and tweak.

pms · on Jan 12, 2023

If so, then why the second line of its documentation says that "it is a rewrite of minGPT that prioritizes teeth over education"?

ironrabbit · on Jan 12, 2023

minGPT prioritized being understandable above all else, and was not very fast. This repo includes several optimizations, but it still much more understandable than probably any other open source implementation.