If you want to understand current LLM craze, check out this talk by Andrej Karpathy:
[converting and quantizing a model in the background while watching the video]
If you want to understand current LLM craze, check out this talk by Andrej Karpathy:
[converting and quantizing a model in the background while watching the video]