大模型基础
大模型微调
PEFT
llama
Infra
https://arxiv.org/pdf/2307.06435.pdf
https://github.com/LLMBook-zh/LLMBook-zh.github.io
KV Cache
Multi-Token Prediction (MTP)