大模型基础

大模型微调

PEFT

llama

Infra

https://arxiv.org/pdf/2307.06435.pdf

https://github.com/LLMBook-zh/LLMBook-zh.github.io

KV Cache

Multi-Token Prediction (MTP)