New LLM Algorithms - a BHbean Collection

BHbean 's Collections

LoRA

LLM Training Systems

Survey

MoE LLM Systems

LLM resource-constrained Inference

New LLM Algorithms

LLM Internal Mechanism

Prompt Engineering

KV Cache Compression

LLM reasoning systems

Speculative Decoding

New LLM Algorithms

updated Jul 8

Multi-Token Attention

Paper • 2504.00927 • Published Apr 1 • 55