Accelerating Retrieval-Augmented Language Model Serving with Speculation Paper • 2401.14021 • Published Jan 25, 2024 • 2