Graph-Structured Decoding 1 💡 Accelerating LLMs by 2x with Graph-structured Speculative Decoding. 2024년 8월 3일