Our paper SirLLM: Streaming Infinite Retentive LLM was accepted by ACL 2024. It presents a streaming framework that enables long-context retention with efficient state updates.
Our paper SirLLM: Streaming Infinite Retentive LLM was accepted by ACL 2024. It presents a streaming framework that enables long-context retention with efficient state updates.