Continue.dev accumulates significant context across coding sessions. Adding a prompt compression layer could reduce LLM token costs by ~65% by scoring each line of context against the current query and dropping irrelevant lines before sending to the LLM.
SuperCompress is an open-source (MIT) learned compression engine that runs in ~60ms on CPU: https://supercompress.vercel.app
This would be especially valuable for users with long Hub sessions where context accumulates across many turns.
Continue.dev accumulates significant context across coding sessions. Adding a prompt compression layer could reduce LLM token costs by ~65% by scoring each line of context against the current query and dropping irrelevant lines before sending to the LLM.
SuperCompress is an open-source (MIT) learned compression engine that runs in ~60ms on CPU: https://supercompress.vercel.app
This would be especially valuable for users with long Hub sessions where context accumulates across many turns.