Every model has a context window. This is the amount of conversation, file content, and tool output the model can consider for one reply.
Babbily shows a context indicator near the composer so you can see how full the current chat is.
What the indicator means
The context indicator estimates how much of the active model’s window your chat is using.
- Used shows the percentage consumed.
- Usage shows used tokens out of the model’s window.
- Remaining shows the approximate space left for the next message and reply.
- Auto compact shows when Babbily may summarize older parts of the chat.
The indicator updates as you send messages, attach files, receive replies, and switch models.
Automatic compaction
When a long chat approaches the model’s limit, Babbily can compact older conversation history into a shorter summary. Your visible chat history remains available, and the current conversation can keep going.
Compaction helps:
- Preserve recent messages in detail.
- Keep the thread within the active model’s limit.
- Continue long-running work without forcing a new chat immediately.
A compacted summary is still part of the model’s context. Your usage percentage may drop, but it will not reset to zero.
Switching models
Different models support different context sizes. If you switch to a smaller model, Babbily recalculates the thread against that model’s limit. If needed, Babbily may compact the thread before the next reply.
When a chat gets too full
If a thread reaches the limit, Babbily may ask you to start a new thread. Memory and connectors can still help you carry important context forward.
Long files, deep research output, and pasted code can fill context quickly. Start a new thread when your topic changes or when you no longer need earlier details.