summaryrefslogtreecommitdiff
path: root/TODO.md
diff options
context:
space:
mode:
Diffstat (limited to 'TODO.md')
-rw-r--r--TODO.md4
1 files changed, 4 insertions, 0 deletions
diff --git a/TODO.md b/TODO.md
index 20f78c1..51cd5d1 100644
--- a/TODO.md
+++ b/TODO.md
@@ -17,6 +17,10 @@ Updated tests accordingly.
Phase 2: Debounce completion requests: Introduce a configurable delay (e.g., 100–500 ms) before sending a completion request to the LLM. This prevents a flood of calls while typing.
+Status: Done — added `completion_debounce_ms` (default 200). Server waits until
+no recent input activity for at least this duration before LLM calls (both chat
+and provider-native paths). Added unit test `TestCompletionDebounce_WaitsUntilQuiet`.
+
Phase 3: Throttle on the server side: Beyond debouncing, implement request throttling to cap the maximum rate of LLM calls (e.g., one per 500 ms). This is especially useful when debounce alone isn’t enough under rapid editing
2
.