summaryrefslogtreecommitdiff
path: root/hyperstack.fish
AgeCommit message (Collapse)Author
2026-03-23Add vLLM watch dashboard, side-by-side layout, and insert-mode defaultPaul Buetow
- hyperstack.rb: add VllmWatcher class and `watch` subcommand — live terminal dashboard polling all active VMs every 5 s via SSH; shows GPU util/VRAM/temp/power bars and vLLM throughput/requests/KV-cache/ cache-hit bars aligned in a shared column layout - draw(): render two or more VM panels side-by-side (horizontal) with a │ separator, padded to equal visible width; single VM falls back to vertical layout - pi/agent/extensions/modal-editor: start in INSERT mode instead of NORMAL - README: document watch command and update fish script rename Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-21Fix Nemotron OOM; add VM lifecycle fish abbrs; document automated setupPaul Buetow
- hyperstack-vm1/vm2.toml: reduce nemotron-super max_model_len 262144→131072 and add --enforce-eager to disable CUDA graph capture (~3-4 GB overhead). Nemotron 120B weights (~60 GB) leave too little VRAM headroom for KV cache allocation and CUDA graph buffers at 262K context on a single A100 80GB. 131K context with eager mode is stable. README VRAM table updated to match. - hyperstack.fish: add hyperstack-create/delete/test and hyperstack-create/delete-both abbreviations for VM lifecycle management alongside the existing pi-* aliases. - README.md: add "Automated setup reference" section with single-VM and two-VM command flows before the manual vLLM Docker setup section. End-to-end tested: single VM (GPT-OSS 120B), dual VM (Nemotron + Qwen3-Coder), pi queries on all three models — all passed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
2026-03-21Set single-VM default to GPT-OSS 120B; add hyperstack Pi provider and fish abbrPaul Buetow
- hyperstack-vm.toml: switch [vllm] default from Qwen3-Coder-Next to openai/gpt-oss-120b (container_name, max_model_len=131072, tool_call_parser=''); labels already reflected gpt-oss-120b - pi/agent/models.json: add 'hyperstack' provider pointing at hyperstack.wg1:11434/v1 with GPT-OSS 120B as primary model and all preset models registered (alongside hyperstack1/hyperstack2) - hyperstack.fish: add pi-hyperstack abbreviation for single-VM GPT-OSS 120B - README.md: update fish abbreviations table, provider table, VM config table, and Single-VM setup section to reflect the new defaults
2026-03-21add fishPaul Buetow