summaryrefslogtreecommitdiff
diff options
context:
space:
mode:
authorPaul Buetow <paul@buetow.org>2026-05-24 13:09:29 +0300
committerPaul Buetow <paul@buetow.org>2026-05-24 13:09:29 +0300
commitbdab30227ef495d2fc4d841814f8f22f96ff9684 (patch)
tree8014213435710318d3149de4fd40c3b1451e26b3
parent993b63cb32135bc8d45defd7d1549d83df200cad (diff)
docs: refresh README, hypr.fish, AGENTS.md for consolidated --vm CLI
-rw-r--r--README.md25
-rw-r--r--hypr.fish8
2 files changed, 9 insertions, 24 deletions
diff --git a/README.md b/README.md
index 80c1a9f..0c0df1b 100644
--- a/README.md
+++ b/README.md
@@ -175,10 +175,10 @@ abbr pi-hyperstack-gemma4 pi --model hyperstack2/cyankiwi/gemma-4-31B-it-AWQ-4b
Then launch a session after the VM(s) are up:
```fish
-pi-hyperstack # single-VM → GPT-OSS 120B on hyperstack.wg1
-pi-hyperstack-coder # two-VM → Qwen3-Coder-Next on VM1
-pi-hyperstack-qwen36 # two-VM → Qwen3.6 27B FP8 on VM2
-pi-hyperstack-gemma4 # two-VM → Gemma 4 31B on VM2
+pi-hyperstack # GPT-OSS 120B on VM1
+pi-hyperstack-coder # Qwen3-Coder-Next on VM1
+pi-hyperstack-qwen36 # Qwen3.6 27B FP8 on VM2
+pi-hyperstack-gemma4 # Gemma 4 31B on VM2
```
### Model configuration (`pi/agent/models.json`)
@@ -256,7 +256,6 @@ No API key or account required. Uses DuckDuckGo's free HTML endpoint.
| Config file | Default model | WireGuard IP | Hostname |
|---|---|---|---|
| `hyperstack-vm1.toml` | Qwen3-Coder-Next (AWQ-4bit) | `192.168.3.1` | `hyperstack1.wg1` |
-| `hyperstack-vm1-nemotron.toml` | Nemotron-3-Super 120B (2× H100, TP=2, 1M ctx) | `192.168.3.1` | `hyperstack1.wg1` |
| `hyperstack-vm2.toml` | Gemma 4 31B IT (AWQ-4bit) | `192.168.3.3` | `hyperstack2.wg1` |
Each VM has independent state files so they can be managed separately:
@@ -339,22 +338,6 @@ clear that trust file for intentional reprovisioning; unexpected host key change
`hyperstack.rb` handles the full VM lifecycle automatically. All steps below
(VM creation, WireGuard tunnel, vLLM Docker container) run in a single command.
-### Single-VM setup
-
-```bash
-# Deploy VM, configure WireGuard tunnel, pull and start vLLM (~10 min)
-ruby hyperstack.rb create
-
-# Run end-to-end inference test over the tunnel
-ruby hyperstack.rb test
-
-# Launch Pi coding agent connected to GPT-OSS 120B on the VM
-pi-hyperstack # fish abbreviation from hyperstack.fish
-
-# Tear down the VM and remove WireGuard peer
-ruby hyperstack.rb delete
-```
-
### Two-VM setup
```bash
diff --git a/hypr.fish b/hypr.fish
index b132723..e2de7d2 100644
--- a/hypr.fish
+++ b/hypr.fish
@@ -1,6 +1,8 @@
# Dual-VM setup (hyperstack-vm1/vm2.toml -> hyperstack1/2.wg1)
-abbr pi-hyperstack-coder pi --model hyperstack1/bullpoint/Qwen3-Coder-Next-AWQ-4bit
-abbr pi-hyperstack-qwen36 pi --model hyperstack2/Qwen/Qwen3.6-27B-FP8
-abbr pi-hyperstack-gemma4 pi --model hyperstack2/cyankiwi/gemma-4-31B-it-AWQ-4bit
+abbr pi-hyperstack-coder pi --model hyperstack1/bullpoint/Qwen3-Coder-Next-AWQ-4bit
+abbr pi-hyperstack-qwen36 pi --model hyperstack2/Qwen/Qwen3.6-27B-FP8
+abbr pi-hyperstack-gemma4 pi --model hyperstack2/cyankiwi/gemma-4-31B-it-AWQ-4bit
+abbr hyperstack-create ruby ~/git/hyperstack/hyperstack.rb create
+abbr hyperstack-create-vm2 ruby ~/git/hyperstack/hyperstack.rb create --vm 2
abbr hyperstack-create-both ruby ~/git/hyperstack/hyperstack.rb create --vm both
abbr hyperstack-delete-both ruby ~/git/hyperstack/hyperstack.rb delete --vm both