conf - Configuration files for the automation of my personal infrastructure (servers, laptops, workstations, phones)!

Age	Commit message (Collapse)	Author
2026-04-08	frontends: add garage.f3s.buetow.org to @f3s_hosts (task 8)	Paul Buetow
	Include garage in f3s host list so DNS, TLS (acme), and httpd/relayd templates generate config for the new hostname. Made-with: Cursor
2026-04-07	dserver: replace broken newsyslog rotation with daily.local find cleanup	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d6727-d603-72c5-97a0-c1e419211767 Co-authored-by: Amp <amp@ampcode.com>
2026-04-06	immich: fix chart value structure - image tag under server/ml controllers, ↵	Paul Buetow
	remove duplicate controllers.server Amp-Thread-ID: https://ampcode.com/threads/T-019d6154-8fdf-74fe-b865-f796d8a4214a Co-authored-by: Amp <amp@ampcode.com>
2026-04-06	immich: fix ML config to use chart's machine-learning.controllers.main structure	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d6154-8fdf-74fe-b865-f796d8a4214a Co-authored-by: Amp <amp@ampcode.com>
2026-04-06	immich: tune ML throughput - add postgres anti-affinity, increase intra-op ↵	Paul Buetow
	threads, increase worker timeout Amp-Thread-ID: https://ampcode.com/threads/T-019d6154-8fdf-74fe-b865-f796d8a4214a Co-authored-by: Amp <amp@ampcode.com>
2026-04-05	immich: relax postgres probes and add resource limits	Paul Buetow
	- Increase liveness probe tolerance (60s delay, 30s period, 10s timeout, 6 failures) - Increase readiness probe tolerance (15s delay, 10s period, 5s timeout, 6 failures) - Add resource requests (100m CPU, 512Mi RAM) and limits (2Gi RAM) - Fixes crash loop caused by probe killing postgres during recovery Amp-Thread-ID: https://ampcode.com/threads/T-019d5f54-27f2-740c-ac41-0f980e7aecd3 Co-authored-by: Amp <amp@ampcode.com>
2026-04-04	fix(immich): use dual-style values for resources and affinity to ensure they ↵	Paul Buetow
	apply
2026-04-04	fix(immich): use correctly nested controllers structure for affinity and 4Gi ↵	Paul Buetow
	resources
2026-04-04	fix(immich): increase memory limits to 4Gi to avoid OOMKilled for ML	Paul Buetow

2026-04-04	feat(immich): add preferred anti-affinity and resources to balance load	Paul Buetow

2026-04-01	immich: separate PVs for videos RO/RW to avoid dual-PVC mount issue	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d47a3-2deb-75c3-8a75-b0f39006a35d Co-authored-by: Amp <amp@ampcode.com>
2026-04-01	immich: per-user external library mounts with RO/RW separation	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d47a3-2deb-75c3-8a75-b0f39006a35d Co-authored-by: Amp <amp@ampcode.com>
2026-04-01	immich: use bjw-s persistence for external library mount	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d47a3-2deb-75c3-8a75-b0f39006a35d Co-authored-by: Amp <amp@ampcode.com>
2026-04-01	immich: replace yoga videos with general external library mount	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d47a3-2deb-75c3-8a75-b0f39006a35d Co-authored-by: Amp <amp@ampcode.com>
2026-03-29	Add newsyslog rotation for dserver logs	Paul Buetow

2026-03-28	Add OpenBSD build VM and dtail package infrastructure	Paul Buetow
	Add a QEMU/KVM OpenBSD VM for native compilation of CGo packages (e.g. dtail with DataDog/zstd). The VM is fully automated via expect driving the serial console installer. - packages/buildvm/: setup, provision, start, stop scripts and expect installer - packages/scripts/pkg-dtail-openbsd.sh: multi-binary package with signify signing - packages/Makefile: build VM management and dtail-openbsd target using git archive - frontends/Rexfile: dtail_install task uses custom pkg repo, dtail task enabled Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	Move package build/upload scripts from gogios Magefile to conf/packages	Paul Buetow
	Packaging logic is now OS-agnostic shell scripts + Makefile, reusable for any Go project. Cross-compiles locally, SCPs to target host for native packaging, and uploads to the PV. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	Sign OpenBSD packages with signify, drop -D unsigned	Paul Buetow
	Packages are now signed via pkg_sign with the custom-pkg signify key on the OpenBSD build host. The public key at /etc/signify/custom-pkg.pub on each client allows pkg_add to verify without -D unsigned. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	frontends: install gogios from pkg repo, add pkgrepo_setup task	Paul Buetow
	Replace manual binary copy in gogios_install with pkg install (FreeBSD) and pkg_add (OpenBSD). Add pkgrepo_setup task that configures PKG_PATH in root's .profile on OpenBSD frontends. The gogios task now calls gogios_install automatically. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	pkgrepo: fix test package build scripts for FreeBSD and OpenBSD	Paul Buetow
	FreeBSD: use -p plist flag so files are actually included in the package. OpenBSD: use -D COMMENT flag and separate desc file as required by pkg_create, auto-detect OS version for repo path. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	pkgrepo: fix health probe path to /healthz	Paul Buetow
	The root path returns 404 by design, so probes need a dedicated /healthz endpoint that returns 200. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-28	pkgrepo: add FreeBSD/OpenBSD package repository service	Paul Buetow
	Serve custom-built FreeBSD and OpenBSD packages via nginx in the k3s cluster. Includes helm chart, ArgoCD app, test artifact build script, and DNS entry via frontends Rexfile. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
2026-03-22	fix: correct NFS sentinel filename in immich-postgres init container	Paul Buetow
	The wait-for-nfs init container was checking for nfs.DO_NOT_REMOVE but the actual file on disk is k3svolumes.DO_NOT_REMOVE. This caused every new pod from the rolling update to be permanently stuck in Init:0/1, leaving two postgres pods running indefinitely (old + stuck new).
2026-03-22	immich: add NFS mount check init container to postgres	Paul Buetow
	Amp-Thread-ID: https://ampcode.com/threads/T-019d14d5-4dbf-71a7-a619-d9c5afed3f7c Co-authored-by: Amp <amp@ampcode.com>
2026-03-21	Remove obsolete documentation snippets	Paul Buetow

2026-03-21	moved	Paul Buetow

2026-03-20	Parallelize delete-both VM teardown	Paul Buetow

2026-03-20	Add Pi VM launcher scripts	Paul Buetow

2026-03-20	Add project Pi VM model switching config	Paul Buetow

2026-03-20	fix wireguard setup ssh host pinning	Paul Buetow

2026-03-20	task 301: extract provisioning collaborators	Paul Buetow

2026-03-20	task 300: persist effective service mode	Paul Buetow

2026-03-20	task 299: clean up local state on delete	Paul Buetow

2026-03-20	task 298: pin SSH host keys per VM state	Paul Buetow

2026-03-20	task 297: lock down default ingress rules	Paul Buetow

2026-03-20	Remove peers by allowed IPs from local WireGuard config	Paul Buetow

2026-03-20	Initial commit: add hyperstack-vm1.toml, hyperstack-vm2.toml, update ↵	Paul Buetow
	hyperstack.rb and wg1-setup.sh for multi-VM WireGuard support
2026-03-18	vllm: skip docker pull on model switch, persist torch compile cache	Paul Buetow
	- model switch now passes pull_image: false to avoid surprise multi-GB image downloads when the upstream vLLM image was updated upstream; docker pull is still run on initial install (pull_image: true default) - mount /ephemeral/vllm_cache → /root/.cache/vllm so torch.compile artifacts survive container restarts; saves ~30-60 s on warm switches - add vllm_compile_cache_dir helper (sibling of hug_cache_dir) Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	gpt-oss-120b: revert to 131072 — hard architecture limit	Paul Buetow
	max_position_embeddings=131072 in model config.json; exceeding it causes NaN/CUDA OOB. 163840 was rejected by vLLM at startup. The 135K error requires starting a fresh opencode conversation instead. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	gpt-oss-120b: raise max_model_len to 163840 (160K)	Paul Buetow
	131K was still too small — observed 135K token conversations in practice. Physical KV capacity is 168K blocks so 160K is safe without OOM. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	gpt-oss-120b: raise max_model_len to 131072	Paul Buetow
	MXFP4 KV cache is compact enough that vLLM allocated 168K token blocks (10560×16) at 0.92 utilization — the 40K limit was too conservative and caused negative max_tokens errors in long Claude Code sessions. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	fix: handle bundler self_manager.rb error with Errno::ENOENT	Paul Buetow

2026-03-18	fix: refactor CLI help to DRY up duplicated code	Paul Buetow

2026-03-18	cli: show help when called without arguments	Paul Buetow

2026-03-18	refactor: Split Config class per SRP	Paul Buetow
	- Created ConfigLoader for TOML loading and validation - Kept Config for configuration value access only - Reduced Config from 489 lines to ~200 lines - Fixed CLI to use ConfigLoader and pass @path to Config
2026-03-18	hyperstack status: display active vLLM model	Paul Buetow
	Show the currently loaded model (from state file, or config default) so it's immediately visible without running `model list`. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	nemotron-super: set max_model_len=262144 (256K); document NoPE and OOM risk	Paul Buetow
	Tested 1M context (NoPE allows arbitrary max_position_embeddings without YaRN) — OOMs on A100 80GB due to insufficient VRAM after 60GB model weights. 256K (262144) is the practical ceiling on this hardware. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	nemotron-super: use qwen3_xml tool call parser — same XML format, works	Paul Buetow
	Both Nemotron and Qwen3-XML use identical <tool_call><function=name> <parameter=p>value</parameter></function></tool_call> format. qwen3_xml correctly parses Nemotron's output; tool calling now works with opencode and other API clients. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	nemotron-super: revert to no tool calling; add nemotron_v3 reasoning parser	Paul Buetow
	vLLM 0.17.1 has no tool call parser for Nemotron's custom XML format (<tool_call><function=...><parameter=...>). Setting llama3_json produced garbage output. Reverted to tool_call_parser="" with a clear comment. Added --reasoning-parser nemotron_v3 via extra_vllm_args so <think> tokens are properly exposed as reasoning_content in the API response. For agentic work requiring tool calls, switch to qwen3-coder-next or devstral. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
2026-03-18	Fix nemotron-super tool_call_parser; auto-clear WireGuard hostname from ↵	Paul Buetow
	known_hosts - hyperstack-vm.toml: set tool_call_parser=llama3_json for nemotron-super so vLLM accepts tool_choice requests from opencode; model won't spontaneously call tools so the vLLM 0.17.1 token_ids crash in llama3_json won't trigger - hyperstack.rb: wait_for_ssh now also removes the WireGuard hostname (hyperstack.wg1) from known_hosts alongside the IP, preventing StrictHostKeyChecking failures across VM recreates Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>