summaryrefslogtreecommitdiff
path: root/snippets/hyperstack/hyperstack.rb
diff options
context:
space:
mode:
authorPaul Buetow <paul@buetow.org>2026-03-18 17:42:00 +0200
committerPaul Buetow <paul@buetow.org>2026-03-18 17:42:00 +0200
commita7d3d2d4339815cf4a39b58873069b07a0ac1d47 (patch)
treea8271bd320e846965b36fd8d430b4da3130d422d /snippets/hyperstack/hyperstack.rb
parentbda86a3c91b307e25507e975927c3dde38f65a74 (diff)
nemotron-super: revert to no tool calling; add nemotron_v3 reasoning parser
vLLM 0.17.1 has no tool call parser for Nemotron's custom XML format (<tool_call><function=...><parameter=...>). Setting llama3_json produced garbage output. Reverted to tool_call_parser="" with a clear comment. Added --reasoning-parser nemotron_v3 via extra_vllm_args so <think> tokens are properly exposed as reasoning_content in the API response. For agentic work requiring tool calls, switch to qwen3-coder-next or devstral. Co-Authored-By: Claude Sonnet 4.6 (1M context) <noreply@anthropic.com>
Diffstat (limited to 'snippets/hyperstack/hyperstack.rb')
0 files changed, 0 insertions, 0 deletions