refactor(chat): unify ask-user flow on <question-form>, delete AskUserQuestion mechanism by elihahah666 · Pull Request #4114 · nexu-io/open-design

elihahah666 · 2026-06-10T17:33:54Z

Why

While building on Open Design I hit a broken UI: a chat message showed a
"Question" card whose text status badge ("Awaiting your answer") was stuffed
into the 24px .op-status icon badge and overflowed onto the title. Digging in,
the real problem was bigger than CSS — the repo carries two parallel
mechanisms for the same job of asking the user a clarifying question:

A — the AskUserQuestion SDK tool: an inline interactive card in the chat
stream (AskUserQuestionCard / StreamingAskUserQuestionCard), whose answer
is fed back into the still-open stream-json child via
POST /api/runs/:id/tool-result + submitChatRunToolResult.
B — the <question-form> artifact: a QuestionsBanner entry point in
chat, the form rendered in the right-hand Questions tab
(QuestionsPanel / QuestionFormView), answers returned as the next user
message.

Two visual languages and two interaction models for one concept, with A's card
visually broken, and an artificial split where A was pinned to mid-conversation
clarification and B to turn-1 discovery. This is the technical-debt / UX-fix that
the PR addresses: delete A, keep only B, and widen <question-form> to any
turn so mid-conversation clarification flows through the same
banner → Questions tab → next-user-message loop.

What users will see

The broken inline "Question" card is gone. Every clarifying question — whether
the first-turn discovery brief or a mid-conversation "you marked a region but
didn't say what to change" — now appears as the "Answer a few questions…"
banner in chat, opening the form in the right-hand Questions tab. One
consistent surface instead of two.
No behavior is gated behind a setting; this is the default ask-user flow.

Surface area

API / contract — removes the POST /api/runs/:id/tool-result endpoint
and the submitChatRunToolResult web caller (return-path deletion).
i18n keys — removes the tool.askQuestion* keys from types.ts and
all locales.
Default behavior change — clarifying questions render via the Questions
tab banner instead of an inline chat card.

(No new UI page/dialog is added — the surviving surface, QuestionsBanner +
Questions tab, already existed. The system-prompt change that widens
<question-form> to any turn is prompt-layer only; the front-end already
supported a question-form in any assistant message.)

Screenshots

Before: the broken inline card (status text overflowing the icon badge) — see
the originating report. After: clarifying questions surface as the existing
QuestionsBanner → Questions tab, validated live on an isolated worktree
runtime. No new UI chrome is introduced, so there is no new entry point to show
beyond the pre-existing banner.

Bug fix verification

This is a refactor that also removes a visual bug whose fix is structural
(deleting the broken card entirely), so the acceptance is the migrated test
suite plus human validation rather than a single red-spec:

apps/daemon/tests/chat-run-artifact-quiet-period.test.ts — the obsolete
pendingHostAnswers-pending assertion was removed and replaced with two specs
pinning the simplified stdin-close decision (closes on a non-tool_use clean
terminal turn; stays open on tool_use).
apps/daemon/tests/run-artifacts.test.ts — runAskedUserQuestion now asserts
detection of a <question-form> marker (incl. one split across text_delta
chunks), replacing the AUQ-tool-use detection.
Human-validated in the browser on an isolated worktree runtime: triggered a
mid-conversation clarification and confirmed the banner → Questions tab → reply
loop, and that the broken card no longer appears.

Validation

pnpm guard (54 pass), pnpm typecheck (all packages green).
pnpm --filter @open-design/web test, pnpm --filter @open-design/contracts test.
pnpm --filter @open-design/daemon test — 4351 pass / 4 skip / 4 todo,
346 files, exit 0.
mocks replay: pulled the Claude recording subset and replayed 04097377
(17-tool single turn) and 21747360 ([form answers — discovery]) through the
mock CLI — 0 parse failures, well-formed stream-json terminating on end_turn
(the clean-terminal path the simplified bookkeeping closes stdin on).

Adjacent / out of scope

docs/plans/2026-06-01-sandbox-orchestration-hardening-architecture.md item I6
proposes hardening POST /api/runs/:id/tool-result. That endpoint is deleted
by this PR, so I6 is now moot — left untouched here as a dated archival
plan; flagging it so a future contributor doesn't harden a removed endpoint.

…rQuestion mechanism There were two parallel "ask the user a clarifying question" mechanisms: A. The AskUserQuestion SDK tool — an inline interactive card in the chat stream (AskUserQuestionCard / StreamingAskUserQuestionCard), whose answer was fed back into the still-open stream-json child via POST /api/runs/:id/tool-result + submitChatRunToolResult. B. The <question-form> markdown artifact — a QuestionsBanner entry point in chat, the form rendered in the right-hand Questions tab (QuestionsPanel / QuestionFormView), answers returned as the next user message. Two visual languages for the same job, and mechanism A's card was visually broken (the text status "Awaiting your answer" was stuffed into the 24px .op-status icon badge and overflowed onto the title). Mechanism A was pinned to mid-conversation clarification while B was pinned to turn-1 discovery — an artificial split. This deletes mechanism A entirely and keeps only B, and widens <question-form> from "turn-1 discovery only" to ANY turn so mid-conversation clarification flows through the same banner → Questions tab → next user message loop. Web: - Delete apps/web/src/runtime/ask-user-question.ts (parser trio). - ToolCard.tsx: drop the AUQ card/streaming-card/dispatch/imports and the onAnswerToolUse / onSubmitForm props. - AssistantMessage.tsx: drop the live AUQ render path, suppressAskUserQuestionFallbackText, the AUQ name from SNAPSHOT_TOOL_NAMES, and the onSubmitForm bridge. Keep suppressDuplicateQuestionForms. - Delete the whole onSubmitForm / onAssistantFormSubmitStart return chain across AssistantMessage / ChatPane / ProjectView / SideChatTab, and submitChatRunToolResult in providers/daemon.ts. Daemon (conservative — keep the generic stream-json input skeleton): - applyClaudeStreamJsonRunBookkeeping: drop the AUQ detection branch and run.pendingHostAnswers; stdin now closes on any non-tool_use clean terminal turn. - Delete run-tool-results.ts, submitToolResultToRun, and the POST /api/runs/:id/tool-result endpoint. - run-artifacts.ts:runAskedUserQuestion now scans streamed text for a <question-form> marker (reassembled across text_delta chunks) instead of an AUQ tool_use, preserving the run_finished.asked_user_question analytics signal. - Keep --input-format stream-json / promptInputFormat: 'stream-json' as generic mid-turn input infra; refresh the comments that referenced AUQ. System prompt / contracts: - Replace the Claude-only AskUserQuestion clarification section with generic "emit <question-form> for mid-conversation clarification" guidance; relax the API-mode override wording; drop "Do not call AskUserQuestion" from the skip-discovery override. i18n / CSS: - Remove the tool.askQuestion* keys from types.ts + all locales. - Remove the op-ask-question* block and .op-status-awaiting from tools.css; keep .op-status (the icon badge) and the qf-* classes that mechanism B uses. Docs / tests: - AGENTS.md: rewrite the runtime-conventions + chat-UI sections; add an "Asking the user questions" section documenting the single mechanism. - Delete the AUQ-only tests; rewrite the quiet-period bookkeeping, run-artifacts, prompt, and plugin-folder tests to the new behavior. Validation: pnpm guard (54), full typecheck, web tests, daemon suite (4351 pass), contracts tests; replayed two Claude stream-json traces through the mock CLI (incl. the [form answers — discovery] trace) — 0 parse failures, clean end_turn termination.

github-actions · 2026-06-10T17:39:52Z

Visual regression review

Head: 0f1585a · Base: 4c8914d

11 changed · 24 unchanged · 0 missing baseline · 0 failed

Changed cases

Case	Main	PR	Diff
visual-home-plugin-use-staged
visual-home-plugin-use-with-query
visual-new-project-modal
visual-project-avatar-model-dropdown
visual-project-workspace
visual-settings-byok
visual-settings-byok-model-dropdown
visual-settings-byok-openai
visual-settings-execution
visual-settings-local-cli
visual-workspace-staged-contexts

Unchanged cases

Case	Main	PR	Diff
visual-avatar-local-agent-list
visual-avatar-menu
visual-design-system-detail
visual-design-systems
visual-home
visual-home-catalog
visual-home-context-picker
visual-home-plugin-filter
visual-home-staged-attachment
visual-integrations
visual-integrations-mcp
visual-integrations-use-everywhere
visual-onboarding-runtime
visual-plugin-details
visual-plugin-share-menu
visual-plugins
visual-projects
visual-projects-kanban
visual-settings-local-cli-model-dropdown
visual-tasks

Visual diff is advisory only and does not block merging.

Siri-Ray

@elihahah666 thanks for the thoughtful cleanup here. I reviewed the daemon stdin lifecycle changes, the question-form rendering path, the endpoint/caller deletion, and the prompt/analytics updates. I found one non-blocking consistency issue where the newly documented API/BYOK mirroring does not appear to be implemented yet; details inline.