Stabilize local agent auto model dump E2E#3712
Conversation
Bugbot couldn't run - usage limit reachedBugbot is counted against Cursor usage for this user or team, and this run hit a usage or spend limit. A user or team admin can review and increase usage limits in the Cursor dashboard. (requestId: serverGenReqId_4ed329e5-799a-48a8-91c8-338cf5d375a5) |
There was a problem hiding this comment.
Pull request overview
This PR deflakes the βlocal-agent - auto modelβ E2E by removing a race where request-dump snapshots could alternate between code_search and explore_code depending on code-explorer indexing readiness.
Changes:
- Pins
enableCodeExplorer: falseviaset-user-settingsin the auto-model dump E2E and waits for the persisted setting before sending[dump]. - Documents the same stabilization pattern in
rules/e2e-testing.mdfor other request-dump specs not explicitly testingexplore_code.
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.
| File | Description |
|---|---|
| rules/e2e-testing.md | Adds a new E2E gotcha entry describing how to keep request-dump snapshots stable by pinning enableCodeExplorer off and polling persisted settings. |
| e2e-tests/local_agent_auto.spec.ts | Disables code explorer and polls po.settings.recordSettings().enableCodeExplorer before issuing the dump prompt to prevent snapshot/tool-list flakiness. |
π‘ Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
π Dyadbot Code Review SummaryVerdict: β
YES - Ready to merge Small, well-scoped test stabilization PR. Two files changed (10 additions, 0 deletions) β all test/docs only, zero production code paths affected. What it does: Pins Correctness: The IPC call uses the validated Security: No new IPC channels or renderer-side capabilities introduced. The test calls an existing, validated endpoint with a well-typed payload. Code Health: Follows the established convention for similar local-agent E2E specs. The documentation bullet is clear, actionable, and placed adjacent to the related UX: No user-facing changes. β No issues found by persona-based review. Generated by Dyadbot persona-based code review |
π Playwright Test Resultsβ All tests passed!
Total: 1070 tests passed (6 flaky) (340 skipped)
|
Summary
Verification
Failing run investigated: https://github.com/dyad-sh/dyad/actions/runs/28407790114
#skip-bugbot
Note
Low Risk
Test-only and documentation changes with no production code paths affected.
Overview
Stabilizes the local-agent auto model request-dump E2E by turning
enableCodeExploreroff viaset-user-settingsand polling persisted settings before the[dump]prompt, so the serialized request snapshot does not race betweencode_searchandexplore_codewhen code-explorer indexing finishes at different times on CI vs locally.Documents the same pattern in
rules/e2e-testing.mdfor other local-agent request-dump specs that are not explicitly testingexplore_code.Reviewed by Cursor Bugbot for commit 600a84f. Bugbot is set up for automated code reviews on this repo. Configure here.