RES โ TFW-38: Quality Enforcement (Iteration 2)
Date: 2026-04-14
Author: Researcher
Status: ๐ฌ RES โ Complete
Parent HL: HL-TFW-38
Predecessor: RES iteration 1
Mode: Pipeline
Research Context
Iteration 2 explores the user's direction that review should be structured with domain-specific stages (like research has gather/extract/challenge), not a single-pass checklist. Resolves all open threads from iteration 1: staged review design, mode system, workflow classification, and diagram indexing format.
Decisions
| # |
Decision |
Rationale |
| D6 |
4-stage review flow: Comprehend โ Verify โ Assess โ Synthesize |
ISO verification/validation model mapped to TFW. Natural stages already appear in the best REVIEW files (HD PhaseF, TFW-19, TFW-36). Each stage has a distinct cognitive mindset: passive understanding โ active audit โ analytical judgment โ decision-making. |
| D7 |
3 review modes: code, prose, spec |
Current 9-point checklist is 44% code-specific. Steps-framework is 60% non-code tasks. Mode describes output type, not domain. Configures Assess-stage checklist items. Same selection pattern as research modes (focused/deep). |
| D8 |
Stages are sections in REVIEW artifact, not separate files |
Review is single-pass (no OODA loops). Creating stage files would add overhead for no value. ONB proves sections-in-one-file CAN enforce cognitive transitions if workflow describes mindset per section. Key difference from research: stages flow sequentially without WAIT gates. |
| D9 |
REVIEW template must restructure to match 4-stage flow |
Iteration 1 root cause: template-workflow disconnect causes skipping. Template and workflow MUST change together. New template: ยง1 Comprehend โ ยง2 Verify โ ยง3 Assess โ ยง4 Verdict โ ยง5 Tech Debt โ ยง6 Traces โ ยง7 Fact Candidates. |
| D10 |
Diagram collection = indexing in KNOWLEDGE.md ยง2, not copying |
Confirmed from iteration 1 D3. Format: table with Description, Location (RF/RES path + section), Type (mermaid/ASCII), Date. Collection triggered in docs.md after APPROVE. |
Hypothesis Status
| # |
Hypothesis |
Iter 1 |
Iter 2 |
Evidence |
| H1 |
Explicit ยง6-8 enumeration stops skipping |
๐ข |
๐ข (no change) |
96-100% skip rate confirmed empirically |
| H2 |
Audit step changes reviewer behavior |
๐ข |
๐ข superseded by H3 |
Audit step is now stage 2 (Verify) in the 4-stage model |
| H3 |
Domain-specific review stages produce more reliable reviews |
โ |
๐ข supported |
4-stage model maps to ISO V&V. 3 modes handle task type diversity. 44% code-specific checklist items โ mode eliminates N/A friction. Stages-as-sections pragmatism (D8) prevents overhead. |
Staged Review Design Summary
Flow
โโโโโโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโโโโโ
โ COMPREHEND โ โ โ VERIFY โ โ โ ASSESS โ โ โ SYNTHESIZE โ
โ "Understand โ โ "Check โ โ "Judge โ โ "Decide and โ
โ claims" โ โ evidence"โ โ quality" โ โ capture" โ
โโโโโโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโโ โโโโโโโโโโโโโโโ
Read RF/TS/HL Open files Checklist Verdict +
Build mental Spot-check (mode-aware) Tech Debt +
model Re-run tests Philosophy Fact Candidates
Check evidence Standards Traces
Mode Checklist Configuration
| # |
Check |
code |
prose |
spec |
| 1 |
DoD met? |
โ
|
โ
|
โ
|
| 2 |
Quality |
Code quality |
Content quality |
Analytical quality |
| 3 |
Coverage |
Test coverage |
Source verification |
Source attribution |
| 4 |
Philosophy aligned |
โ
|
โ
|
โ
|
| 5 |
Tech debt |
โ
|
โ
|
โ
|
| 6 |
Domain gate A |
Security |
NDA compliance |
โ |
| 7 |
Domain gate B |
Breaking changes |
โ |
โ |
| 8 |
Style & standards |
โ
|
โ
|
โ
|
| 9 |
Observations collected |
โ
|
โ
|
โ
|
| 10 |
RF completeness (ยง6-8) |
โ
|
โ
|
โ
|
Universal (all modes): 6 items (#1, #4, #5, #8, #9, #10)
Mode-specific: 2-4 items โ total per review: 8-10
Verify Stage Actions by Mode
| Mode |
Verify actions |
code |
Open 2-3 files from RF ยง1, verify changes exist. Check test file presence. Re-run 1 test/build if possible. Cross-reference AC checkmarks against TS DoD. |
prose |
Verify deliverable files exist. Check structure matches spec. Check key sources/citations. |
spec |
Verify deliverable files exist. Check source citations are traceable. Verify data claims against primary sources. |
Key Design Distinction from Research
|
Research |
Review |
| Stage files |
Separate files (gather.md, extract.md, ...) |
Sections in REVIEW artifact |
| OODA loops |
Multiple per stage |
None โ single pass |
| WAIT gates |
Between each stage |
None โ continuous flow |
| Purpose |
Accumulate knowledge iteratively |
Apply judgment once |
| Output |
RES artifact |
REVIEW artifact |
| Mode timing |
Per-iteration |
Per-review |
HL Update Recommendations
| # |
What to update |
Source |
| 1 |
Phase A scope: add REVIEW template restructure (ยง1-ยง7 renumbering) alongside review.md workflow changes |
D9 |
| 2 |
Phase A scope: add review/ mode files (code.md, prose.md, spec.md) to .tfw/workflows/review/ |
D7 |
| 3 |
Phase A scope: add Step 0 (mode selection) to review.md |
D7 |
| 4 |
Phase B D6: diagram indexing format decided โ table in KNOWLEDGE.md ยง2 |
D10 |
| 5 |
Rename Phase A from "Workflow & Template Enforcement" to "Review Restructure + Handoff Enforcement" |
D6-D9 broaden Phase A scope |
| 6 |
Add H3 to HL ยง10 |
H3 |
Fact Candidates
| # |
Category |
Candidate |
Source |
Confidence |
| F6 |
philosophy |
Review stages should be sections in a single artifact (not separate files) because review is single-pass, unlike research which iterates. The cognitive transition comes from the workflow's mindset descriptions, not from file boundaries. ONB is proof: sections-in-one-file enforce cognitive transitions when the workflow explicitly describes the mindset per section. |
D8, Challenge C1 |
High |
| F7 |
philosophy |
User's "like research but not research" direction means: review should have structured cognitive phases with different goals per phase, but without the OODA loops, WAIT gates, and iteration mechanism that research uses. The structural parallel is in forced mindset transitions, not in process mechanics. |
User, 2026-04-14 |
High |
| F8 |
convention |
Review mode names should describe the output type being reviewed (code/prose/spec), not the domain (business/education/transit). Output-type naming is finite and domain-agnostic; domain naming creates infinite modes. |
Challenge C4 |
High |
| F9 |
process |
Current REVIEW checklist is 44% code-specific (4 of 9 items). Non-code tasks generate 4 N/A entries per review. Mode-specific checklists eliminate this friction while keeping 6 universal items. |
Gather G3 |
High |
Iteration Status
- Iteration: 2 of 2 (min) / 3 (max)
- Hypotheses tested: H1 (๐ข), H2 (๐ข โ superseded by H3), H3 (๐ข)
- Hypotheses deferred: None
- Gaps discovered: None
- Superseded decisions: D2 (audit step) โ superseded by D6 (staged review with Verify as stage 2)
Open Threads
| # |
Thread |
Status |
| 1 |
Staged review workflow |
โ
Resolved โ D6, D7, D8, D9 |
| 2 |
Two classes of TFW workflows |
โ
Resolved โ review gets staged structure, handoff/docs stay procedural |
| 3 |
Diagram indexing format |
โ
Resolved โ D10 |
Recommendation
- [x] SUFFICIENT โ proceed to
/tfw-plan to update HL and write TS
- [ ] MORE NEEDED
All open threads from iteration 1 are resolved. The staged review design (D6-D9) is concrete enough for TS. Handoff enforcement (D1, D4) was ready since iteration 1. Diagram indexing (D10) format is defined.
Conclusion
Iteration 2 designed a 4-stage domain-agnostic review flow (Comprehend โ Verify โ Assess โ Synthesize) with 3 output-type modes (code, prose, spec). The design maintains TFW's domain independence by naming modes after output types, not domains. Stages live as sections within the existing REVIEW artifact (no separate files) โ the cognitive transition is enforced by the workflow's explicit mindset descriptions, not file boundaries. Combined with iteration 1's confirmed enforcement fixes (ยง6-8 enumeration in handoff.md, Findings Map mandate in research/base.md), TFW-38 has a complete research foundation for TS specification.
RES โ TFW-38: Quality Enforcement (Iteration 2) | 2026-04-14