fix: trim 3 SKILL.md files to comply with Anthropic 500-line limit

Per Anthropic docs: "Keep SKILL.md under 500 lines. Move detailed reference material to separate files." - browser-automation: 564 → 266 lines (moved examples to references/) - spec-driven-workflow: 586 → 333 lines (moved full spec example to references/) - security-pen-testing: 850 → 306 lines (condensed OWASP/attack details, moved to references/) No content deleted — all moved to existing reference files with pointers. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>
2026-03-25 15:20:47 +01:00
parent 268061b0fd
commit f352e8cdd0
5 changed files with 298 additions and 1205 deletions
--- a/engineering/browser-automation/SKILL.md
+++ b/engineering/browser-automation/SKILL.md
@@ -33,9 +33,6 @@ The Browser Automation skill provides comprehensive tools and knowledge for buil

 ### 1. Web Scraping Patterns

-#### DOM Extraction with CSS Selectors
-CSS selectors are the primary tool for element targeting. Prefer them over XPath for readability and performance.
-
 **Selector priority (most to least reliable):**
 1. `data-testid`, `data-id`, or custom data attributes — stable across redesigns
 2. `#id` selectors — unique but may change between deploys
@@ -43,365 +40,70 @@ CSS selectors are the primary tool for element targeting. Prefer them over XPath
 4. Class-based: `.product-card`, `.price` — brittle if classes are generated (e.g., CSS modules)
 5. Positional: `nth-child()`, `nth-of-type()` — last resort, breaks on layout changes

-**Compound selectors for precision:**
-```python
-# Product cards within a specific container
-page.query_selector_all("div.search-results > article.product-card")
+Use XPath only when CSS cannot express the relationship (e.g., ancestor traversal, text-based selection).

-# Price inside a product card (scoped)
-card.query_selector("span[data-field='price']")
-
-# Links with specific text content
-page.locator("a", has_text="Next Page")
-```
-
-#### XPath for Complex Traversal
-Use XPath only when CSS cannot express the relationship:
-```python
-# Find element by text content (XPath strength)
-page.locator("//td[contains(text(), 'Total')]/following-sibling::td[1]")
-
-# Navigate up the DOM tree
-page.locator("//span[@class='price']/ancestor::div[@class='product']")
-```
-
-#### Pagination Patterns
- **Next-button pagination**: Click "Next" until disabled or absent
- **URL-based pagination**: Increment `?page=N` or `&offset=N` in URL
- **Infinite scroll**: Scroll to bottom, wait for new content, repeat until no change
- **Load-more button**: Click button, wait for DOM mutation, repeat
-
-#### Infinite Scroll Handling
-```python
-async def scroll_to_bottom(page, max_scrolls=50, pause_ms=1500):
-    previous_height = 0
-    for i in range(max_scrolls):
-        current_height = await page.evaluate("document.body.scrollHeight")
-        if current_height == previous_height:
-            break
-        await page.evaluate("window.scrollTo(0, document.body.scrollHeight)")
-        await page.wait_for_timeout(pause_ms)
-        previous_height = current_height
-    return i + 1  # number of scrolls performed
-```
+**Pagination strategies:** next-button, URL-based (`?page=N`), infinite scroll, load-more button. See [data_extraction_recipes.md](references/data_extraction_recipes.md) for complete pagination handlers and scroll patterns.

 ### 2. Form Filling & Multi-Step Workflows

-#### Login Flows
-```python
-async def login(page, url, username, password):
-    await page.goto(url)
-    await page.fill("input[name='username']", username)
-    await page.fill("input[name='password']", password)
-    await page.click("button[type='submit']")
-    # Wait for navigation to complete (post-login redirect)
-    await page.wait_for_url("**/dashboard**")
-```
+Break multi-step forms into discrete functions per step. Each function fills fields, clicks "Next"/"Continue", and waits for the next step to load (URL change or DOM element).

-#### Multi-Page Forms
-Break multi-step forms into discrete functions per step. Each function:
-1. Fills the fields for that step
-2. Clicks the "Next" or "Continue" button
-3. Waits for the next step to load (URL change or DOM element)
-
-```python
-async def fill_step_1(page, data):
-    await page.fill("#first-name", data["first_name"])
-    await page.fill("#last-name", data["last_name"])
-    await page.select_option("#country", data["country"])
-    await page.click("button:has-text('Continue')")
-    await page.wait_for_selector("#step-2-form")
-
-async def fill_step_2(page, data):
-    await page.fill("#address", data["address"])
-    await page.fill("#city", data["city"])
-    await page.click("button:has-text('Continue')")
-    await page.wait_for_selector("#step-3-form")
-```
-
-#### File Uploads
-```python
-# Single file
-await page.set_input_files("input[type='file']", "/path/to/file.pdf")
-
-# Multiple files
-await page.set_input_files("input[type='file']", [
-    "/path/to/file1.pdf",
-    "/path/to/file2.pdf"
-])
-
-# Drag-and-drop upload zones (no visible input element)
-async with page.expect_file_chooser() as fc_info:
-    await page.click("div.upload-zone")
-file_chooser = await fc_info.value
-await file_chooser.set_files("/path/to/file.pdf")
-```
-
-#### Dropdown and Select Handling
-```python
-# Native <select> element
-await page.select_option("#country", value="US")
-await page.select_option("#country", label="United States")
-
-# Custom dropdown (div-based)
-await page.click("div.dropdown-trigger")
-await page.click("div.dropdown-option:has-text('United States')")
-```
+Key patterns: login flows, multi-page forms, file uploads (including drag-and-drop zones), native and custom dropdown handling. See [playwright_browser_api.md](references/playwright_browser_api.md) for complete API reference on `fill()`, `select_option()`, `set_input_files()`, and `expect_file_chooser()`.

 ### 3. Screenshot & PDF Capture

-#### Screenshot Strategies
-```python
-# Full page (scrolls automatically)
-await page.screenshot(path="full-page.png", full_page=True)
+- **Full page:** `await page.screenshot(path="full.png", full_page=True)`
+- **Element:** `await page.locator("div.chart").screenshot(path="chart.png")`
+- **PDF (Chromium only):** `await page.pdf(path="out.pdf", format="A4", print_background=True)`
+- **Visual regression:** Take screenshots at known states, store baselines in version control with naming: `{page}_{viewport}_{state}.png`

-# Viewport only (what's visible)
-await page.screenshot(path="viewport.png")
-
-# Specific element
-element = page.locator("div.chart-container")
-await element.screenshot(path="chart.png")
-
-# With custom viewport for consistency
-context = await browser.new_context(viewport={"width": 1920, "height": 1080})
-```
-
-#### PDF Generation
-```python
-# Only works in Chromium
-await page.pdf(
-    path="output.pdf",
-    format="A4",
-    margin={"top": "1cm", "right": "1cm", "bottom": "1cm", "left": "1cm"},
-    print_background=True
-)
-```
-
-#### Visual Regression Baselines
-Take screenshots at known states and compare pixel-by-pixel. Store baselines in version control. Use naming conventions: `{page}_{viewport}_{state}.png`.
+See [playwright_browser_api.md](references/playwright_browser_api.md) for full screenshot/PDF options.

 ### 4. Structured Data Extraction

-#### Tables to JSON
-```python
-async def extract_table(page, selector):
-    headers = await page.eval_on_selector_all(
-        f"{selector} thead th",
-        "elements => elements.map(e => e.textContent.trim())"
-    )
-    rows = await page.eval_on_selector_all(
-        f"{selector} tbody tr",
-        """rows => rows.map(row => {
-            return Array.from(row.querySelectorAll('td'))
-                .map(cell => cell.textContent.trim())
-        })"""
-    )
-    return [dict(zip(headers, row)) for row in rows]
-```
+Core extraction patterns:
+- **Tables to JSON** — Extract `<thead>` headers and `<tbody>` rows into dictionaries
+- **Listings to arrays** — Map repeating card elements using a field-selector map (supports `::attr()` for attributes)
+- **Nested/threaded data** — Recursive extraction for comments with replies, category trees

-#### Listings to Arrays
-```python
-async def extract_listings(page, container_sel, field_map):
-    """
-    field_map example: {"title": "h3.title", "price": "span.price", "url": "a::attr(href)"}
-    """
-    items = []
-    cards = await page.query_selector_all(container_sel)
-    for card in cards:
-        item = {}
-        for field, sel in field_map.items():
-            if "::attr(" in sel:
-                attr_sel, attr_name = sel.split("::attr(")
-                attr_name = attr_name.rstrip(")")
-                el = await card.query_selector(attr_sel)
-                item[field] = await el.get_attribute(attr_name) if el else None
-            else:
-                el = await card.query_selector(sel)
-                item[field] = (await el.text_content()).strip() if el else None
-        items.append(item)
-    return items
-```
-
-#### Nested Data Extraction
-For threaded content (comments with replies), use recursive extraction:
-```python
-async def extract_comments(page, parent_selector):
-    comments = []
-    elements = await page.query_selector_all(f"{parent_selector} > .comment")
-    for el in elements:
-        text = await (await el.query_selector(".comment-body")).text_content()
-        author = await (await el.query_selector(".author")).text_content()
-        replies = await extract_comments(el, ".replies")
-        comments.append({
-            "author": author.strip(),
-            "text": text.strip(),
-            "replies": replies
-        })
-    return comments
-```
+See [data_extraction_recipes.md](references/data_extraction_recipes.md) for complete extraction functions, price parsing, data cleaning utilities, and output format helpers (JSON, CSV, JSONL).

 ### 5. Cookie & Session Management

-#### Save and Restore Sessions
-```python
-import json
+- **Save/restore cookies:** `context.cookies()` and `context.add_cookies()`
+- **Full storage state** (cookies + localStorage): `context.storage_state(path="state.json")` to save, `browser.new_context(storage_state="state.json")` to restore

-# Save cookies after login
-cookies = await context.cookies()
-with open("session.json", "w") as f:
-    json.dump(cookies, f)
-
-# Restore session in new context
-with open("session.json", "r") as f:
-    cookies = json.load(f)
-context = await browser.new_context()
-await context.add_cookies(cookies)
-```
-
-#### Storage State (Cookies + Local Storage)
-```python
-# Save full state (cookies + localStorage + sessionStorage)
-await context.storage_state(path="state.json")
-
-# Restore full state
-context = await browser.new_context(storage_state="state.json")
-```
-
-**Best practice:** Save state after login, reuse across scraping sessions. Check session validity before starting a long job — make a lightweight request to a protected page and verify you are not redirected to login.
+**Best practice:** Save state after login, reuse across scraping sessions. Check session validity before starting a long job — make a lightweight request to a protected page and verify you are not redirected to login. See [playwright_browser_api.md](references/playwright_browser_api.md) for cookie and storage state API details.

 ### 6. Anti-Detection Patterns

-Modern websites detect automation through multiple vectors. Address all of them:
+Modern websites detect automation through multiple vectors. Apply these in priority order:

-#### User Agent Rotation
-Never use the default Playwright user agent. Rotate through real browser user agents:
-```python
-USER_AGENTS = [
-    "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
-    "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
-    "Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/120.0.0.0 Safari/537.36",
-]
-```
+1. **WebDriver flag removal** — Remove `navigator.webdriver = true` via init script (critical)
+2. **Custom user agent** — Rotate through real browser UAs; never use the default headless UA
+3. **Realistic viewport** — Set 1920x1080 or similar real-world dimensions (default 800x600 is a red flag)
+4. **Request throttling** — Add `random.uniform()` delays between actions
+5. **Proxy support** — Per-browser or per-context proxy configuration

-#### Viewport and Screen Size
-Set realistic viewport dimensions. The default 800x600 is a red flag:
-```python
-context = await browser.new_context(
-    viewport={"width": 1920, "height": 1080},
-    screen={"width": 1920, "height": 1080},
-    user_agent=random.choice(USER_AGENTS),
-)
-```
-
-#### WebDriver Flag Removal
-Playwright sets `navigator.webdriver = true`. Remove it:
-```python
-await page.add_init_script("""
-    Object.defineProperty(navigator, 'webdriver', {get: () => undefined});
-""")
-```
-
-#### Request Throttling
-Add human-like delays between actions:
-```python
-import random
-
-async def human_delay(min_ms=500, max_ms=2000):
-    delay = random.randint(min_ms, max_ms)
-    await page.wait_for_timeout(delay)
-```
-
-#### Proxy Support
-```python
-browser = await playwright.chromium.launch(
-    proxy={"server": "http://proxy.example.com:8080"}
-)
-# Or per-context:
-context = await browser.new_context(
-    proxy={"server": "http://proxy.example.com:8080",
-           "username": "user", "password": "pass"}
-)
-```
+See [anti_detection_patterns.md](references/anti_detection_patterns.md) for the complete stealth stack: navigator property hardening, WebGL/canvas fingerprint evasion, behavioral simulation (mouse movement, typing speed, scroll patterns), proxy rotation strategies, and detection self-test URLs.

 ### 7. Dynamic Content Handling

-#### SPA Rendering
-SPAs render content client-side. Wait for the actual content, not the page load:
-```python
-await page.goto(url)
-# Wait for the data to render, not just the shell
-await page.wait_for_selector("div.product-list article", state="attached")
-```
+- **SPA rendering:** Wait for content selectors (`wait_for_selector`), not the page load event
+- **AJAX/Fetch waiting:** Use `page.expect_response("**/api/data*")` to intercept and wait for specific API calls
+- **Shadow DOM:** Playwright pierces open Shadow DOM with `>>` operator: `page.locator("custom-element >> .inner-class")`
+- **Lazy-loaded images:** Scroll elements into view with `scroll_into_view_if_needed()` to trigger loading

-#### AJAX / Fetch Waiting
-Intercept and wait for specific API calls:
-```python
-async with page.expect_response("**/api/products*") as response_info:
-    await page.click("button.load-more")
-response = await response_info.value
-data = await response.json()  # You can use the API data directly
-```
-
-#### Shadow DOM Traversal
-```python
-# Playwright pierces open Shadow DOM automatically with >>
-await page.locator("custom-element >> .inner-class").click()
-```
-
-#### Lazy-Loaded Images
-Scroll elements into view to trigger lazy loading:
-```python
-images = await page.query_selector_all("img[data-src]")
-for img in images:
-    await img.scroll_into_view_if_needed()
-    await page.wait_for_timeout(200)
-```
+See [playwright_browser_api.md](references/playwright_browser_api.md) for wait strategies, network interception, and Shadow DOM details.

 ### 8. Error Handling & Retry Logic

-#### Retry Decorator Pattern
-```python
-import asyncio
+- **Retry with backoff:** Wrap page interactions in retry logic with exponential backoff (e.g., 1s, 2s, 4s)
+- **Fallback selectors:** On `TimeoutError`, try alternative selectors before failing
+- **Error-state screenshots:** Capture `page.screenshot(path="error-state.png")` on unexpected failures for debugging
+- **Rate limit detection:** Check for HTTP 429 responses and respect `Retry-After` headers

-async def with_retry(coro_factory, max_retries=3, backoff_base=2):
-    for attempt in range(max_retries):
-        try:
-            return await coro_factory()
-        except Exception as e:
-            if attempt == max_retries - 1:
-                raise
-            wait = backoff_base ** attempt
-            print(f"Attempt {attempt + 1} failed: {e}. Retrying in {wait}s...")
-            await asyncio.sleep(wait)
-```
-
-#### Handling Common Failures
-```python
-from playwright.async_api import TimeoutError as PlaywrightTimeout
-
-try:
-    await page.click("button.submit", timeout=5000)
-except PlaywrightTimeout:
-    # Element did not appear — page structure may have changed
-    # Try fallback selector
-    await page.click("[type='submit']", timeout=5000)
-except Exception as e:
-    # Network error, browser crash, etc.
-    await page.screenshot(path="error-state.png")
-    raise
-```
-
-#### Rate Limit Detection
-```python
-async def check_rate_limit(response):
-    if response.status == 429:
-        retry_after = response.headers.get("retry-after", "60")
-        wait_seconds = int(retry_after)
-        print(f"Rate limited. Waiting {wait_seconds}s...")
-        await asyncio.sleep(wait_seconds)
-        return True
-    return False
-```
+See [anti_detection_patterns.md](references/anti_detection_patterns.md) for the complete exponential backoff implementation and rate limiter class.

 ## Workflows

--- a/engineering/spec-driven-workflow/SKILL.md
+++ b/engineering/spec-driven-workflow/SKILL.md
@@ -34,185 +34,32 @@ If the spec is not written, reviewed, and approved, implementation does not begi

 Every spec follows this structure. No sections are optional — if a section does not apply, write "N/A — [reason]" so reviewers know it was considered, not forgotten.

-### 1. Title and Context
+### Mandatory Sections

-```markdown
-# Spec: [Feature Name]
+| # | Section | Key Rules |
+|---|---------|-----------|
+| 1 | **Title and Metadata** | Author, date, status (Draft/In Review/Approved/Superseded), reviewers |
+| 2 | **Context** | Why this feature exists. 2-4 paragraphs with evidence (metrics, tickets). |
+| 3 | **Functional Requirements** | RFC 2119 keywords (MUST/SHOULD/MAY). Numbered FR-N. Each is atomic and testable. |
+| 4 | **Non-Functional Requirements** | Performance, security, accessibility, scalability, reliability — all with measurable thresholds. |
+| 5 | **Acceptance Criteria** | Given/When/Then format. Every AC references at least one FR-* or NFR-*. |
+| 6 | **Edge Cases** | Numbered EC-N. Cover failure modes for every external dependency. |
+| 7 | **API Contracts** | TypeScript-style interfaces. Cover success and error responses. |
+| 8 | **Data Models** | Table format with field, type, constraints. Every entity from requirements must have a model. |
+| 9 | **Out of Scope** | Explicit exclusions with reasons. Prevents scope creep during implementation. |

-**Author:** [name]
-**Date:** [ISO 8601]
-**Status:** Draft | In Review | Approved | Superseded
-**Reviewers:** [list]
-**Related specs:** [links]
-
-## Context
-
-[Why does this feature exist? What problem does it solve? What is the business
-motivation? Include links to user research, support tickets, or metrics that
-justify this work. 2-4 paragraphs maximum.]
-```
-
-### 2. Functional Requirements (RFC 2119)
-
-Use RFC 2119 keywords precisely:
+### RFC 2119 Keywords

 | Keyword | Meaning |
 |---------|---------|
-| **MUST** | Absolute requirement. Failing this means the implementation is non-conformant. |
-| **MUST NOT** | Absolute prohibition. Doing this means the implementation is broken. |
-| **SHOULD** | Recommended. May be omitted with documented justification. |
-| **SHOULD NOT** | Discouraged. May be included with documented justification. |
-| **MAY** | Optional. Purely at the implementer's discretion. |
+| **MUST** | Absolute requirement. Non-conformant without it. |
+| **MUST NOT** | Absolute prohibition. |
+| **SHOULD** | Recommended. Omit only with documented justification. |
+| **MAY** | Optional. Implementer's discretion. |

-```markdown
-## Functional Requirements
+See [spec_format_guide.md](references/spec_format_guide.md) for the complete template with section-by-section examples, good/bad requirement patterns, and feature-type templates (CRUD, Integration, Migration).

- FR-1: The system MUST authenticate users via OAuth 2.0 PKCE flow.
- FR-2: The system MUST reject tokens older than 24 hours.
- FR-3: The system SHOULD support refresh token rotation.
- FR-4: The system MAY cache user profiles for up to 5 minutes.
- FR-5: The system MUST NOT store plaintext passwords under any circumstance.
-```
-
-Number every requirement. Use `FR-` prefix. Each requirement is a single, testable statement.
-
-### 3. Non-Functional Requirements
-
-```markdown
-## Non-Functional Requirements
-
-### Performance
- NFR-P1: Login flow MUST complete in < 500ms (p95) under normal load.
- NFR-P2: Token validation MUST complete in < 50ms (p99).
-
-### Security
- NFR-S1: All tokens MUST be transmitted over TLS 1.2+.
- NFR-S2: The system MUST rate-limit login attempts to 5/minute per IP.
-
-### Accessibility
- NFR-A1: Login form MUST meet WCAG 2.1 AA standards.
- NFR-A2: Error messages MUST be announced to screen readers.
-
-### Scalability
- NFR-SC1: The system SHOULD handle 10,000 concurrent sessions.
-
-### Reliability
- NFR-R1: The authentication service MUST maintain 99.9% uptime.
-```
-
-### 4. Acceptance Criteria (Given/When/Then)
-
-Every functional requirement maps to one or more acceptance criteria. Use Gherkin syntax:
-
-```markdown
-## Acceptance Criteria
-
-### AC-1: Successful login (FR-1)
-Given a user with valid credentials
-When they submit the login form with correct email and password
-Then they receive a valid access token
-And they are redirected to the dashboard
-And the login event is logged with timestamp and IP
-
-### AC-2: Expired token rejection (FR-2)
-Given a user with an access token issued 25 hours ago
-When they make an API request with that token
-Then they receive a 401 Unauthorized response
-And the response body contains error code "TOKEN_EXPIRED"
-And they are NOT redirected (API clients handle their own flow)
-
-### AC-3: Rate limiting (NFR-S2)
-Given an IP address that has made 5 failed login attempts in the last minute
-When a 6th login attempt arrives from that IP
-Then the request is rejected with 429 Too Many Requests
-And the response includes a Retry-After header
-```
-
-### 5. Edge Cases and Error Scenarios
-
-```markdown
-## Edge Cases
-
- EC-1: User submits login form with empty email → Show validation error, do not hit API.
- EC-2: OAuth provider is down → Show "Service temporarily unavailable", retry after 30s.
- EC-3: User has account but no password (social-only) → Redirect to social login.
- EC-4: Concurrent login from two devices → Both sessions are valid (no single-session enforcement).
- EC-5: Token expires mid-request → Complete the current request, return warning header.
-```
-
-### 6. API Contracts
-
-Define request/response shapes using TypeScript-style notation:
-
-```markdown
-## API Contracts
-
-### POST /api/auth/login
-Request:
-```typescript
-interface LoginRequest {
-  email: string;       // MUST be valid email format
-  password: string;    // MUST be 8-128 characters
-  rememberMe?: boolean; // Default: false
-}
-```
-
-Success Response (200):
-```typescript
-interface LoginResponse {
-  accessToken: string;   // JWT, expires in 24h
-  refreshToken: string;  // Opaque, expires in 30d
-  expiresIn: number;     // Seconds until access token expires
-  user: {
-    id: string;
-    email: string;
-    displayName: string;
-  };
-}
-```
-
-Error Response (401):
-```typescript
-interface AuthError {
-  error: "INVALID_CREDENTIALS" | "TOKEN_EXPIRED" | "ACCOUNT_LOCKED";
-  message: string;
-  retryAfter?: number; // Seconds, present for rate-limited responses
-}
-```
-```
-
-### 7. Data Models
-
-```markdown
-## Data Models
-
-### User
-| Field | Type | Constraints |
-|-------|------|-------------|
-| id | UUID | Primary key, auto-generated |
-| email | string | Unique, max 255 chars, valid email format |
-| passwordHash | string | bcrypt, never exposed via API |
-| createdAt | timestamp | UTC, immutable |
-| lastLoginAt | timestamp | UTC, updated on each login |
-| loginAttempts | integer | Reset to 0 on successful login |
-| lockedUntil | timestamp | Null if not locked |
-```
-
-### 8. Out of Scope
-
-Explicit exclusions prevent scope creep:
-
-```markdown
-## Out of Scope
-
- OS-1: Multi-factor authentication (separate spec: SPEC-042)
- OS-2: Social login providers beyond Google and GitHub
- OS-3: Admin impersonation of user accounts
- OS-4: Password complexity rules beyond minimum length (deferred to v2)
- OS-5: Session management UI (users cannot see/revoke active sessions yet)
-```
-
-If someone asks for an out-of-scope item during implementation, point them to this section. Do not build it.
+See [acceptance_criteria_patterns.md](references/acceptance_criteria_patterns.md) for a full pattern library of Given/When/Then criteria across authentication, CRUD, search, file upload, payment, notification, and accessibility scenarios.

 ---

@@ -405,107 +252,7 @@ Use `engineering/spec-driven-workflow` for:

 ## Examples

-### Full Spec: User Password Reset
-
-```markdown
-# Spec: Password Reset Flow
-
-**Author:** Engineering Team
-**Date:** 2026-03-25
-**Status:** Approved
-
-## Context
-
-Users who forget their passwords currently have no self-service recovery option.
-Support receives ~200 password reset requests per week, costing approximately
-8 hours of support time. This feature eliminates that burden entirely.
-
-## Functional Requirements
-
- FR-1: The system MUST allow users to request a password reset via email.
- FR-2: The system MUST send a reset link that expires after 1 hour.
- FR-3: The system MUST invalidate all previous reset links when a new one is requested.
- FR-4: The system MUST enforce minimum password length of 8 characters on reset.
- FR-5: The system MUST NOT reveal whether an email exists in the system.
- FR-6: The system SHOULD log all reset attempts for audit purposes.
-
-## Acceptance Criteria
-
-### AC-1: Request reset (FR-1, FR-5)
-Given a user on the password reset page
-When they enter any email address and submit
-Then they see "If an account exists, a reset link has been sent"
-And the response is identical whether the email exists or not
-
-### AC-2: Valid reset link (FR-2)
-Given a user who received a reset email 30 minutes ago
-When they click the reset link
-Then they see the password reset form
-
-### AC-3: Expired reset link (FR-2)
-Given a user who received a reset email 2 hours ago
-When they click the reset link
-Then they see "This link has expired. Please request a new one."
-
-### AC-4: Previous links invalidated (FR-3)
-Given a user who requested two reset emails
-When they click the link from the first email
-Then they see "This link is no longer valid."
-
-## Edge Cases
-
- EC-1: User submits reset for non-existent email → Same success message (FR-5).
- EC-2: User clicks reset link twice → Second click shows "already used" if password was changed.
- EC-3: Email delivery fails → Log error, do not retry automatically.
- EC-4: User requests reset while already logged in → Allow it, do not force logout.
-
-## Out of Scope
-
- OS-1: Security questions as alternative reset method.
- OS-2: SMS-based password reset.
- OS-3: Admin-initiated password reset (separate spec).
-```
-
-### Extracted Test Cases (from above spec)
-
-```python
-# Generated by test_extractor.py --framework pytest
-
-class TestPasswordReset:
-    def test_ac1_request_reset_existing_email(self):
-        """AC-1: Request reset with existing email shows generic message."""
-        # Given a user on the password reset page
-        # When they enter a registered email and submit
-        # Then they see "If an account exists, a reset link has been sent"
-        raise NotImplementedError("Implement this test")
-
-    def test_ac1_request_reset_nonexistent_email(self):
-        """AC-1: Request reset with unknown email shows same generic message."""
-        # Given a user on the password reset page
-        # When they enter an unregistered email and submit
-        # Then they see identical response to existing email case
-        raise NotImplementedError("Implement this test")
-
-    def test_ac2_valid_reset_link(self):
-        """AC-2: Reset link works within expiry window."""
-        raise NotImplementedError("Implement this test")
-
-    def test_ac3_expired_reset_link(self):
-        """AC-3: Reset link rejected after 1 hour."""
-        raise NotImplementedError("Implement this test")
-
-    def test_ac4_previous_links_invalidated(self):
-        """AC-4: Old reset links stop working when new one is requested."""
-        raise NotImplementedError("Implement this test")
-
-    def test_ec1_nonexistent_email_same_response(self):
-        """EC-1: Non-existent email produces identical response."""
-        raise NotImplementedError("Implement this test")
-
-    def test_ec2_reset_link_used_twice(self):
-        """EC-2: Already-used reset link shows appropriate message."""
-        raise NotImplementedError("Implement this test")
-```
+A complete worked example (Password Reset spec with extracted test cases) is available in [spec_format_guide.md](references/spec_format_guide.md#full-example-password-reset). It demonstrates all 9 sections, requirement numbering, acceptance criteria, edge cases, and the corresponding pytest stubs generated by `test_extractor.py`.

 ---

--- a/engineering/spec-driven-workflow/references/spec_format_guide.md
+++ b/engineering/spec-driven-workflow/references/spec_format_guide.md
@@ -421,3 +421,111 @@ Focus on: backward compatibility, rollback plan, data integrity, zero-downtime d
 - [ ] No placeholder text remains
 - [ ] Context includes evidence (metrics, tickets, research)
 - [ ] Status is "In Review" (not still "Draft")
+
+---
+
+## Full Example: Password Reset
+
+A complete spec demonstrating all sections, followed by extracted test stubs.
+
+### The Spec
+
+```markdown
+# Spec: Password Reset Flow
+
+**Author:** Engineering Team
+**Date:** 2026-03-25
+**Status:** Approved
+
+## Context
+
+Users who forget their passwords currently have no self-service recovery option.
+Support receives ~200 password reset requests per week, costing approximately
+8 hours of support time. This feature eliminates that burden entirely.
+
+## Functional Requirements
+
+- FR-1: The system MUST allow users to request a password reset via email.
+- FR-2: The system MUST send a reset link that expires after 1 hour.
+- FR-3: The system MUST invalidate all previous reset links when a new one is requested.
+- FR-4: The system MUST enforce minimum password length of 8 characters on reset.
+- FR-5: The system MUST NOT reveal whether an email exists in the system.
+- FR-6: The system SHOULD log all reset attempts for audit purposes.
+
+## Acceptance Criteria
+
+### AC-1: Request reset (FR-1, FR-5)
+Given a user on the password reset page
+When they enter any email address and submit
+Then they see "If an account exists, a reset link has been sent"
+And the response is identical whether the email exists or not
+
+### AC-2: Valid reset link (FR-2)
+Given a user who received a reset email 30 minutes ago
+When they click the reset link
+Then they see the password reset form
+
+### AC-3: Expired reset link (FR-2)
+Given a user who received a reset email 2 hours ago
+When they click the reset link
+Then they see "This link has expired. Please request a new one."
+
+### AC-4: Previous links invalidated (FR-3)
+Given a user who requested two reset emails
+When they click the link from the first email
+Then they see "This link is no longer valid."
+
+## Edge Cases
+
+- EC-1: User submits reset for non-existent email → Same success message (FR-5).
+- EC-2: User clicks reset link twice → Second click shows "already used" if password was changed.
+- EC-3: Email delivery fails → Log error, do not retry automatically.
+- EC-4: User requests reset while already logged in → Allow it, do not force logout.
+
+## Out of Scope
+
+- OS-1: Security questions as alternative reset method.
+- OS-2: SMS-based password reset.
+- OS-3: Admin-initiated password reset (separate spec).
+```
+
+### Extracted Test Cases
+
+Generated by `test_extractor.py --framework pytest`:
+
+```python
+class TestPasswordReset:
+    def test_ac1_request_reset_existing_email(self):
+        """AC-1: Request reset with existing email shows generic message."""
+        # Given a user on the password reset page
+        # When they enter a registered email and submit
+        # Then they see "If an account exists, a reset link has been sent"
+        raise NotImplementedError("Implement this test")
+
+    def test_ac1_request_reset_nonexistent_email(self):
+        """AC-1: Request reset with unknown email shows same generic message."""
+        # Given a user on the password reset page
+        # When they enter an unregistered email and submit
+        # Then they see identical response to existing email case
+        raise NotImplementedError("Implement this test")
+
+    def test_ac2_valid_reset_link(self):
+        """AC-2: Reset link works within expiry window."""
+        raise NotImplementedError("Implement this test")
+
+    def test_ac3_expired_reset_link(self):
+        """AC-3: Reset link rejected after 1 hour."""
+        raise NotImplementedError("Implement this test")
+
+    def test_ac4_previous_links_invalidated(self):
+        """AC-4: Old reset links stop working when new one is requested."""
+        raise NotImplementedError("Implement this test")
+
+    def test_ec1_nonexistent_email_same_response(self):
+        """EC-1: Non-existent email produces identical response."""
+        raise NotImplementedError("Implement this test")
+
+    def test_ec2_reset_link_used_twice(self):
+        """EC-2: Already-used reset link shows appropriate message."""
+        raise NotImplementedError("Implement this test")
+```