firefrost-gaming/claude-skills-reference

Files

Reza Rezvani 97952ccbee feat(engineering): add browser-automation and spec-driven-workflow skills

browser-automation (564-line SKILL.md, 3 scripts, 3 references):
- Web scraping, form filling, screenshot capture, data extraction
- Anti-detection patterns, cookie/session management, dynamic content
- scraping_toolkit.py, form_automation_builder.py, anti_detection_checker.py
- NOT testing (that's playwright-pro) — this is automation & scraping

spec-driven-workflow (586-line SKILL.md, 3 scripts, 3 references):
- Spec-first development: write spec BEFORE code
- Bounded autonomy rules, 6-phase workflow, self-review checklist
- spec_generator.py, spec_validator.py, test_extractor.py
- Pairs with tdd-guide for red-green-refactor after spec

Updated engineering plugin.json (31 → 33 skills).
Added both to mkdocs.yml nav and generated docs pages.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

2026-03-25 12:57:18 +01:00

15 KiB

Raw Blame History

Spec Format Guide

Complete reference for writing feature specifications. Every section is explained with examples, rationale, and common mistakes.

The Spec Document Structure

A spec has 8 mandatory sections. If a section does not apply, write "N/A — [reason]" so reviewers know it was considered, not skipped.

1. Title and Metadata
2. Context
3. Functional Requirements
4. Non-Functional Requirements
5. Acceptance Criteria
6. Edge Cases and Error Scenarios
7. API Contracts
8. Data Models
9. Out of Scope

Section 1: Title and Metadata

# Spec: [Feature Name]

**Author:** Jane Doe
**Date:** 2026-03-25
**Status:** Draft | In Review | Approved | Superseded
**Reviewers:** John Smith, Alice Chen
**Related specs:** SPEC-018 (User Registration), SPEC-023 (Session Management)

Status Lifecycle

Status	Meaning	Who Can Change
Draft	Author is still writing. Not ready for review.	Author
In Review	Ready for feedback. Implementation blocked.	Author
Approved	Reviewed and accepted. Implementation may begin.	Reviewer
Superseded	Replaced by a newer spec. Link to replacement.	Author

Rule: Implementation MUST NOT begin until status is "Approved."

Section 2: Context

The context section answers: Why does this feature exist?

What to Include

The problem being solved (with evidence: support tickets, metrics, user research)
The current state (what exists today and what is broken or missing)
The business justification (revenue impact, cost savings, user retention)
Constraints or dependencies (regulatory, technical, timeline)

What to Exclude

Implementation details (that is the engineer's job)
Solution proposals (the spec says WHAT, not HOW)
Lengthy background (2-4 paragraphs maximum)

Good Example

## Context

Users who forget their passwords currently have no self-service recovery.
Support handles ~200 password reset requests per week, consuming approximately
8 hours of agent time at $45/hour ($360/week, $18,720/year). Additionally,
12% of users who contact support for a reset never return.

This feature provides self-service password reset via email, eliminating
support burden and reducing user churn from the reset flow.

Bad Example

## Context

We need a password reset feature. Users forget their passwords sometimes
and need to reset them. We should build this.

Why it is bad: No evidence, no metrics, no business justification. "We should build this" is not a reason.

Section 3: Functional Requirements — RFC 2119

RFC 2119 Keywords

These keywords have precise meanings per RFC 2119. Do not use them casually.

Keyword	Meaning	Testing Implication
MUST	Absolute requirement. The implementation is non-conformant without this.	Must have a passing test. Failure = release blocker.
MUST NOT	Absolute prohibition. Doing this = broken implementation.	Must have a test proving this cannot happen.
SHOULD	Strongly recommended. Can be omitted only with documented justification.	Should have a test. Omission requires written rationale.
SHOULD NOT	Strongly discouraged. Can be done only with documented justification.	Should have a test confirming the behavior does not occur.
MAY	Truly optional. Implementer's discretion.	Test is optional. Document if implemented.

Writing Good Requirements

Each requirement MUST be:

Atomic — One behavior per requirement. Not "The system MUST authenticate users and log them in."
Testable — You can write a test that proves it works or does not.
Numbered — Sequential FR-N format for traceability.
Specific — No ambiguous adjectives ("fast", "secure", "user-friendly").

Good Requirements

- FR-1: The system MUST accept login via email and password.
- FR-2: The system MUST reject passwords shorter than 8 characters.
- FR-3: The system MUST return a JWT access token on successful login.
- FR-4: The system MUST NOT include the password hash in any API response.
- FR-5: The system SHOULD support "remember me" with a 30-day refresh token.
- FR-6: The system MAY display last login time on the dashboard.

Bad Requirements

- FR-1: The login system must be fast and secure.
  (Untestable: what is "fast"? What is "secure"?)

- FR-2: The system must handle all edge cases.
  (Vague: which edge cases? This delegates the spec to the implementer.)

- FR-3: Users should be able to log in easily.
  (Subjective: "easily" is not measurable.)

Section 4: Non-Functional Requirements

Non-functional requirements define quality attributes. Every requirement needs a measurable threshold.

Section 5: Acceptance Criteria — Given/When/Then

Acceptance criteria are the contract between the spec author and the implementer. They define "done."

The Given/When/Then Pattern

Given [precondition — the world is in this state]
When  [action — the user or system does this]
Then  [outcome — this observable result occurs]
And   [additional outcome — and also this]

Rules for Acceptance Criteria

Every AC MUST reference at least one FR- or NFR-*.* Orphaned criteria indicate missing requirements.
Every AC MUST be testable by a machine. If you cannot write an automated test, rewrite the criterion.
No subjective language. Not "should look good" but "MUST render within the design-system grid."
One scenario per AC. If you have multiple Given/When/Then blocks, split into separate ACs.

Example: Authentication Feature

### AC-1: Successful login (FR-1, FR-3)
Given a registered user with email "user@example.com" and password "P@ssw0rd123"
When they POST /api/auth/login with those credentials
Then they receive a 200 response with a valid JWT token
And the token expires in 24 hours
And the response includes the user's display name

### AC-2: Invalid password (FR-1)
Given a registered user with email "user@example.com"
When they POST /api/auth/login with an incorrect password
Then they receive a 401 response
And the response body contains error "INVALID_CREDENTIALS"
And no token is issued

### AC-3: Short password rejected on registration (FR-2)
Given a new user attempting to register
When they submit a password with 7 characters
Then they receive a 400 response
And the response body contains error "PASSWORD_TOO_SHORT"
And the account is not created

Common Mistakes

Mistake	Example	Fix
Vague outcome	"Then the system works correctly"	"Then the response status is 200 and body contains {field: value}"
Missing precondition	"When user logs in, then token is issued"	"Given a registered user, when they POST valid credentials, then..."
Multiple scenarios	AC with 3 different When clauses	Split into 3 separate ACs
No FR reference	"AC-5: User sees dashboard"	"AC-5: User sees dashboard (FR-7)"

Section 6: Edge Cases and Error Scenarios

What Counts as an Edge Case

Invalid or malformed input
External service failures (API down, timeout, rate-limited)
Concurrent operations (race conditions)
Boundary values (empty string, max length, zero, negative numbers)
State conflicts (already exists, already deleted, expired)

Format

- EC-1: Empty email field → Return 400 with error "EMAIL_REQUIRED". Do not call auth service.
- EC-2: Email exceeds 255 characters → Return 400 with error "EMAIL_TOO_LONG".
- EC-3: OAuth provider returns 503 → Return 503 with "Service temporarily unavailable". Retry after 30s.
- EC-4: Two users register same email simultaneously → First succeeds, second gets 409 Conflict.
- EC-5: User clicks reset link after password was already changed → Show "Link already used."

Coverage Rule

For every external dependency, specify at least one failure:

Database: connection lost, timeout, constraint violation
API: 4xx, 5xx, timeout, invalid response
File system: file not found, permission denied, disk full
User input: empty, too long, wrong type, injection attempt

Section 7: API Contracts

Notation

Use TypeScript-style interfaces. They are readable by both frontend and backend engineers.

interface CreateUserRequest {
  email: string;         // MUST be valid email, max 255 chars
  password: string;      // MUST be 8-128 chars
  displayName: string;   // MUST be 1-100 chars, no HTML
  role?: "user" | "admin"; // Default: "user"
}

What to Define

For each endpoint:

HTTP method and path (e.g., POST /api/users)
Request body (fields, types, constraints, defaults)
Success response (status code, body shape)
Error responses (each error code with its status and body)
Headers (Authorization, Content-Type, custom headers)

Error Response Convention

interface ApiError {
  error: string;         // Machine-readable code: "INVALID_CREDENTIALS"
  message: string;       // Human-readable: "The email or password is incorrect."
  details?: Record<string, string>;  // Field-level errors for validation
}

Always include:

400 for validation errors
401 for authentication failures
403 for authorization failures
404 for not found
409 for conflicts
429 for rate limiting
500 for unexpected errors (keep it generic — do not leak internals)

Section 8: Data Models

Table Format

### User
| Field | Type | Constraints |
|-------|------|-------------|
| id | UUID | PK, auto-generated, immutable |
| email | varchar(255) | Unique, not null, valid email |
| passwordHash | varchar(60) | Not null, bcrypt, never in API responses |
| displayName | varchar(100) | Not null |
| role | enum('user','admin') | Default: 'user' |
| createdAt | timestamp | UTC, immutable, auto-set |
| updatedAt | timestamp | UTC, auto-updated |
| deletedAt | timestamp | Null unless soft-deleted |

Rules

Every entity in requirements MUST have a data model. If FR-1 mentions "users", there must be a User model.
Constraints MUST match requirements. If FR-2 says passwords >= 8 chars, the model must note that.
Include indexes. If NFR-P1 says < 500ms queries, note which fields need indexes.
Specify soft vs. hard delete. State it explicitly.

Section 9: Out of Scope

Why This Section Matters

Out of Scope prevents scope creep during implementation. When someone says "while you're in there, could you also..." — point them to this section.

Format

- OS-1: Multi-factor authentication — Planned for Q3 (SPEC-045).
- OS-2: Social login beyond Google/GitHub — Insufficient user demand (< 2% requests).
- OS-3: Admin impersonation — Security review pending. Separate spec required.
- OS-4: Password strength meter UI — Nice-to-have, deferred to design sprint 12.

Rules

Every feature discussed and rejected MUST be listed. This creates a paper trail.
Include the reason. "Not now" is not a reason. "Insufficient demand (< 2% of requests)" is.
Link to future specs when the exclusion is a deferral, not a rejection.

Feature-Type Templates

CRUD Feature

Focus on: all 4 operations, validation rules, authorization, pagination for list endpoints.

- FR-1: Users MUST be able to create a [resource] with [required fields].
- FR-2: Users MUST be able to read a [resource] by ID.
- FR-3: Users MUST be able to list [resources] with pagination (default: 20/page).
- FR-4: Users MUST be able to update [mutable fields] of their own [resources].
- FR-5: Users MUST be able to delete their own [resources] (soft delete).
- FR-6: Users MUST NOT be able to modify or delete other users' [resources].

Integration Feature

Focus on: external API contract, retry/fallback behavior, data mapping, error propagation.

- FR-1: The system MUST call [external API] to [purpose].
- FR-2: The system MUST retry failed calls up to 3 times with exponential backoff.
- FR-3: The system MUST map [external field] to [internal field].
- FR-4: The system MUST NOT expose external API errors directly to users.
- EC-1: External API returns 5xx → Log error, return cached data if < 1h old, else 503.
- EC-2: External API response schema changes → Log warning, reject unmappable fields.

Migration Feature

Focus on: backward compatibility, rollback plan, data integrity, zero-downtime deployment.

- FR-1: The migration MUST transform [old schema] to [new schema].
- FR-2: The migration MUST be reversible (rollback script required).
- FR-3: The migration MUST NOT cause downtime exceeding 30 seconds.
- FR-4: The migration MUST validate data integrity post-run (row count, checksum).
- EC-1: Migration fails mid-way → Automatic rollback, alert ops team.
- EC-2: New schema has stricter constraints → Log invalid rows, quarantine for manual review.

Checklist: Is This Spec Ready for Review?

Every section is filled (or marked N/A with reason)
All requirements use FR-N, NFR-N numbering
RFC 2119 keywords are UPPERCASE
Every AC references at least one requirement
Every AC uses Given/When/Then
Edge cases cover each external dependency failure
API contracts define success AND error responses
Data models include all entities from requirements
Out of Scope lists items discussed and rejected
No placeholder text remains
Context includes evidence (metrics, tickets, research)
Status is "In Review" (not still "Draft")

15 KiB Raw Blame History

Spec Format Guide

The Spec Document Structure

Section 1: Title and Metadata

Status Lifecycle

Section 2: Context

What to Include

What to Exclude

Good Example

Bad Example

Section 3: Functional Requirements — RFC 2119

RFC 2119 Keywords

Writing Good Requirements

Good Requirements

Bad Requirements

Section 4: Non-Functional Requirements

Categories

Performance

Security

Accessibility

Scalability

Reliability

Section 5: Acceptance Criteria — Given/When/Then

The Given/When/Then Pattern

Rules for Acceptance Criteria

Example: Authentication Feature

Common Mistakes

Section 6: Edge Cases and Error Scenarios

What Counts as an Edge Case

Format

Coverage Rule

Section 7: API Contracts

Notation

What to Define

Error Response Convention

Section 8: Data Models

Table Format

Rules

Section 9: Out of Scope

Why This Section Matters

Format

Rules

Feature-Type Templates

CRUD Feature

Integration Feature

Migration Feature

Checklist: Is This Spec Ready for Review?

15 KiB

Raw Blame History