Files
claude-skills-reference/AUDIT_REPORT.md
Leo f6f50f5282 Fix CI workflows and installation documentation
- Replace non-existent anthropics/claude-code-action@v1 with direct bash steps in smart-sync.yml and pr-issue-auto-close.yml
- Add missing checkout steps to both workflows for WORKFLOW_KILLSWITCH access
- Fix Issue #189: Replace broken 'npx ai-agent-skills install' with working 'npx agent-skills-cli add' command
- Update README.md and INSTALLATION.md with correct Agent Skills CLI commands and repository links
- Verified: agent-skills-cli detects all 53 skills and works with 42+ AI agents

Fixes: Two GitHub Actions workflows that broke on PR #191 merge
Closes: #189
2026-02-16 11:30:18 +00:00

317 lines
19 KiB
Markdown
Raw Permalink Blame History

This file contains ambiguous Unicode characters
This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.
# Skills Audit Report
**Date:** 2026-02-15
**Auditor:** Automated Skill Quality Audit
**Scope:** Recently added skills in business-growth/, finance/, marketing-skill/campaign-analytics/, project-management/
---
## Executive Summary
The recently added skills fall into two distinct tiers:
1. **Business-growth, Finance, and Campaign Analytics skills** — Genuinely impressive. Production-ready Python tooling, deep domain frameworks, real structured outputs. These would make a domain practitioner say "this actually knows what it's doing."
2. **Project Management skills** — A mixed bag. The Atlassian-specific skills (jira-expert, confluence-expert, atlassian-admin, atlassian-templates) have strong knowledge-base content. The scrum-master and senior-pm skills are thin and generic. None of the PM skills have scripts or assets — they're pure prompt-engineering skills, which is a fundamentally different (and weaker) category.
**Overall: 4 POWERFUL, 1 SOLID, 4 SOLID, 2 GENERIC, 1 WEAK**
---
## Detailed Skill Audits
---
### 1. business-growth/customer-success-manager
**Code Quality: EXCELLENT**
- 3 Python scripts (438 + 487 + 414 = 1,339 lines total)
- Well-structured: proper typing, argparse CLI, JSON/text dual output, error handling
- Zero external dependencies (stdlib only) — deliberate, documented design choice
- `health_score_calculator.py`: Multi-dimensional weighted scoring with segment-aware benchmarks (Enterprise/Mid-Market/SMB). Not placeholder math — real configurable thresholds, normalization logic, trend analysis
- `churn_risk_analyzer.py`: Behavioral signal detection with renewal urgency multipliers
- `expansion_opportunity_scorer.py`: Whitespace mapping and effort-vs-impact prioritization
- All scripts actually runnable with provided sample data
**Problem-Solving Quality: EXCELLENT**
- Health scoring framework reference (80+ lines) explains *why* each dimension is weighted as it is — genuinely pedagogical
- Real CS playbooks: not "be proactive" platitudes but specific intervention triggers (e.g., "if health score drops below yellow for 2 consecutive periods, escalate")
- QBR template is production-ready — has ROI summary tables, value-delivered sections, next-quarter planning
- Success plan template, onboarding checklist, executive business review — all structured with fill-in tables
- Uses real industry frameworks: DAU/MAU ratio, NPS scoring methodology, multi-threading depth
**Structure: STRONG**
- SKILL.md has proper frontmatter, TOC, input/output schemas, limitations section
- References are actually used by the scripts (health-scoring-framework.md maps directly to score calculation logic)
- Assets include sample data AND expected output JSON for validation
**Verdict: POWERFUL**
*Evidence: A CS leader could hand this to a team and they'd have a working health scoring system same day. The weighted scoring model with segment-aware thresholds is exactly how real CS platforms (Gainsight, Totango) work. The scripts produce structured JSON that could feed a dashboard.*
---
### 2. business-growth/revenue-operations
**Code Quality: EXCELLENT**
- 3 scripts (496 + 531 + 658 = 1,685 lines total) — the largest script set
- `pipeline_analyzer.py`: Coverage ratios, stage conversion rates, sales velocity formula (Opportunities × Avg Deal × Win Rate / Cycle), deal aging detection, concentration risk warnings
- `forecast_accuracy_tracker.py`: MAPE calculation, period-over-period accuracy trending
- `gtm_efficiency_calculator.py`: CAC, LTV, CAC payback period, magic number, burn multiple — these are real SaaS metrics, not made up
- Proper CLI args, dual output format, input validation
**Problem-Solving Quality: EXCELLENT**
- RevOps metrics guide references real benchmarks (3-4x pipeline coverage, magic number >0.75)
- Pipeline management framework covers qualification methodology
- GTM efficiency benchmarks are industry-standard (Bessemer, OpenView style)
- Templates: pipeline review, forecast report, GTM dashboard — all structured with metric tables
**Structure: STRONG**
- Consistent with customer-success-manager pattern
- Sample data files for all three scripts
- Expected output JSON for validation
**Verdict: POWERFUL**
*Evidence: The pipeline analyzer alone replaces basic Salesforce reporting. The GTM efficiency calculator uses the exact metrics VCs and board members ask for (magic number, burn multiple, CAC payback). A RevOps manager would find real utility here.*
---
### 3. business-growth/sales-engineer
**Code Quality: EXCELLENT**
- 3 scripts (557 + 525 + 765 = 1,847 lines total) — largest individual script set
- `rfp_response_analyzer.py`: Weighted coverage scoring (Full/Partial/Planned/Gap × Must-have/Should-have/Nice-to-have), automated bid/no-bid recommendation with configurable thresholds
- `competitive_matrix_builder.py`: Feature-by-feature comparison with differentiator/vulnerability identification
- `poc_planner.py`: Timeline generation, resource planning, success criteria definition, evaluation scorecards
- 765-line POC planner is genuinely comprehensive
**Problem-Solving Quality: EXCELLENT**
- 5-phase workflow (Discovery → Solution Design → Demo → POC → Close) maps to real SE methodology
- RFP analyzer produces structured gap analysis with mitigation strategies — not just "you have gaps"
- Competitive positioning framework includes feature-level comparison, not just "we're better"
- Demo script template and POC scorecard are practitioner-level artifacts
- Technical proposal template has architecture section
**Structure: STRONG**
- Same consistent pattern as other business-growth skills
- Rich asset set: demo script template, POC scorecard, technical proposal template, sample RFP data
- References cover competitive positioning, POC best practices, RFP response methodology
**Verdict: POWERFUL**
*Evidence: The RFP analyzer with weighted coverage scoring and bid/no-bid recommendation is something SEs actually need and usually do in spreadsheets. The POC planner at 765 lines is the most substantive single script in this batch. A pre-sales team could adopt this immediately.*
---
### 4. finance/financial-analyst
**Code Quality: EXCELLENT**
- 4 scripts (432 + 449 + 406 + 494 = 1,781 lines total)
- `ratio_calculator.py`: 20+ ratios across 5 categories (profitability, liquidity, leverage, efficiency, valuation) — ROE, ROA, DSCR, DSO, EV/EBITDA, PEG ratio
- `dcf_valuation.py`: Full DCF model with WACC via CAPM, 5-year projections, terminal value (perpetuity growth AND exit multiple methods), two-way sensitivity analysis, equity bridge
- `budget_variance_analyzer.py`: Favorable/unfavorable classification by department and category
- `forecast_builder.py`: Driver-based forecasting with scenario modeling (base/bull/bear)
- All stdlib only, handles edge cases (inf values in JSON serialization)
**Problem-Solving Quality: EXCELLENT**
- DCF model implements real finance: CAPM cost of equity, after-tax cost of debt, terminal value via both methods, sensitivity matrix — this is textbook corporate finance done correctly
- Ratio guide includes interpretation context (not just "here's the number" but "here's what it means")
- Valuation methodology reference explains when to use DCF vs. comparables vs. precedent transactions
- Forecasting best practices cover driver-based vs. trend-based approaches
- Variance report template is exactly what FP&A teams produce monthly
**Structure: STRONG**
- Consistent format with other skills
- 4 scripts (most of any skill) — comprehensive coverage of analyst workflow
- Sample data, expected output, 3 templates (DCF, forecast, variance)
**Verdict: POWERFUL**
*Evidence: The DCF valuation model alone is genuinely useful — it implements WACC calculation, cash flow projection, terminal value via two methods, and sensitivity analysis. A junior analyst could use this as a learning tool; a senior analyst could use it for quick-and-dirty valuations. The sensitivity table output is exactly what you'd see in an investment banking pitch book.*
---
### 5. marketing-skill/campaign-analytics
**Code Quality: VERY GOOD**
- 3 scripts (347 + 459 + 305 = 1,111 lines total) — smallest script set but still substantive
- `attribution_analyzer.py`: 5 attribution models (first-touch, last-touch, linear, time-decay, position-based) — these are the real standard models used in marketing analytics
- `campaign_roi_calculator.py`: ROI, ROAS, CPA, CPL, CAC with industry benchmarking
- `funnel_analyzer.py`: Stage-by-stage conversion rates, drop-off identification, bottleneck detection
- Time-decay model uses configurable half-life parameter — not just a label
**Problem-Solving Quality: VERY GOOD**
- Attribution models guide explains when to use each model (rare — most resources just list them)
- Funnel optimization framework covers real concepts (stage-specific interventions, not just "improve conversion")
- Campaign metrics benchmarks provide industry reference points
- A/B test template and channel comparison template are useful artifacts
**Structure: STRONG**
- Consistent with business-growth pattern
- References tied to script functionality
- Sample data with customer journeys for attribution testing
**Verdict: SOLID** (borderline POWERFUL)
*Evidence: The 5 attribution models are correctly implemented and genuinely useful for any marketing team not yet using a dedicated attribution platform. However, the funnel analyzer (305 lines) is thinner than the equivalent scripts in other skills, and the overall scope is narrower than the business-growth skills.*
---
### 6. project-management/jira-expert
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: GOOD**
- JQL examples reference is genuinely useful — covers sprint queries, team workload, SLA tracking, change management queries
- Automation examples reference covers real Jira automation rules
- SKILL.md has comprehensive workflow descriptions for project creation, workflow design, JQL building
- Actually teaches JQL syntax with practical examples, not just theory
**Structure: ADEQUATE**
- No scripts, no assets, no sample data
- But the references are substantive (415 + 423 = 838 lines of reference material)
- Workflows reference other PM skills (Scrum Master, Confluence Expert) — good cross-linking
**Verdict: SOLID**
*Evidence: The JQL examples alone are a legitimate reference resource. The automation examples cover real-world rules. But without scripts or structured output tooling, this is fundamentally a knowledge-base skill, not a tool skill. It makes Claude better at Jira advice but doesn't produce artifacts.*
---
### 7. project-management/confluence-expert
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: GOOD**
- Templates reference (725 lines) contains 10+ ready-to-use Confluence page templates: meeting notes, decision log, project status, runbook, postmortem, ADR, onboarding guide
- Space architecture guidance is practical and specific (max 3 levels deep, naming conventions)
- Macro usage examples are helpful for Confluence power users
**Structure: ADEQUATE**
- Strong reference content compensates for lack of scripts
- Templates are the actual artifact output — when Claude uses this skill, it produces Confluence pages
**Verdict: SOLID**
*Evidence: The templates reference is the real value here — it's a curated library of production-quality Confluence page templates. A team setting up Confluence from scratch would find this genuinely useful. The space architecture guidance reflects real organizational experience.*
---
### 8. project-management/atlassian-admin
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: GOOD**
- SKILL.md is comprehensive at 414 lines covering user provisioning, deprovisioning, group management, permission schemes, security configuration
- Workflows are procedural and actionable (step-by-step with handoffs to other skills)
- Permission scheme design section is practical — distinguishes public/team/restricted/confidential project types
- SSO/SAML and security policy coverage is relevant
**Structure: ADEQUATE**
- No references, no assets — all content in SKILL.md
- Good cross-references to other PM skills (Jira Expert, Confluence Expert)
**Verdict: SOLID**
*Evidence: The user provisioning/deprovisioning workflows with audit steps reflect real admin concerns (content reassignment before account deletion). Permission scheme design is specific enough to be useful. But without reference docs or scripts, it's a well-written playbook rather than a tool.*
---
### 9. project-management/atlassian-templates
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: GOOD**
- SKILL.md at 751 lines is the longest PM skill — contains actual template content inline
- Template creation process (10-step) and modification process (8-step) are well-structured
- Contains ready-to-use templates: meeting notes, decision log, sprint planning, retrospective, project charter
- Blueprint development workflow is practical
**Structure: ADEQUATE**
- All content in SKILL.md — no separate references or assets
- Templates are embedded directly rather than in a templates/ directory
**Verdict: SOLID**
*Evidence: The templates themselves are the deliverable, and they're decent. The template governance process (versioning, deprecation, migration) shows organizational maturity. However, significant overlap with confluence-expert/references/templates.md raises questions about redundancy.*
---
### 10. project-management/scrum-master
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: MEDIOCRE**
- SKILL.md at 189 lines is thin — covers basic Scrum ceremonies at a surface level
- Nothing here goes beyond what's in the Scrum Guide
- No velocity tracking formulas, no capacity planning models, no sprint health metrics
- Retro formats reference (336 lines) is the saving grace — covers Start/Stop/Continue, Glad/Sad/Mad, 4Ls, Sailboat, DAKI formats with actual process steps
**Structure: WEAK**
- No assets, no sample data
- Single reference file
- Cross-references to Jira Expert and Confluence Expert add some value
**Verdict: GENERIC**
*Evidence: A certified Scrum Master would find nothing new here. The retro formats reference is genuinely useful but is the only substantive content. The SKILL.md reads like a job description, not a methodology. No metrics, no anti-patterns, no "when sprints go wrong" playbooks. Missing: burndown analysis tools, velocity prediction, capacity planning scripts.*
---
### 11. project-management/senior-pm
**Code Quality: N/A** — No scripts
**Problem-Solving Quality: WEAK**
- SKILL.md at 146 lines is the thinnest skill in the entire batch
- `references/api_reference.md` is literally a placeholder: "This is a placeholder for detailed reference documentation. Replace with actual reference content or delete if not needed."
- Content is generic PM advice: "develop product roadmaps aligned with business objectives," "identify and mitigate project risks"
- No frameworks, no decision models, no risk quantification methods
- No RACI template, no project charter template despite mentioning them
**Structure: WEAK**
- Placeholder reference file is a red flag
- No assets, no templates, no sample data
- Mentions creating artifacts (RACI matrix, project charter) but provides no templates
**Verdict: WEAK**
*Evidence: The placeholder reference file tells the whole story — this skill was scaffolded but never completed. A senior PM would find nothing actionable. Compare to the financial-analyst skill (1,781 lines of working code + templates) vs. this (146 lines of generic advice + a placeholder). This is "act as a Senior PM" prompting dressed up as a skill.*
---
## Comparative Analysis
| Skill | Scripts (LOC) | References | Assets/Templates | Verdict |
|-------|--------------|------------|-------------------|---------|
| customer-success-manager | 3 (1,339) | 3 deep | 5 templates + sample data | **POWERFUL** |
| revenue-operations | 3 (1,685) | 3 deep | 7 templates + sample data | **POWERFUL** |
| sales-engineer | 3 (1,847) | 3 deep | 5 templates + sample data | **POWERFUL** |
| financial-analyst | 4 (1,781) | 3 deep | 4 templates + sample data | **POWERFUL** |
| campaign-analytics | 3 (1,111) | 3 deep | 5 templates + sample data | **SOLID** |
| jira-expert | 0 | 2 substantive | 0 | **SOLID** |
| confluence-expert | 0 | 1 (725 lines) | 0 | **SOLID** |
| atlassian-admin | 0 | 0 | 0 | **SOLID** |
| atlassian-templates | 0 | 0 | 0 | **SOLID** |
| scrum-master | 0 | 1 (336 lines) | 0 | **GENERIC** |
| senior-pm | 0 | 1 (placeholder!) | 0 | **WEAK** |
## Key Observations
### What Works (business-growth, finance, campaign-analytics)
1. **Scripts that actually compute things** — Not wrappers, not boilerplate. Real algorithms with real business logic (DCF valuation, attribution modeling, health scoring)
2. **Zero external dependencies** — Deliberate stdlib-only design means they run anywhere, immediately
3. **Dual output format** — JSON for automation, text for humans. This is good engineering
4. **Sample data + expected output** — Enables validation and serves as documentation
5. **References that explain *why*** — The health scoring framework doesn't just list metrics; it explains why each dimension is weighted as it is
6. **Templates that are fill-in-ready** — QBR template, variance report, demo script — these save real time
### What Doesn't Work (parts of project-management)
1. **Senior PM is unfinished** — Placeholder reference file, no templates despite claiming to produce them
2. **Scrum Master is generic** — Doesn't exceed the Scrum Guide in depth or utility
3. **No scripts in any PM skill** — The business-growth skills prove that scripts add massive value. The PM skills could have had: sprint velocity calculator, capacity planner, risk matrix scorer, RACI generator
4. **Two-tier quality** — The gap between POWERFUL and WEAK skills in the same repo is jarring
### Recommendations
1. **Senior PM needs a complete rewrite or removal** — The placeholder reference is unacceptable. Either build it to the standard of financial-analyst (scripts + real frameworks) or don't ship it
2. **Scrum Master needs depth** — Add velocity tracking scripts, burndown analysis, capacity planning calculator, sprint health scorer
3. **PM skills should get scripts** — Even simple ones: RACI matrix generator, risk register scorer, project status report formatter
4. **Deduplicate PM templates** — atlassian-templates and confluence-expert overlap significantly
5. **Add expected_output.json to PM skills** — If they can't have scripts, at least define what "good output" looks like
---
*Report generated 2026-02-15. Skills assessed against the bar: "Would this make someone say 'holy shit, this actually knows what it's doing?'"*
*Business-growth and finance skills clear that bar. Campaign-analytics nearly does. PM skills mostly don't.*