* chore: update gitignore for audit reports and playwright cache * fix: add YAML frontmatter (name + description) to all SKILL.md files - Added frontmatter to 34 skills that were missing it entirely (0% Tessl score) - Fixed name field format to kebab-case across all 169 skills - Resolves #284 * chore: sync codex skills symlinks [automated] * fix: optimize 14 low-scoring skills via Tessl review (#290) Tessl optimization: 14 skills improved from ≤69% to 85%+. Closes #285, #286. * chore: sync codex skills symlinks [automated] * fix: optimize 18 skills via Tessl review + compliance fix (closes #287) (#291) Phase 1: 18 skills optimized via Tessl (avg 77% → 95%). Closes #287. * feat: add scripts and references to 4 prompt-only skills + Tessl optimization (#292) Phase 2: 3 new scripts + 2 reference files for prompt-only skills. Tessl 45-55% → 94-100%. * feat: add 6 agents + 5 slash commands for full coverage (v2.7.0) (#293) Phase 3: 6 new agents (all 9 categories covered) + 5 slash commands. * fix: Phase 5 verification fixes + docs update (#294) Phase 5 verification fixes * chore: sync codex skills symlinks [automated] * fix: marketplace audit — all 11 plugins validated by Claude Code (#295) Marketplace audit: all 11 plugins validated + installed + tested in Claude Code * fix: restore 7 removed plugins + revert playwright-pro name to pw Reverts two overly aggressive audit changes: - Restored content-creator, demand-gen, fullstack-engineer, aws-architect, product-manager, scrum-master, skill-security-auditor to marketplace - Reverted playwright-pro plugin.json name back to 'pw' (intentional short name) * refactor: split 21 over-500-line skills into SKILL.md + references (#296) * chore: sync codex skills symlinks [automated] * docs: update all documentation with accurate counts and regenerated skill pages - Update skill count to 170, Python tools to 213, references to 314 across all docs - Regenerate all 170 skill doc pages from latest SKILL.md sources - Update CLAUDE.md with v2.1.1 highlights, accurate architecture tree, and roadmap - Update README.md badges and overview table - Update marketplace.json metadata description and version - Update mkdocs.yml, index.md, getting-started.md with correct numbers * fix: add root-level SKILL.md and .codex/instructions.md to all domains (#301) Root cause: CLI tools (ai-agent-skills, agent-skills-cli) look for SKILL.md at the specified install path. 7 of 9 domain directories were missing this file, causing "Skill not found" errors for bundle installs like: npx ai-agent-skills install alirezarezvani/claude-skills/engineering-team Fix: - Add root-level SKILL.md with YAML frontmatter to 7 domains - Add .codex/instructions.md to 8 domains (for Codex CLI discovery) - Update INSTALLATION.md with accurate skill counts (53→170) - Add troubleshooting entry for "Skill not found" error All 9 domains now have: SKILL.md + .codex/instructions.md + plugin.json Closes #301 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat: add Gemini CLI + OpenClaw support, fix Codex missing 25 skills Gemini CLI: - Add GEMINI.md with activation instructions - Add scripts/gemini-install.sh setup script - Add scripts/sync-gemini-skills.py (194 skills indexed) - Add .gemini/skills/ with symlinks for all skills, agents, commands - Remove phantom medium-content-pro entries from sync script - Add top-level folder filter to prevent gitignored dirs from leaking Codex CLI: - Fix sync-codex-skills.py missing "engineering" domain (25 POWERFUL skills) - Regenerate .codex/skills-index.json: 124 → 149 skills - Add 25 new symlinks in .codex/skills/ OpenClaw: - Add OpenClaw installation section to INSTALLATION.md - Add ClawHub install + manual install + YAML frontmatter docs Documentation: - Update INSTALLATION.md with all 4 platforms + accurate counts - Update README.md: "three platforms" → "four platforms" + Gemini quick start - Update CLAUDE.md with Gemini CLI support in v2.1.1 highlights - Update SKILL-AUTHORING-STANDARD.md + SKILL_PIPELINE.md with Gemini steps - Add OpenClaw + Gemini to installation locations reference table Marketplace: all 18 plugins validated — sources exist, SKILL.md present Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * feat(product,pm): world-class product & PM skills audit — 6 scripts, 5 agents, 7 commands, 23 references/assets Phase 1 — Agent & Command Foundation: - Rewrite cs-project-manager agent (55→515 lines, 4 workflows, 6 skill integrations) - Expand cs-product-manager agent (408→684 lines, orchestrates all 8 product skills) - Add 7 slash commands: /rice, /okr, /persona, /user-story, /sprint-health, /project-health, /retro Phase 2 — Script Gap Closure (2,779 lines): - jira-expert: jql_query_builder.py (22 patterns), workflow_validator.py - confluence-expert: space_structure_generator.py, content_audit_analyzer.py - atlassian-admin: permission_audit_tool.py - atlassian-templates: template_scaffolder.py (Confluence XHTML generation) Phase 3 — Reference & Asset Enrichment: - 9 product references (competitive-teardown, landing-page-generator, saas-scaffolder) - 6 PM references (confluence-expert, atlassian-admin, atlassian-templates) - 7 product assets (templates for PRD, RICE, sprint, stories, OKR, research, design system) - 1 PM asset (permission_scheme_template.json) Phase 4 — New Agents: - cs-agile-product-owner, cs-product-strategist, cs-ux-researcher Phase 5 — Integration & Polish: - Related Skills cross-references in 8 SKILL.md files - Updated product-team/CLAUDE.md (5→8 skills, 6→9 tools, 4 agents, 5 commands) - Updated project-management/CLAUDE.md (0→12 scripts, 3 commands) - Regenerated docs site (177 pages), updated homepage and getting-started Quality audit: 31 files reviewed, 29 PASS, 2 fixed (copy-frameworks.md, governance-framework.md) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: audit and repair all plugins, agents, and commands - Fix 12 command files: correct CLI arg syntax, script paths, and usage docs - Fix 3 agents with broken script/reference paths (cs-content-creator, cs-demand-gen-specialist, cs-financial-analyst) - Add complete YAML frontmatter to 5 agents (cs-growth-strategist, cs-engineering-lead, cs-senior-engineer, cs-financial-analyst, cs-quality-regulatory) - Fix cs-ceo-advisor related agent path - Update marketplace.json metadata counts (224 tools, 341 refs, 14 agents, 12 commands) Verified: all 19 scripts pass --help, all 14 agent paths resolve, mkdocs builds clean. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: repair 25 Python scripts failing --help across all domains - Fix Python 3.10+ syntax (float | None → Optional[float]) in 2 scripts - Add argparse CLI handling to 9 marketing scripts using raw sys.argv - Fix 10 scripts crashing at module level (wrap in __main__, add argparse) - Make yaml/prefect/mcp imports conditional with stdlib fallbacks (4 scripts) - Fix f-string backslash syntax in project_bootstrapper.py - Fix -h flag conflict in pr_analyzer.py - Fix tech-debt.md description (score → prioritize) All 237 scripts now pass python3 --help verification. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(product-team): close 3 verified gaps in product skills - Fix competitive-teardown/SKILL.md: replace broken references DATA_COLLECTION.md → references/data-collection-guide.md and TEMPLATES.md → references/analysis-templates.md (workflow was broken at steps 2 and 4) - Upgrade landing_page_scaffolder.py: add TSX + Tailwind output format (--format tsx) matching SKILL.md promise of Next.js/React components. 4 design styles (dark-saas, clean-minimal, bold-startup, enterprise). TSX is now default; HTML preserved via --format html - Rewrite README.md: fix stale counts (was 5 skills/15+ tools, now accurately shows 8 skills/9 tools), remove 7 ghost scripts that never existed (sprint_planner.py, velocity_tracker.py, etc.) - Fix tech-debt.md description (score → prioritize) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * release: v2.1.2 — landing page TSX output, brand voice integration, docs update - Landing page generator defaults to Next.js TSX + Tailwind CSS (4 design styles) - Brand voice analyzer integrated into landing page generation workflow - CHANGELOG, CLAUDE.md, README.md updated for v2.1.2 - All 13 plugin.json + marketplace.json bumped to 2.1.2 - Gemini/Codex skill indexes re-synced - Backward compatible: --format html preserved, no breaking changes Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: alirezarezvani <5697919+alirezarezvani@users.noreply.github.com> Co-authored-by: Leo <leo@openclaw.ai> Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>
458 lines
15 KiB
Python
458 lines
15 KiB
Python
#!/usr/bin/env python3
|
|
"""
|
|
Content Audit Analyzer
|
|
|
|
Analyzes Confluence page inventory for content health. Identifies stale pages,
|
|
low-engagement content, orphaned pages, oversized documents, and produces a
|
|
health score with actionable recommendations.
|
|
|
|
Usage:
|
|
python content_audit_analyzer.py pages.json
|
|
python content_audit_analyzer.py pages.json --format json
|
|
"""
|
|
|
|
import argparse
|
|
import json
|
|
import sys
|
|
from datetime import datetime, timedelta
|
|
from typing import Any, Dict, List, Optional, Tuple
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Audit Configuration
|
|
# ---------------------------------------------------------------------------
|
|
|
|
STALE_THRESHOLD_DAYS = 90
|
|
OUTDATED_THRESHOLD_DAYS = 180
|
|
LOW_VIEW_THRESHOLD = 5
|
|
OVERSIZED_WORD_THRESHOLD = 5000
|
|
IDEAL_WORD_RANGE = (200, 3000)
|
|
|
|
HEALTH_WEIGHTS = {
|
|
"freshness": 0.30,
|
|
"engagement": 0.25,
|
|
"organization": 0.20,
|
|
"size_balance": 0.15,
|
|
"completeness": 0.10,
|
|
}
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Audit Checks
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def check_stale_pages(
|
|
pages: List[Dict[str, Any]],
|
|
reference_date: datetime,
|
|
) -> Dict[str, Any]:
|
|
"""Identify pages not updated within the stale threshold."""
|
|
stale = []
|
|
outdated = []
|
|
|
|
for page in pages:
|
|
last_modified = _parse_date(page.get("last_modified", ""))
|
|
if not last_modified:
|
|
continue
|
|
|
|
days_since_update = (reference_date - last_modified).days
|
|
|
|
if days_since_update > OUTDATED_THRESHOLD_DAYS:
|
|
outdated.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"days_since_update": days_since_update,
|
|
"last_modified": page.get("last_modified", ""),
|
|
"author": page.get("author", "unknown"),
|
|
})
|
|
elif days_since_update > STALE_THRESHOLD_DAYS:
|
|
stale.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"days_since_update": days_since_update,
|
|
"last_modified": page.get("last_modified", ""),
|
|
"author": page.get("author", "unknown"),
|
|
})
|
|
|
|
total = len(pages)
|
|
stale_count = len(stale) + len(outdated)
|
|
fresh_ratio = 1 - (stale_count / total) if total > 0 else 1
|
|
score = max(0, fresh_ratio * 100)
|
|
|
|
return {
|
|
"score": score,
|
|
"stale_pages": stale,
|
|
"outdated_pages": outdated,
|
|
"stale_count": len(stale),
|
|
"outdated_count": len(outdated),
|
|
"fresh_count": total - stale_count,
|
|
}
|
|
|
|
|
|
def check_engagement(pages: List[Dict[str, Any]]) -> Dict[str, Any]:
|
|
"""Identify low-engagement pages based on view counts."""
|
|
low_engagement = []
|
|
view_counts = []
|
|
|
|
for page in pages:
|
|
views = page.get("view_count", 0)
|
|
view_counts.append(views)
|
|
|
|
if views < LOW_VIEW_THRESHOLD:
|
|
low_engagement.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"view_count": views,
|
|
"author": page.get("author", "unknown"),
|
|
})
|
|
|
|
total = len(pages)
|
|
avg_views = sum(view_counts) / total if total > 0 else 0
|
|
engaged_ratio = 1 - (len(low_engagement) / total) if total > 0 else 1
|
|
score = max(0, engaged_ratio * 100)
|
|
|
|
return {
|
|
"score": score,
|
|
"low_engagement_pages": low_engagement,
|
|
"low_engagement_count": len(low_engagement),
|
|
"average_views": round(avg_views, 1),
|
|
"max_views": max(view_counts) if view_counts else 0,
|
|
"min_views": min(view_counts) if view_counts else 0,
|
|
}
|
|
|
|
|
|
def check_organization(pages: List[Dict[str, Any]]) -> Dict[str, Any]:
|
|
"""Identify orphaned pages with no labels."""
|
|
orphaned = []
|
|
|
|
for page in pages:
|
|
labels = page.get("labels", [])
|
|
if not labels:
|
|
orphaned.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"author": page.get("author", "unknown"),
|
|
})
|
|
|
|
total = len(pages)
|
|
labeled_ratio = 1 - (len(orphaned) / total) if total > 0 else 1
|
|
score = max(0, labeled_ratio * 100)
|
|
|
|
# Collect label distribution
|
|
label_counts = {}
|
|
for page in pages:
|
|
for label in page.get("labels", []):
|
|
label_counts[label] = label_counts.get(label, 0) + 1
|
|
|
|
return {
|
|
"score": score,
|
|
"orphaned_pages": orphaned,
|
|
"orphaned_count": len(orphaned),
|
|
"labeled_count": total - len(orphaned),
|
|
"label_distribution": dict(sorted(label_counts.items(), key=lambda x: -x[1])[:20]),
|
|
}
|
|
|
|
|
|
def check_size_balance(pages: List[Dict[str, Any]]) -> Dict[str, Any]:
|
|
"""Check for oversized or undersized pages."""
|
|
oversized = []
|
|
undersized = []
|
|
word_counts = []
|
|
|
|
for page in pages:
|
|
word_count = page.get("word_count", 0)
|
|
word_counts.append(word_count)
|
|
|
|
if word_count > OVERSIZED_WORD_THRESHOLD:
|
|
oversized.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"word_count": word_count,
|
|
"recommendation": "Split into multiple focused pages",
|
|
})
|
|
elif word_count < 50 and word_count > 0:
|
|
undersized.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"word_count": word_count,
|
|
"recommendation": "Expand content or merge with related page",
|
|
})
|
|
|
|
total = len(pages)
|
|
well_sized = total - len(oversized) - len(undersized)
|
|
balance_ratio = well_sized / total if total > 0 else 1
|
|
score = max(0, balance_ratio * 100)
|
|
avg_words = sum(word_counts) / total if total > 0 else 0
|
|
|
|
return {
|
|
"score": score,
|
|
"oversized_pages": oversized,
|
|
"undersized_pages": undersized,
|
|
"oversized_count": len(oversized),
|
|
"undersized_count": len(undersized),
|
|
"average_word_count": round(avg_words),
|
|
}
|
|
|
|
|
|
def check_completeness(pages: List[Dict[str, Any]]) -> Dict[str, Any]:
|
|
"""Check pages for required metadata completeness."""
|
|
incomplete = []
|
|
required_fields = ["title", "last_modified", "author"]
|
|
|
|
for page in pages:
|
|
missing = [f for f in required_fields if not page.get(f)]
|
|
if missing:
|
|
incomplete.append({
|
|
"title": page.get("title", "Untitled"),
|
|
"missing_fields": missing,
|
|
})
|
|
|
|
total = len(pages)
|
|
complete_ratio = 1 - (len(incomplete) / total) if total > 0 else 1
|
|
score = max(0, complete_ratio * 100)
|
|
|
|
return {
|
|
"score": score,
|
|
"incomplete_pages": incomplete,
|
|
"incomplete_count": len(incomplete),
|
|
"complete_count": total - len(incomplete),
|
|
}
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Main Analysis
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def analyze_content_health(data: Dict[str, Any]) -> Dict[str, Any]:
|
|
"""Run full content audit analysis."""
|
|
pages = data.get("pages", [])
|
|
|
|
if not pages:
|
|
return {
|
|
"health_score": 0,
|
|
"grade": "invalid",
|
|
"error": "No pages found in input data",
|
|
"dimensions": {},
|
|
"action_items": [],
|
|
}
|
|
|
|
reference_date = datetime.now()
|
|
|
|
# Run all checks
|
|
dimensions = {
|
|
"freshness": check_stale_pages(pages, reference_date),
|
|
"engagement": check_engagement(pages),
|
|
"organization": check_organization(pages),
|
|
"size_balance": check_size_balance(pages),
|
|
"completeness": check_completeness(pages),
|
|
}
|
|
|
|
# Calculate weighted health score
|
|
weighted_scores = []
|
|
for dim_name, dim_result in dimensions.items():
|
|
weight = HEALTH_WEIGHTS.get(dim_name, 0.1)
|
|
weighted_scores.append(dim_result["score"] * weight)
|
|
|
|
health_score = sum(weighted_scores)
|
|
|
|
if health_score >= 85:
|
|
grade = "excellent"
|
|
elif health_score >= 70:
|
|
grade = "good"
|
|
elif health_score >= 55:
|
|
grade = "fair"
|
|
else:
|
|
grade = "poor"
|
|
|
|
# Generate action items
|
|
action_items = _generate_action_items(dimensions)
|
|
|
|
return {
|
|
"health_score": round(health_score, 1),
|
|
"grade": grade,
|
|
"total_pages": len(pages),
|
|
"dimensions": dimensions,
|
|
"action_items": action_items,
|
|
}
|
|
|
|
|
|
def _generate_action_items(dimensions: Dict[str, Any]) -> List[Dict[str, str]]:
|
|
"""Generate prioritized action items from audit findings."""
|
|
items = []
|
|
|
|
# Freshness actions
|
|
freshness = dimensions.get("freshness", {})
|
|
if freshness.get("outdated_count", 0) > 0:
|
|
items.append({
|
|
"priority": "high",
|
|
"action": f"Review and update or archive {freshness['outdated_count']} outdated pages (>180 days old)",
|
|
"category": "freshness",
|
|
})
|
|
if freshness.get("stale_count", 0) > 0:
|
|
items.append({
|
|
"priority": "medium",
|
|
"action": f"Review {freshness['stale_count']} stale pages (90-180 days old) for relevance",
|
|
"category": "freshness",
|
|
})
|
|
|
|
# Engagement actions
|
|
engagement = dimensions.get("engagement", {})
|
|
if engagement.get("low_engagement_count", 0) > 0:
|
|
items.append({
|
|
"priority": "medium",
|
|
"action": f"Investigate {engagement['low_engagement_count']} low-engagement pages - consider improving discoverability or archiving",
|
|
"category": "engagement",
|
|
})
|
|
|
|
# Organization actions
|
|
organization = dimensions.get("organization", {})
|
|
if organization.get("orphaned_count", 0) > 0:
|
|
items.append({
|
|
"priority": "medium",
|
|
"action": f"Add labels to {organization['orphaned_count']} orphaned pages for better categorization",
|
|
"category": "organization",
|
|
})
|
|
|
|
# Size actions
|
|
size = dimensions.get("size_balance", {})
|
|
if size.get("oversized_count", 0) > 0:
|
|
items.append({
|
|
"priority": "low",
|
|
"action": f"Split {size['oversized_count']} oversized pages (>5000 words) into focused sub-pages",
|
|
"category": "size",
|
|
})
|
|
|
|
# Completeness actions
|
|
completeness = dimensions.get("completeness", {})
|
|
if completeness.get("incomplete_count", 0) > 0:
|
|
items.append({
|
|
"priority": "low",
|
|
"action": f"Fill in missing metadata for {completeness['incomplete_count']} incomplete pages",
|
|
"category": "completeness",
|
|
})
|
|
|
|
return items
|
|
|
|
|
|
def _parse_date(date_str: str) -> Optional[datetime]:
|
|
"""Parse date string in common formats."""
|
|
formats = [
|
|
"%Y-%m-%d",
|
|
"%Y-%m-%dT%H:%M:%S",
|
|
"%Y-%m-%dT%H:%M:%SZ",
|
|
"%Y-%m-%dT%H:%M:%S.%f",
|
|
"%Y-%m-%dT%H:%M:%S.%fZ",
|
|
"%d/%m/%Y",
|
|
"%m/%d/%Y",
|
|
]
|
|
for fmt in formats:
|
|
try:
|
|
return datetime.strptime(date_str, fmt)
|
|
except ValueError:
|
|
continue
|
|
return None
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# Output Formatting
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def format_text_output(result: Dict[str, Any]) -> str:
|
|
"""Format results as readable text report."""
|
|
lines = []
|
|
lines.append("=" * 60)
|
|
lines.append("CONTENT AUDIT REPORT")
|
|
lines.append("=" * 60)
|
|
lines.append("")
|
|
|
|
if "error" in result:
|
|
lines.append(f"ERROR: {result['error']}")
|
|
return "\n".join(lines)
|
|
|
|
lines.append("HEALTH SUMMARY")
|
|
lines.append("-" * 30)
|
|
lines.append(f"Health Score: {result['health_score']}/100")
|
|
lines.append(f"Grade: {result['grade'].title()}")
|
|
lines.append(f"Total Pages Analyzed: {result['total_pages']}")
|
|
lines.append("")
|
|
|
|
# Dimension scores
|
|
lines.append("DIMENSION SCORES")
|
|
lines.append("-" * 30)
|
|
for dim_name, dim_data in result.get("dimensions", {}).items():
|
|
weight = HEALTH_WEIGHTS.get(dim_name, 0)
|
|
lines.append(f"{dim_name.replace('_', ' ').title()} (Weight: {weight:.0%})")
|
|
lines.append(f" Score: {dim_data['score']:.1f}/100")
|
|
|
|
if dim_name == "freshness":
|
|
lines.append(f" Stale: {dim_data.get('stale_count', 0)}, Outdated: {dim_data.get('outdated_count', 0)}, Fresh: {dim_data.get('fresh_count', 0)}")
|
|
elif dim_name == "engagement":
|
|
lines.append(f" Low Engagement: {dim_data.get('low_engagement_count', 0)}, Avg Views: {dim_data.get('average_views', 0)}")
|
|
elif dim_name == "organization":
|
|
lines.append(f" Orphaned (no labels): {dim_data.get('orphaned_count', 0)}, Labeled: {dim_data.get('labeled_count', 0)}")
|
|
elif dim_name == "size_balance":
|
|
lines.append(f" Oversized: {dim_data.get('oversized_count', 0)}, Undersized: {dim_data.get('undersized_count', 0)}, Avg Words: {dim_data.get('average_word_count', 0)}")
|
|
elif dim_name == "completeness":
|
|
lines.append(f" Incomplete: {dim_data.get('incomplete_count', 0)}, Complete: {dim_data.get('complete_count', 0)}")
|
|
lines.append("")
|
|
|
|
# Action items
|
|
action_items = result.get("action_items", [])
|
|
if action_items:
|
|
lines.append("ACTION ITEMS")
|
|
lines.append("-" * 30)
|
|
for i, item in enumerate(action_items, 1):
|
|
priority = item["priority"].upper()
|
|
lines.append(f"{i}. [{priority}] {item['action']}")
|
|
lines.append("")
|
|
|
|
return "\n".join(lines)
|
|
|
|
|
|
def format_json_output(result: Dict[str, Any]) -> Dict[str, Any]:
|
|
"""Format results as JSON."""
|
|
return result
|
|
|
|
|
|
# ---------------------------------------------------------------------------
|
|
# CLI Interface
|
|
# ---------------------------------------------------------------------------
|
|
|
|
def main() -> int:
|
|
"""Main CLI entry point."""
|
|
parser = argparse.ArgumentParser(
|
|
description="Analyze Confluence page inventory for content health"
|
|
)
|
|
parser.add_argument(
|
|
"pages_file",
|
|
help="JSON file with page list (title, last_modified, view_count, author, labels, word_count)",
|
|
)
|
|
parser.add_argument(
|
|
"--format",
|
|
choices=["text", "json"],
|
|
default="text",
|
|
help="Output format (default: text)",
|
|
)
|
|
|
|
args = parser.parse_args()
|
|
|
|
try:
|
|
with open(args.pages_file, "r") as f:
|
|
data = json.load(f)
|
|
|
|
result = analyze_content_health(data)
|
|
|
|
if args.format == "json":
|
|
print(json.dumps(format_json_output(result), indent=2))
|
|
else:
|
|
print(format_text_output(result))
|
|
|
|
return 0
|
|
|
|
except FileNotFoundError:
|
|
print(f"Error: File '{args.pages_file}' not found", file=sys.stderr)
|
|
return 1
|
|
except json.JSONDecodeError as e:
|
|
print(f"Error: Invalid JSON in '{args.pages_file}': {e}", file=sys.stderr)
|
|
return 1
|
|
except Exception as e:
|
|
print(f"Error: {e}", file=sys.stderr)
|
|
return 1
|
|
|
|
|
|
if __name__ == "__main__":
|
|
sys.exit(main())
|