skill-seekers-reference

firefrost-gaming/skill-seekers-reference

Author	SHA1	Message	Date
yusyus	c89f059712	feat(v2.7.0): Smart Rate Limit Management & Multi-Token Configuration Major Features: - Multi-profile GitHub token system with secure storage - Smart rate limit handler with 4 strategies (prompt/wait/switch/fail) - Interactive configuration wizard with browser integration - Configurable timeout (default 30 min) per profile - Automatic profile switching on rate limits - Live countdown timers with real-time progress - Non-interactive mode for CI/CD (--non-interactive flag) - Progress tracking and resume capability (skeleton) - Comprehensive test suite (16 tests, all passing) Solves: - Indefinite waiting on GitHub rate limits - Confusing GitHub token setup Files Added: - src/skill_seekers/cli/config_manager.py (~490 lines) - src/skill_seekers/cli/config_command.py (~400 lines) - src/skill_seekers/cli/rate_limit_handler.py (~450 lines) - src/skill_seekers/cli/resume_command.py (~150 lines) - tests/test_rate_limit_handler.py (16 tests) Files Modified: - src/skill_seekers/cli/github_fetcher.py (rate limit integration) - src/skill_seekers/cli/github_scraper.py (--non-interactive, --profile flags) - src/skill_seekers/cli/main.py (config, resume subcommands) - pyproject.toml (version 2.7.0) - CHANGELOG.md, README.md, CLAUDE.md (documentation) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-17 18:38:31 +03:00
yusyus	2019a02b51	docs: Update CLAUDE.md to v2.6.0 with complete C3.x suite Updates: - Version: v2.5.2 → v2.6.0 - Added complete C3.x feature documentation (C3.1-C3.8) - Updated Recent Achievements section with v2.6.0 release info - Expanded C3.x descriptions with all 8 features - Documented C3.8 Standalone Codebase Scraper C3.x Suite Now Complete: - C3.1: Design pattern detection (10 GoF patterns, 9 languages, 87% precision) - C3.2: Test example extraction (5 categories, AST-based) - C3.3: How-to guide generation with AI enhancement - C3.4: Configuration pattern extraction - C3.5: Architectural overview & router skill generation - C3.6: AI enhancement for patterns and tests (Claude API integration) - C3.7: Architectural pattern detection (8 patterns, framework-aware) - C3.8: Standalone codebase scraper (300+ line SKILL.md from code alone) Release History Updated: - v2.6.0 (Latest - January 14, 2026) - C3.x suite complete - v2.5.2 - UX improvements (opt-out flags) - v2.5.0 - Multi-platform support - v2.1.0 - Unified multi-source scraping - v1.0.0 - Production release Benefits: - Accurate version information for Claude Code - Complete C3.x feature documentation - Clear release history - Better developer onboarding 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-14 22:52:35 +03:00
yusyus	48b8544dea	docs: Consolidate roadmaps and refactor documentation structure MAJOR REFACTORING: Merge 3 roadmap files into single comprehensive ROADMAP.md Changes: - Merged ROADMAP.md + FLEXIBLE_ROADMAP.md + FUTURE_RELEASES.md → ROADMAP.md - Consolidated 1,008 lines across 3 files into 429 lines (single source of truth) - Removed duplicate/overlapping content - Cleaned up docs archive structure New ROADMAP.md Structure: - Current Status (v2.6.0) - Development Philosophy (task-based approach) - Task-Based Roadmap (136 tasks, 10 categories) - Release History (v1.0.0, v2.1.0, v2.6.0) - Release Planning (v2.7-v2.9) - Long-term Vision (v3.0+) - Metrics & Goals - Contribution guidelines Deleted Files: - FLEXIBLE_ROADMAP.md (merged into ROADMAP.md) - FUTURE_RELEASES.md (merged into ROADMAP.md) - docs/archive/temp/TERMINAL_SELECTION.md (temporary file) - docs/archive/temp/TESTING.md (temporary file) Moved Files: - docs/plans/*.md → docs/archive/plans/ (dated planning docs) Updated References: - CLAUDE.md: FLEXIBLE_ROADMAP.md → ROADMAP.md - docs/README.md: Removed duplicate roadmap references - CHANGELOG.md: Updated documentation references Benefits: - Single source of truth for roadmap - No duplicate maintenance - Cleaner repository structure - Better discoverability - Historical context preserved in archive/ 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-14 22:36:03 +03:00
yusyus	a99e22c639	feat: Multi-Source Synthesis Architecture - Rich Standalone Skills + Smart Combination BREAKING CHANGE: Major architectural improvements to multi-source skill generation This commit implements the complete "Multi-Source Synthesis Architecture" where each source (documentation, GitHub, PDF) generates a rich standalone SKILL.md file before being intelligently synthesized with source-specific formulas. ## 🎯 Core Architecture Changes ### 1. Rich Standalone SKILL.md Generation (Source Parity) Each source now generates comprehensive, production-quality SKILL.md files that can stand alone OR be synthesized with other sources. GitHub Scraper Enhancements (+263 lines): - Now generates 300+ line SKILL.md (was ~50 lines) - Integrates C3.x codebase analysis data: - C2.5: API Reference extraction - C3.1: Design pattern detection (27 high-confidence patterns) - C3.2: Test example extraction (215 examples) - C3.7: Architectural pattern analysis - Enhanced sections: - ⚡ Quick Reference with pattern summaries - 📝 Code Examples from real repository tests - 🔧 API Reference from codebase analysis - 🏗️ Architecture Overview with design patterns - ⚠️ Known Issues from GitHub issues - Location: src/skill_seekers/cli/github_scraper.py PDF Scraper Enhancements (+205 lines): - Now generates 200+ line SKILL.md (was ~50 lines) - Enhanced content extraction: - 📖 Chapter Overview (PDF structure breakdown) - 🔑 Key Concepts (extracted from headings) - ⚡ Quick Reference (pattern extraction) - 📝 Code Examples: Top 15 (was top 5), grouped by language - Quality scoring and intelligent truncation - Better formatting and organization - Location: src/skill_seekers/cli/pdf_scraper.py Result: All 3 sources (docs, GitHub, PDF) now have equal capability to generate rich, comprehensive standalone skills. ### 2. File Organization & Caching System Problem: output/ directory cluttered with intermediate files, data, and logs. Solution: New `.skillseeker-cache/` hidden directory for all intermediate files. New Structure: ``` .skillseeker-cache/{skill_name}/ ├── sources/ # Standalone SKILL.md from each source │ ├── httpx_docs/ │ ├── httpx_github/ │ └── httpx_pdf/ ├── data/ # Raw scraped data (JSON) ├── repos/ # Cloned GitHub repositories (cached for reuse) └── logs/ # Session logs with timestamps output/{skill_name}/ # CLEAN: Only final synthesized skill ├── SKILL.md └── references/ ``` Benefits: - ✅ Clean output/ directory (only final product) - ✅ Intermediate files preserved for debugging - ✅ Repository clones cached and reused (faster re-runs) - ✅ Timestamped logs for each scraping session - ✅ All cache dirs added to .gitignore Changes: - .gitignore: Added `.skillseeker-cache/` entry - unified_scraper.py: Complete reorganization (+238 lines) - Added cache directory structure - File logging with timestamps - Repository cloning with caching/reuse - Cleaner intermediate file management - Better subprocess logging and error handling ### 3. Config Repository Migration Moved to separate config repository: https://github.com/yusufkaraaslan/skill-seekers-configs Deleted from this repo (35 config files): - ansible-core.json, astro.json, claude-code.json - django.json, django_unified.json, fastapi.json, fastapi_unified.json - godot.json, godot_unified.json, godot_github.json, godot-large-example.json - react.json, react_unified.json, react_github.json, react_github_example.json - vue.json, kubernetes.json, laravel.json, tailwind.json, hono.json - svelte_cli_unified.json, steam-economy-complete.json - deck_deck_go_local.json, python-tutorial-test.json, example_pdf.json - test-manual.json, fastapi_unified_test.json, fastmcp_github_example.json - example-team/ directory (4 files) Kept as reference example: - configs/httpx_comprehensive.json (complete multi-source example) Rationale: - Cleaner repository (979+ lines added, 1680 deleted) - Configs managed separately with versioning - Official presets available via `fetch-config` command - Users can maintain private config repos ### 4. AI Enhancement Improvements enhance_skill.py (+125 lines): - Better integration with multi-source synthesis - Enhanced prompt generation for synthesized skills - Improved error handling and logging - Support for source metadata in enhancement ### 5. Documentation Updates CLAUDE.md (+252 lines): - Comprehensive project documentation - Architecture explanations - Development workflow guidelines - Testing requirements - Multi-source synthesis patterns SKILL_QUALITY_ANALYSIS.md (new): - Quality assessment framework - Before/after analysis of httpx skill - Grading rubric for skill quality - Metrics and benchmarks ### 6. Testing & Validation Scripts test_httpx_skill.sh (new): - Complete httpx skill generation test - Multi-source synthesis validation - Quality metrics verification test_httpx_quick.sh (new): - Quick validation script - Subset of features for rapid testing ## 📊 Quality Improvements \| Metric \| Before \| After \| Improvement \| \|--------\|--------\|-------\|-------------\| \| GitHub SKILL.md lines \| ~50 \| 300+ \| +500% \| \| PDF SKILL.md lines \| ~50 \| 200+ \| +300% \| \| GitHub C3.x integration \| ❌ No \| ✅ Yes \| New feature \| \| PDF pattern extraction \| ❌ No \| ✅ Yes \| New feature \| \| File organization \| Messy \| Clean cache \| Major improvement \| \| Repository cloning \| Always fresh \| Cached reuse \| Faster re-runs \| \| Logging \| Console only \| Timestamped files \| Better debugging \| \| Config management \| In-repo \| Separate repo \| Cleaner separation \| ## 🧪 Testing All existing tests pass: - test_c3_integration.py: Updated for new architecture - 700+ tests passing - Multi-source synthesis validated with httpx example ## 🔧 Technical Details Modified Core Files: 1. src/skill_seekers/cli/github_scraper.py (+263 lines) - _generate_skill_md(): Rich content with C3.x integration - _format_pattern_summary(): Design pattern summaries - _format_code_examples(): Test example formatting - _format_api_reference(): API reference from codebase - _format_architecture(): Architectural pattern analysis 2. src/skill_seekers/cli/pdf_scraper.py (+205 lines) - _generate_skill_md(): Enhanced with rich content - _format_key_concepts(): Extract concepts from headings - _format_patterns_from_content(): Pattern extraction - Code examples: Top 15, grouped by language, better quality scoring 3. src/skill_seekers/cli/unified_scraper.py (+238 lines) - __init__(): Cache directory structure - _setup_logging(): File logging with timestamps - _clone_github_repo(): Repository caching system - _scrape_documentation(): Move to cache, better logging - Better subprocess handling and error reporting 4. src/skill_seekers/cli/enhance_skill.py (+125 lines) - Multi-source synthesis awareness - Enhanced prompt generation - Better error handling Minor Updates: - src/skill_seekers/cli/codebase_scraper.py (+3 lines): Minor improvements - src/skill_seekers/cli/test_example_extractor.py: Quality scoring adjustments - tests/test_c3_integration.py: Test updates for new architecture ## 🚀 Migration Guide For users with existing configs: No action required - all existing configs continue to work. For users wanting official presets: ```bash # Fetch from official config repo skill-seekers fetch-config --name react --target unified # Or use existing local configs skill-seekers unified --config configs/httpx_comprehensive.json ``` Cache directory: New `.skillseeker-cache/` directory will be created automatically. Safe to delete - will be regenerated on next run. ## 📈 Next Steps This architecture enables: - ✅ Source parity: All sources generate rich standalone skills - ✅ Smart synthesis: Each combination has optimal formula - ✅ Better debugging: Cached files and logs preserved - ✅ Faster iteration: Repository caching, clean output - 🔄 Future: Multi-platform enhancement (Gemini, GPT-4) - planned - 🔄 Future: Conflict detection between sources - planned - 🔄 Future: Source prioritization rules - planned ## 🎓 Example: httpx Skill Quality Before: 186 lines, basic synthesis, missing data After: 640 lines with AI enhancement, A- (9/10) quality What changed: - All C3.x analysis data integrated (patterns, tests, API, architecture) - GitHub metadata included (stars, topics, languages) - PDF chapter structure visible - Professional formatting with emojis and clear sections - Real-world code examples from test suite - Design patterns explained with confidence scores - Known issues with impact assessment 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-11 23:01:07 +03:00
yusyus	48370a1963	docs: Update CLAUDE.md with streamlined developer guidance - Reduced from 1116 to 526 lines (53% reduction) - Focused on architecture and testing requirements - Removed redundant user-facing documentation - Added critical development notes and workflows 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-01 18:57:29 +03:00
yusyus	5e166c40b9	chore: Bump version to v2.5.1 - Critical PyPI Bug Fix Version Updates: - pyproject.toml: 2.5.0 → 2.5.1 - src/skill_seekers/__init__.py: 2.0.0 → 2.5.1 - src/skill_seekers/cli/__init__.py: 2.0.0 → 2.5.1 - src/skill_seekers/cli/main.py: 2.4.0 → 2.5.1 - src/skill_seekers/mcp/__init__.py: 2.4.0 → 2.5.1 - src/skill_seekers/mcp/tools/__init__.py: 2.4.0 → 2.5.1 CHANGELOG: - Added v2.5.1 release notes documenting PR #221 fix - Critical: Fixed missing skill_seekers.cli.adaptors package - Impact: Restores all multi-platform features for PyPI users Documentation: - Updated CLAUDE.md to v2.5.0 with multi-platform details - Added platform adaptor architecture documentation - Updated test architecture and environment variables Related: PR #221 (merged), Issue #222 (py.typed follow-up) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2025-12-30 23:22:30 +03:00
yusyus	72611af87d	feat(v2.3.0): Add multi-agent installation support Add automatic skill installation to 10+ AI coding agents with a single command. New Features: - New install-agent command for installing skills to any AI agent - Support for 10+ agents: Claude Code, Cursor, VS Code, Amp, Goose, OpenCode, Letta, Aide, Windsurf - Smart path resolution (global ~/.agent vs project-relative .agent/) - Fuzzy agent name matching with suggestions - --agent all flag to install to all agents at once - --force flag to overwrite existing installations - --dry-run flag to preview installations - Comprehensive error handling and user feedback Implementation: - Created install_agent.py (379 lines) with core installation logic - Updated main.py with install-agent subcommand - Updated pyproject.toml with entry point - Added 32 comprehensive tests (all passing, 603 total) - No regressions in existing functionality Documentation: - Updated README.md with multi-agent installation guide - Updated CLAUDE.md with install-agent examples - Updated CHANGELOG.md with v2.3.0 release notes - Added agent compatibility table Technical Details: - 100% own implementation (no external dependencies) - Pure Python using stdlib (shutil, pathlib, argparse) - Compatible with Agent Skills open standard (agentskills.io) - Works offline Closes #210 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2025-12-22 02:04:32 +03:00
yusyus	b7cd317efb	feat(A1.7): Add install_skill MCP tool for one-command workflow automation Implements complete end-to-end skill installation in a single command: fetch_config → scrape_docs → enhance_skill_local → package_skill → upload_skill Changes: - MCP Tool: Added install_skill_tool() to server.py (~300 lines) - Input validation (config_name XOR config_path) - 5-phase orchestration with error handling - Dry-run mode for workflow preview - Mandatory AI enhancement (30-60 sec, 3/10→9/10 quality boost) - Auto-upload to Claude (if ANTHROPIC_API_KEY set) - CLI Integration: New install command - Created install_skill.py CLI wrapper (~150 lines) - Updated main.py with install subcommand - Added entry point to pyproject.toml - Testing: Comprehensive test suite - Created test_install_skill.py with 13 tests - Tests cover validation, dry-run, orchestration, error handling - All tests passing (13/13) - Documentation: Updated all user-facing docs - CLAUDE.md: Added MCP tool (10 tools total) and CLI examples - README.md: Added prominent one-command workflow section - FLEXIBLE_ROADMAP.md: Marked A1.7 as complete Features: - Zero friction: One command instead of 5 separate steps - Quality guaranteed: Mandatory enhancement ensures 9/10 quality - Complete automation: From config to uploaded skill - Intelligent: Auto-detects config type (name vs path) - Flexible: Dry-run, unlimited, no-upload modes - Well-tested: 13 unit tests with mocking Usage: skill-seekers install --config react skill-seekers install --config configs/custom.json --no-upload skill-seekers install --config django --unlimited skill-seekers install --config react --dry-run Closes #204 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2025-12-21 20:17:59 +03:00
yusyus	3c8603e6b7	docs: Update test architecture and CLI details in CLAUDE.md	2025-12-21 14:17:12 +03:00
yusyus	cbacdb0e66	release: v2.1.1 - GitHub Repository Analysis Enhancements Major improvements: - Configurable directory exclusions (Issue #203) - Unlimited local repository analysis - Skip llms.txt option (PR #198) - 10+ bug fixes for GitHub scraper - Test suite expanded to 427 tests See CHANGELOG.md for full details.	2025-11-30 12:22:28 +03:00
yusyus	bd2b201aa5	docs: Update all documentation for v2.1.0 release Updates across all major documentation files to reflect v2.1.0 release status and recent completions. Changes: - CLAUDE.md: * Updated version from v2.0.0 to v2.1.0 * Updated date to November 29, 2025 * Updated test count from 391 to 427 * Moved completed PRs (#195, #198) and Issue #203 to "Completed" section * Updated "Next Up" priorities - README.md: * Updated version badge from 2.0.0 to 2.1.0 * Updated test badge from 379 to 427 passing - CHANGELOG.md: * Added Issue #203 (Configurable EXCLUDED_DIRS) to Unreleased section * Documented 19 comprehensive tests for exclude_dirs feature * Listed both extend and replace modes - FUTURE_RELEASES.md: * Marked v2.1.0 as "Released" (November 29, 2025) * Moved "Fix 12 unified tests" to completed * Updated release schedule table - FLEXIBLE_ROADMAP.md: * Updated current status from v1.0.0 to v2.1.0 * Added latest release date * Expanded "What Works" section with new features * Updated test count to 427 All documentation now accurately reflects: - v2.1.0 release status ✅ - 427 tests passing (up from 391) ✅ - Issue #203 completion ✅ - PR #195 and #198 merged status ✅ Related: #203	2025-11-30 01:06:21 +03:00
yusyus	ea289cebe1	feat: Make EXCLUDED_DIRS configurable for local repository analysis Closes #203 Adds configuration options to customize directory exclusions during local repository analysis, while maintaining backward compatibility with smart defaults. New Config Options: 1. `exclude_dirs_additional` - Extend defaults (most common) - Adds custom directories to default exclusions - Example: ["proprietary", "legacy", "third_party"] - Total exclusions = defaults + additional 2. `exclude_dirs` - Replace defaults (advanced users) - Completely overrides default exclusions - Example: ["node_modules", ".git", "custom_vendor"] - Gives full control over exclusions Implementation: - Modified GitHubScraper.__init__() to parse exclude_dirs config - Changed should_exclude_dir() to use instance variable instead of global - Added logging for custom exclusions (INFO for extend, WARNING for replace) - Maintains backward compatibility (no config = use defaults) Testing: - Added 12 comprehensive tests in test_excluded_dirs_config.py - 3 tests for defaults (backward compatibility) - 3 tests for extend mode - 3 tests for replace mode - 1 test for precedence - 2 tests for edge cases - All 12 new tests passing ✅ - All 22 existing github_scraper tests passing ✅ Documentation: - Updated CLAUDE.md config parameters section - Added detailed "Configurable Directory Exclusions" feature section - Included examples for both modes - Listed common use cases (monorepos, enterprise, legacy codebases) Use Cases: - Monorepos with custom directory structures - Enterprise projects with non-standard naming conventions - Including unusual directories for analysis - Minimal exclusions for small/simple projects Backward Compatibility: ✅ Fully backward compatible - existing configs work unchanged ✅ Smart defaults maintained when no config provided ✅ All existing tests pass Co-authored-by: jimmy058910 <jimmy058910@users.noreply.github.com>	2025-11-29 23:53:27 +03:00
yusyus	bd20b32470	Merge PR #198 : Skip llms.txt Config Option Merges feat/add-skip-llm-to-config by @sogoiii. This PR adds a valuable configuration option to explicitly skip llms.txt detection, useful when a site's llms.txt is incomplete, incorrect, or when specific HTML scraping is needed. Key features: - New 'skip_llms_txt' config option (default: false, backward compatible) - Boolean type validation with warning for invalid values - Support in both sync and async scraping modes - 17 comprehensive tests (15 feature tests + 2 config validation tests) All tests passing after fixing import paths to use proper package names. Test results: ✅ 17/17 tests passing Full test suite: ✅ 391 tests passing Co-authored-by: sogoiii <sogoiii@users.noreply.github.com>	2025-11-29 22:56:46 +03:00
yusyus	cf77f9e392	docs: Update test status - all 391 tests passing including unified tests All unified scraping tests are now passing! Updated documentation to reflect current status. Changes: 1. CLAUDE.md - Updated test status throughout - Changed "⚠️ 12 unified tests need fixes" to "✅ All 22 unified tests passing" - Updated test count from 379 to 391 tests - Marked unified configs as ✅ (all 5 working and tested) - Updated "Next Up" section with completed items - Updated last verification date to Nov 29, 2025 2. README.md - Updated test count - Changed "379 tests" to "391 tests" 3. docs/CLAUDE.md - Updated test documentation - Updated test counts throughout - Removed outdated warnings about failing tests Test Status: - ✅ tests/test_unified.py: 18/18 passing - ✅ tests/test_unified_mcp_integration.py: 4/4 passing - ✅ Total: 391 tests passing, 32 skipped Unified Scraping: - All 5 unified configs verified and working - Conflict detection fully tested - Rule-based and AI merge modes tested - Feature is production-ready Task 2.2 Complete - No code changes needed, tests were already passing!	2025-11-29 22:20:43 +03:00
sogoiii	91692db87c	📝 docs: add skip_llms_txt to config parameters documentation	2025-11-20 14:00:55 -08:00
yusyus	5ee07a2181	docs: Update CLAUDE.md for v2.0.0 PyPI release Major updates for v2.0.0: - Added PyPI publication status and installation instructions - Updated to reflect modern Python packaging (src/ layout, pyproject.toml) - Updated all commands to use 'skill-seekers' CLI instead of python3 cli/* - Updated file structure section for src/ layout - Updated key code locations with new paths - Added FUTURE_RELEASES.md to documentation list - Updated test count (379 passing, all CI checks green) - Updated date to November 11, 2025 - Added development workflow section - Reorganized Additional Documentation into categories All sections now reflect the post-PyPI publication state of the project.	2025-11-11 23:27:48 +03:00
yusyus	693294be8e	docs: Update CLAUDE.md with new unified CLI commands Updated all command examples to use new entry points: - skill-seekers scrape (was: python3 cli/doc_scraper.py) - skill-seekers unified (was: python3 cli/unified_scraper.py) - skill-seekers estimate (was: python3 cli/estimate_pages.py) - skill-seekers package (was: python3 cli/package_skill.py) - skill-seekers enhance (was: python3 cli/enhance_skill_local.py) - skill-seekers upload (was: python3 cli/upload_skill.py) All 44+ command examples now use modern entry point syntax. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-07 01:25:40 +03:00
yusyus	13b19c2b06	Update CLAUDE.md with current project status - Update date from October 26 to November 6, 2025 - Update test count: 390 tests total, 378 passing, 12 unified tests failing - Update configs inventory: 24 total configs (14 single-source, 5 unified, 5 test) - Add priority task: Fix 12 failing unified tests - Update status: Core functionality stable, unified tests need attention - Add detailed config breakdown by category - Update available configs section with complete categorization 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-11-06 23:23:12 +03:00
yusyus	1e277f80d2	Update documentation for unified multi-source scraping (v2.0.0) Major documentation update explaining the new unified scraping system that combines documentation + GitHub + PDF sources in a single skill with automatic conflict detection. ## Changes: README.md: - Update version badge to v2.0.0 - Add "Unified Multi-Source Scraping" to Key Features section - Add comprehensive Option 5 section showing: - Problem statement (documentation drift) - Solution with code example - Conflict detection types and severity levels - Transparent reporting with side-by-side comparison - List of advantages (identifies gaps, catches changes, single source of truth) - Available unified configs - Link to full guide (docs/UNIFIED_SCRAPING.md) CLAUDE.md: - Update Current Status to v2.0.0 - Add "Major Release: Unified Multi-Source Scraping" in Recent Updates - Update configs count from 11/11 to 15/15 (added 4 unified configs) - Add new "Unified Multi-Source Scraping" section under Core Commands - Include command examples and feature highlights - Explain what makes unified scraping special QUICKSTART.md: - Add Option D: Unified Multi-Source to Step 2 - Add unified configs to Available Presets section - Show react_unified, django_unified, fastapi_unified, godot_unified examples ## Value: This documentation update explains how unified scraping helps developers: - Mix documentation + code in one skill - Automatically detect conflicts (missing_in_docs, missing_in_code, signature_mismatch) - Get transparent side-by-side comparisons with ⚠️ warnings - Identify documentation gaps and outdated docs - Create a single source of truth combining both sources Related to: Phase 7-11 unified scraper implementation (commit `5d8c7e3`) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-26 16:41:58 +03:00
yusyus	319331f5a6	feat: Complete refactoring with async support, type safety, and package structure This comprehensive refactoring improves code quality, performance, and maintainability while maintaining 100% backwards compatibility. ## Major Features Added ### 🚀 Async/Await Support (2-3x Performance Boost) - Added `--async` flag for parallel scraping using asyncio - Implemented `scrape_page_async()` with httpx.AsyncClient - Implemented `scrape_all_async()` with asyncio.gather() - Connection pooling for better resource management - Performance: 18 pg/s → 55 pg/s (3x faster) - Memory: 120 MB → 40 MB (66% reduction) - Full documentation in ASYNC_SUPPORT.md ### 📦 Python Package Structure (Phase 0 Complete) - Created cli/__init__.py for clean imports - Created skill_seeker_mcp/__init__.py (renamed from mcp/) - Created skill_seeker_mcp/tools/__init__.py - Proper package imports: `from cli import constants` - Better IDE support and autocomplete ### ⚙️ Centralized Configuration - Created cli/constants.py with 18 configuration constants - DEFAULT_ASYNC_MODE, DEFAULT_RATE_LIMIT, DEFAULT_MAX_PAGES - Enhancement limits, categorization scores, file limits - All magic numbers now centralized and configurable ### 🔧 Code Quality Improvements - Converted 71 print() statements to proper logging - Added type hints to all DocToSkillConverter methods - Fixed all mypy type checking issues - Installed types-requests for better type safety - Code quality: 5.5/10 → 6.5/10 ## Testing - Test count: 207 → 299 tests (92 new tests) - 11 comprehensive async tests (all passing) - 16 constants tests (all passing) - Fixed test isolation issues - 100% pass rate maintained (299/299 passing) ## Documentation - Updated README.md with async examples and test count - Updated CLAUDE.md with async usage guide - Created ASYNC_SUPPORT.md (292 lines) - Updated CHANGELOG.md with all changes - Cleaned up temporary refactoring documents ## Cleanup - Removed temporary planning/status documents - Moved test_pr144_concerns.py to tests/ folder - Updated .gitignore for test artifacts - Better repository organization ## Breaking Changes None - all changes are backwards compatible. Async mode is opt-in via --async flag. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-26 13:05:39 +03:00
Preston Brown	de5344caf9	Add virtual environment setup and minimal dependencies (#149 ) ## Changes - Add virtual environment setup instructions to all docs - Create requirements.txt with minimal dependencies (13 packages) - Make anthropic optional (only needed for API enhancement) - Clarify path notation (~ = $HOME, /Users/yourname examples) - Add venv activation reminders throughout documentation ## Files Changed - README.md: Added venv setup section to CLI method - BULLETPROOF_QUICKSTART.md: Replaced Step 4 with venv setup - CLAUDE.md: Updated Prerequisites with venv instructions - requirements.txt: Created with minimal deps (requests, beautifulsoup4, pytest) ## Why - Prevents package conflicts and permission issues - Standard Python development practice - Enables proper pytest usage without pipx complications - Makes setup clearer for beginners	2025-10-22 21:54:05 +03:00
yusyus	ff148cf98f	Update documentation for new Ansible config Added ansible-core.json config to available presets list in: - README.md: Added to preset table and usage examples - CLAUDE.md: Added to production configs list with details Changes: - Total configs: 11 → 12 - New category: DevOps & Automation - Reorganized config list for better categorization Related: PR #147	2025-10-22 21:51:45 +03:00
yusyus	831ea67d58	Update task tracking and CLAUDE.md with latest progress Documentation Updates: ====================== TODO.md: -------- ✅ Added "Completed This Week" section: - H1.1: Issue #8 fixed (bulletproof docs + MCP setup) - H1.2: Issue #7 fixed (11/11 configs working) - H1.4: Issue #4 linked to roadmap - PR #5: Reviewed and approved ✅ Updated "Immediate Tasks" list: - Removed completed tasks - Added H1.3 (example project) as next priority ✅ Updated Progress Tracking: - 10 items completed this week - Clear visibility of accomplishments - Next steps clearly defined NEXT_TASKS.md: -------------- ✅ Marked completed tasks in Starter Pack: - H1.1 (Issue #8) - DONE - H1.2 (Issue #7) - DONE - H1.4 (Issue #4) - DONE - PR #5 Review - DONE ✅ Updated Current Sprint (Oct 20-27): - Monday/Tuesday: 4/4 tasks completed ✅ - Wednesday/Thursday: 3 tasks remaining - Progress: 4/10 tasks (40%) ✅ Added specific accomplishments: - Community engaged (3 issues) - All configs fixed (11/11) - PR security verified - Bulletproof documentation CLAUDE.md: ---------- ✅ Added "Current Status" section at top: - Version: v1.0.0 - Recent updates this week - Community response wins - Next priorities ✅ Added configs status: - 11/11 verified working (100%) - New Laravel config - All selectors tested ✅ Added roadmap reference: - 134 tasks in 22 groups - Project board link - Clear next steps ✅ Added Laravel to Quick Start examples ✅ Added "Available Production Configs" section: - All 11 configs listed with selectors - Content extraction stats - Organized by category - Verification date ✅ Updated Additional Documentation: - Added BULLETPROOF_QUICKSTART.md - Added TROUBLESHOOTING.md - Added FLEXIBLE_ROADMAP.md - Added NEXT_TASKS.md - Added TODO.md Impact: ------- - Clear visibility of progress (4 major items this week) - Updated guidance for Claude Code - Accurate config information (11 working configs) - Better onboarding with new docs - Transparent roadmap tracking Files modified: TODO.md, NEXT_TASKS.md, CLAUDE.md	2025-10-21 00:42:36 +03:00
yusyus	b83f276621	Update Python requirement to 3.10+ for MCP compatibility The MCP package requires Python 3.10 or higher. Updated: - GitHub Actions workflow to test Python 3.10, 3.11, 3.12 - README.md badge to Python 3.10+ - CLAUDE.md prerequisites - CONTRIBUTING.md prerequisites - docs/MCP_SETUP.md prerequisites This fixes the MCP installation error in CI: 'ERROR: No matching distribution found for mcp>=1.0.0' MCP package versions 0.9.1+ all require Python 3.10+.	2025-10-19 22:53:28 +03:00
yusyus	9ce78e9a16	Fix GitHub Actions workflow: Update Python version requirements - Update CI workflow to Python 3.9-3.12 (from 3.7-3.11) - Python 3.7 and 3.8 no longer available on ubuntu-latest (Ubuntu 24.04) - Add fail-fast: false to continue testing on failures - Update all documentation to reflect Python 3.9+ requirement Files updated: - .github/workflows/tests.yml - New Python versions - README.md - Badge updated to Python 3.9+ - CLAUDE.md - Prerequisites updated - CONTRIBUTING.md - Prerequisites updated - docs/MCP_SETUP.md - Prerequisites updated This fixes the failing GitHub Actions tests.	2025-10-19 22:49:14 +03:00
yusyus	d8cc92cd46	Add smart auto-upload feature with API key detection Features: - New upload_skill.py for automatic API-based upload - Smart detection: upload if API key available, helpful message if not - Enhanced package_skill.py with --upload flag - New MCP tool: upload_skill (9 total MCP tools now) - Enhanced MCP tool: package_skill with smart auto-upload - Cross-platform folder opening in utils.py - Graceful error handling throughout Fixes: - Fix missing import os in mcp/server.py - Fix package_skill.py exit code (now 0 when API key missing) - Improve UX with helpful messages instead of errors Tests: 14/14 passed (100%) - CLI tests: 8/8 passed - MCP tests: 6/6 passed Files: +4 new, 5 modified, ~600 lines added	2025-10-19 22:17:23 +03:00
yusyus	1c5801d121	Update documentation for MCP integration Comprehensive documentation updates reflecting MCP integration: README.md: - Add MCP Integration and Tests Passing badges - Enhance MCP section with "Tested and Working" status - Add links to both setup and testing guides docs/MCP_SETUP.md: - Update status to reflect production testing - Add integration testing verification notes - Confirm all 6 tools working with natural language CLAUDE.md: - Add prominent MCP Integration section at top - List all 6 available MCP tools with descriptions - Add setup instructions and production status docs/TEST_MCP_IN_CLAUDE_CODE.md (moved from root): - Relocate testing guide to docs/ for better organization - Provides step-by-step MCP integration testing workflow - Documents complete test suite for all 6 tools All documentation now accurately reflects the fully tested and working MCP integration verified in production Claude Code environment. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-19 19:44:47 +03:00
yusyus	b69f57b60a	Add comprehensive MCP setup guide and integration test template Documentation Added: - docs/MCP_SETUP.md: Complete 400+ line setup guide - Prerequisites and installation steps - Configuration examples for Claude Code - Verification and troubleshooting - 3 usage examples and advanced configuration - End-to-end workflow and quick reference - tests/mcp_integration_test.md: Comprehensive test template - 10 test cases covering all MCP tools - Performance metrics table - Issue tracking and environment setup - Setup and cleanup scripts - .claude/mcp_config.example.json: Example MCP configuration Documentation Updated: - STRUCTURE.md: Complete monorepo structure documentation - CLAUDE.md: All Python script paths updated to cli/ prefix - docs/USAGE.md: All command examples updated for monorepo - TODO.md: Current sprint status and completed tasks Summary: - Issues #2 and #3 handled (MCP setup guide + integration tests) - All documentation now reflects monorepo structure (cli/ + mcp/) - Tests: 71/71 passing (100%) - Ready for MCP server testing with Claude Code 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-19 17:01:37 +03:00
yusyus	9c1a133c51	Add page count estimator for fast config validation - Add estimate_pages.py script (~270 lines) - Fast estimation without downloading content (HEAD requests only) - Shows estimated total pages and recommended max_pages - Validates URL patterns work correctly - Estimates scraping time based on rate_limit - Update CLAUDE.md with estimator workflow and commands - Update README.md features section with estimation benefits - Usage: python3 estimate_pages.py configs/react.json - Time: 1-2 minutes vs 20-40 minutes for full scrape 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-19 02:44:50 +03:00
yusyus	f8c75a3b2d	Add comprehensive CLAUDE.md for Claude Code integration - Add root-level CLAUDE.md with complete guidance for Claude Code - Include Python 3.7+ requirement - Add first-time user workflow with all commands - Include CSS selector testing with BeautifulSoup examples - Add output quality verification commands - Document force re-scrape instructions - Fix package_skill.py path (remove hardcoded /mnt/skills reference) - Add complete config file structure with real examples - Include testing section for selector validation - Add performance metrics table - Document all key code locations with line numbers - Organize by: quick start → architecture → workflows → troubleshooting - Preserve existing docs/CLAUDE.md as detailed technical reference 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-19 01:43:02 +03:00

30 Commits