skill-seekers-reference

firefrost-gaming/skill-seekers-reference

Author	SHA1	Message	Date
yusyus	22bdd4f5f6	fix: sync CLI flags across analyze/pdf/unified commands and fix workflow JSON config Flag/option synchronization fixes: - analyze: add --dry-run, --api-key, and all workflow flags (--enhance-workflow, --enhance-stage, --var, --workflow-dry-run) via WORKFLOW_ARGUMENTS merge - pdf: add --api-key to PDF_ARGUMENTS; replace 5 hardcoded add_argument() calls in pdf_scraper.py:main() with add_pdf_arguments() to activate all defined args - unified: add --api-key and --enhance-level (global override) to UNIFIED_ARGUMENTS and standalone parser; wire enhance_level CLI override into run() per-source loop - codebase_scraper: fix --enhance-workflow to use action="append" (was type=str), enabling multiple workflow chaining instead of silently dropping all but last ConfigManager test isolation fix: - __init__ now reads self.CONFIG_DIR/CONFIG_FILE/PROGRESS_DIR class variables instead of calling _get_config_dir()/_get_progress_dir() directly, enabling monkeypatching in tests (fixes pre-existing test_add_and_retrieve_github_profile) Workflow JSON config support in unified_scraper: - Phase 5 now reads workflows/workflow_stages/workflow_vars from top-level JSON config and merges them with CLI args (CLI-first ordering); supports running workflows even when unified scraper is called without CLI args (args=None) Tests: 1,949 passed, 0 failed (added 18 new tests across 3 test files) Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-02-22 00:44:02 +03:00
yusyus	3fc4b54164	fix: Remove duplicate import os statements causing UnboundLocalError ## Critical Bugs Fixed ### 1. UnboundLocalError in AI Enhancement Modules (BLOCKING) Issue: Duplicate `import os` statements inside conditional blocks caused UnboundLocalError when accessing os.environ before the import was reached. Files Fixed: - src/skill_seekers/cli/guide_enhancer.py (lines 92, 112) - src/skill_seekers/cli/ai_enhancer.py (line 77) - src/skill_seekers/cli/config_enhancer.py (line 82) Root Cause: `os` was already imported at file top, but re-imported inside conditional blocks, creating a local variable scope issue. Solution: Removed duplicate import statements - os is already available from the top-level import. Impact: Fixed 30 failing guide_enhancer tests ### 2. PDF Scraper Test Expectations (BREAKING CHANGE) Issue: Tests expected old keyword-based categorization behavior, but PR introduced new single-file strategy for single PDF sources. Files Fixed: - tests/test_pdf_scraper.py (5 tests updated) Tests Updated: 1. test_categorize_by_keywords 2. test_build_skill_creates_reference_files 3. test_code_blocks_included_in_references 4. test_high_quality_code_preferred 5. test_image_references_in_markdown Solution: Updated test expectations to match new single-file strategy behavior (single PDF → single category named after PDF basename). Impact: Fixed 5 failing PDF scraper tests ## Test Results Before Fixes: 35 tests failing After Fixes: 130 tests passing, 5 skipped ✅ ### Tested Modules: - ✅ PDF scraper (18 tests) - ✅ Guide enhancer (30 tests) - ✅ All adaptors (82 tests) ## Verification ```bash pytest tests/test_pdf_scraper.py tests/test_guide_enhancer.py tests/test_adaptors/ -v # Result: 130 passed, 5 skipped in 1.11s ``` ## Notes The original PR features (GLM-4.7 support + PDF scraper improvements) are excellent and working correctly. These fixes only address the import scoping bug introduced during implementation and update tests for the new behavior. Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-27 21:11:04 +03:00
yusyus	ec3e0bf491	fix: Resolve 61 critical linting errors Fixed priority linting errors to improve code quality: Critical Fixes: - F821 (2 errors): Fixed undefined name 'original_result' in config_enhancer.py - UP035 (2 errors): Removed deprecated typing.Dict and typing.Type imports - F401 (27 errors): Removed unused imports and added noqa for availability checks - E722 (19 errors): Replaced bare 'except:' with 'except Exception:' Code Quality Improvements: - SIM201 (4 errors): Simplified 'not x == y' to 'x != y' - SIM118 (2 errors): Removed unnecessary .keys() in dict iterations - E741 (4 errors): Renamed ambiguous variable 'l' to 'line' - I001 (1 error): Sorted imports in test_bootstrap_skill.py All modified areas tested and passing: - test_scraper_features.py: 42 passed - test_integration.py: 51 passed - test_architecture_scenarios.py: 11 passed - test_real_world_fastmcp.py: 19 passed (1 skipped) Remaining linting errors: 249 (mostly code style suggestions like ARG002, F841, SIM102) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>	2026-01-17 22:54:40 +03:00
Pablo Estevez	c33c6f9073	change max lenght	2026-01-17 17:48:15 +00:00
Pablo Estevez	5ed767ff9a	run ruff	2026-01-17 17:29:21 +00:00
yusyus	50e0bfd19b	fix: Update test file imports to use proper package paths Fixed import errors in test_pdf_scraper.py and test_github_scraper.py: - Replaced absolute imports with proper package imports - Changed 'from pdf_scraper import' to 'from skill_seekers.cli.pdf_scraper import' - Changed 'from github_scraper import' to 'from skill_seekers.cli.github_scraper import' - Updated all @patch() decorators to use full module paths - Removed sys.path manipulation workarounds This completes the fix for import issues discovered during Task 1.2 (Issue #193). Test Results: - test_pdf_scraper.py: 18/18 passed ✅ - test_github_scraper.py: 22/22 passed ✅	2025-11-29 21:55:46 +03:00
yusyus	7cc3d8b175	Fix all tests: 297/297 passing, 0 skipped, 0 failed CHANGES: 1. Fixed 9 PDF Scraper Test Failures: - Added .get() safety for missing page keys (headings, text, code_blocks, images) - Supported both 'code_samples' and 'code_blocks' keys for compatibility - Fixed extract_pdf() to raise RuntimeError on failure (tests expect exception) - Added image saving functionality to _generate_reference_file() - Updated all test methods to override skill_dir with temp directory - Fixed categorization to handle pre-categorized test data 2. Fixed 25 MCP Test Skips: - Renamed mcp/ directory to skill_seeker_mcp/ to avoid shadowing external mcp package - Updated all imports in tests/test_mcp_server.py - Simplified skill_seeker_mcp/server.py import logic (no more shadowing workarounds) - Updated tests/test_package_structure.py to reference skill_seeker_mcp 3. Test Results: - ✅ 297 tests passing (100%) - ✅ 0 tests skipped - ✅ 0 tests failed - All test categories passing: * 23 package structure tests * 18 PDF scraper tests * 67 PDF extractor/advanced tests * 25 MCP server tests * 164 other core tests BREAKING CHANGE: MCP server directory renamed from `mcp/` to `skill_seeker_mcp/` 📦 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-26 00:51:18 +03:00
yusyus	394eab218e	Add PDF Advanced Features (v1.2.0) Priority 2 & 3 Features Implemented: - OCR support for scanned PDFs (pytesseract + Pillow) - Password-protected PDF support - Complex table extraction - Parallel page processing (3x faster) - Intelligent caching (50% faster re-runs) Testing: - New test file: test_pdf_advanced_features.py (26 tests) - Updated test_pdf_extractor.py (23 tests) - Updated test_pdf_scraper.py (18 tests) - Total: 49/49 PDF tests passing (100%) - Overall: 142/142 tests passing (100%) Documentation: - Added docs/PDF_ADVANCED_FEATURES.md (580 lines) - Updated CHANGELOG.md with v1.1.0 and v1.2.0 - Updated README.md version badges and features - Updated docs/TESTING.md with new test counts Dependencies: - Added Pillow==11.0.0 - Added pytesseract==0.3.13 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>	2025-10-23 21:43:05 +03:00

8 Commits