- Change _walk_directory to check relative paths instead of absolute paths
- Fixes issue where SKIP_DIRS containing 'tmp' was skipping all files under /tmp/
- This was causing test failures on Ubuntu (tests use tempfile.mkdtemp() which creates under /tmp)
- Now only skips directories that are within the search directory, not in the absolute path
Fixes test_config_extractor.py failures on Ubuntu
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
PROBLEM:
- Config extractor crashed on JSON files with arrays at root
- Error: "'list' object has no attribute 'items'"
- Example: save.json with [{"name": "item1"}, {"name": "item2"}]
- Only handled dict roots, not list roots
SOLUTION:
- Added type checking in _parse_json() and _parse_yaml()
- Handle three cases:
1. Dict at root: extract normally (existing behavior)
2. List at root: iterate and extract from each dict item
3. Primitive at root: skip with debug log
- List items are prefixed with [index] in nested path
CHANGES:
- config_extractor.py _parse_json(): Added isinstance checks
- config_extractor.py _parse_yaml(): Added list handling
EXAMPLE:
Before: WARNING: Error parsing save.json: 'list' object has no attribute 'items'
After: Extracts settings with paths like "[0].name", "[1].value"
IMPACT:
- No more crashes on valid JSON/YAML arrays
- Better coverage of config file variations
- Handles game save files, API responses, data arrays
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Completes the implementation for Unity/Unreal/Godot game engine support
and adds missing "local" source type validation.
Changes:
- Add "local" to VALID_SOURCE_TYPES in config_validator.py
- Add _validate_local_source() method with full validation
- Add Unity/Unreal/Godot to FRAMEWORK_MARKERS for priority detection
- Add game engine directory exclusions to all 3 scrapers:
* Unity: Library/, Temp/, Logs/, UserSettings/, etc.
* Unreal: Intermediate/, Saved/, DerivedDataCache/
* Godot: .godot/, .import/
- Prevents scanning massive build cache directories (saves GBs + hours)
This completes all features mentioned in PR #278:
✅ Unity/Unreal/Godot framework detection with priority
✅ Pattern enhancement performance fix (grouped approach)
✅ Game engine directory exclusions
✅ Phase 5 SKILL.md AI enhancement
✅ Local source references copying
✅ "local" source type validation
✅ Config field name compatibility
✅ C# test example extraction
Tested:
- All unified config tests pass (18/18)
- All config validation tests pass (28/28)
- Ready for Unity project testing
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Fixed incorrect variable names in list comprehensions that were causing
NameError in CI (Python 3.11/3.12):
Critical fixes:
- tests/test_markdown_parsing.py: 'l' → 'link' in list comprehension
- src/skill_seekers/cli/pdf_extractor_poc.py: 'l' → 'line' (2 occurrences)
Additional auto-lint fixes:
- Removed unused imports in llms_txt_downloader.py, llms_txt_parser.py
- Fixed comparison operators in config files
- Fixed list comprehension in other files
All tests now pass in CI.
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
Add comprehensive AI enhancement to C3.4 Configuration Pattern Extraction
similar to C3.3's dual-mode architecture (API + LOCAL).
NEW CAPABILITIES (What users can do now):
1. **AI-Powered Config Analysis** - Understand what configs do, not just extract them
- Explanations: What each configuration setting does
- Best Practices: Suggested improvements and better organization
- Security Analysis: Identifies hardcoded secrets, exposed credentials
- Migration Suggestions: Opportunities to consolidate configs
- Context: Explains detected patterns and when to use them
2. **Dual-Mode AI Support** (Same as C3.3):
- API Mode: Claude API analyzes configs (requires ANTHROPIC_API_KEY)
- LOCAL Mode: Claude Code CLI (FREE, no API key needed)
- AUTO Mode: Automatically detects best available mode
3. **Seamless Integration**:
- CLI: --enhance, --enhance-local, --ai-mode flags
- Codebase Scraper: Works with existing enhance_with_ai parameter
- MCP Tools: Enhanced extract_config_patterns with AI parameters
- Optional: Enhancement only runs when explicitly requested
Components Added:
- ConfigEnhancer class (~400 lines) - Dual-mode AI enhancement engine
- Enhanced CLI flags in config_extractor.py
- AI integration in codebase_scraper.py config extraction workflow
- MCP tool parameter expansion (enhance, enhance_local, ai_mode)
- FastMCP server tool signature updates
- Comprehensive documentation in CHANGELOG.md and README.md
Performance:
- Basic extraction: ~3 seconds for 100 config files
- With AI enhancement: +30-60 seconds (LOCAL mode, FREE)
- With AI enhancement: +20-40 seconds (API mode, ~$0.10-0.20)
Use Cases:
- Security audits: Find hardcoded secrets across all configs
- Migration planning: Identify consolidation opportunities
- Onboarding: Understand what each config file does
- Best practices: Get improvement suggestions for config organization
Technical Details:
- Structured JSON prompts for reliable AI responses
- 5 enhancement categories: explanations, best_practices, security, migration, context
- Graceful fallback if AI enhancement fails
- Security findings logged separately for visibility
- Results stored in JSON under 'ai_enhancements' key
Testing:
- 28 comprehensive tests in test_config_extractor.py
- Tests cover: file detection, parsing, pattern detection, enhancement modes
- All integrations tested: CLI, codebase_scraper, MCP tools
Documentation:
- CHANGELOG.md: Complete C3.4 feature description
- README.md: Updated C3.4 section with AI enhancement
- MCP tool descriptions: Added AI enhancement details
Related Issues: #74🤖 Generated with [Claude Code](https://claude.com/claude-code)
Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>