feat: Add unlimited local repository analysis and fix 10 critical bugs
Features: - Add local_repo_path config parameter for unlimited file analysis - Auto-exclude virtual environments and build artifacts (95% noise reduction) - Enable comprehensive codebase analysis (50 → 323 files, 546% increase) Bug Fixes: - Fix logger initialization error (Issue #190) - Fix NoneType subscriptable errors in release tag parsing (3 instances) - Fix relative import paths causing ModuleNotFoundError - Fix hardcoded 50-file analysis limit - Fix GitHub API file tree limitation (140 → 345 files discovered) - Fix AST parser 'not iterable' errors (95 → 0 parsing failures) - Fix virtual environment file pollution (23,341 → 1,109 file tree items) - Fix force_rescrape flag not checked before interactive prompt Impact: - Code coverage: 14% → 93.6% (+79.6pp) - Files analyzed: 50 → 323 (+546%) - Classes extracted: 55 → 585 (+964%) - Functions extracted: 512 → 2,784 (+444%) - AST errors: 95 → 0 (-100%) Tested on JMo Security repository with 345 Python files.
This commit is contained in:
@@ -187,7 +187,8 @@ class UnifiedScraper:
|
||||
'include_releases': source.get('include_releases', True),
|
||||
'include_code': source.get('include_code', True),
|
||||
'code_analysis_depth': source.get('code_analysis_depth', 'surface'),
|
||||
'file_patterns': source.get('file_patterns', [])
|
||||
'file_patterns': source.get('file_patterns', []),
|
||||
'local_repo_path': source.get('local_repo_path') # Pass local_repo_path from config
|
||||
}
|
||||
|
||||
# Scrape
|
||||
|
||||
Reference in New Issue
Block a user