fix: Resolve PDF processing (#267), How-To Guide (#242), Chinese README (#260) + code quality (#273)

Thanks @franklegolasyoung for the excellent work on the core fixes for issues #267, #242, and #260! 🙏

Your comprehensive approach to fixing PDF processing, expanding workflow detection, and improving the Chinese README documentation is much appreciated. I've added code quality fixes and comprehensive tests to ensure everything passes CI.

All 1266+ tests are now passing, and the issues are resolved! 🎉
This commit is contained in:
yusyus
2026-01-31 21:30:00 +03:00
committed by GitHub
parent f726a9abc5
commit 91bd2184e5
19 changed files with 622 additions and 174 deletions

View File

@@ -67,8 +67,8 @@ Skill Seeker 是一个自动化工具可将文档网站、GitHub 仓库和 PD
-**并行处理** - 大型 PDF 快 3 倍
-**智能缓存** - 重复运行快 50%
### 🐙 GitHub 仓库抓取 (**v2.0.0**)
-**深度代码分析** - Python、JavaScript、TypeScript、Java、C++、Go 进行 AST 解析
### 🐙 GitHub 仓库分析 (**v2.0.0**)
-**深度代码分析** - 基于 AST抽象语法树解析 Python、JavaScript、TypeScript、Java、C++、Go 代码
-**API 提取** - 提取函数、类、方法及其参数和类型
-**仓库元数据** - README、文件树、语言分布、星标/fork 数
-**GitHub Issues 和 PR** - 获取带标签和里程碑的开放/关闭问题
@@ -977,6 +977,10 @@ skill-seekers scrape \
# 设置您的 API 密钥(一次性)
export ANTHROPIC_API_KEY=sk-ant-...
# 或使用兼容 Claude 的 API 端点(如 GLM-4.7 智谱 AI
# export ANTHROPIC_API_KEY=your-api-key
# export ANTHROPIC_BASE_URL=https://your-compatible-endpoint.com/v1
# 自动打包和上传
skill-seekers package output/react/ --upload
@@ -1524,6 +1528,8 @@ skill-seekers scrape --config configs/largedocs.json --async --workers 8 --no-ra
# 选项 1抓取期间基于 API需要 API 密钥)
pip3 install anthropic
export ANTHROPIC_API_KEY=sk-ant-...
# 或使用兼容 Claude 的 API如 GLM-4.7 智谱 AI
# export ANTHROPIC_BASE_URL=https://your-endpoint.com/v1
skill-seekers scrape --config configs/react.json --enhance
# 选项 2抓取期间LOCAL无需 API 密钥 - 使用 Claude Code Max