# Skill Seekers v3.0.0: The Universal Documentation Preprocessor for AI Systems

![Skill Seekers v3.0.0 Banner](https://skillseekersweb.com/images/blog/v3-release-banner.png)

> 🚀 **One command converts any documentation into structured knowledge for any AI system.**

## TL;DR

- 🎯 **16 output formats** (was 4 in v2.x)
- 🛠️ **26 MCP tools** for AI agents
- ✅ **1,852 tests** passing
- ☁️ **Cloud storage** support (S3, GCS, Azure)
- 🔄 **CI/CD ready** with GitHub Action

```bash
pip install skill-seekers
skill-seekers scrape --config react.json
```

---

## The Problem We're All Solving

Raise your hand if you've written this code before:

```python
# The custom scraper we all write
import requests
from bs4 import BeautifulSoup

def scrape_docs(url):
    # Handle pagination
    # Extract clean text
    # Preserve code blocks
    # Add metadata
    # Chunk properly
    # Format for vector DB
    # ... 200 lines later
    pass
```

**Every AI project needs documentation preprocessing.**

- **RAG pipelines**: "Scrape these docs, chunk them, embed them..."
- **AI coding tools**: "I wish Cursor knew this framework..."
- **Claude skills**: "Convert this documentation into a skill"

We all rebuild the same infrastructure. **Stop rebuilding. Start using.**

---

## Meet Skill Seekers v3.0.0

One command → Any format → Production-ready

### For RAG Pipelines

```bash
# LangChain Documents
skill-seekers scrape --format langchain --config react.json

# LlamaIndex TextNodes
skill-seekers scrape --format llama-index --config vue.json

# Pinecone-ready markdown
skill-seekers scrape --target markdown --config django.json
```

**Then in Python:**

```python
from skill_seekers.cli.adaptors import get_adaptor

adaptor = get_adaptor('langchain')
documents = adaptor.load_documents("output/react/")

# Now use with any vector store
from langchain_chroma import Chroma
from langchain_openai import OpenAIEmbeddings

vectorstore = Chroma.from_documents(
    documents,
    OpenAIEmbeddings()
)
```

### For AI Coding Assistants

```bash
# Give Cursor framework knowledge
skill-seekers scrape --target claude --config react.json
cp output/react-claude/.cursorrules ./
```

**Result:** Cursor now knows React hooks, patterns, and best practices from the actual documentation.

### For Claude AI

```bash
# Complete workflow: fetch → scrape → enhance → package → upload
skill-seekers install --config react.json
```

---

## What's New in v3.0.0

### 16 Platform Adaptors

| Category | Platforms | Use Case |
|----------|-----------|----------|
| **RAG/Vectors** | LangChain, LlamaIndex, Chroma, FAISS, Haystack, Qdrant, Weaviate | Build production RAG pipelines |
| **AI Platforms** | Claude, Gemini, OpenAI | Create AI skills |
| **AI Coding** | Cursor, Windsurf, Cline, Continue.dev | Framework-specific AI assistance |
| **Generic** | Markdown | Any vector database |

### 26 MCP Tools

Your AI agent can now prepare its own knowledge:

```
🔧 Config: generate_config, list_configs, validate_config
🌐 Scraping: scrape_docs, scrape_github, scrape_pdf, scrape_codebase
📦 Packaging: package_skill, upload_skill, enhance_skill, install_skill
☁️ Cloud: upload to S3, GCS, Azure
🔗 Sources: fetch_config, add_config_source
✂️ Splitting: split_config, generate_router
🗄️ Vector DBs: export_to_weaviate, export_to_chroma, export_to_faiss, export_to_qdrant
```

### Cloud Storage

```bash
# Upload to AWS S3
skill-seekers cloud upload output/ --provider s3 --bucket my-bucket

# Or Google Cloud Storage
skill-seekers cloud upload output/ --provider gcs --bucket my-bucket

# Or Azure Blob Storage
skill-seekers cloud upload output/ --provider azure --container my-container
```

### CI/CD Ready

```yaml
# .github/workflows/update-docs.yml
- uses: skill-seekers/action@v1
  with:
    config: configs/react.json
    format: langchain
```

Auto-update your AI knowledge when documentation changes.

---

## Why This Matters

### Before Skill Seekers

```
Week 1: Build custom scraper
Week 2: Handle edge cases
Week 3: Format for your tool
Week 4: Maintain and debug
```

### After Skill Seekers

```
15 minutes: Install and run
Done: Production-ready output
```

---

## Real Example: React + LangChain + Chroma

```bash
# 1. Install
pip install skill-seekers langchain-chroma langchain-openai

# 2. Scrape React docs
skill-seekers scrape --format langchain --config configs/react.json

# 3. Create RAG pipeline
```

```python
from skill_seekers.cli.adaptors import get_adaptor
from langchain_chroma import Chroma
from langchain_openai import OpenAIEmbeddings, ChatOpenAI
from langchain.chains import RetrievalQA

# Load documents
adaptor = get_adaptor('langchain')
documents = adaptor.load_documents("output/react/")

# Create vector store
vectorstore = Chroma.from_documents(
    documents,
    OpenAIEmbeddings()
)

# Query
qa_chain = RetrievalQA.from_chain_type(
    llm=ChatOpenAI(),
    retriever=vectorstore.as_retriever()
)

result = qa_chain.invoke({"query": "What are React Hooks?"})
print(result["result"])
```

**That's it.** 15 minutes from docs to working RAG pipeline.

---

## Production Ready

- ✅ **1,852 tests** across 100 test files
- ✅ **58,512 lines** of Python code
- ✅ **CI/CD** on every commit
- ✅ **Docker** images available
- ✅ **Multi-platform** (Ubuntu, macOS)
- ✅ **Python 3.10-3.13** tested

---

## Get Started

```bash
# Install
pip install skill-seekers

# Try an example
skill-seekers scrape --config configs/react.json

# Or create your own config
skill-seekers config --wizard
```

---

## Links

- 🌐 **Website:** https://skillseekersweb.com
- 💻 **GitHub:** https://github.com/yusufkaraaslan/Skill_Seekers
- 📖 **Documentation:** https://skillseekersweb.com/docs
- 📦 **PyPI:** https://pypi.org/project/skill-seekers/

---

## What's Next?

- ⭐ Star us on GitHub if you hate writing scrapers
- 🐛 Report issues (1,852 tests but bugs happen)
- 💡 Suggest features (we're building in public)
- 🚀 Share your use case

---

*Skill Seekers v3.0.0 was released on February 10, 2026. This is our biggest release yet - transforming from a Claude skill generator into a universal documentation preprocessor for the entire AI ecosystem.*

---

## Tags

#python #ai #machinelearning #rag #langchain #llamaindex #opensource #developer_tools #cursor #claude #docker #cloud