firefrost-gaming/skill-seekers-reference

Files

yusyus 8b3f31409e fix: Enforce min_chunk_size in RAG chunker

- Filter out chunks smaller than min_chunk_size (default 100 tokens)
- Exception: Keep all chunks if entire document is smaller than target size
- All 15 tests passing (100% pass rate)

Fixes edge case where very small chunks (e.g., 'Short.' = 6 chars) were
being created despite min_chunk_size=100 setting.

Test: pytest tests/test_rag_chunker.py -v

2026-02-07 20:59:03 +03:00

11 KiB

Raw Blame History

Task #20 Complete: GitHub Actions Automation Workflows

Completion Date: February 7, 2026 Status: ✅ Complete New Workflows: 4

Objective

Extend GitHub Actions with automated workflows for Week 2 features, including vector database exports, quality metrics automation, scheduled skill updates, and comprehensive testing infrastructure.

Implementation Summary

Created 4 new GitHub Actions workflows that automate Week 2 features and provide comprehensive CI/CD capabilities for skill generation, quality analysis, and vector database integration.

New Workflows

1. Vector Database Export (`vector-db-export.yml`)

Triggers:

Manual (workflow_dispatch) with parameters
Scheduled (weekly on Sundays at 2 AM UTC)

Features:

Matrix strategy for popular frameworks (react, django, godot, fastapi)
Export to all 4 vector databases (Weaviate, Chroma, FAISS, Qdrant)
Configurable targets (single, multiple, or all)
Automatic quality report generation
Artifact uploads with 30-day retention
GitHub Step Summary with export results

Parameters:

skill_name: Framework to export
targets: Vector databases (comma-separated or "all")
config_path: Optional config file path

Output:

Vector database JSON exports
Quality metrics report
Export summary in GitHub UI

Security: All inputs accessed via environment variables (safe pattern)

2. Quality Metrics Dashboard (`quality-metrics.yml`)

Triggers:

Manual (workflow_dispatch) with parameters
Pull requests affecting output/ or configs/

Features:

Automated quality analysis with 4-dimensional scoring
GitHub annotations (errors, warnings, notices)
Configurable fail threshold (default: 70/100)
Automatic PR comments with quality dashboard
Multi-skill analysis support
Artifact uploads of detailed reports

Quality Dimensions:

Completeness (30% weight) - SKILL.md, references, metadata
Accuracy (25% weight) - No TODOs, valid JSON, no placeholders
Coverage (25% weight) - Getting started, API docs, examples
Health (20% weight) - No empty files, proper structure

Output:

Quality score with letter grade (A+ to F)
Component breakdowns
GitHub annotations on files
PR comments with dashboard
Detailed reports as artifacts

Security: Workflow_dispatch inputs and PR events only, no untrusted content

3. Test Vector Database Adaptors (`test-vector-dbs.yml`)

Triggers:

Push to main or development
Pull requests
Manual (workflow_dispatch)
Path filters for adaptor/MCP code

Features:

Matrix testing across 4 adaptors × 2 Python versions (3.10, 3.12)
Individual adaptor tests
Integration testing with real packaging
MCP tool testing
Week 2 validation script
Test artifact uploads
Comprehensive test summary

Test Jobs:

test-adaptors - Tests each adaptor (Weaviate, Chroma, FAISS, Qdrant)
test-mcp-tools - Tests MCP vector database tools
test-week2-integration - Full Week 2 feature validation

Coverage:

4 vector database adaptors
8 MCP tools
6 Week 2 feature categories
Python 3.10 and 3.12 compatibility

Security: Push/PR/workflow_dispatch only, matrix values are hardcoded constants

4. Scheduled Skill Updates (`scheduled-updates.yml`)

Triggers:

Scheduled (weekly on Sundays at 3 AM UTC)
Manual (workflow_dispatch) with optional framework filter

Features:

Matrix strategy for 6 popular frameworks
Incremental updates using change detection (95% faster)
Full scrape for new skills
Streaming ingestion for large docs
Automatic quality report generation
Claude AI packaging
Artifact uploads with 90-day retention
Update summary dashboard

Supported Frameworks:

React
Django
FastAPI
Godot
Vue
Flask

Workflow:

Check if skill exists
Incremental update if exists (change detection)
Full scrape if new
Generate quality metrics
Package for Claude AI
Upload artifacts

Parameters:

frameworks: Comma-separated list or "all" (default: all)

Security: Schedule + workflow_dispatch, input accessed via FRAMEWORKS_INPUT env variable

Workflow Integration

Existing Workflows Enhanced

The new workflows complement existing CI/CD:

Workflow	Purpose	Integration
`tests.yml`	Core testing	Enhanced with Week 2 test runs
`release.yml`	PyPI publishing	Now includes quality metrics
`vector-db-export.yml`	✨ NEW - Export automation
`quality-metrics.yml`	✨ NEW - Quality dashboard
`test-vector-dbs.yml`	✨ NEW - Week 2 testing
`scheduled-updates.yml`	✨ NEW - Auto-refresh

Workflow Relationships

tests.yml (Core CI)
  └─> test-vector-dbs.yml (Week 2 specific)
        └─> quality-metrics.yml (Quality gates)

scheduled-updates.yml (Weekly refresh)
  └─> vector-db-export.yml (Export to vector DBs)
        └─> quality-metrics.yml (Quality check)

Pull Request
  └─> tests.yml + quality-metrics.yml (PR validation)

Features & Benefits

1. Automation

Before Task #20:

Manual vector database exports
Manual quality checks
No automated skill updates
Limited CI/CD for Week 2 features

After Task #20:

✅ Automated weekly exports to 4 vector databases
✅ Automated quality analysis with PR comments
✅ Automated skill refresh for 6 frameworks
✅ Comprehensive Week 2 feature testing

2. Quality Gates

PR Quality Checks:

Code quality (ruff, mypy) - tests.yml
Unit tests (pytest) - tests.yml
Vector DB tests - test-vector-dbs.yml
Quality metrics - quality-metrics.yml

Release Quality:

All tests pass
Quality score ≥ 70/100
Vector DB exports successful
MCP tools validated

3. Continuous Delivery

Weekly Automation:

Sunday 2 AM: Vector DB exports (vector-db-export.yml)
Sunday 3 AM: Skill updates (scheduled-updates.yml)

On-Demand:

Manual triggers for all workflows
Custom framework selection
Configurable quality thresholds
Selective vector database exports

Security Measures

All workflows follow GitHub Actions security best practices:

✅ Safe Input Handling

Environment Variables: All inputs accessed via env: section
No Direct Interpolation: Never use ${{ github.event.* }} in run: commands
Quoted Variables: All shell variables properly quoted
Controlled Triggers: Only workflow_dispatch, schedule, push, pull_request

❌ Avoided Patterns

No github.event.issue.title/body usage
No github.event.comment.body in run commands
No github.event.pull_request.head.ref direct usage
No untrusted commit messages in commands

Security Documentation

Each workflow includes security comment header:

# Security Note: This workflow uses [trigger types].
# All inputs accessed via environment variables (safe pattern).

Usage Examples

Manual Vector Database Export

# Export React skill to all vector databases
gh workflow run vector-db-export.yml \
  -f skill_name=react \
  -f targets=all

# Export Django to specific databases
gh workflow run vector-db-export.yml \
  -f skill_name=django \
  -f targets=weaviate,chroma

Quality Analysis

# Analyze specific skill
gh workflow run quality-metrics.yml \
  -f skill_dir=output/react \
  -f fail_threshold=80

# On PR: Automatically triggered
# (no manual invocation needed)

Scheduled Updates

# Update specific frameworks
gh workflow run scheduled-updates.yml \
  -f frameworks=react,django

# Weekly automatic updates
# (runs every Sunday at 3 AM UTC)

Vector DB Testing

# Manual test run
gh workflow run test-vector-dbs.yml

# Automatic on push/PR
# (triggered by adaptor code changes)

Artifacts & Outputs

Artifact Types

Vector Database Exports (30-day retention)
- {skill}-vector-exports - All 4 JSON files
- Format: {skill}-{target}.json
Quality Reports (30-day retention)
- {skill}-quality-report - Detailed analysis
- quality-metrics-reports - All reports
Updated Skills (90-day retention)
- {framework}-skill-updated - Refreshed skill ZIPs
- Claude AI ready packages
Test Packages (7-day retention)
- test-package-{adaptor}-py{version} - Test exports

GitHub UI Integration

Step Summaries:

Export results with file sizes
Quality dashboard with grades
Test results matrix
Update status for frameworks

PR Comments:

Quality metrics dashboard
Threshold pass/fail status
Recommendations for improvement

Annotations:

Errors: Quality < threshold
Warnings: Quality < 80
Notices: Quality ≥ 80

Performance Metrics

Workflow Execution Times

Workflow	Duration	Frequency
vector-db-export.yml	5-10 min/skill	Weekly + manual
quality-metrics.yml	1-2 min/skill	PR + manual
test-vector-dbs.yml	8-12 min	Push/PR
scheduled-updates.yml	10-15 min/framework	Weekly

Resource Usage

Concurrency: Matrix strategies for parallelization
Caching: pip cache for dependencies
Artifacts: Compressed with retention policies
Storage: ~500MB/week for all workflows

Integration with Week 2 Features

Task #20 workflows integrate all Week 2 capabilities:

Week 2 Feature	Workflow Integration
Weaviate Adaptor	`vector-db-export.yml`, `test-vector-dbs.yml`
Chroma Adaptor	`vector-db-export.yml`, `test-vector-dbs.yml`
FAISS Adaptor	`vector-db-export.yml`, `test-vector-dbs.yml`
Qdrant Adaptor	`vector-db-export.yml`, `test-vector-dbs.yml`
Streaming Ingestion	`scheduled-updates.yml`
Incremental Updates	`scheduled-updates.yml`
Multi-Language	All workflows (language detection)
Embedding Pipeline	`vector-db-export.yml`
Quality Metrics	`quality-metrics.yml`
MCP Integration	`test-vector-dbs.yml`

Next Steps (Week 3 Remaining)

With Task #20 complete, continue Week 3 automation:

Task #21: Docker deployment
Task #22: Kubernetes Helm charts
Task #23: Multi-cloud storage (S3, GCS, Azure)
Task #24: API server for embedding generation
Task #25: Real-time documentation sync
Task #26: Performance benchmarking suite
Task #27: Production deployment guides

Files Created

GitHub Actions Workflows (4 files)

.github/workflows/vector-db-export.yml (220 lines)
.github/workflows/quality-metrics.yml (180 lines)
.github/workflows/test-vector-dbs.yml (140 lines)
.github/workflows/scheduled-updates.yml (200 lines)

Total Impact

New Files: 4 workflows (~740 lines)
Enhanced Workflows: 2 (tests.yml, release.yml)
Automation Coverage: 10 Week 2 features
CI/CD Maturity: Basic → Advanced

Quality Improvements

CI/CD Coverage

Before: 2 workflows (tests, release)
After: 6 workflows (+4 new)
Automation: Manual → Automated
Frequency: On-demand → Scheduled

Developer Experience

Quality Feedback: Manual → Automated PR comments
Vector DB Export: CLI → GitHub Actions
Skill Updates: Manual → Weekly automatic
Testing: Basic → Comprehensive matrix

Task #20: GitHub Actions Automation Workflows - COMPLETE ✅

Week 3 Progress: 1/8 tasks complete Ready for Task #21: Docker Deployment

11 KiB Raw Blame History Unescape Escape

Task #20 Complete: GitHub Actions Automation Workflows

Objective

Implementation Summary

New Workflows

1. Vector Database Export (vector-db-export.yml)

2. Quality Metrics Dashboard (quality-metrics.yml)

3. Test Vector Database Adaptors (test-vector-dbs.yml)

4. Scheduled Skill Updates (scheduled-updates.yml)

Workflow Integration

Existing Workflows Enhanced

Workflow Relationships

Features & Benefits

1. Automation

2. Quality Gates

3. Continuous Delivery

Security Measures

✅ Safe Input Handling

❌ Avoided Patterns

Security Documentation

Usage Examples

Manual Vector Database Export

Quality Analysis

Scheduled Updates

Vector DB Testing

Artifacts & Outputs

Artifact Types

GitHub UI Integration

Performance Metrics

Workflow Execution Times

Resource Usage

Integration with Week 2 Features

Next Steps (Week 3 Remaining)

Files Created

GitHub Actions Workflows (4 files)

Total Impact

Quality Improvements

CI/CD Coverage

Developer Experience

11 KiB

Raw Blame History

1. Vector Database Export (`vector-db-export.yml`)

2. Quality Metrics Dashboard (`quality-metrics.yml`)

3. Test Vector Database Adaptors (`test-vector-dbs.yml`)

4. Scheduled Skill Updates (`scheduled-updates.yml`)