docs: Add documentation for 7 new programming languages

Update documentation for PR #275 extended language detection:
- CHANGELOG.md: Add comprehensive section for new languages
- language_detector.py: Update docstrings from 20+ to 27+ languages

New languages:
- Dart (Flutter framework)
- Scala (pattern matching, case classes)
- SCSS/SASS (CSS preprocessors)
- Elixir (functional, pipe operator)
- Lua (game scripting)
- Perl (text processing)

70 regex patterns with confidence scoring (0.6-0.8+ thresholds)
7 new tests, 30/30 passing (100%)

Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
This commit is contained in:
yusyus
2026-02-04 21:01:40 +03:00
parent 0abb01f3dd
commit 8f99ed0003
2 changed files with 20 additions and 2 deletions

View File

@@ -9,6 +9,24 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
### Added
#### Extended Language Detection (NEW)
- **7 New Programming Languages**: Dart, Scala, SCSS, SASS, Elixir, Lua, Perl
- Pattern-based detection with confidence scoring (0.6-0.8+ thresholds)
- **70 regex patterns** prioritizing unique identifiers (weight 5)
- Framework-specific patterns:
- **Dart**: Flutter widgets (`StatelessWidget`, `StatefulWidget`, `Widget build()`)
- **Scala**: Pattern matching (`case class`, `trait`, `match {}`)
- **SCSS**: Preprocessor features (`$variables`, `@mixin`, `@include`, `@extend`)
- **SASS**: Indented syntax (`=mixin`, `+include`, `$variables`)
- **Elixir**: Functional patterns (`defmodule`, `def ... do`, pipe operator `|>`)
- **Lua**: Game scripting (`local`, `repeat...until`, `~=`, `elseif`)
- **Perl**: Text processing (`my $`, `use strict`, `sub`, `chomp`, regex `=~`)
- **Comprehensive test coverage**: 7 new tests, 30/30 passing (100%)
- **False positive prevention**: Unique identifiers (weight 5) + confidence thresholds
- **No regressions**: All existing language detection tests still pass
- **Total language support**: Now 27+ programming languages
- **Credit**: Contributed by @PaawanBarach via PR #275
#### Multi-Agent Support for Local Enhancement (NEW)
- **Multiple Coding Agent Support**: Choose your preferred local coding agent for SKILL.md enhancement
- **Claude Code** (default): Claude Code CLI with `--dangerously-skip-permissions`

View File

@@ -3,7 +3,7 @@
Unified Language Detection for Code Blocks
Provides confidence-based language detection for documentation scrapers.
Supports 20+ programming languages with weighted pattern matching.
Supports 27+ programming languages with weighted pattern matching.
Author: Skill Seekers Project
"""
@@ -505,7 +505,7 @@ class LanguageDetector:
"""
Unified confidence-based language detection for code blocks.
Supports 20+ programming languages with weighted pattern matching.
Supports 27+ programming languages with weighted pattern matching.
Uses two-stage detection:
1. CSS class extraction (high confidence = 1.0)
2. Pattern-based heuristics with confidence scoring (0.0-1.0)