Release v1.8.0: Add transcript-fixer skill
## New Skill: transcript-fixer v1.0.0 Correct speech-to-text (ASR/STT) transcription errors through dictionary-based rules and AI-powered corrections with automatic pattern learning. **Features:** - Two-stage correction pipeline (dictionary + AI) - Automatic pattern detection and learning - Domain-specific dictionaries (general, embodied_ai, finance, medical) - SQLite-based correction repository - Team collaboration with import/export - GLM API integration for AI corrections - Cost optimization through dictionary promotion **Use cases:** - Correcting meeting notes, lecture recordings, or interview transcripts - Fixing Chinese/English homophone errors and technical terminology - Building domain-specific correction dictionaries - Improving transcript accuracy through iterative learning **Documentation:** - Complete workflow guides in references/ - SQL query templates - Troubleshooting guide - Team collaboration patterns - API setup instructions **Marketplace updates:** - Updated marketplace to v1.8.0 - Added transcript-fixer plugin (category: productivity) - Updated README.md with skill description and use cases - Updated CLAUDE.md with skill listing and counts 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
This commit is contained in:
188
transcript-fixer/references/sql_queries.md
Normal file
188
transcript-fixer/references/sql_queries.md
Normal file
@@ -0,0 +1,188 @@
|
||||
# SQL Query Reference
|
||||
|
||||
Database location: `~/.transcript-fixer/corrections.db`
|
||||
|
||||
## Basic Operations
|
||||
|
||||
### Add Corrections
|
||||
|
||||
```sql
|
||||
-- Add a correction
|
||||
INSERT INTO corrections (from_text, to_text, domain, source)
|
||||
VALUES ('巨升智能', '具身智能', 'embodied_ai', 'manual');
|
||||
|
||||
INSERT INTO corrections (from_text, to_text, domain, source)
|
||||
VALUES ('奇迹创坛', '奇绩创坛', 'general', 'manual');
|
||||
```
|
||||
|
||||
### View Corrections
|
||||
|
||||
```sql
|
||||
-- View all active corrections
|
||||
SELECT from_text, to_text, domain, source, usage_count
|
||||
FROM active_corrections
|
||||
ORDER BY domain, from_text;
|
||||
|
||||
-- View corrections for specific domain
|
||||
SELECT from_text, to_text, usage_count, added_at
|
||||
FROM active_corrections
|
||||
WHERE domain = 'embodied_ai';
|
||||
```
|
||||
|
||||
## Context Rules
|
||||
|
||||
### Add Context-Aware Rules
|
||||
|
||||
```sql
|
||||
-- Add regex-based context rule
|
||||
INSERT INTO context_rules (pattern, replacement, description, priority)
|
||||
VALUES ('巨升方向', '具身方向', '巨升→具身', 10);
|
||||
|
||||
INSERT INTO context_rules (pattern, replacement, description, priority)
|
||||
VALUES ('近距离的去看', '近距离地去看', '的→地 副词修饰', 5);
|
||||
```
|
||||
|
||||
### View Rules
|
||||
|
||||
```sql
|
||||
-- View all active context rules (ordered by priority)
|
||||
SELECT pattern, replacement, description, priority
|
||||
FROM context_rules
|
||||
WHERE is_active = 1
|
||||
ORDER BY priority DESC;
|
||||
```
|
||||
|
||||
## Statistics
|
||||
|
||||
```sql
|
||||
-- View correction statistics by domain
|
||||
SELECT * FROM correction_statistics;
|
||||
|
||||
-- Count corrections by source
|
||||
SELECT source, COUNT(*) as count, SUM(usage_count) as total_usage
|
||||
FROM corrections
|
||||
WHERE is_active = 1
|
||||
GROUP BY source;
|
||||
|
||||
-- Most frequently used corrections
|
||||
SELECT from_text, to_text, domain, usage_count, last_used
|
||||
FROM corrections
|
||||
WHERE is_active = 1 AND usage_count > 0
|
||||
ORDER BY usage_count DESC
|
||||
LIMIT 10;
|
||||
```
|
||||
|
||||
## Learning and Suggestions
|
||||
|
||||
### View Suggestions
|
||||
|
||||
```sql
|
||||
-- View pending suggestions
|
||||
SELECT * FROM pending_suggestions;
|
||||
|
||||
-- View high-confidence suggestions
|
||||
SELECT from_text, to_text, domain, frequency, confidence
|
||||
FROM learned_suggestions
|
||||
WHERE status = 'pending' AND confidence >= 0.8
|
||||
ORDER BY confidence DESC, frequency DESC;
|
||||
```
|
||||
|
||||
### Approve Suggestions
|
||||
|
||||
```sql
|
||||
-- Insert into corrections
|
||||
INSERT INTO corrections (from_text, to_text, domain, source, confidence)
|
||||
SELECT from_text, to_text, domain, 'learned', confidence
|
||||
FROM learned_suggestions
|
||||
WHERE id = 1;
|
||||
|
||||
-- Mark as approved
|
||||
UPDATE learned_suggestions
|
||||
SET status = 'approved', reviewed_at = CURRENT_TIMESTAMP
|
||||
WHERE id = 1;
|
||||
```
|
||||
|
||||
## History and Audit
|
||||
|
||||
```sql
|
||||
-- View recent correction runs
|
||||
SELECT filename, domain, stage1_changes, stage2_changes, run_timestamp
|
||||
FROM correction_history
|
||||
ORDER BY run_timestamp DESC
|
||||
LIMIT 10;
|
||||
|
||||
-- View detailed changes for a specific run
|
||||
SELECT ch.line_number, ch.from_text, ch.to_text, ch.rule_type
|
||||
FROM correction_changes ch
|
||||
JOIN correction_history h ON ch.history_id = h.id
|
||||
WHERE h.filename = 'meeting.md'
|
||||
ORDER BY ch.line_number;
|
||||
|
||||
-- Calculate success rate
|
||||
SELECT
|
||||
COUNT(*) as total_runs,
|
||||
SUM(CASE WHEN success = 1 THEN 1 ELSE 0 END) as successful,
|
||||
ROUND(100.0 * SUM(CASE WHEN success = 1 THEN 1 ELSE 0 END) / COUNT(*), 2) as success_rate
|
||||
FROM correction_history;
|
||||
```
|
||||
|
||||
## Maintenance
|
||||
|
||||
```sql
|
||||
-- Deactivate (soft delete) a correction
|
||||
UPDATE corrections
|
||||
SET is_active = 0
|
||||
WHERE from_text = '错误词' AND domain = 'general';
|
||||
|
||||
-- Reactivate a correction
|
||||
UPDATE corrections
|
||||
SET is_active = 1
|
||||
WHERE from_text = '错误词' AND domain = 'general';
|
||||
|
||||
-- Update correction confidence
|
||||
UPDATE corrections
|
||||
SET confidence = 0.95
|
||||
WHERE from_text = '巨升' AND to_text = '具身';
|
||||
|
||||
-- Delete old history (older than 90 days)
|
||||
DELETE FROM correction_history
|
||||
WHERE run_timestamp < datetime('now', '-90 days');
|
||||
|
||||
-- Reclaim space
|
||||
VACUUM;
|
||||
```
|
||||
|
||||
## System Configuration
|
||||
|
||||
```sql
|
||||
-- View system configuration
|
||||
SELECT key, value, description FROM system_config;
|
||||
|
||||
-- Update configuration
|
||||
UPDATE system_config
|
||||
SET value = '5'
|
||||
WHERE key = 'learning_frequency_threshold';
|
||||
|
||||
-- Check schema version
|
||||
SELECT value FROM system_config WHERE key = 'schema_version';
|
||||
```
|
||||
|
||||
## Export
|
||||
|
||||
```sql
|
||||
-- Export corrections as CSV
|
||||
.mode csv
|
||||
.headers on
|
||||
.output corrections_export.csv
|
||||
SELECT from_text, to_text, domain, source, confidence, usage_count, added_at
|
||||
FROM active_corrections;
|
||||
.output stdout
|
||||
```
|
||||
|
||||
For JSON export, use Python script with `service.export_corrections()` instead.
|
||||
|
||||
## See Also
|
||||
|
||||
- `references/file_formats.md` - Complete database schema documentation
|
||||
- `references/quick_reference.md` - CLI command quick reference
|
||||
- `SKILL.md` - Main user documentation
|
||||
Reference in New Issue
Block a user