yusyus
8b3f31409e
fix: Enforce min_chunk_size in RAG chunker
...
- Filter out chunks smaller than min_chunk_size (default 100 tokens)
- Exception: Keep all chunks if entire document is smaller than target size
- All 15 tests passing (100% pass rate)
Fixes edge case where very small chunks (e.g., 'Short.' = 6 chars) were
being created despite min_chunk_size=100 setting.
Test: pytest tests/test_rag_chunker.py -v
2026-02-07 20:59:03 +03:00
..
2025-10-29 23:19:32 +03:00
2026-01-17 17:48:15 +00:00
2025-10-19 02:08:58 +03:00
2026-01-17 23:02:11 +03:00
2025-10-19 17:01:37 +03:00
2026-02-03 21:37:54 +03:00
2026-01-31 21:30:00 +03:00
2026-01-17 17:48:15 +00:00
2026-01-18 00:01:30 +03:00
2026-01-18 00:01:30 +03:00
2026-02-07 20:59:03 +03:00
2026-01-17 23:02:11 +03:00
2026-01-29 22:56:33 +03:00
2026-01-17 23:25:12 +03:00
2026-02-03 21:00:34 +03:00
2026-02-07 20:59:03 +03:00
2026-01-17 17:29:21 +00:00
2026-02-05 21:27:41 +03:00
2026-01-17 23:02:11 +03:00
2026-01-31 21:30:00 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 17:48:15 +00:00
2026-01-17 22:54:40 +03:00
2026-01-17 23:02:11 +03:00
2026-02-07 13:48:05 +03:00
2026-02-07 20:59:03 +03:00
2026-02-04 10:14:20 +01:00
2026-01-17 17:48:15 +00:00
2026-01-17 23:02:11 +03:00
2026-02-05 22:02:06 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-01-18 00:01:30 +03:00
2026-01-18 00:01:30 +03:00
2026-01-31 21:30:00 +03:00
2026-02-07 13:42:14 +03:00
2026-01-18 00:01:30 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 23:02:11 +03:00
2026-01-17 17:48:15 +00:00
2026-01-31 14:58:09 +03:00
2026-01-18 12:11:01 +03:00
2026-02-04 21:20:23 +03:00
2026-01-17 17:29:21 +00:00
2026-01-17 23:02:11 +03:00
2026-01-18 00:01:30 +03:00
2026-01-17 17:29:21 +00:00
2026-01-17 23:33:34 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-02-07 20:59:03 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 17:48:15 +00:00
2026-02-07 13:45:01 +03:00
2026-01-17 17:48:15 +00:00
2026-02-03 21:00:34 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 17:48:15 +00:00
2026-01-17 22:54:40 +03:00
2026-02-04 21:00:49 +03:00
2026-01-27 21:11:04 +03:00
2026-01-17 17:29:21 +00:00
2026-01-17 17:48:15 +00:00
2026-02-07 13:54:44 +03:00
2026-02-07 20:53:44 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 23:02:11 +03:00
2026-01-17 17:48:15 +00:00
2026-01-17 23:25:12 +03:00
2026-01-18 13:48:37 +03:00
2026-01-18 00:01:30 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-02-07 13:39:43 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 23:02:11 +03:00
2026-02-02 23:08:25 +03:00
2026-01-17 23:02:11 +03:00
2026-01-17 22:54:40 +03:00
2026-01-17 23:25:12 +03:00
2026-01-17 17:48:15 +00:00
2026-02-04 21:16:13 +03:00
2026-01-17 17:29:21 +00:00