Moltbot/src at fb84e18bc3061e803b19be581b5354494ba060f9 - Moltbot - Gitea: Jon's Git

admin/Moltbot

Files

History

Rodrigo Uroz 7f1712c1ba (fix): enforce embedding model token limit to prevent overflow (#13455 )

* fix: enforce embedding model token limit to prevent 8192 overflow

- Replace EMBEDDING_APPROX_CHARS_PER_TOKEN=1 with UTF-8 byte length
  estimation (safe upper bound for tokenizer output)
- Add EMBEDDING_MODEL_MAX_TOKENS=8192 hard cap
- Add splitChunkToTokenLimit() that binary-searches for the largest
  safe split point, with surrogate pair handling
- Add enforceChunkTokenLimit() wrapper called in indexFile() after
  chunkMarkdown(), before any embedding API call
- Fixes: session files with large JSONL entries could produce chunks
  exceeding text-embedding-3-small's 8192 token limit

Tests: 2 new colocated tests in manager.embedding-token-limit.test.ts
- Verifies oversized ASCII chunks are split to <=8192 bytes each
- Verifies multibyte (emoji) content batching respects byte limits

* fix: make embedding token limit provider-aware

- Add optional maxInputTokens to EmbeddingProvider interface
- Each provider (openai, gemini, voyage) reports its own limit
- Known-limits map as fallback: openai 8192, gemini 2048, voyage 32K
- Resolution: provider field > known map > default 8192
- Backward compatible: local/llama uses fallback

* fix: enforce embedding input size limits (#13455) (thanks @rodrigouroz)

---------

Co-authored-by: Tak Hoffman <781889+Takhoffman@users.noreply.github.com>

2026-02-10 20:10:17 -06:00

..

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

feat(gateway): stream thinking events and decouple tool events from verbose level (#10568 )

2026-02-10 19:17:21 -06:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

fix: prevent act:evaluate hangs from getting browser tool stuck/killed (#13498 )

2026-02-11 07:54:48 +08:00

fix: use STATE_DIR instead of hardcoded ~/.openclaw for identity and canvas (#4824 )

2026-02-07 22:16:59 -05:00

feat: IRC — add first-class channel support

2026-02-10 17:33:57 -06:00

fix: unify session maintenance and cron run pruning (#13083 )

2026-02-09 20:42:35 -08:00

Update Together default model to together/moonshotai/Kimi-K2.5 (#13324 )

2026-02-11 08:39:15 +09:00

refactor: rename to openclaw

2026-01-30 03:16:21 +01:00

feat(hooks): add agentId support to webhook mappings (#13672 )

2026-02-10 19:23:58 -05:00

Heartbeat: inject cron-style current time into prompts (#13733 )

2026-02-10 18:58:45 -06:00

fix(runtime): bump minimum Node.js version to 22.12.0 (#5370 )

2026-02-05 13:42:52 -08:00

fix(memory/qmd): scope query to managed collections (#11645 )

2026-02-09 23:35:27 -08:00

Docs: landing page revamp (#8885 )

2026-02-04 10:37:14 -05:00

feat(gateway): stream thinking events and decouple tool events from verbose level (#10568 )

2026-02-10 19:17:21 -06:00

fix: use STATE_DIR instead of hardcoded ~/.openclaw for identity and canvas (#4824 )

2026-02-07 22:16:59 -05:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

Heartbeat: inject cron-style current time into prompts (#13733 )

2026-02-10 18:58:45 -06:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

link-understanding

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

fix: guard resolveUserPath against undefined input (#10176 )

2026-02-06 13:16:58 -05:00

chore: Enable "curly" rule to avoid single-statement if confusion/errors.

2026-01-31 16:19:20 +09:00

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

refactor: consolidate PNG encoder and safeParseJson utilities (#12457 )

2026-02-09 00:21:54 -08:00

media-understanding

refactor: consolidate fetchWithTimeout into shared utility

2026-02-09 20:34:56 -08:00

(fix): enforce embedding model token limit to prevent overflow (#13455 )

2026-02-10 20:10:17 -06:00

fix: prevent act:evaluate hangs from getting browser tool stuck/killed (#13498 )

2026-02-11 07:54:48 +08:00

fix(pairing): use actual code in pairing approval text

2026-02-10 19:48:02 -05:00

Update contributing, deduplicate more functions

2026-02-09 19:21:33 -08:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

fix: skip extension append if command already has one

2026-01-31 20:39:33 -06:00

chore: Fix failing test.

2026-02-09 09:58:58 +09:00

refactor: unify peer kind to ChatType, rename dm to direct (#11881 )

2026-02-09 09:20:52 +09:00

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

refactor(security,config): split oversized files (#13182 )

2026-02-09 22:22:29 -08:00

fix: unify session maintenance and cron run pruning (#13083 )

2026-02-09 20:42:35 -08:00

chore: Enable "curly" rule to avoid single-statement if confusion/errors.

2026-01-31 16:19:20 +09:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

fix(auto-reply): prevent sender spoofing in group prompts

2026-02-10 00:44:38 -06:00

fix(pairing): use actual code in pairing approval text

2026-02-10 19:48:02 -05:00

fix: error handling in restore failure reporting

2026-02-03 06:22:51 +00:00

fix: use STATE_DIR instead of hardcoded ~/.openclaw for identity and canvas (#4824 )

2026-02-07 22:16:59 -05:00

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

Centralize date/time formatting utilities (#11831 )

2026-02-08 04:53:31 -08:00

fix: update pi packages to 0.51.0, remove bogus type augmentation

2026-02-02 01:52:33 +01:00

refactor: consolidate fetchWithTimeout into shared utility

2026-02-09 20:34:56 -08:00

Heartbeat: inject cron-style current time into prompts (#13733 )

2026-02-10 18:58:45 -06:00

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

feat(onboard): add custom/local API configuration flow (#11106 )

2026-02-10 07:31:02 -05:00

channel-web.barrel.test.ts

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

channel-web.ts

…

docker-setup.test.ts

test(docker): make bash 3.2 compatibility check portable

2026-02-10 18:04:48 -05:00

entry.ts

Centralize date/time formatting utilities (#11831 )

2026-02-08 04:53:31 -08:00

extensionAPI.ts

chore: Migrate to tsdown, speed up JS bundling by ~10x (thanks @hyf0).

2026-02-03 20:18:16 +09:00

globals.test.ts

…

globals.ts

chore: Enable "curly" rule to avoid single-statement if confusion/errors.

2026-01-31 16:19:20 +09:00

index.test.ts

…

index.ts

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

logger.test.ts

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

logger.ts

chore: Enable "curly" rule to avoid single-statement if confusion/errors.

2026-01-31 16:19:20 +09:00

logging.ts

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

polls.test.ts

chore: Enable "experimentalSortImports" in Oxfmt and reformat all imorts.

2026-02-01 10:03:47 +09:00

polls.ts

…

runtime.ts

CLI: restore terminal state on exit

2026-02-03 06:10:19 +00:00

utils.test.ts

fix(paths): structurally resolve home dir to prevent Windows path bugs (#12125 )

2026-02-08 20:06:29 -05:00

utils.ts

Deduplicate more

2026-02-09 18:56:58 -08:00

version.test.ts

fix: CLI harden update restart imports and fix nested bundle version resolution

2026-02-06 00:09:48 -05:00

version.ts

fix: CLI harden update restart imports and fix nested bundle version resolution

2026-02-06 00:09:48 -05:00