picoclaw

mirror of https://github.com/sipeed/picoclaw.git synced 2026-06-12 18:08:54 +00:00

Author	SHA1	Message	Date
xiaoen	c63c6449b4	fix(agent): forceCompression recovers from single oversized Turn When the entire session history is a single Turn (e.g. one user message followed by a massive tool response), findSafeBoundary returns 0 and forceCompression previously did nothing — leaving the agent stuck in a context-exceeded retry loop. Now falls back to keeping only the most recent user message when no safe Turn boundary exists. This breaks Turn atomicity as a last resort but guarantees the agent can recover. Also updates docs/agent-refactor/context.md to document this behavior. Ref #1490	2026-03-17 10:23:16 +08:00
xiaoen	b768dab822	test(agent): use realistic session data in context retry test Session history only stores user/assistant/tool messages — the system prompt is built dynamically by BuildMessages. Remove the incorrect system message from TestAgentLoop_ContextExhaustionRetry test data to match the real data model that forceCompression operates on.	2026-03-16 14:48:35 +08:00
xiaoen	7c1a1c2c1a	style(agent): fix gci comment alignment in test	2026-03-16 14:48:35 +08:00
xiaoen	edbdc3bcf1	fix(agent): findSafeBoundary returns 0 for single-Turn history When the entire history is a single Turn (one user message followed by tool calls and responses, no subsequent user message), the only Turn boundary is at index 0. Previously the fallback returned targetIndex, which could land on a tool or assistant message — splitting the Turn. Return 0 instead, so callers (forceCompression, summarizeSession) see mid <= 0 and skip compression rather than cutting inside the Turn.	2026-03-16 14:48:35 +08:00
xiaoen	8034ee7be1	fix(agent): correct media token arithmetic and tool call double-counting Two estimation bugs fixed: 1. Media tokens were added to the chars accumulator before the chars2/5 conversion, resulting in 2562/5=102 tokens per item instead of 256. Fix: add media tokens directly to the final token count, bypassing the character-based heuristic. 2. estimateMessageTokens counted both tc.Name and tc.Function.Name for tool calls, but providers only send one (OpenAI-compat uses function.name, Anthropic uses tc.Name). Fix: count tc.Function.Name when Function is present, fall back to tc.Name only otherwise. Also fix i18n hint text: "auto-detect" was misleading — the backend uses a 4x max_tokens heuristic, not actual model detection.	2026-03-16 14:48:34 +08:00
xiaoen	639739cb85	refactor(agent): use Turn as the atomic unit for compression cut-off Introduce parseTurnBoundaries() which identifies each Turn start index in the session history. A Turn is a complete "user input → LLM iterations → final response" cycle (as defined in the agent refactor design #1316). findSafeBoundary now uses Turn boundaries instead of raw role-scanning, making the intent explicit: "find the nearest Turn boundary." forceCompression drops the oldest half of Turns (not arbitrary messages), which is simpler and more intuitive. The Turn-based approach naturally prevents splitting tool-call sequences since each Turn is atomic.	2026-03-16 14:48:34 +08:00
xiaoen	efd403242e	fix(agent): preallocate messages slice in budget test Fixes prealloc lint warning by using make() with capacity hint.	2026-03-16 14:48:34 +08:00
xiaoen	b7f1c2b5fc	test(agent): add realistic session-shaped tests for context budget Add tests that reflect actual session data shape: history starts with user messages (no system prompt), includes chained tool-call sequences, reasoning content, and media items. Exercises the proactive budget check path with BuildMessages-style assembled messages.	2026-03-16 14:48:34 +08:00
xiaoen	d5fdd5ebd2	fix(agent): include ReasoningContent and Media in token estimation estimateMessageTokens now counts ReasoningContent (extended thinking / chain-of-thought) which can be substantial and is persisted in session history. Media items get a fixed per-item overhead (256 tokens) since actual cost depends on provider-specific image tokenization.	2026-03-16 14:48:33 +08:00
xiaoen	9c65d78b07	fix(agent): forceCompression must not assume history[0] is system prompt Session history (GetHistory) contains only user/assistant/tool messages. The system prompt is built dynamically by BuildMessages and is never stored in session. The previous code incorrectly treated history[0] as a system prompt, skipping the first user message and appending a compression note to it. Fix: operate on the full history slice, and record the compression note in the session summary (which BuildMessages already injects into the system prompt) rather than modifying any history message.	2026-03-16 14:48:33 +08:00
xiaoen	9c82b0baa2	refactor(agent): context boundary detection, proactive budget check, and safe compression Separate context_window from max_tokens — they serve different purposes (input capacity vs output generation limit). The previous conflation caused premature summarization or missed compression triggers. Changes: - Add context_window field to AgentDefaults config (default: 4x max_tokens) - Extract boundary-safe truncation helpers (isSafeBoundary, findSafeBoundary) into context_budget.go — pure functions with no AgentLoop dependency - forceCompression: align split to safe boundary so tool-call sequences (assistant+ToolCalls → tool results) are never torn apart - summarizeSession: use findSafeBoundary instead of hardcoded keep-last-4 - estimateTokens: count ToolCalls arguments and ToolCallID metadata, not just Content — fixes systematic undercounting in tool-heavy sessions - Add proactive context budget check before LLM call in runAgentLoop, preventing 400 context-length errors instead of reacting to them - Add estimateToolDefsTokens for tool definition token cost Closes #556, closes #665 Ref #1439	2026-03-16 14:48:32 +08:00
Mauro	021aa7d6d5	feat(agent): steering (#1517 ) * feat(agent): steering * fix loop * fix lint * fix lint	2026-03-16 00:08:16 +08:00
dataCenter430	0c5d7500e8	feat: expose local file paths for non-image media to enable agent file tools (#1516 ) * feat: expose local file paths for non-image media to enable agent file tools * fix: Golang Lint error	2026-03-14 12:09:11 +08:00
Alix-007	c68b4f3903	fix(qq): populate account bindings metadata (#1456 ) Co-authored-by: XYSK-lilong007 <267018309+XYSK-lilong007@users.noreply.github.com>	2026-03-13 23:08:55 +08:00
Hakancan	6b72326be1	fix: safety guard incorrectly blocks commands with URLs (#1254 ) * fix: safety guard incorrectly blocks commands with URLs The absolutePathPattern regex was matching URL path components like //github.com as file system paths, causing commands containing URLs to be incorrectly blocked by the workspace restriction safety guard. For example, 'agent-browser open https://github.com' would be blocked because //github.com was treated as an absolute file path outside the working directory. The fix adds a check to skip any path match that starts with '//', as these are URL path components, not file system paths. Fixes #1203 * fix: handle file:// URIs correctly in safety guard The previous fix skipped all paths starting with '//', which incorrectly also skipped file:// URIs that could escape the workspace sandbox. Changes: - Only skip '//' paths when preceded by web URL schemes (http:, https:, ftp:, etc.) - file:// URIs are now properly checked against workspace boundaries - Added TestShellTool_FileURISandboxing to verify the fix Fixes security issue raised by @alexhoshina in PR #1254 * style: fix gofumpt formatting * fix(safety-guard): use exact match position to prevent URL exemption bypass Using strings.Index(cmd, raw) always returned the first occurrence of the matched substring, allowing a bypass where the same //path appeared both inside a URL and as a standalone shell path (e.g. echo https://etc/passwd && cat //etc/passwd would skip the second match). Switch to FindAllStringIndex so each match is evaluated at its actual position in the command string. Adds TestShellTool_URLBypassPrevented to cover the exploit scenario.	2026-03-13 17:16:05 +08:00
lxowalle	9530883d2c	Fix/Add warning tips for MCP initialization when no valid servers configured (#1497 ) * add tips for mcp * fix test issue	2026-03-13 16:43:00 +08:00
美電球	4ccea5eb93	fix(identity): prevent allowlist ID entries from matching usernames (#1406 )	2026-03-13 15:41:18 +08:00
Cytown	9676e51e89	make gateway aware of config.json change (#1187 ) * make gateway aware of config.json change * fix according to code review * fix lint * fix review comment * fix for review * refactor to fix review * fix for review * fix for review	2026-03-13 14:27:46 +08:00
Cytown	dfa36f39cb	add model command to set default model (#1250 ) * add model command to set default model * fix for ci * fix test for model * fix active agent not recognized * implement test for model command * fix local-model can not set as default issue * fix review comment * fix for comment	2026-03-13 14:10:11 +08:00
Zane Tung	9fed4ec136	feat: add anthropic-messages protocol for native Anthropic Messages API support Fixes #269 (#1284 ) * feat: add anthropic-messages protocol support Add native Anthropic Messages API format support to enable compatibility with custom endpoints that only support Anthropic's native message format (not OpenAI-compatible format). Changes: - Add new pkg/providers/anthropic_messages package with HTTP-based provider - Implement Anthropic Messages API request/response format conversion - Add anthropic-messages protocol support in factory_provider.go - Include comprehensive unit tests (64.2% coverage) Features: - Support for system, user, assistant, and tool messages - Support for tool calls (tool_use blocks) - Proper header handling (x-api-key, anthropic-version) - Configurable max_tokens and temperature - Automatic base URL normalization Configuration example: model: "anthropic-messages/claude-opus-4-6" api_base: "https://api.anthropic.com" api_key: "sk-..." Tested with actual API endpoint, verified compatibility with Anthropic Messages API specification. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * docs: add anthropic-messages protocol examples to README and config Add configuration examples and documentation for the new anthropic-messages protocol: - config.example.json: Add claude-opus-4.6 example with anthropic-messages - README.md: Add "Anthropic Messages API (native format)" section - README.zh.md: Add Chinese version of the documentation This helps users understand when to use anthropic-messages vs anthropic protocol and fixes issue #269. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: format code with gofmt -s - Align constant definitions in provider.go - Align struct fields in test cases - Fix gofmt formatting issues reported in review 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: address linter errors - Fix HTTP header canonical form: "x-api-key" → "X-API-Key" - Fix HTTP header canonical form: "anthropic-version" → "Anthropic-Version" - Format imports with gci (standard, default, localmodule order) - Format code with golines (max line length 120) 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com> * fix: resolve golangci-lint errors in anthropic-messages provider - add nolint comment for canonicalheader rule on X-API-Key header (Anthropic API requires exact casing) - fix golines formatting issues in provider_test.go (split long lines under 120 chars) - fix long comment line in factory_provider.go (split into two lines) Resolves CI linter failures for the anthropic-messages protocol implementation. * fix(providers): address review comments in anthropic-messages provider - fix normalizeBaseURL edge case that incorrectly appends /v1 to URLs already containing /v1 path (e.g., https://api.example.com/v1/proxy) - remove dead code for apiBase empty check as normalizeBaseURL() always provides a default value - update test to use proper constructor instead of direct struct initialization - add detailed comments explaining the URL normalization logic Resolves review comments on PR #1284 * fix(providers): remove hardcoded max_tokens in anthropic-messages provider - remove hardcoded max_tokens value (4096) from buildRequestBody - read max_tokens directly from options parameter - add error handling when max_tokens is missing from options - update test cases to include max_tokens in options This fix ensures the provider respects the config default value (32768) or system fallback (8192) instead of always using the hardcoded 4096. * fix(providers): improve error handling and add edge case tests - fix ToolCalls nil vs empty slice issue to ensure consistent JSON serialization - add detailed HTTP error handling for common status codes (401, 429, 400, 404, 500, 503) - add edge case tests for buildRequestBody and parseResponseBody - clarify anthropic vs anthropic-messages protocol differences in docs --------- Co-authored-by: Claude <noreply@anthropic.com>	2026-03-13 14:09:40 +08:00
leamon	0fb92b21b6	enhance skill installer (#1252 ) * enhance skill installer * enhance install skills v2 * go file formate * fix:use proxy download skills;many chunck download;simple code * add default config to config.example.json, download skill from github use proxy and token --------- Co-authored-by: FantasticCode2019 <1443996278@qq.com>	2026-03-13 14:04:02 +08:00
dataCenter430	b811e9186c	feat(provider): add ModelScope as OpenAI-compatible provider (#1486 ) * feat(provider): add ModelScope as OpenAI-compatible provider * test(provider): add ModelScope provider and migration tests * docs: add ModelScope to README provider tables and free tier sections * chore: add ModelScope to example config and env template	2026-03-13 14:02:23 +08:00
Cytown	83e24e8ceb	fix 3rd party logger not correct output (#1482 )	2026-03-13 11:20:17 +08:00
don	19835b2f60	fix(line): limit webhook request body size to prevent DoS (#1413 ) * fix(line): limit webhook request body size to prevent DoS Add io.LimitReader with 1 MB cap on the LINE webhook handler to prevent unauthenticated memory exhaustion via oversized POST requests. Follows the same pattern used in the WeCom channel (io.LimitReader). Requests exceeding the limit are rejected with 413 Request Entity Too Large. Fixes #1407 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * refactor(line): hoist body size const, add boundary tests - Move maxWebhookBodySize to package-level const - Add TestWebhookAcceptsMaxBodySize (exact limit → 403, not 413) - Add TestWebhookRejectsOversizedBodyBeforeSignatureCheck - Use const in test instead of magic number Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 23:55:40 +08:00
Horsley Lee	8f49af99f9	fix(matrix): stream inbound media downloads to disk (#1436 )	2026-03-12 23:48:26 +08:00
Guoguo	1e024321c0	refactor: update model name and add VolcEngine coding plan (#1412 ) * docs: swap header logo to webp, move meme logo to bottom Replace header logo with assets/logo.webp across all 6 README language variants and move the original meme logo (logo.jpg) to the bottom of each file. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * docs: update GPT model names to gpt-5.4 and refine provider descriptions Update all 6 language README variants: - Correct GPT model references from gpt-5.2/gpt4 to gpt-5.4 - Refine provider descriptions in API Key comparison tables Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * chore: update default model to gpt-5.4, codex to gpt-5.3-codex Update OpenAI default model references from gpt-5.2 to gpt-5.4 across source code, config examples, tests, and docs. Set Codex default model to gpt-5.3-codex. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 16:10:29 +08:00
Alix-007	3bcbfd99b9	fix(channels): stop stale typing loops on overwrite (#1392 ) Co-authored-by: XYSK-lilong007 <267018309+XYSK-lilong007@users.noreply.github.com>	2026-03-12 14:31:00 +08:00
Mahendra Teja	8cac29d9bb	docs: remove stale TOOLS.md references (#1388 ) TOOLS.md was intentionally removed in `21d60f6` and #771, as tools are now provided to the LLM via JSON schema through ToProviderDefs(). These references were missed during that cleanup. Suggested by @yinwm in #1355. Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-12 09:47:01 +08:00
Mahendra Teja	6612ca099a	fix(openai_compat): improve prompt_cache_key host matching (#1387 ) LGTM! The changes improve the robustness of prompt_cache_key host matching and add Azure OpenAI support. Thanks for the contribution!	2026-03-12 03:24:31 +08:00
amagi	49204df678	fix(openai_compat): accept object tool call arguments (#1292 )	2026-03-12 02:47:22 +08:00
Cytown	d920b78b41	refactor logger to zerolog (#1239 ) * refactor logger to zerolog * modify dingtalk and discord logger * fix for lint * fix for review * fix for file leak * fix for review	2026-03-12 02:35:37 +08:00
LeaderOnePro	9222351871	feat(providers): add LongCat model provider support (#1317 ) * feat(providers): add LongCat model provider support Add LongCat as an OpenAI-compatible provider with base URL https://api.longcat.chat/openai and default model LongCat-Flash-Thinking. Includes provider config, migration, factory routing, example config, tests, and README entries for all 6 locales. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * fix(providers): address LongCat review feedback - Add dedicated factory routing test for LongCat provider - Add longcat to DefaultAPIBase test coverage - Set default api_base in example config providers section Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> * test(providers): add ResolveProviderSelection tests for LongCat Add two test cases to TestResolveProviderSelection: - Explicit provider selection with api_base default and proxy wiring - Fallback inference from model name with api_base default --------- Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-12 02:34:42 +08:00
Darren.Zeng	8431fa3e04	fix(config): support Chinese comma separator in allow_from environment variables (#1301 ) Add UnmarshalText method to FlexibleStringSlice to support both English (,) and Chinese (，) comma separators in environment variables. Includes comprehensive unit tests covering: - English commas, Chinese commas, mixed commas - Single values, whitespace trimming - Empty strings, edge cases Fixes #1280	2026-03-12 02:33:33 +08:00
Dimitrij Denissenko	39a451d312	Enable rich-text messages in matrix channel (#1370 ) * Enable rich-text messages in matrix channel * Fix lint	2026-03-12 01:25:28 +08:00
Mahendra Teja	4a80c6f58c	fix(openai_compat): only send prompt_cache_key to OpenAI endpoints (#1353 ) Non-OpenAI providers (Mistral, DeepSeek, Groq, etc.) reject unknown request fields with 422 errors. The previous blocklist only excluded Google/Gemini, but the comment already noted this feature is OpenAI-only. Flip to an allowlist so only api.openai.com receives the field. Fixes #1333	2026-03-12 01:21:54 +08:00
Congregalis	9b0a48ac6d	fix(agent): initialize MCP in direct agent mode (#1361 )	2026-03-12 01:06:48 +08:00
wenjie	8949a2575b	Add exec allow_remote config support in web settings (#1363 ) - default tools.exec.allow_remote to true when omitted in config loading - preserve allow_remote in OpenClaw config migration and API updates - expose allow_remote in the web config form with i18n strings - add backend and config tests covering the new default behavior	2026-03-11 19:57:59 +08:00
wenjie	8c2a9332c6	fix(security): harden unauthenticated tool-exec paths (#1360 ) * fix(security): harden unauthenticated tool-exec paths (GHSA-pv8c-p6jf-3fpp) - Exec tool: channel-based access control (default deny remote) - Cron tool: command scheduling restricted to internal channels - Web fetch: SSRF defense-in-depth (pre-flight + dial-time + redirect checks) - File permissions: session/state dirs 0700, files 0600 - Registry: inject __channel/__chat_id into tool args (replaces racy SetContext) 28 new security regression tests. (cherry picked from commit 191446ae19021604d3d5b0d9376b9655ab749105) * fix(exec): revalidate working_dir before command start * test(web): allow local oversized payload fixture --------- Co-authored-by: xj <gh-xj@users.noreply.github.com>	2026-03-11 19:22:20 +08:00
nayihz	8a398988d7	refactor skills loader markdown metadata parsing (#1354 )	2026-03-11 18:08:00 +08:00
Mauro	30584f04cb	Merge pull request #1214 from afjcjsbx/feat/echo-voice-audio-transcription feat(channel): echo voice audio transcription feedback	2026-03-11 08:45:25 +01:00
wenjie	e74820cf69	fix: skip meta json files during session migration (#1340 )	2026-03-11 14:29:42 +08:00
Cage	d5cbf198b2	fix: resolve gateway binary path, pass --config flag, and clarify empty model error (#1337 )	2026-03-11 12:54:08 +08:00
美電球	755fa32336	Merge pull request #1330 from statxc/fix/session-key-sanitize-slash fix(session): sanitize '/' and '\' in session keys so forum topic key…	2026-03-11 12:18:54 +08:00
afjcjsbx	08cc09e091	resolve conflicts	2026-03-11 00:17:10 +01:00
afjcjsbx	87d458f519	Merge remote-tracking branch 'origin/main' into feat/echo-voice-audio-transcription # Conflicts: # pkg/channels/telegram/telegram.go # pkg/config/config.go # pkg/config/defaults.go	2026-03-11 00:06:37 +01:00
Mauro	9cd2d21800	Merge pull request #1207 from afjcjsbx/feat/debug-mode-no-truncate feat: no-truncate param for debug	2026-03-10 17:13:44 +01:00
statxc	2e3e6788ab	fix(session): sanitize '/' and '\' in session keys so forum topic keys don't create invalid paths	2026-03-10 16:11:34 +00:00
美電球	54f0680add	Merge pull request #1291 from statxc/feat/telegram-forum-topics feat(telegram): support forum topics with per-topic session isolation	2026-03-10 21:38:40 +08:00
statxc	320fcd1f02	fix: Add IsForum check so only forum topic threads get session isolation, not regular group reply threads	2026-03-10 13:25:14 +00:00
lxowalle	680e845d61	feat:Modify the location where version is obtained, and insert version information into the context (#1300 ) * feat:migrate version info from internal package to pkg/config * * fix lint issue	2026-03-10 17:42:05 +08:00

1 2 3 4 5 ...

785 Commits