picoclaw

mirror of https://github.com/sipeed/picoclaw.git synced 2026-06-12 18:08:54 +00:00

Author	SHA1	Message	Date
Liu Yuan	7eba27c3c4	feat: add ContextManager abstraction for pluggable context management (#2203 ) - Define ContextManager interface with Assemble/Compact/Ingest methods - Implement legacyContextManager wrapping existing summarization logic - Wire Assemble (before BuildMessages), Compact (post-turn + overflow), and Ingest (after message persistence) into agent loop - Add ContextManager config field and factory registry with config passthrough - Remove old maybeSummarize/summarizeSession/summarizeBatch/etc from loop.go - All existing tests pass with default (legacy) config Co-authored-by: Liu Yuan <namei.unix@gmail.com>	2026-04-02 00:08:15 +08:00
Cytown	9ac21c5908	add missing recover panic in subturn.go (#2253 )	2026-04-01 23:44:41 +08:00
Cytown	e2a9bb97c7	unify all panic event to panic log file (#2250 )	2026-04-01 23:26:49 +08:00
reusu	31afad6e87	feat: add load_image tool for local file vision (#2116 ) * feat: add load_image tool for local file vision * fix: address load_image PR review feedback - Exclude load_image from sub-agent tools via Unregister after Clone, since RunToolLoop does not call resolveMediaRefs - Add ToolRegistry.Unregister() method - Fix scope collision: use channel:chatID instead of filename - Add channel/chatID context resolution matching send_file pattern - Add comment explaining iteration > 1 guard on resolveMediaRefs - Remove emoji from ForUser for consistency with send_file - Add load_image_test.go * feat: enable load_image for subagents via MediaResolver in RunToolLoop Instead of removing load_image from sub-agent tools (`28f69e71`), inject a MediaResolver into the legacy RunToolLoop fallback path so media:// refs are resolved to base64 before each LLM call — matching the main agent loop behavior. - Add MediaResolver field to ToolLoopConfig and call it on iteration > 1 - Add SubagentManager.SetMediaResolver() and wire it through runTask - Remove ToolRegistry.Unregister() (no longer needed) - Restore load_image in sub-agent tool set (revert Clone+Unregister) - Add TestSubagentManager_SetMediaResolver_StoresResolver * refactor(load_image): remove prompt parameter from tool schema * test(tools): add success-path test for LoadImageTool Add TestLoadImage_SuccessPath that creates a real PNG file with valid magic bytes, calls Execute with WithToolContext, and verifies: - result.IsError == false - ToolResult.Media contains a media:// ref - ToolResult.ForLLM contains the [image: marker - media ref is resolvable in the store Add explanatory comment in loop.go for why Media and ArtifactTags coexist on non-ResponseHandled tool results (e.g. load_image). * fix: preallocate slice in tests and add ResponseHandled guard in toolloop Fix prealloc linter failure in load_image_test.go. Prevent double-resolving media by checking ResponseHandled in toolloop.go. * Register TTS tool if provider is available --------- Co-authored-by: Reusu <admin@yumao.name> Co-authored-by: 美電球 <hoshina@evaz.org>	2026-04-01 21:32:10 +08:00
Hua Audio	0f395ce110	Refactor/asr tts (#1939 ) * refactor: update ASR and TTS implementations * fix lint * Integrating asr/tts models w/ new security config * update documents * add arbitrary whisper transcriptor support * update documents * fix lint * add mimo tts	2026-04-01 12:21:21 +08:00
Badgerbees	1a44752dc5	fix(agent): prevent double-counting system message tokens in estimator Treat SystemParts as an alternative representation of message Content rather than an additive one. This prevents systematic overestimation of system message tokens which could trigger premature context pruning or summarization. - Picks the maximum of Content vs. SystemParts to stay conservative. - Adds a per-part overhead (20 chars) to account for JSON metadata. - Streamlines the ReasoningContent counting logic. Fixes a deficiency where structured blocks for cache-aware adapters caused overestimated budgets or hidden overflows.	2026-03-31 17:09:01 +07:00
Badgerbees	93f391a6bf	fix(agent): include SystemParts in token estimation and add reasoning guards	2026-03-31 16:33:24 +07:00
DimonB	6c0798ca3f	feat(channels): make Channel.Send return delivered message IDs (#2190 ) * feat(channels): Channel.Send and MediaSender.SendMedia return delivered message IDs Change Channel.Send signature from (ctx, msg) error to (ctx, msg) ([]string, error) and MediaSender.SendMedia similarly, so callers can capture platform message IDs for threading, reactions, and history annotation. Adapters that return real IDs: Telegram (per-chunk MessageID), Discord (Message.ID), Slack Send (ts), QQ (sentMsg.ID), Matrix (EventID). Slack SendMedia returns nil because UploadFileV2 does not expose the posted message timestamp in its response. All other adapters return nil IDs. preSend and sendWithRetry in manager.go updated to propagate ([]string, bool). README examples updated for both English and Chinese docs. * style: apply golangci-lint fixes (golines) * docs: fix Send migration guide — restore old error-only signature in before/after example	2026-03-31 11:07:32 +08:00
Cytown	50b8d9bf83	Merge branch 'main' into t3	2026-03-30 18:01:07 +08:00
Alix-007	e88df4ff9c	feat(tools): add reaction tool and reply-aware message sends (#2156 ) - Add `reaction` tool that reacts to a message (defaults to current inbound message via context) - Extend `message` tool with optional `reply_to_message_id` parameter - Introduce `WithToolInboundContext` to inject inbound message IDs into tool execution context - Surface `MessageID` and `ReplyToMessageID` in `processOptions` for tool-surface consumption Refs #2137	2026-03-30 16:31:34 +08:00
Cytown	9c28870e80	Merge branch 'main' into t3	2026-03-29 16:48:56 +08:00
沈青川	e414b82ac3	fix(cron): publish agent response to outbound bus for cron-triggered jobs (#2100 ) * fix(cron): publish agent response to outbound bus for cron-triggered jobs When a cron job triggers agent execution via ProcessDirectWithChannel, the agent response was silently discarded — the code assumed AgentLoop would auto-publish it, but SendResponse is false on this path. Delegate to PublishResponseIfNeeded (exported from AgentLoop) so the response reaches the originating channel (e.g. Telegram) only when the message tool did not already deliver content in the same round. Also adds a "directive" message type to CronPayload, allowing cron jobs to instruct the agent to execute a task rather than echo static text. * fix(cron): add type validation and directive test coverage Address reviewer blocking feedback: 1. Server-side whitelist for `type` parameter — the `enum` in Parameters() is only an LLM schema hint; any string was persisted. Now `addJob` rejects values other than "message" and "directive". 2. Comprehensive test coverage for the directive code path: - directive adds prompt prefix to ProcessDirectWithChannel - deliver=true + directive routes through agent (not direct publish) - directive prompt content, sessionKey, channel, chatID are correct - invalid type is rejected; valid types ("", "message", "directive") pass - deliver=true message type goes directly to bus (regression) - agent error path does not trigger publish (regression) Also merge the two UpdateJob calls in addJob into one to avoid redundant disk I/O (non-blocking suggestion from review). * fix(cron): remove omitempty from CronPayload.Type for consistent JSON Empty string and "message" are semantically equivalent defaults; always serializing the field avoids asymmetric JSON output. * test(cron): remove redundant test, strengthen error path coverage - Remove ExecuteJobDirectivePassesCorrectContent: its assertions on sessionKey/channel/chatID duplicate ExecuteJobPublishesAgentResponse; its prompt check duplicates DirectiveAddsPromptPrefix. - Strengthen DirectiveAddsPromptPrefix with exact prompt match and publish response assertion. - Fix ReturnsErrorWithoutPublish: set non-empty stub response so the test verifies the error branch early-return, not the response=="" guard. * fix(ci): satisfy golines and gosmopolitan in cron code	2026-03-29 13:47:28 +08:00
Cytown	475d377af1	Merge branch 'main' into t3	2026-03-29 01:25:20 +08:00
Cytown	0bb561548f	add pid file for gateway running and auth token for /reload and pico channel	2026-03-29 01:14:39 +08:00
Guoguo	62d40a02d4	fix: resolve typecheck errors in loop_test.go and dingtalk_test.go (#2122 ) - loop_test.go: replace undefined WithSecurity/SecurityConfig/ModelSecurityEntry with direct APIKeys field using SimpleSecureStrings() - dingtalk_test.go: use ClientSecret.String() and ClientSecret.Set() instead of non-existent ClientSecret() and SetClientSecret() methods Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-28 18:58:09 +08:00
Mauro	230942d234	fix(loop): polling (#2103 )	2026-03-28 16:36:06 +08:00
xiwuqi	e011284d8f	fix(agent): use light provider for routed model calls (#2038 )	2026-03-28 15:25:23 +08:00
Cytown	f1cb7cc8f5	fix gateway reload will cause pico stop working issue (#2082 ) * fix gateway reload will cause pico stop working issue * fix for review	2026-03-28 11:30:31 +08:00
Mauro	60d7ec20a5	feat(log): prompt tokens (#2047 )	2026-03-28 02:00:12 +08:00
Cytown	b646d3b8fe	refactor config and security to simplified the structure (#2068 )	2026-03-28 00:03:34 +08:00
Mauro	1dff5e6903	Merge pull request #2016 from badgerbees/fix/context-overflow-errors fix(providers): improve context overflow detection and classification	2026-03-25 21:58:53 +01:00
Badgerbees	ae94893605	adding test units	2026-03-26 03:03:19 +07:00
Badgerbees	97dec16769	fix(providers): improve context overflow detection and classification	2026-03-26 01:07:56 +07:00
柚子	ed618e14aa	feat(channels): support multi-message sending via split marker (#2008 ) * Add multi-message sending via split marker * Add marker and length split integration tests Tests that SplitByMarker and SplitMessage work together correctly, and that code block boundaries are preserved during marker splitting. * Simplify message chunking logic in channel worker Extract splitByLength helper function and remove goto-based control flow. The logic now flows more naturally - try marker splitting first, then fall back to length-based splitting. * Update multi-message output instructions in agent context * Add split_on_marker to config defaults * Add split_on_marker config option * Rename 'Multi-Message Sending' setting to 'Chatty Mode' * Add SplitOnMarker config option	2026-03-26 01:33:49 +08:00
Liu Yuan	3f1ac297d4	feat(tools): add exec tool enhancement with background execution and PTY support (#1752 ) - Unified exec tool with actions: run/list/poll/read/write/send-keys/kill - PTY support using creack/pty library - Process session management with background execution - Process group kill for cleaning up child processes - Session cleanup: 30-minute TTL for old sessions - Output buffer: 100MB limit with truncation Actions: - run: execute command (sync or background) - list: list all sessions - poll: check session status - read: read session output - write: send input to session stdin - send-keys: send special keys (up, down, ctrl-c, enter, etc.) - kill: terminate session Tests: - PTY: allowed commands, write/read, poll, kill, process group kill - Non-PTY: background execution, list, read, write, poll, kill, process group kill - Session management: add/get/remove/list/cleanup	2026-03-25 21:02:49 +08:00
xiwuqi	85dfb341a8	fix(agent): suppress heartbeat tool feedback (#1937 )	2026-03-25 14:22:41 +08:00
daming大铭	1b9445b806	Merge pull request #1955 from alexhoshina/refactor/wecom Refactor/wecom	2026-03-24 23:37:35 +08:00
Mauro	2a0efb6e52	Merge pull request #1889 from afjcjsbx/fix/binary-tool-output-handling fix(tool): route binary outputs through the media pipeline	2026-03-24 15:37:06 +01:00
Hoshina	a1f95f02bc	refactor(wecom): rebuild ai bot channel	2026-03-24 20:23:29 +08:00
美電球	f2f6987f00	test(agent): allow mock custom tool args (#1965 )	2026-03-24 19:27:29 +08:00
Orkun Manap	dd9adf8a04	feat: add ElevenLabs Scribe STT transcriber and Telegram SendVoice support (#1905 ) * feat: add ElevenLabs Scribe STT transcriber and Telegram SendVoice support Add ElevenLabsTranscriber as an alternative speech-to-text provider using the ElevenLabs Scribe API (scribe_v1). This enables voice message transcription for users who already have an ElevenLabs API key, without requiring a separate Groq account. Changes: - Add ElevenLabsTranscriber implementing the Transcriber interface - Update DetectTranscriber to check providers.elevenlabs.api_key first, falling back to Groq for backward compatibility - Add ElevenLabs to ProvidersConfig - Add "voice" media type for OGG files with "voice" in filename - Add SendVoice support in Telegram channel for voice bubble messages - Add comprehensive tests for ElevenLabs transcriber Configuration: "providers": { "elevenlabs": { "api_key": "sk_your_key_here" } } Closes #1503 (partial) * fix: move voice-bubble detection into Telegram channel to avoid regression in other channels Address review feedback: keep inferMediaType returning "audio" for all OGG files. Voice-bubble detection (SendVoice vs SendAudio) is now done inside the Telegram channel based on filename, so other channels that map "audio" explicitly are unaffected. * fix: align VoiceConfig struct tags to pass golines formatter Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(agent): use ModelName in loop test added by upstream Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 22:11:10 +01:00
daming大铭	2c48cd3461	Merge pull request #1907 from xiwuqi/wuxi/fix-reasoning-channel-content fix(agent): route reasoning_content to reasoning channel	2026-03-24 01:24:14 +08:00
afjcjsbx	5d5536a1a6	fix delivery and steering	2026-03-23 14:09:52 +01:00
uiyzzi	16d23d8cdc	feat(security): add sensitive data filtering for tool results sent to LLM Prevent LLM from seeing its own credentials (API keys, tokens, secrets) by filtering sensitive values from tool call results before sending to the model. Values are collected from .security.yml and replaced with [FILTERED] using an efficient strings.Replacer (O(n+m)). - Add FilterSensitiveData and FilterMinLength to ToolsConfig - Implement SensitiveDataReplacer() with sync.Once caching in SecurityConfig - Use reflection to collect all sensitive values (Model API keys, channel tokens, web tool API keys, skills tokens) - Apply filtering in agent loop at 4 tool result locations - Add comprehensive tests covering all token types	2026-03-23 20:55:41 +08:00
afjcjsbx	8ed171dbe6	resolved conflicts	2026-03-23 13:43:02 +01:00
afjcjsbx	fddfd56b50	Merge branch 'main' into fix/binary-tool-output-handling # Conflicts: # pkg/agent/loop.go # pkg/agent/loop_test.go # pkg/commands/builtin_test.go # pkg/tools/send_file_test.go	2026-03-23 13:16:23 +01:00
Liqiang Liu	f81b44bf19	fix(provider): deduplicate tool results and merge consecutive tool_result blocks for Anthropic API (#1793 ) Anthropic API returns 400 when multiple tool_result blocks share the same tool_use_id, or when consecutive tool results are sent as separate user messages. This fix: 1. Adds ToolCallID deduplication in sanitizeHistoryForProvider (context.go) to drop duplicate tool results before sending to any provider. 2. Merges consecutive tool result messages into a single user message with multiple tool_result content blocks in Anthropic's buildRequestBody, for both "user" (with ToolCallID) and "tool" role messages. 3. Adds tests for both behaviors. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-23 17:24:46 +08:00
uiyzzi	d1d2155edb	Use ModelName instead of Model in test config structs	2026-03-23 16:47:13 +08:00
Mauro	054b55fdfc	Merge pull request #1893 from afjcjsbx/feat/skill-channel-commands feat(skills): add channel commands to list and force installed skills	2026-03-23 09:04:06 +01:00
Cytown	5a8aab8143	Merge branch 'main' into version	2026-03-23 11:41:36 +08:00
Cytown	7bf4831059	Merge branch 'main' into version	2026-03-23 10:54:08 +08:00
xiwuqi	336d5d4c07	fix(agent): route reasoning_content to reasoning channel	2026-03-22 19:57:47 -05:00
afjcjsbx	1e98f86fa9	fix Ooutboundmedia	2026-03-23 00:08:43 +01:00
afjcjsbx	f735b0551c	fix	2026-03-22 23:46:10 +01:00
afjcjsbx	388505d7e0	fix lint	2026-03-22 23:39:33 +01:00
afjcjsbx	b90c5007f6	resolve conflicts	2026-03-22 23:36:25 +01:00
afjcjsbx	14a4983af3	Merge branch 'main' into fix/binary-tool-output-handling # Conflicts: # pkg/agent/loop.go # pkg/tools/result.go	2026-03-22 23:08:27 +01:00
afjcjsbx	be59133ce9	resolve conflicts	2026-03-22 20:58:46 +01:00
afjcjsbx	d3ba40090b	Merge branch 'main' into feat/skill-channel-commands # Conflicts: # pkg/agent/loop.go	2026-03-22 20:51:16 +01:00
BeaconCat	60a7098fd3	feat(search): add Baidu Qianfan AI Search provider with i18n docs - Add BaiduSearchConfig struct and register in WebToolsConfig/defaults - Insert Baidu Search in priority chain: DuckDuckGo > Baidu > GLM Search - Use perplexityTimeout (30s) — Qianfan is LLM-based - Fix response parsing: use references[] field per API spec - Add baidu_search block to config.example.json docs: sync configuration.md and README Documentation table across all languages - Complete truncated configuration.md for fr/ja/pt-br/vi/zh: add Spawn async flow diagram, Providers table, Model Configuration (all vendors, examples, load balancing, migration), Provider Architecture, Scheduled Tasks, and Advanced Topics links - Add Hooks/Steering/SubTurn entries to Documentation table in all 8 READMEs (en/zh/fr/id/it/ja/pt-br/vi), ordered before Troubleshooting - Add Baidu Search row to web search table in all 8 READMEs and tools_configuration.md (en + 5 i18n); zh README reorders search engines with China-friendly options first - Add Matrix channel docs translations (fr/ja/pt-br/vi) - Add Weixin channel to chat-apps.md and all README Channels tables Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 00:51:27 +08:00

1 2 3 4 5 ...

346 Commits