picoclaw

mirror of https://github.com/sipeed/picoclaw.git synced 2026-06-12 18:08:54 +00:00

Author	SHA1	Message	Date
Orkun Manap	dd9adf8a04	feat: add ElevenLabs Scribe STT transcriber and Telegram SendVoice support (#1905 ) * feat: add ElevenLabs Scribe STT transcriber and Telegram SendVoice support Add ElevenLabsTranscriber as an alternative speech-to-text provider using the ElevenLabs Scribe API (scribe_v1). This enables voice message transcription for users who already have an ElevenLabs API key, without requiring a separate Groq account. Changes: - Add ElevenLabsTranscriber implementing the Transcriber interface - Update DetectTranscriber to check providers.elevenlabs.api_key first, falling back to Groq for backward compatibility - Add ElevenLabs to ProvidersConfig - Add "voice" media type for OGG files with "voice" in filename - Add SendVoice support in Telegram channel for voice bubble messages - Add comprehensive tests for ElevenLabs transcriber Configuration: "providers": { "elevenlabs": { "api_key": "sk_your_key_here" } } Closes #1503 (partial) * fix: move voice-bubble detection into Telegram channel to avoid regression in other channels Address review feedback: keep inferMediaType returning "audio" for all OGG files. Voice-bubble detection (SendVoice vs SendAudio) is now done inside the Telegram channel based on filename, so other channels that map "audio" explicitly are unaffected. * fix: align VoiceConfig struct tags to pass golines formatter Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> * fix(agent): use ModelName in loop test added by upstream Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-23 22:11:10 +01:00
daming大铭	2c48cd3461	Merge pull request #1907 from xiwuqi/wuxi/fix-reasoning-channel-content fix(agent): route reasoning_content to reasoning channel	2026-03-24 01:24:14 +08:00
uiyzzi	16d23d8cdc	feat(security): add sensitive data filtering for tool results sent to LLM Prevent LLM from seeing its own credentials (API keys, tokens, secrets) by filtering sensitive values from tool call results before sending to the model. Values are collected from .security.yml and replaced with [FILTERED] using an efficient strings.Replacer (O(n+m)). - Add FilterSensitiveData and FilterMinLength to ToolsConfig - Implement SensitiveDataReplacer() with sync.Once caching in SecurityConfig - Use reflection to collect all sensitive values (Model API keys, channel tokens, web tool API keys, skills tokens) - Apply filtering in agent loop at 4 tool result locations - Add comprehensive tests covering all token types	2026-03-23 20:55:41 +08:00
Liqiang Liu	f81b44bf19	fix(provider): deduplicate tool results and merge consecutive tool_result blocks for Anthropic API (#1793 ) Anthropic API returns 400 when multiple tool_result blocks share the same tool_use_id, or when consecutive tool results are sent as separate user messages. This fix: 1. Adds ToolCallID deduplication in sanitizeHistoryForProvider (context.go) to drop duplicate tool results before sending to any provider. 2. Merges consecutive tool result messages into a single user message with multiple tool_result content blocks in Anthropic's buildRequestBody, for both "user" (with ToolCallID) and "tool" role messages. 3. Adds tests for both behaviors. Co-authored-by: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-03-23 17:24:46 +08:00
uiyzzi	d1d2155edb	Use ModelName instead of Model in test config structs	2026-03-23 16:47:13 +08:00
Mauro	054b55fdfc	Merge pull request #1893 from afjcjsbx/feat/skill-channel-commands feat(skills): add channel commands to list and force installed skills	2026-03-23 09:04:06 +01:00
Cytown	5a8aab8143	Merge branch 'main' into version	2026-03-23 11:41:36 +08:00
Cytown	7bf4831059	Merge branch 'main' into version	2026-03-23 10:54:08 +08:00
xiwuqi	336d5d4c07	fix(agent): route reasoning_content to reasoning channel	2026-03-22 19:57:47 -05:00
afjcjsbx	be59133ce9	resolve conflicts	2026-03-22 20:58:46 +01:00
afjcjsbx	d3ba40090b	Merge branch 'main' into feat/skill-channel-commands # Conflicts: # pkg/agent/loop.go	2026-03-22 20:51:16 +01:00
BeaconCat	60a7098fd3	feat(search): add Baidu Qianfan AI Search provider with i18n docs - Add BaiduSearchConfig struct and register in WebToolsConfig/defaults - Insert Baidu Search in priority chain: DuckDuckGo > Baidu > GLM Search - Use perplexityTimeout (30s) — Qianfan is LLM-based - Fix response parsing: use references[] field per API spec - Add baidu_search block to config.example.json docs: sync configuration.md and README Documentation table across all languages - Complete truncated configuration.md for fr/ja/pt-br/vi/zh: add Spawn async flow diagram, Providers table, Model Configuration (all vendors, examples, load balancing, migration), Provider Architecture, Scheduled Tasks, and Advanced Topics links - Add Hooks/Steering/SubTurn entries to Documentation table in all 8 READMEs (en/zh/fr/id/it/ja/pt-br/vi), ordered before Troubleshooting - Add Baidu Search row to web search table in all 8 READMEs and tools_configuration.md (en + 5 i18n); zh README reorders search engines with China-friendly options first - Add Matrix channel docs translations (fr/ja/pt-br/vi) - Add Weixin channel to chat-apps.md and all README Channels tables Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 00:51:27 +08:00
afjcjsbx	d7d2bf69bf	feat(skills): add channel commands to list and force installed skills	2026-03-22 15:33:25 +01:00
Administrator	7868c5811a	fix(agent): fix subturn panic result, hard abort rollback, and drain bus exit - spawnSubTurn: set result=nil on panic instead of constructing a non-nil ToolResult - HardAbort: roll back session history to initialHistoryLength after Finish() - drainBusToSteering: switch to non-blocking reads after first message so function returns promptly when the inbound channel is empty - remove obsolete documentation files	2026-03-22 20:35:14 +08:00
Administrator	7ba8682ac5	Merge branch 'refactor/agent' into feat/subturn-poc	2026-03-22 19:51:43 +08:00
Administrator	f7f27e237a	merge: resolve conflicts between refactor/agent and main	2026-03-22 19:21:58 +08:00
daming大铭	0432facffc	Merge pull request #1863 from alexhoshina/feat/hook-manager Feat/hook manager	2026-03-22 14:36:07 +08:00
Administrator	88d754b172	merge main	2026-03-22 13:47:14 +08:00
Cytown	7c854fe6d7	Merge branch 'main' into version	2026-03-22 02:53:55 +08:00
Cytown	e455eb5e67	refactor: seperate security.yml for store keys	2026-03-22 01:55:00 +08:00
daming大铭	ebcd5645f1	Revert "feat(tools): add exec tool enhancement with background execution and …" This reverts commit `f901af8cbc`.	2026-03-22 00:39:47 +08:00
Administrator	24d6cb5272	Merge branch 'upstream-main' into feat/subturn-poc	2026-03-21 23:42:25 +08:00
Liu Yuan	f901af8cbc	feat(tools): add exec tool enhancement with background execution and PTY support (#1752 ) - Unified exec tool with actions: run/list/poll/read/write/send-keys/kill - PTY support using creack/pty library - Process session management with background execution - Process group kill for cleaning up child processes - Session cleanup: 30-minute TTL for old sessions - Output buffer: 100MB limit with truncation Actions: - run: execute command (sync or background) - list: list all sessions - poll: check session status - read: read session output - write: send input to session stdin - send-keys: send special keys (up, down, ctrl-c, enter, etc.) - kill: terminate session Tests: - PTY: allowed commands, write/read, poll, kill, process group kill - Non-PTY: background execution, list, read, write, poll, kill, process group kill - Session management: add/get/remove/list/cleanup	2026-03-21 22:38:03 +08:00
Hoshina	337e43e5a5	feat(agent): add configurable hook mounting	2026-03-21 19:46:16 +08:00
Hoshina	cf68c91eca	feat(agent): add hook manager foundation	2026-03-21 19:15:10 +08:00
Administrator	670b433f1a	refactor: replace interface{} with any for improved type clarity	2026-03-21 18:24:56 +08:00
Administrator	1bd144ac13	Merge branch 'upstream-main' into feat/subturn-poc	2026-03-21 17:13:26 +08:00
Administrator	087e8519c5	refactor: improve code readability and consistency across multiple files	2026-03-21 17:12:45 +08:00
Mauro	100720bb74	Merge pull request #1818 from Alix-007/fix/issue-1815-empty-response-message fix(agent): separate empty-response and tool-limit fallbacks	2026-03-20 23:23:48 +01:00
afjcjsbx	9e344594a2	fix logic	2026-03-20 21:07:07 +01:00
afjcjsbx	827449aff3	fix lint	2026-03-20 20:12:55 +01:00
afjcjsbx	1c6586681d	fix(agent) scope steering	2026-03-20 19:44:00 +01:00
Amir Mamaghani	71134babb9	feat(telegram): stream LLM responses via sendMessageDraft (#1101 ) * feat(telegram): stream LLM responses in real-time via sendMessageDraft Implements real-time token streaming to Telegram using the sendMessageDraft API (telego v1.6.0). Instead of showing only a "Thinking..." placeholder until the full response arrives, users now see partial LLM output appear in the chat as it's generated. The streaming pipeline threads through all layers: - StreamingProvider interface (providers/types.go): opt-in ChatStream() method that receives an onChunk callback with accumulated text - OpenAI-compatible SSE streaming (openai_compat/provider.go): parses SSE events with stream:true, handles text deltas and tool call assembly - Anthropic native streaming (anthropic/provider.go): uses SDK's NewStreaming() for direct Anthropic API connections - HTTPProvider delegation (http_provider.go): delegates ChatStream to the underlying openai_compat provider - StreamingCapable + Streamer interfaces (channels/interfaces.go): opt-in channel capability like TypingCapable/PlaceholderCapable - Telegram streamer (telegram/telegram.go): BeginStream returns a telegramStreamer that throttles sendMessageDraft calls (3s/200 chars) with graceful degradation on API errors - StreamDelegate bridge (bus/bus.go): decouples agent loop from channel manager without tight imports - Manager integration (manager.go): implements StreamDelegate, tracks streamActive state, coordinates with placeholder editing - Agent loop (loop.go): uses ChatStream when both provider and channel support streaming, cancels stream on tool calls, skips PublishOutbound when Finalize already delivered the message Graceful degradation: - Bots without forum/topics mode: first sendMessageDraft error sets failed=true, subsequent Updates become no-ops, Finalize still delivers via SendMessage. User sees normal non-streaming behavior. - Non-streaming providers: type assertion fails, falls back to Chat() - Config opt-out: streaming.enabled (default true) in telegram config Closes #1098 * fix(telegram): delete placeholder message when streaming delivers response When streaming was active, the "Thinking..." placeholder message stayed in the chat because preSend only deleted the tracking entry without removing the actual Telegram message. Now preSend deletes the placeholder via the new MessageDeleter interface when streamActive is set. * refactor(streaming): remove dead code and simplify streaming wiring - Delete unused Anthropic ChatStream/parseStream (-131 lines) — factory creates HTTPProvider for all OpenAI-compat providers including OpenRouter - Simplify runLLMIteration from 4 to 3 return values (remove unused streamed bool) - Replace managerStreamer struct with finalizeHookStreamer using embedding (Update/Cancel promoted, only Finalize overridden) * fix(streaming): skip streamer acquisition when SendResponse is false Heartbeat messages set SendResponse=false but the streaming path was unconditionally acquiring a streamer, causing HEARTBEAT_OK to leak to Telegram via streamer.Finalize(). * fix(streaming): guard streamer for non-sendable messages, add streaming config Skip streamer acquisition for heartbeat (NoHistory=true), preventing HEARTBEAT_OK from leaking to Telegram via streamer.Finalize(). Add streaming.enabled to Telegram defaults and example config. * feat(telegram): stream LLM responses in real-time via sendMessageDraft Implements real-time token streaming to Telegram using the sendMessageDraft API (telego v1.6.0). Instead of showing only a "Thinking..." placeholder until the full response arrives, users now see partial LLM output appear in the chat as it's generated. The streaming pipeline threads through all layers: - StreamingProvider interface (providers/types.go): opt-in ChatStream() method that receives an onChunk callback with accumulated text - OpenAI-compatible SSE streaming (openai_compat/provider.go): parses SSE events with stream:true, handles text deltas and tool call assembly - Anthropic native streaming (anthropic/provider.go): uses SDK's NewStreaming() for direct Anthropic API connections - HTTPProvider delegation (http_provider.go): delegates ChatStream to the underlying openai_compat provider - StreamingCapable + Streamer interfaces (channels/interfaces.go): opt-in channel capability like TypingCapable/PlaceholderCapable - Telegram streamer (telegram/telegram.go): BeginStream returns a telegramStreamer that throttles sendMessageDraft calls (3s/200 chars) with graceful degradation on API errors - StreamDelegate bridge (bus/bus.go): decouples agent loop from channel manager without tight imports - Manager integration (manager.go): implements StreamDelegate, tracks streamActive state, coordinates with placeholder editing - Agent loop (loop.go): uses ChatStream when both provider and channel support streaming, cancels stream on tool calls, skips PublishOutbound when Finalize already delivered the message Graceful degradation: - Bots without forum/topics mode: first sendMessageDraft error sets failed=true, subsequent Updates become no-ops, Finalize still delivers via SendMessage. User sees normal non-streaming behavior. - Non-streaming providers: type assertion fails, falls back to Chat() - Config opt-out: streaming.enabled (default true) in telegram config Closes #1098 * fix(telegram): delete placeholder message when streaming delivers response When streaming was active, the "Thinking..." placeholder message stayed in the chat because preSend only deleted the tracking entry without removing the actual Telegram message. Now preSend deletes the placeholder via the new MessageDeleter interface when streamActive is set. * refactor(streaming): remove dead code and simplify streaming wiring - Delete unused Anthropic ChatStream/parseStream (-131 lines) — factory creates HTTPProvider for all OpenAI-compat providers including OpenRouter - Simplify runLLMIteration from 4 to 3 return values (remove unused streamed bool) - Replace managerStreamer struct with finalizeHookStreamer using embedding (Update/Cancel promoted, only Finalize overridden) * fix(streaming): skip streamer acquisition when SendResponse is false Heartbeat messages set SendResponse=false but the streaming path was unconditionally acquiring a streamer, causing HEARTBEAT_OK to leak to Telegram via streamer.Finalize(). * fix(streaming): guard streamer for non-sendable messages, add streaming config Skip streamer acquisition for heartbeat (NoHistory=true), preventing HEARTBEAT_OK from leaking to Telegram via streamer.Finalize(). Add streaming.enabled to Telegram defaults and example config. * fix(picoclaw): add missing closing brace for StreamingProvider interface Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve golangci-lint formatting issues Fix gci import ordering in telegram and anthropic provider, and break long function signature in openai_compat provider to satisfy golines. * fix: address code review feedback on streaming PR - Deduplicate Streamer interface: alias channels.Streamer to bus.Streamer to prevent type drift across packages - Increase SSE scanner buffer to 10MB max to handle large single-line responses that exceed bufio.Scanner's 64KB default - Switch draftID generation from math/rand to crypto/rand for collision-resistant random IDs - Add context cancellation check in SSE parsing loop so cancelled streams stop processing immediately - Log Finalize failures with chat_id and content length for debugging silent message delivery failures * feat: make streaming throttle interval and min growth configurable Move hardcoded streamThrottleInterval (3s) and streamMinGrowth (200) into StreamingConfig so they can be tuned per deployment via config or environment variables. * fix(telegram): use parseTelegramChatID in DeleteMessage and BeginStream These two functions called undefined parseChatID. Use parseTelegramChatID with _ for the unused threadID instead of adding a wrapper function. Fixes all three CI checks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(streaming): set streamActive only after successful Finalize Move onFinalize hook to run after Streamer.Finalize succeeds, so that if Finalize fails the streamActive flag stays false and the regular placeholder fallback path remains available. Addresses review feedback from @alexhoshina. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 21:04:14 +08:00
Hoshina	2b3c95b1f1	fix: lint err	2026-03-20 17:46:31 +08:00
Hoshina	0e075f7300	feat(agent): centralize turn lifecycle and continue queued steering Refactor agent loop execution around runTurn, add explicit turn state and interrupt semantics, and automatically continue queued steering that misses the current turn boundary.	2026-03-20 17:28:12 +08:00
Hoshina	a65e0e95d6	fix: lint err	2026-03-20 15:45:27 +08:00
Hoshina	57cde73b36	feat(agent): expand event bus coverage	2026-03-20 15:29:52 +08:00
Hoshina	50cc7100ce	feat(agent): make event logs show event kind clearly	2026-03-20 15:06:43 +08:00
Hoshina	af61d0bca7	feat(agent): add event bus foundation	2026-03-20 14:53:22 +08:00
Alix-007	82d574eb7b	fix(agent): separate empty-response and tool-limit fallbacks	2026-03-20 14:37:47 +08:00
Administrator	4f646ef2b8	Merge branch 'main' into feat/subturn-poc	2026-03-20 11:51:25 +08:00
Administrator	e71ef3764d	fix(test): reduce blank identifiers to comply with dogsled linter Changed newTestAgentLoop calls from using 3 blank identifiers to 2 by assigning the unused provider parameter and explicitly marking it as unused with `_ = provider`. This fixes the dogsled linter violations that were causing CI failures. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-03-20 11:12:47 +08:00
Mauro	bd4317f1f4	Merge pull request #1390 from kiannidev/fix/1323-telegram-endless-typing fix(telegram): stop typing indicator when LLM fails or hangs	2026-03-19 21:52:10 +01:00
Administrator	c18d8a2ecc	Merge branch 'upstream-main' into feat/subturn-poc	2026-03-19 22:12:51 +08:00
Alix-007	276a0cb92c	fix(agent): rebind provider after /switch model to (#1769 ) * fix(agent): rebind provider after model switch * test(agent): deduplicate switch model mock servers --------- Co-authored-by: Alix-007 <267018309+Alix-007@users.noreply.github.com>	2026-03-19 21:44:01 +08:00
SakoroYou	844a4eefc7	fix(agent): avoid process exit on exec init failure and add regression test (#1784 ) * fix(agent): make exec tool init failure non-fatal * test(agent): add regression test for invalid exec config fallback	2026-03-19 21:11:36 +08:00
Administrator	583c586db6	Merge branch 'main' into feat/subturn-poc	2026-03-19 20:20:31 +08:00
Cytown	94fcb25039	Merge branch 'main' into version	2026-03-19 18:16:15 +08:00
Mauro	7673b626b3	feat(tool): debug tool usage via channels (#1332 ) * feat(tool): debug usage via channel * set defaults * fix conflicts	2026-03-19 18:08:50 +08:00
Cytown	cfd3a1b441	Merge branch 'main' into version	2026-03-19 18:04:58 +08:00

1 2 3 4 5 ...

307 Commits