picoclaw

mirror of https://github.com/sipeed/picoclaw.git synced 2026-06-12 18:08:54 +00:00

Author	SHA1	Message	Date
LC	38a498e202	feat(provider): support custom headers injection for HTTP providers (#2402 ) * feat(provider): support custom headers injection for HTTP providers * fix(provider): resolve lint problem * fix(provider): align stream user-agent and header precedence docs	2026-04-07 16:05:21 +08:00
Cytown	2c446e1e07	feat: add userAgent config for ModelConfig (#2242 ) * feat: add userAgent config for ModelConfig * update docs for ModelConfig.userAgent * make defaut userAgent to PicoClaw and add test case	2026-04-02 11:44:13 +08:00
LC	bbcfeaa361	feat(provider): add Venice AI support and update related documentation (#2238 ) * feat(provider): add Venice AI support and update related documentation * revert(asr): restore asr files to previous commit * feat(config): add Venice API base URL and local LM Studio configuration * fix(config): update Venice API base URL to correct endpoint	2026-04-01 23:50:29 +08:00
LC	ee02e30992	feat(provider): add lmstudio and align local provider default auth/base handling (#2193 ) * feat(provider): add lmstudio vendor and local no-key behavior * refactor(provider): consolidate protocol metadata and local tests * fix(provider): sync lmstudio probing and model normalization * test(web): format lmstudio model status cases for golines	2026-03-31 14:48:18 +08:00
uiyzzi	f2985b8bee	feat(providers): add extra_body config to inject custom fields into request body Allow configuring provider-specific fields like reasoning_split for minimax via the model config's extra_body map. These fields are merged into the request body last, giving them precedence over default values. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-23 16:39:42 +08:00
Amir Mamaghani	71134babb9	feat(telegram): stream LLM responses via sendMessageDraft (#1101 ) * feat(telegram): stream LLM responses in real-time via sendMessageDraft Implements real-time token streaming to Telegram using the sendMessageDraft API (telego v1.6.0). Instead of showing only a "Thinking..." placeholder until the full response arrives, users now see partial LLM output appear in the chat as it's generated. The streaming pipeline threads through all layers: - StreamingProvider interface (providers/types.go): opt-in ChatStream() method that receives an onChunk callback with accumulated text - OpenAI-compatible SSE streaming (openai_compat/provider.go): parses SSE events with stream:true, handles text deltas and tool call assembly - Anthropic native streaming (anthropic/provider.go): uses SDK's NewStreaming() for direct Anthropic API connections - HTTPProvider delegation (http_provider.go): delegates ChatStream to the underlying openai_compat provider - StreamingCapable + Streamer interfaces (channels/interfaces.go): opt-in channel capability like TypingCapable/PlaceholderCapable - Telegram streamer (telegram/telegram.go): BeginStream returns a telegramStreamer that throttles sendMessageDraft calls (3s/200 chars) with graceful degradation on API errors - StreamDelegate bridge (bus/bus.go): decouples agent loop from channel manager without tight imports - Manager integration (manager.go): implements StreamDelegate, tracks streamActive state, coordinates with placeholder editing - Agent loop (loop.go): uses ChatStream when both provider and channel support streaming, cancels stream on tool calls, skips PublishOutbound when Finalize already delivered the message Graceful degradation: - Bots without forum/topics mode: first sendMessageDraft error sets failed=true, subsequent Updates become no-ops, Finalize still delivers via SendMessage. User sees normal non-streaming behavior. - Non-streaming providers: type assertion fails, falls back to Chat() - Config opt-out: streaming.enabled (default true) in telegram config Closes #1098 * fix(telegram): delete placeholder message when streaming delivers response When streaming was active, the "Thinking..." placeholder message stayed in the chat because preSend only deleted the tracking entry without removing the actual Telegram message. Now preSend deletes the placeholder via the new MessageDeleter interface when streamActive is set. * refactor(streaming): remove dead code and simplify streaming wiring - Delete unused Anthropic ChatStream/parseStream (-131 lines) — factory creates HTTPProvider for all OpenAI-compat providers including OpenRouter - Simplify runLLMIteration from 4 to 3 return values (remove unused streamed bool) - Replace managerStreamer struct with finalizeHookStreamer using embedding (Update/Cancel promoted, only Finalize overridden) * fix(streaming): skip streamer acquisition when SendResponse is false Heartbeat messages set SendResponse=false but the streaming path was unconditionally acquiring a streamer, causing HEARTBEAT_OK to leak to Telegram via streamer.Finalize(). * fix(streaming): guard streamer for non-sendable messages, add streaming config Skip streamer acquisition for heartbeat (NoHistory=true), preventing HEARTBEAT_OK from leaking to Telegram via streamer.Finalize(). Add streaming.enabled to Telegram defaults and example config. * feat(telegram): stream LLM responses in real-time via sendMessageDraft Implements real-time token streaming to Telegram using the sendMessageDraft API (telego v1.6.0). Instead of showing only a "Thinking..." placeholder until the full response arrives, users now see partial LLM output appear in the chat as it's generated. The streaming pipeline threads through all layers: - StreamingProvider interface (providers/types.go): opt-in ChatStream() method that receives an onChunk callback with accumulated text - OpenAI-compatible SSE streaming (openai_compat/provider.go): parses SSE events with stream:true, handles text deltas and tool call assembly - Anthropic native streaming (anthropic/provider.go): uses SDK's NewStreaming() for direct Anthropic API connections - HTTPProvider delegation (http_provider.go): delegates ChatStream to the underlying openai_compat provider - StreamingCapable + Streamer interfaces (channels/interfaces.go): opt-in channel capability like TypingCapable/PlaceholderCapable - Telegram streamer (telegram/telegram.go): BeginStream returns a telegramStreamer that throttles sendMessageDraft calls (3s/200 chars) with graceful degradation on API errors - StreamDelegate bridge (bus/bus.go): decouples agent loop from channel manager without tight imports - Manager integration (manager.go): implements StreamDelegate, tracks streamActive state, coordinates with placeholder editing - Agent loop (loop.go): uses ChatStream when both provider and channel support streaming, cancels stream on tool calls, skips PublishOutbound when Finalize already delivered the message Graceful degradation: - Bots without forum/topics mode: first sendMessageDraft error sets failed=true, subsequent Updates become no-ops, Finalize still delivers via SendMessage. User sees normal non-streaming behavior. - Non-streaming providers: type assertion fails, falls back to Chat() - Config opt-out: streaming.enabled (default true) in telegram config Closes #1098 * fix(telegram): delete placeholder message when streaming delivers response When streaming was active, the "Thinking..." placeholder message stayed in the chat because preSend only deleted the tracking entry without removing the actual Telegram message. Now preSend deletes the placeholder via the new MessageDeleter interface when streamActive is set. * refactor(streaming): remove dead code and simplify streaming wiring - Delete unused Anthropic ChatStream/parseStream (-131 lines) — factory creates HTTPProvider for all OpenAI-compat providers including OpenRouter - Simplify runLLMIteration from 4 to 3 return values (remove unused streamed bool) - Replace managerStreamer struct with finalizeHookStreamer using embedding (Update/Cancel promoted, only Finalize overridden) * fix(streaming): skip streamer acquisition when SendResponse is false Heartbeat messages set SendResponse=false but the streaming path was unconditionally acquiring a streamer, causing HEARTBEAT_OK to leak to Telegram via streamer.Finalize(). * fix(streaming): guard streamer for non-sendable messages, add streaming config Skip streamer acquisition for heartbeat (NoHistory=true), preventing HEARTBEAT_OK from leaking to Telegram via streamer.Finalize(). Add streaming.enabled to Telegram defaults and example config. * fix(picoclaw): add missing closing brace for StreamingProvider interface Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: resolve golangci-lint formatting issues Fix gci import ordering in telegram and anthropic provider, and break long function signature in openai_compat provider to satisfy golines. * fix: address code review feedback on streaming PR - Deduplicate Streamer interface: alias channels.Streamer to bus.Streamer to prevent type drift across packages - Increase SSE scanner buffer to 10MB max to handle large single-line responses that exceed bufio.Scanner's 64KB default - Switch draftID generation from math/rand to crypto/rand for collision-resistant random IDs - Add context cancellation check in SSE parsing loop so cancelled streams stop processing immediately - Log Finalize failures with chat_id and content length for debugging silent message delivery failures * feat: make streaming throttle interval and min growth configurable Move hardcoded streamThrottleInterval (3s) and streamMinGrowth (200) into StreamingConfig so they can be tuned per deployment via config or environment variables. * fix(telegram): use parseTelegramChatID in DeleteMessage and BeginStream These two functions called undefined parseChatID. Use parseTelegramChatID with _ for the unused threadID instead of adding a wrapper function. Fixes all three CI checks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix(streaming): set streamActive only after successful Finalize Move onFinalize hook to run after Streamer.Finalize succeeds, so that if Finalize fails the streamActive flag stays false and the regular placeholder fallback path remains available. Addresses review feedback from @alexhoshina. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-20 21:04:14 +08:00
Alex	578f90855e	feat: Add Novita provider support (#1677 ) * Add Novita provider support - Add 'novita' prefix to normalizeModel switch in openai_compat provider - Add Novita provider to all_supported_vendors table in README.md - Add test cases for Novita model prefix stripping Novita endpoint: https://api.novita.ai/openai Default models: deepseek/deepseek-v3.2, zai-org/glm-5, minimax/minimax-m2.5 * feat: complete Novita provider integration * chore: drop README changes from Novita PR * fix: remove duplicate function declarations in openai_compat provider The functions buildToolsList, SupportsNativeSearch, and isNativeSearchHost were declared twice, causing compilation failures in all CI checks. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> * fix: break long line in novita test to satisfy golines linter Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-18 18:29:27 +08:00
dataCenter430	f79469c19d	Add model-native search (prefer_native) for OpenAI/Codex (#1618 ) * config: add prefer_native and NativeSearchCapable for model-native search * providers: implement native web search for OpenAI and Codex * agent: use provider-native search when prefer_native and supported * tests: add coverage for model-native search * fix: Golang lint errors * fix: update the code based on the review * fix: update codex_provider_test	2026-03-18 11:55:30 +08:00
Kunal Karmakar	5fb4b3bedf	feat(provider): add support for azure openai provider (#1422 ) * Add support for azure openai provider * Add checks for deployment model name * Apply suggestion from @Copilot Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * Addressing @Copilot suggestion to remove the init() function which seemed redundant * Fix readme * Fix linting checks --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2026-03-14 22:52:34 +08:00
Mahendra Teja	6612ca099a	fix(openai_compat): improve prompt_cache_key host matching (#1387 ) LGTM! The changes improve the robustness of prompt_cache_key host matching and add Azure OpenAI support. Thanks for the contribution!	2026-03-12 03:24:31 +08:00
amagi	49204df678	fix(openai_compat): accept object tool call arguments (#1292 )	2026-03-12 02:47:22 +08:00
Mahendra Teja	4a80c6f58c	fix(openai_compat): only send prompt_cache_key to OpenAI endpoints (#1353 ) Non-OpenAI providers (Mistral, DeepSeek, Groq, etc.) reject unknown request fields with 422 errors. The previous blocklist only excluded Google/Gemini, but the comment already noted this feature is OpenAI-only. Flip to an allowlist so only api.openai.com receives the field. Fixes #1333	2026-03-12 01:21:54 +08:00
lxowalle	abafa3c2aa	* add minimax provider (#1273 )	2026-03-09 18:43:58 +08:00
Meng Zhuo	81dfdf5f45	Merge pull request #1100 from zihan987/main feat: add Vivgrid provider support	2026-03-08 11:03:13 +08:00
Mauro	440d665baa	Merge pull request #1075 from qs3c/fix/1068-html-response-error fix(openai_compat): clarify HTML response parse errors	2026-03-07 09:56:50 +01:00
amagi	53cba73283	fix: resolve openai compat lint issues	2026-03-07 16:12:23 +08:00
amagi	6eaa49f7ab	fix: improve openai compat HTML response handling	2026-03-07 15:50:08 +08:00
zihan987	e6f5467711	Fix golines for vivgrid case	2026-03-06 04:20:22 -08:00
zihan987	7308f0621b	Merge upstream main	2026-03-05 23:58:59 -08:00
amagi	c1a3876f7d	fix: improve error handling for non-JSON responses by checking content type and using a streaming JSON parser.	2026-03-06 01:51:24 +08:00
qs3c	9216cd14b5	fix(openai_compat): handle html error bodies and reduce allocations	2026-03-05 19:42:58 +08:00
zihan987	d1cf680657	Resolve merge conflicts	2026-03-04 22:53:17 -08:00
afjcjsbx	b9ee9b33f5	prevent audio as image url	2026-03-04 19:34:08 +01:00
zihan987	0c17c075da	Merge remote-tracking branch 'origin_picoclaw/main'	2026-03-04 09:58:20 -08:00
zihan987	ea0b634b3b	add Vivgrid config	2026-03-04 09:19:03 -08:00
amagi	a305c0a479	fix(openai_compat): avoid predeclared identifier in preview	2026-03-04 23:57:26 +08:00
qs3c	4946a8b449	fix(openai_compat): clarify HTML response errors	2026-03-04 17:54:04 +08:00
shikihane	6ccb68c63e	fix: resolve linter issues (gci import grouping, gofumpt, govet shadow) - Separate third-party imports from local module imports (gci) - Fix byte slice literal formatting (gofumpt) - Rename shadowed err variable to ftErr (govet) - Remove trailing blank lines in test files Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 17:04:54 +08:00
shikihane	03f7ae494f	feat(openai_compat): implement serializeMessages with multipart media support Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 16:28:20 +08:00
Guoguo	407707a7cc	Revert "feat(agent): add vision/image support to agent pipeline"	2026-03-03 11:38:32 +08:00
Orgmar	12d4570a36	Merge pull request #990 from shikihane/feat/agent-vision-pipeline feat(agent): add vision/image support to agent pipeline	2026-03-03 11:32:07 +08:00
shikihane	8ebeefc59f	fix(agent,openai_compat): address review feedback on vision pipeline - serializeMessages: preserve ToolCallID/ToolCalls when Media is present - resolveMediaRefs: add 20MB file size limit to prevent OOM - mimeFromExtension: return empty string for unknown extensions - Add 11 unit tests for serializeMessages, resolveMediaRefs, mimeFromExtension Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-03 11:13:22 +08:00
Alfonso	946af6b53d	feat: add LiteLLM provider alias support (#930 )	2026-03-03 08:23:55 +11:00
shikihane	a4e5c391bd	fix(openai_compat): preserve reasoning_content in serializeMessages The serializeMessages() function was not preserving the reasoning_content field when serializing messages for vision API calls. This caused the TestProviderChat_PreservesReasoningContentInHistory test to fail. This fix ensures reasoning_content is included in both text-only messages and vision messages with media attachments. Co-authored-by: Zachary Guerrero <zack.grrr@gmail.com>	2026-03-02 17:38:08 +08:00
Zachary Guerrero	3d54a77c40	feat: add Media field to Message struct and implement serializeMessages for vision API support - Add Media []string field to Message struct for image/media URLs - Implement serializeMessages() to format messages with image_url content parts - Enables OpenAI-compatible vision APIs to receive image attachments	2026-03-02 17:18:04 +08:00
winterfx	9efdde25ad	fix: preserve reasoning_content in multi-turn conversation history The openaiMessage struct and stripSystemParts() were not carrying over the ReasoningContent field when serializing conversation history for API requests. This caused thinking models (e.g. kimi-k2.5) to receive incomplete assistant messages on subsequent turns, resulting in 400 errors from the Moonshot API. Add the ReasoningContent field to openaiMessage and copy it in stripSystemParts(). Also add a test to verify reasoning_content is preserved when sending conversation history. Fixes #588 Related: #876 Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-01 16:23:05 +08:00
Petrichor	62bdece7f5	chore: resolve conflicts with upstream/main	2026-02-28 12:21:54 +08:00
Avisek	9f95aad5f3	feat: Introduce LLM reasoning fields to LLM responses and enable routing reasoning output to dedicated channels.	2026-02-27 16:58:42 +08:00
Petrichor	222d1a3086	refactor(modernize): apply safe modernize fixes	2026-02-27 16:35:07 +08:00
Yiliu	438f764c7a	fix(providers): support per-model request_timeout in model_list (#733 ) * fix(providers): support per-model request_timeout in model_list * fix(lint): format provider constructors for golines * refactor(providers): adopt functional options and preserve timeout migration * docs(readme): sync request_timeout guidance across translated docs --------- Co-authored-by: Yiliu <yiliu@affiliate-guide.com>	2026-02-26 19:08:19 +11:00
임창욱	ea902429f2	fix: exclude prompt_cache_key for Gemini API requests Gemini's OpenAI-compat endpoint rejects unknown fields. Only send prompt_cache_key to OpenAI-native endpoints.	2026-02-25 14:48:51 -08:00
Zhaoyikaiii	1f7cbd9164	fix: cache system prompt with mtime-based auto-invalidation (#607 ) Avoid rebuilding the entire system prompt on every BuildMessages() call by caching the static portion (identity, bootstrap, skills summary, memory) and only recomputing it when workspace source files change. Key changes: - ContextBuilder caches the static prompt behind an RWMutex with double-checked locking. Source file changes are detected via cheap os.Stat mtime checks so no explicit invalidation is needed. - Track file existence at cache time (existedAtCache map) so that newly created or deleted bootstrap/memory files also trigger a rebuild — the old modifiedSince() silently returned false on os.IsNotExist. - Walk the skills directory recursively with filepath.WalkDir to catch content-only edits at any nesting depth; directory mtime alone misses in-place file modifications on most filesystems. - ToolRegistry.sortedToolNames() sorts tool names before iteration, ensuring deterministic tool definition order across calls — a prerequisite for LLM-side prefix/KV cache reuse. - Merge all context (static + dynamic + summary) into a single system message for provider compatibility: the Anthropic adapter extracts messages[0] as the top-level system parameter, and Codex reads only the first system message as instructions. - Fix a data race in BuildMessages() where cachedSystemPrompt was read without holding the lock in a debug log statement. - Add tests: single system message invariant, mtime auto-invalidation, new-file creation detection, skill file content change, explicit InvalidateCache, cache stability, concurrent access (20 goroutines x 50 iterations, passes go test -race), and a benchmark.	2026-02-25 15:27:45 +08:00
daming大铭	b6e965e549	Merge pull request #604 from winterfx/fix/reasoning-content-missing fix: preserve reasoning_content for OpenAI-compatible reasoning models	2026-02-24 14:20:36 +08:00
Edouard CLAUDE	65422a16a4	feat: add native Mistral AI provider support Add Mistral as a first-class provider alongside the 17 existing ones. Mistral uses the OpenAI-compatible API at https://api.mistral.ai/v1 with provider-specific model prefix stripping (mistral/model → model). Changes: - Add Mistral to ProvidersConfig, IsEmpty(), HasProvidersConfig() - Add mistral entry in default model_list (defaults.go) - Add mistral protocol in factory_provider.go and getDefaultAPIBase() - Add mistral prefix stripping in openai_compat normalizeModel() - Add mistral case in legacy factory.go resolveProviderSelection() - Add mistral migration entry in ConvertProvidersToModelList() - Add mistral to supported providers in migrate/config.go - Add mistral section in config.example.json - Update AllProviders test (17 → 18 providers) Tested end-to-end with mistral-small-latest model.	2026-02-22 11:40:21 +04:00
winterfx	d224397f40	fix: preserve reasoning_content for OpenAI-compatible reasoning models Models like Moonshot kimi-k2.5 and DeepSeek-R1 return a reasoning_content field in assistant messages. When thinking is enabled, the API requires this field to be echoed back in subsequent requests. PicoClaw was silently dropping it, causing 400 errors on tool-call round-trips. - Add ReasoningContent to Message and LLMResponse types - Parse reasoning_content in openai_compat parseResponse() - Carry reasoning_content through assistant tool-call messages - Add unit test for reasoning_content parsing Fixes #588	2026-02-21 23:29:40 +08:00
Artem Yadelskyi	0675ce7c38	feat(fmt): Fix formatting	2026-02-20 20:03:11 +02:00
Artem Yadelskyi	ad8c2d48c8	Merge branch 'main' into fix-formatting # Conflicts: # cmd/picoclaw/main.go # pkg/agent/context.go # pkg/agent/loop.go # pkg/channels/dingtalk.go # pkg/channels/feishu_64.go # pkg/channels/line.go # pkg/channels/manager.go # pkg/config/config.go # pkg/migrate/migrate_test.go # pkg/providers/anthropic/provider_test.go # pkg/providers/claude_provider_test.go # pkg/providers/http_provider.go # pkg/providers/openai_compat/provider.go # pkg/providers/protocoltypes/types.go # pkg/providers/types.go	2026-02-20 20:02:53 +02:00
Artem Yadelskyi	a896831903	feat(fmt): Fix formatting	2026-02-19 22:05:15 +02:00
Artem Yadelskyi	2038f04d0d	Merge branch 'main' into fix-formatting # Conflicts: # pkg/agent/loop.go # pkg/agent/loop_test.go # pkg/channels/discord.go # pkg/channels/onebot.go # pkg/config/config.go # pkg/tools/subagent_tool_test.go	2026-02-19 22:04:48 +02:00
yinwm	7f241647be	feat(providers): add thought_signature support for gemini Add support for persisting thought_signature metadata from Google/Gemini 3 models. This introduces ExtraContent and GoogleExtra types to handle provider-specific metadata, and ensures thought signatures are properly preserved through the tool call lifecycle.	2026-02-20 00:36:31 +08:00

1 2

56 Commits