studio/sol - sol - Gitea: Git with a cup of tea

studio/sol

Author	SHA1	Message	Date
Sienna Meridian Satterwhite	9e5f7e61be	feat(orchestrator): Phase 2 engine + tokenizer + tool dispatch Orchestrator engine: - engine.rs: unified Mistral Conversations API tool loop that emits OrchestratorEvent instead of calling Matrix/gRPC directly - tool_dispatch.rs: ToolSide routing (client vs server tools) - Memory loading stubbed (migrates in Phase 4) Server-side tokenizer: - tokenizer.rs: HuggingFace tokenizers-rs with Mistral's BPE tokenizer - count_tokens() for accurate usage metrics - Loads from local tokenizer.json or falls back to bundled vocab - Config: mistral.tokenizer_path (optional) No behavior change — engine is wired but not yet called from sync.rs or session.rs (Phase 2 continuation).	2026-03-23 17:40:25 +00:00
Sienna Meridian Satterwhite	ec4fde7b97	feat(orchestrator): Phase 1 — event types + broadcast channel foundation Introduces the orchestrator module with: - OrchestratorEvent enum: 11 event variants covering lifecycle, tools, progress, and side effects - RequestId (UUID per generation), ResponseMode (Chat/Code), ToolSide - ChatRequest/CodeRequest structs for transport-agnostic request input - Orchestrator struct with tokio::broadcast channel (capacity 256) - subscribe() for transport bridges, emit() for the engine - Client-side tool dispatch: pending_client_tools map with oneshot channels - submit_tool_result() to unblock engine from gRPC client responses Additive only — no behavior change. Existing responder + gRPC session paths are untouched. Phase 2 will migrate the Conversations API path.	2026-03-23 17:30:36 +00:00
Sienna Meridian Satterwhite	b8b76687a5	feat(grpc): dev mode, agent prefix, system prompt, error UX - gRPC dev_mode config: disables JWT auth, uses fixed dev identity - Agent prefix (agents.agent_prefix): dev agents use "dev-sol-orchestrator" to avoid colliding with production on shared Mistral accounts - Coding sessions use instructions (system prompt + coding addendum) with mistral-medium-latest for personality adherence - Conversations API: don't send both model + agent_id (422 fix) - GrpcState carries system_prompt + orchestrator_agent_id - Session.end() keeps session active for reuse (not "ended") - User messages posted as m.notice, assistant as m.text (role detection) - History loaded from Matrix room on session resume - Docker Compose local dev stack: OpenSearch 3 + Tuwunel + SearXNG - Dev config: localhost URLs, dev_mode, opensearch-init.sh for ML setup	2026-03-23 17:07:50 +00:00
Sienna Meridian Satterwhite	71392cef9c	feat(code): wire gRPC server into Sol startup spawns gRPC server alongside Matrix sync loop when [grpc] config is present. shares ToolRegistry, Store, MistralClient, and Matrix client with the gRPC CodeSession handler.	2026-03-23 13:01:36 +00:00
Sienna Meridian Satterwhite	abfad337c5	feat(code): CodeSession + agent loop + Matrix room bridge phase 2 server core: - CodeSession: create/resume sessions, Matrix room per project, Mistral conversation lifecycle, tool dispatch loop - agent loop: user input → Mistral → tool calls → route (client via gRPC / server via ToolRegistry) → collect results → respond - Matrix bridge: all messages posted to project room, accessible from any Matrix client - code_sessions SQLite table (Postgres-compatible schema) - coding mode context injection (project path, git info, prompt.md)	2026-03-23 11:46:22 +00:00
Sienna Meridian Satterwhite	35b6246fa7	feat(code): gRPC server with JWT auth + tool routing tonic 0.14 gRPC server for sunbeam code sessions: - bidirectional streaming Session RPC - JWT interceptor validates tokens against Hydra JWKS - tool router classifies calls as client-side (file_read, bash, grep, etc.) or server-side (gitea, identity, search, etc.) - service stub with session lifecycle (start, chat, tool results, end) - coding_model config (default: devstral-small-2506) - grpc config section (listen_addr, jwks_url) - 182 tests (5 new: JWT claims, tool routing) phase 2 TODOs: Matrix room bridge, Mistral agent loop, streaming	2026-03-23 11:35:37 +00:00
Sienna Meridian Satterwhite	2a1d7a003d	auto-recover corrupted conversations on API error when append_conversation fails (422, 404, etc.), the stale mapping is deleted and a fresh conversation is created automatically. prevents Sol from being permanently stuck after a hung research session or Mistral API error. v1.1.0	2026-03-23 09:53:29 +00:00
Sienna Meridian Satterwhite	1ba4e016ba	add self-hosted web search via SearXNG new search_web tool calls SearXNG (cluster-internal, free, no tracking) instead of Mistral's built-in web_search ($0.03/query + rate limits). returns structured results from DuckDuckGo, Wikipedia, StackOverflow, GitHub, arXiv, and Brave. no API keys, no cost, no rate limits. removed Mistral AgentTool::web_search() from orchestrator — replaced by the custom tool which goes through Sol's normal tool dispatch.	2026-03-23 09:52:56 +00:00
Sienna Meridian Satterwhite	567d4c1171	fix research agent hang: per-agent timeout + startup cleanup research agents now have a 2-minute timeout via tokio::time::timeout. a hung Mistral API call can no longer block Sol's entire sync loop. timed-out agents return partial results instead of hanging forever. on startup, Sol detects research sessions with status='running' from previous crashes and marks them as failed. 6 new tests covering the full research session lifecycle: create, append findings, complete, fail, hung cleanup, and partial findings survival.	2026-03-23 09:03:03 +00:00
Sienna Meridian Satterwhite	447bead0b7	wire up identity agent, research tool, silence state main.rs: create KratosClient, pass mistral+store to ToolRegistry, build active_agents list for dynamic delegation. conversations.rs: context_hint for new conversations, reset_all. sdk/mod.rs: added kratos module.	2026-03-23 01:43:51 +00:00
Sienna Meridian Satterwhite	de33ddfe33	multi-agent research: parallel LLM-powered investigation new research tool spawns 3-25 micro-agents (ministral-3b) in parallel via futures::join_all. each agent gets its own Mistral conversation with full tool access. recursive spawning up to depth 4 — agents can spawn sub-agents. research sessions persisted in SQLite (survive reboots). thread UX: 🔍 reaction, per-agent progress posts, ✅ when done. cost: ~$0.03 per research task (20 micro-agents on ministral-3b).	2026-03-23 01:42:40 +00:00
Sienna Meridian Satterwhite	7dbc8a3121	room overlap access control for cross-room search search_archive, get_room_context, and sol.search() (in run_script) enforce a configurable member overlap threshold. results from a room are only visible if >=25% of that room's members are also in the requesting room. system-level filter applied at the opensearch query layer — sol never sees results from excluded rooms.	2026-03-23 01:42:20 +00:00
Sienna Meridian Satterwhite	7324c10d25	proper Matrix threading + concise tool call formatting agent_ux: uses Relation::Thread (not Reply) for tool call details. format_tool_call extracts key params instead of dumping raw JSON. format_tool_result truncated to 200 chars. matrix_utils: added make_thread_reply() for threaded responses. sync.rs routes ThreadReply engagement to threaded messages.	2026-03-23 01:42:08 +00:00
Sienna Meridian Satterwhite	3b62d86c45	evaluator redesign: response types, silence, structural suppression new engagement types: Respond (inline), ThreadReply (threaded), React, Ignore. LLM returns response_type to decide HOW to engage. silence mechanic: "shut up"/"be quiet" sets a 30min per-room timer. only direct @mention breaks through. structural suppression (A+B): - reply to non-Sol human → capped at React - 3+ human messages since Sol → forced passive mode threads have a lower relevance threshold (70% of spontaneous). time context injected into evaluator prompt.	2026-03-23 01:41:57 +00:00
Sienna Meridian Satterwhite	1058afb635	add TimeContext: 25 pre-computed time values for the model midnight-based day boundaries (today, yesterday, 2 days ago), week/month boundaries, rolling offsets (1h to 30d). injected into system prompt via {time_block} and per-message via compact time line. models no longer need to compute epoch timestamps.	2026-03-23 01:41:44 +00:00
Sienna Meridian Satterwhite	84278fc1f5	add identity agent: 7 kratos admin API tools list_users, get_user, create_user, recover_user, disable_user, enable_user, list_sessions. all via kratos admin API (cluster- internal, no auth needed). email-to-UUID resolution with fallback search. delete_user and set_password excluded (CLI-only).	2026-03-23 01:41:25 +00:00
Sienna Meridian Satterwhite	8e7c572381	expand gitea tools: 17 new operations (24 total) repos: create, edit, fork, list org repos issues: edit, list comments, create comment PRs: get, create, merge branches: list, create, delete orgs: list user orgs, get org notifications: list added write:repository and read:notification PAT scopes. new response types: Comment, Branch, Organization, Notification. authed_patch and authed_delete helpers with 401 retry. URL-encoded query params throughout.	2026-03-23 01:41:08 +00:00
Sienna Meridian Satterwhite	822a597a87	sol 1.0.0 — relicense to AGPL-3.0, dual-license model relicensed from MIT to AGPL-3.0-or-later. commercial license available for organizations that need private modifications (does not permit redistribution). sol 1.0.0 ships with: - multi-agent architecture with mistral conversations API - user impersonation via vault-backed PAT provisioning - gitea integration (first domain agent: devtools) - per-user memory system with automatic extraction - full-context evaluator with system prompt awareness - agent recreation on prompt changes with conversation reset - web search, sandboxed deno runtime, archive search v1.0.0	2026-03-22 15:24:02 +00:00
Sienna Meridian Satterwhite	cccdb7b502	update documentation for sdk, vault, gitea integration	2026-03-22 15:00:51 +00:00
Sienna Meridian Satterwhite	7bf9e25361	per-message context headers, memory notes, conversation continuity conversations API path now injects per-message context headers with live timestamps, room name, and memory notes. this replaces the template variables in agent instructions which were frozen at creation time. memory notes (topical + recent backfill) loaded before each response in the conversations path — was previously only in the legacy path. context hint seeds new conversations with recent room history after resets, so sol doesn't lose conversational continuity on sneeze. tool call results now logged with preview + length for debugging. reset_all() clears both in-memory and sqlite conversation state.	2026-03-22 15:00:43 +00:00
Sienna Meridian Satterwhite	904ffa2d4d	agent recreation on prompt changes, deterministic hash, dynamic delegation orchestrator instructions hash (FNV-1a, stable across rust versions) is stored alongside agent ID. on startup, hash mismatch triggers delete of old agent + creation of new one + conversation reset + sneeze. delegation section is now dynamic — only lists domain agents that are actually registered, preventing the model from hallucinating capabilities for agents that don't exist yet. web_search added as a built-in tool on the orchestrator.	2026-03-22 15:00:23 +00:00
Sienna Meridian Satterwhite	c9d4f7400d	add devtools agent tools, fix search field mapping 7 gitea tools: list_repos, get_repo, list_issues, get_issue, create_issue, list_pulls, get_file. all operate as the requesting user via PAT impersonation. tool registry conditionally includes gitea tools when configured. dispatch uses prefix matching (gitea_*) for clean extension. fixed search bug: room_name and sender_name filters used .keyword subfield which doesn't exist on keyword-typed fields — queries silently returned zero results for all room/sender-filtered searches.	2026-03-22 15:00:06 +00:00
Sienna Meridian Satterwhite	f479235a63	add sdk layer: vault client, token store, gitea API vault.rs — OpenBao client with kubernetes auth, KV v2 operations, automatic token refresh on 403. proper error handling on all paths. tokens.rs — vault-backed token storage with expiry validation. get_valid returns Result<Option> to distinguish vault errors from missing tokens. username mappings stay in sqlite (not secrets). gitea.rs — typed gitea API v1 wrapper with per-user PAT auto-provisioning via admin API. username discovery by direct match or email search. URL-encoded query params. handles 400 and 422 token name conflicts with delete+retry.	2026-03-22 14:59:25 +00:00
Sienna Meridian Satterwhite	14022aa7c0	add token infrastructure: service_users table, localpart helper new SQLite table service_users maps OIDC identities (matrix localpart) to service-specific usernames, handling auth boundary mismatches. localpart() extracts the username from a matrix user ID. delete_all_conversations() added for bulk reset after agent recreation. all delete_* methods now log failures instead of silently discarding. removed dead user_tokens table (tokens now live in vault).	2026-03-22 14:58:28 +00:00
Sienna Meridian Satterwhite	2333dda904	enhance evaluator with full system prompt context the evaluator now receives sol's entire system prompt as a system message, giving ministral-3b deep context on sol's personality when scoring relevance. evaluation context window bumped from 25 to 200 messages, room/dm context windows unified at 200. pre-computed timestamp variables ({ts_yesterday}, {ts_1h_ago}, {ts_last_week}) added to personality template for accurate time references without LLM math.	2026-03-22 14:58:11 +00:00
Sienna Meridian Satterwhite	cf0f640c66	add .gitignore for rust build artifacts	2026-03-22 14:57:41 +00:00
Sienna Meridian Satterwhite	7580c10dda	feat: multi-agent architecture with Conversations API and persistent state Mistral Agents + Conversations API integration: - Orchestrator agent created on startup with Sol's personality + tools - ConversationRegistry routes messages through persistent conversations - Per-room conversation state (room_id → conversation_id + token counts) - Function call handling within conversation responses - Configurable via [agents] section in sol.toml (use_conversations_api flag) Multimodal support: - m.image detection and Matrix media download (mxc:// → base64 data URI) - ContentPart-based messages sent to Mistral vision models - Archive stores media_urls for image messages System prompt rewrite: - 687 → 150 lines — dense, few-shot examples, hard rules - {room_context_rules} placeholder for group vs DM behavior - Sender prefixing (<@user:server>) for multi-user turns in group rooms SQLite persistence (/data/sol.db): - Conversation mappings and agent IDs survive reboots - WAL mode for concurrent reads - Falls back to in-memory on failure (sneezes into all rooms to signal) - PVC already mounted at /data alongside Matrix SDK state store New modules: - src/persistence.rs — SQLite state store - src/conversations.rs — ConversationRegistry + message merging - src/agents/{mod,definitions,registry}.rs — agent lifecycle - src/agent_ux.rs — reaction + thread progress UX - src/tools/bridge.rs — tool dispatch for domain agents 102 tests passing.	2026-03-21 22:21:14 +00:00
Sienna Meridian Satterwhite	5e2186f324	add README, MIT license, and package metadata	2026-03-21 15:53:49 +00:00
Sienna Meridian Satterwhite	4949e70ecc	feat: per-user auto-memory with ResponseContext Three memory channels: hidden tool (sol.memory.set/get in scripts), pre-response injection (relevant memories loaded into system prompt), and post-response extraction (ministral-3b extracts facts after each response). User isolation enforced at Rust level — user_id derived from Matrix sender, never from script arguments. New modules: context (ResponseContext), memory (schema, store, extractor). ResponseContext threaded through responder → tools → script runtime. OpenSearch index sol_user_memory created on startup alongside archive.	2026-03-21 15:51:31 +00:00
Sienna Meridian Satterwhite	4dc20bee23	feat: initial Sol virtual librarian implementation Matrix bot with E2EE (matrix-sdk 0.9) that passively archives all messages to OpenSearch and responds to queries via Mistral AI with function calling tools. Core systems: - Archive: bulk OpenSearch indexer with batch/flush, edit/redaction handling, embedding pipeline passthrough - Brain: rule-based engagement evaluator (mentions, DMs, name invocations), LLM-powered spontaneous engagement, per-room conversation context windows, response delay simulation - Tools: search_archive, get_room_context, list_rooms, get_room_members registered as Mistral function calling tools with iterative tool loop - Personality: templated system prompt with Sol's librarian persona 47 unit tests covering config, evaluator, conversation windowing, personality templates, schema serialization, and search query building.	2026-03-20 21:40:13 +00:00

30 Commits