studio/meet - meet - Gitea: Git with a cup of tea

Author	SHA1	Message	Date
lebaudantoine	0a0c7ba618	✨(summary) add dutch and german languages Based on a request from our European partners, introduce new languages for the transcription feature. Dutch and German are now supported, which is a great addition. It closes #837. WhisperX is expected to support both languages.	2026-01-06 17:52:04 +01:00
lebaudantoine	47cd3eff74	🔖(minor) bump release to 1.2.0	2026-01-05 18:10:05 +01:00
lebaudantoine	5be7595533	🐛(summary) fix MinIO endpoint handling in constructor Fix MinIO client configuration: I was incorrectly passing a full URL instead of an endpoint, which caused errors in staging. Local development values did not reflect the staging setup and were also out of sync with the backend.	2026-01-05 15:40:11 +01:00
lebaudantoine	39271544d7	✨(summary) link transcript to their downloadable recording Link the transcription document to its related recording by adding a short header explaining that users can download the audio file via a dedicated link. This was a highly requested feature, as many users need to keep their audio files. As part of a small refactor, remove the argument length check in the metadata analytics class. The hardcoded argument count made code evolution harder and was easy to forget updating. Argument unwrapping remains fragile and should be redesigned later to be more robust. The backend is responsible for generating the download link to ensure consistency and reliability. I tried adding a divider, but the Markdown-to-Yjs conversion is very lossy and almost never handles it correctly. Only about one out of ten conversions works as expected.	2026-01-04 20:22:15 +01:00
lebaudantoine	9ebf2f277b	🔊(summarize) log language with more details Enhance transcription language logging by explicitly indicating when no language is provided and the code falls back to automatic detection mode.	2026-01-04 20:22:15 +01:00
lebaudantoine	857b4bd1f1	✨(summary) handle video files more efficiently Video files are heavy recording files, sometimes several hours long. Previously, recordings were naively submitted to the Whisper API without chunking, resulting in very large requests that could take a long time to process. Video files are much larger than audio-only files, which could cause performance issues during upload. Introduce an extra step to extract the audio component from MP4 files, producing a lighter audio-only file (to be confirmed). No re-encoding is done, just a minimal FFmpeg extraction based on community guidance, since I’m not an FFmpeg expert. This feature is experimental and may introduce regressions, especially if audio quality or sampling is impacted, which could reduce Whisper’s accuracy. Early tests with the ASR model worked, but it has not been tested on long recordings (e.g., 3-hour meetings), which some users have.	2026-01-04 20:22:15 +01:00
lebaudantoine	309c532811	✨(backend) submit screen recordings to the summary microservice Screen recording are MP4 files containing video) The current approach is suboptimal: the microservice will later be updated to extract audio paths from video, which can be heavy to send to the Whisper service. This implementation is straightforward, but the notification service is now handling many responsibilities through conditional logic. A refactor with a more configurable approach (mapping attributes to processing steps via settings) would be cleaner and easier to maintain. For now, this works; further improvements can come later. I follow the KISS principle, and try to make this new feature implemented with the lesser impact on the codebase. This isn’t perfect.	2026-01-04 20:22:15 +01:00
lebaudantoine	4e5032a7a4	♻️(summary) enhance file handling in the Celery worker The previous code lacked proper encapsulation, resulting in an overly complex worker. While the initial naive approach was great for bootstrapping the feature, the refactor introduces more maturity with dedicated service classes that have clear, single responsibilities. During the extraction to services, several minor issues were fixed: 1) Properly closing the MinIO response. 2) Enhanced validation of object filenames and extensions to ensure correct file handling. 3) Introduced a context manager to automatically clean up temporary local files, removing reliance on developers. 4) Slightly improved logging and naming for clarity. 5) Dynamic temporary file extension handling when it was previously always an hardcoded .ogg file, even when it was not the case.	2026-01-04 20:22:15 +01:00
lebaudantoine	4cb6320b83	✨(summary) add a language parameter for transcription Pass recording options’ language to the summary service, allowing users to personalize the recording language. This is important because automatic language detection often fails, causing empty transcriptions or 5xx errors from the Whisper API. Users then do not receive their transcriptions, which leads to frustration. For most of our userbase, meetings are in French, and automatic detection is unreliable. Support for language parameterization in the Whisper API has existed for some time; only the frontend and backend integration were missing. I did not force French as the default, since a minority of users hold English or other European meetings. A proper settings tab to configure this value will be introduced later.	2026-01-04 20:22:15 +01:00
lebaudantoine	0daa6d0432	🔖(release) release 1.1.0 - enable user provisioning through the external viewset - add LLM observability on the summary service	2025-12-22 11:23:28 +01:00
lebaudantoine	b5895ccba0	🩹(summary) fix missing f-string Spotted by code rabbit. Missing F-string was leading to an unexpected behavior.	2025-12-19 14:29:56 +01:00
lebaudantoine	aff87d4953	✨(summary) add Langfuse observability for LLM API calls Implement Langfuse tracing integration for LLM service calls to capture prompts, responses, latency, token usage, and errors, enabling comprehensive monitoring and debugging of AI model interactions for performance analysis and cost optimization.	2025-12-19 14:29:56 +01:00
lebaudantoine	c81ef38005	♻️(summary) extract LLMService class into dedicated module Move LLMService class from existing file into separate dedicated module to improve code organization.	2025-12-19 14:29:56 +01:00
lebaudantoine	4256eb403d	🔒️(summary) refactor configuration secrets to use Pydantic SecretStr Replace plain string fields with Pydantic SecretStr class for all sensitive configuration values in FastAPI settings to prevent accidental exposure in logs, error messages, or debugging output, following security best practices for credential handling.	2025-12-19 14:29:56 +01:00
lebaudantoine	43f3e4691b	➕(summmary) add Langfuse to summary service dependencies Install Langfuse observability client in summary service to enable LLM tracing, monitoring, and debugging capabilities for AI-powered summarization workflows, improving visibility into model performance and behavior.	2025-12-19 14:29:56 +01:00
lebaudantoine	dcdae26610	🔖(release) release 1.0.1 Patch several accessibility issues.	2025-12-17 17:36:01 +01:00
lebaudantoine	98e568d63c	🔖(major) release 1.0.0 Wouhou, finally. Important milestone, as our software is used by thousand of users in production.	2025-12-11 00:18:59 +01:00
lebaudantoine	d241de6af1	🔖(minor) bump release to 0.1.43 - upgrade dependencies for security reason - handle hallucination in transcription - minor frontend fixes - support resource server authentification	2025-12-10 23:16:22 +01:00
Martin Guitteny	ad494f5de5	♻️(summary) refactor transcript formatting into unified handler class Consolidate scattered transcript formatting functions into single cohesive class encapsulating all transcript processing logic for better maintainability and clearer separation of concerns. Add transcript cleaning step to remove spurious recognition artifacts like randomly predicted "Vap'n'Roll Thierry" phrases that appear without corresponding audio, improving transcript quality by filtering model hallucinations.	2025-12-10 20:40:23 +01:00
lebaudantoine	d7ebdbf401	🔖(minor) bump release to 0.1.42 - add admin action to retry a recording notification to external services - log more Celery tasks' parameters - add multilingual support for real-time subtitles - update backend dependencies	2025-11-14 18:23:22 +01:00
renovate[bot]	f8a37e55b1	⬆️(dependencies) update python dependencies	2025-11-13 16:26:17 +01:00
lebaudantoine	b403ac56bf	🚨(summary) disable linter warning too many statements summarize_transcribe_v2 as now slightly too many statements, ignore it for now, but I'll reorganize the code asap.	2025-10-23 06:39:12 +02:00
lebaudantoine	990507e3c7	🔊(summary) increase transcription Celery task logging verbosity Add detailed logging for owner ID, recording metadata, and processing context in transcription tasks to improve debugging capabilities. It was especially important to get the created document id, so when having trouble with the docs API, I could share with them the newly created documents being impacted.	2025-10-23 06:39:12 +02:00
lebaudantoine	10eda5c2ea	🔖(minor) bump release to 0.1.41 - fix transcription observability - introduce auto idle disconnection	2025-10-22 11:04:04 +02:00
lebaudantoine	6b5e8081bc	🐛(celery) fix metadata task_args order broken by signal sender argument Restore correct task_args ordering in metadata manager after commit `f0939b6f` added sender argument to Celery signals for transcription task scoping, unexpectedly shifting positional arguments and breaking metadata creation. Issue went undetected due to missing staging analytics deployment, silently losing production observability on microservice without blocking transcription job execution, highlighting need for staging analytics activation.	2025-10-22 07:17:00 +02:00
Martin Guitteny	36b2156c7b	⚡️(summary) change formating from prompt to response_format Add ability to use response_format in call function in order to have better result with albert-large model Use reponse_format for next steps and plan generation	2025-10-13 12:07:54 +02:00
lebaudantoine	ec94d613fa	🔖(minor) bump release to 0.1.40 - enhance technical documentation - introduce external-api and service account - fix inverted keyboard shortcuts - allow configuring whisperX language (still wip) - filter livekit event when sharing a single livekit instance	2025-10-12 17:13:09 +02:00
lebaudantoine	f0939b6f7c	🐛(summary) scope metadata manager signals to transcription tasks only Restrict metadata manager signal triggers to transcription-specific Celery tasks to prevent exceptions when new summary worker executes tasks not designed for metadata operations, reducing false-positive Sentry errors.	2025-10-10 14:00:00 +02:00
lebaudantoine	aecc48f928	🔧(summary) add configurable language settings for WhisperX transcription Make WhisperX language detection configurable through FastAPI settings to handle empty audio start scenarios where automatic detection fails and incorrectly defaults to English despite 99% French usage. Quick fix acknowledging long-term solution should allow dynamic per-recording language selection configured by users through web interface rather than global server settings.	2025-10-10 13:55:53 +02:00
Martin Guitteny	c07b8f920f	📝(docs) add summarization documentation Add documentation for transcription et summarization Include sequence diagrams	2025-10-06 14:53:56 +02:00
lebaudantoine	57aa812ef6	🔖(minor) bump release to 0.1.39 Enable meeting summary (/w a feature flag)	2025-10-06 11:28:12 +02:00
lebaudantoine	c83d3b99fc	💡(summary) improve metadata manager error messages with explicit source Enhance error logging in metadata manager to explicitly identify the metadata manager as error source.	2025-10-01 15:32:27 +02:00
lebaudantoine	a58d3416e0	🐛(summary) fix metadata manager args after adding owner_id parameter Update metadata manager initialization with additional required arguments after owner_id field addition broke existing initialization logic, restoring proper metadata handling functionality in summary microservice.	2025-10-01 15:32:27 +02:00
Martin Guitteny	c3eb877377	🐛(summary) fix feature flag on summary job Sadly, we used user db id as the posthog distinct id of identified user, and not the sub. Before this commit, we were only passing sub to the summary microservice. Add the owner's id. Please note we introduce a different naming behavir, by prefixing the id with "owner". We didn't for the sub and the email. We cannot align sub and email with this new naming approach, because external contributors have already started building their own microservice.	2025-09-30 22:46:30 +02:00
lebaudantoine	5caed6222b	🐛(summary) fix transcribe job queue assignment Ensure transcribe jobs are properly assigned to their specific queue instead of using default queue. This prevents job routing issues and ensures proper task distribution across workers.	2025-09-18 18:27:10 +02:00
lebaudantoine	c2e6927978	📝(summary) add minimal README for dev experience Basic README with developer setup info. Will be expanded with more details in future commits.	2025-09-18 00:56:00 +02:00
lebaudantoine	1b4a144650	🔧(summary) add settings to disable summary feature entirely Introduce FastAPI settings configuration option to completely disable the summary feature. This improves developer experience by allowing developers to skip summary-related setup when not needed for their workflow.	2025-09-18 00:56:00 +02:00
lebaudantoine	849f8ac08c	✨(summary) introduce summary logic for meeting transcripts Implement summarization functionality that processes completed meeting transcripts to generate concise summaries. First draft base on a simple recursive agentic scenario. Observability and evaluation will be added in the next PRs.	2025-09-18 00:56:00 +02:00
lebaudantoine	9fd264ae0e	🔧(summary) specify dedicated transcription queue for Celery worker Name the Celery queue used by transcription worker to prepare for dedicated summarization queue separation, enabling faster transcript delivery while isolating new agentic logic in separate worker processes.	2025-09-18 00:56:00 +02:00
lebaudantoine	bfdf5548a0	🔧(backend) rename OpenAI settings to WhisperX to avoid confusion Rename incorrectly named OpenAI configuration settings since they're used to instantiate WhisperX client which is not OpenAI compatible, preventing confusion about actual service dependencies.	2025-09-18 00:56:00 +02:00
lebaudantoine	0102b428f1	📦️(summary) vendor existing logic for agentic system transition Vendoring dead code before introducing new agent-based summarization architecture to maintain clean code.	2025-09-18 00:56:00 +02:00
lebaudantoine	e301c5deed	✨(summary) wrap PostHog feature flag checks in analytics client Encapsulate PostHog SDK feature flag functionality within analytics client.	2025-09-18 00:56:00 +02:00
lebaudantoine	67b046c9ba	♻️(summary) integrate summary Docker compose into global dev tooling Consolidate summary service into main development stack to centralize development environment management and simplify service orchestration with shared infrastructure like MinIO storage.	2025-09-18 00:56:00 +02:00
lebaudantoine	1b3b9ff858	✨(summary) add development stage to summary Docker image for hot reload Introduce new Docker stage enabling hot reload during active API development to eliminate rebuild cycles and improve developer workflow efficiency.	2025-09-18 00:56:00 +02:00
lebaudantoine	64fca531fa	🔖(minor) bump release to 0.1.38 - bump LiveKit dependencies - fix some regressions link to permissions	2025-09-18 00:43:47 +02:00
lebaudantoine	e0fe78e3fa	🔖(minor) bump release to 0.1.37 - revert dynacast / simulcast change - fix safari audio output selector	2025-09-15 23:32:43 +02:00
lebaudantoine	d39d02d445	🔖(minor) bump release to 0.1.36	2025-09-10 22:05:39 +02:00
lebaudantoine	88dbfae925	🔖(minor) bump release to 0.1.35 - fix crisp regression	2025-09-09 23:30:54 +02:00
lebaudantoine	9565e7a5d2	🔖(minor) bump release to 0.1.34 - refactor pre-join screen - handle permission errors - add advanced admin controls over meeting - display raised hand order - add video settings - add quick access video / microphone settings - fix crisp regression	2025-09-08 22:27:12 +02:00
lebaudantoine	83192f69e3	🔧(summary) align ruff Python target with Docker image version Sync ruff's target Python version to match Docker image version used for summary component to ensure runtime consistency. Prevents syntax/feature mismatches, catches version-specific issues before deployment, and ensures linting targets the actual runtime environment for better deployment safety.	2025-09-03 18:09:00 +02:00

1 2 3

119 Commits