Allow any user, anonymous or authenticated, to start subtitling in a room only if they are an active participant of it. Subtitling a room consists of starting the multi-user transcriber agent. This agent forwards all participants' audio to an STT server and returns transcription segments for any active voice to the room. User roles in the backend room system cannot be used to determine subtitle permissions. The transcriber agent can be triggered multiple times but will only join a room once. Unicity is managed by the agent itself. Any user with a valid LiveKit token can initiate subtitles. Feature flag logic is implemented on the frontend. The frontend ensures the "start subtitle" action is only available to users who should see it. The backend does not enforce feature flags in this version. Authentication in our system does not imply access to a room. The only valid proof of access is the LiveKit API token issued by the backend. Security consideration: A LiveKit API token is valid for 6 hours and cannot be revoked at the end of a meeting. It is important to verify that the token was issued for the correct room. Calls to the agent dispatch endpoint must be server-initiated. The backend proxies these calls, as clients cannot securely contact the agent dispatch endpoint directly (per LiveKit documentation). Room ID is passed as a query parameter. There is currently no validation ensuring that the room exists prior to agent dispatch. TODO: implement validation or error handling for non-existent rooms. The backend does not forward LiveKit tokens to the agent. Default API rate limiting is applied to prevent abuse.
48 lines
1.4 KiB
Python
48 lines
1.4 KiB
Python
"""Service for managing subtitle agents in LiveKit rooms."""
|
|
|
|
from logging import getLogger
|
|
|
|
from django.conf import settings
|
|
|
|
from asgiref.sync import async_to_sync
|
|
from livekit.protocol.agent_dispatch import CreateAgentDispatchRequest
|
|
|
|
from core import utils
|
|
|
|
logger = getLogger(__name__)
|
|
|
|
|
|
class SubtitleException(Exception):
|
|
"""Exception raised when subtitle operations fail."""
|
|
|
|
|
|
class SubtitleService:
|
|
"""Service for managing subtitle agents in LiveKit rooms."""
|
|
|
|
@async_to_sync
|
|
async def start_subtitle(self, room):
|
|
"""Start subtitle agent for the specified room."""
|
|
|
|
lkapi = utils.create_livekit_client()
|
|
|
|
try:
|
|
# Transcriber agent prevents duplicate subtitle agents per room
|
|
# No error is raised if agent already exists
|
|
await lkapi.agent_dispatch.create_dispatch(
|
|
CreateAgentDispatchRequest(
|
|
agent_name=settings.ROOM_SUBTITLE_AGENT_NAME, room=str(room.id)
|
|
)
|
|
)
|
|
except Exception as e:
|
|
logger.exception("Failed to create agent dispatch for room %s", room.id)
|
|
raise SubtitleException("Failed to create subtitle agent") from e
|
|
|
|
finally:
|
|
await lkapi.aclose()
|
|
|
|
@async_to_sync
|
|
async def stop_subtitle(self, room) -> None:
|
|
"""Stop subtitle agent for the specified room."""
|
|
|
|
raise NotImplementedError("Subtitle agent stopping not yet implemented")
|