For long-running inference where you want progress as it happens — rather than waiting for the full response — Lyceum Cloud exposes a Server-Sent Events (SSE) endpoint per inference request.Documentation Index
Fetch the complete documentation index at: https://docs.lyceum.technology/llms.txt
Use this file to discover all available pages before exploring further.
Endpoint
queued event immediately, then either finished or failed when the inference resolves.
When to use this
- You want to display progress to a user during a long inference
- You’re building a chat UI where partial responses should appear as they’re generated
- You’re integrating with a frontend that already speaks SSE (most do)
Consuming the stream
EventSource in browsers, sseclient-py, etc.
