Hello!
Is it possible to retrieve an agent's response in real-time (word by word) during generation?
I've seen the documentation about "streaming" but I can't get it working in Python.
Does anyone have a working Python code example?
Thanks!
Hi Cridland, yes you just need to add "blocking" : false (instead of true) in your API call (on create conversation for example). And you will receive the events
Real-Time Streaming of Agent Responses in Python: Help Needed! | Dust Community