How to stop token generation #298
Unanswered
federicotesta1703
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Is there a way to stop the generation while using mlx_vlm.generate()? Something like the "stop_sequences" parameter? I want to avoid using mlx_vlm.stream_generate() and capture the stop tokens manually. Any suggestions?
Beta Was this translation helpful? Give feedback.
All reactions