I'd like to get your thoughts on how to cache the various outputs from the different stages of the pipeline, primarily the output from the text encoder. That step seems to be a substantial amount of time for a generation. But from a users view, I can imagine a scenario where they'd want to regenerate with the same prompt.
I haven't given the implementation part much thought, but probably default to some kind of interface and in memory storage.
I'd like to get your thoughts on how to cache the various outputs from the different stages of the pipeline, primarily the output from the text encoder. That step seems to be a substantial amount of time for a generation. But from a users view, I can imagine a scenario where they'd want to regenerate with the same prompt.
I haven't given the implementation part much thought, but probably default to some kind of interface and in memory storage.