Can the text encoder in LTX2.3 be replaced by another model?
r/StableDiffusion thread on LTX-2.x: Can the text encoder in LTX2.3 be replaced by another model?
LTX2.3 uses gemma3 12b it as it’s text encoder, I was wondering if it could be swapped with some qwen3.5 variant or something else to potentially get better results, or is the model built around that specific LLM?