[WIP] Add Support for LTX-2.3 Models by dg845 · Pull Request #13217 · huggingface/diffusers

dg845 · 2026-03-06T04:10:04Z

What does this PR do?

This PR adds support for LTX-2.3 (official code, model weights), a new model in the LTX-2.X family of audio-video models. LTX-2.3 has improved audio and visual quality and prompt adherence as compared to LTX-2.0.

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@yiyixuxu
@sayakpaul

yiyixuxu · 2026-03-06T18:33:47Z

src/diffusers/models/transformers/transformer_ltx2.py

        return hidden_states


+class LTX2PerturbedAttnProcessor:


I think this is just a guider https://github.com/huggingface/diffusers/blob/main/src/diffusers/guiders/skip_layer_guidance.py

Thanks! Looking at the code, it's unclear to me whether SkipLayerGuidance currently works for LTX-2.3 for the following reasons:

Not attention backend agnostic: if I understand correctly, STG is implemented through AttentionProcessorSkipHook, which uses AttentionScoreSkipFunctionMode to intercept calls to torch.nn.functional.scaled_dot_product_attention to simply return the value:

diffusers/src/diffusers/hooks/layer_skip.py

Line 93 in e747fe4

if func is torch.nn.functional.scaled_dot_product_attention:

But I think other attention backends like flash-attn won't call that function and thus will not work with SkipLayerGuidance.

LTX-2.3 does additional computation on the values: LTX-2.3 additionally processes the values using learned per-head gates before sending it to the attention output projection to_out. This is not supported by the current SkipLayerGuidance implementation.

I'm not sure whether these issues can be resolved with changes to the SkipLayerGuidance implementation or whether something like a new attention processor would make more sense here.

I have opened a PR with a possible modification to SkipLayerGuidance to allow it to better support LTX-2.3 at #13220.

This is a good callout! From my understanding, guider as a component doesn't change much. LTX-2 is probably an exception. If more models start to do their own form of SLG, we could think of giving them their own guider classes / attention processors. But for now, I think modifications to the existing SLG class make more sense.

Initial implementation of perturbed attn processor for LTX 2.3

6c7e720

yiyixuxu reviewed Mar 6, 2026

View reviewed changes

dg845 mentioned this pull request Mar 7, 2026

Refactor AttentionProcessorSkipHook to Support Custom STG Logic #13220

Open

Update DiT block for LTX 2.3 + add self_attention_mask

e90b90a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add Support for LTX-2.3 Models#13217

[WIP] Add Support for LTX-2.3 Models#13217
dg845 wants to merge 2 commits intomainfrom
ltx2-3-pipeline

dg845 commented Mar 6, 2026

Uh oh!

yiyixuxu Mar 6, 2026

Uh oh!

dg845 Mar 6, 2026

Uh oh!

dg845 Mar 7, 2026

Uh oh!

sayakpaul Mar 7, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

dg845 commented Mar 6, 2026

What does this PR do?

Who can review?

Uh oh!

yiyixuxu Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 Mar 6, 2026

Choose a reason for hiding this comment

Uh oh!

dg845 Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

sayakpaul Mar 7, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants