Question about positional embeddings

https://github.com/AgibotTech/Genie-Envisioner/blob/4c874d9ae89e24e20a7ce8b9baa55808c7a999d8/models/ltx_models/transformer_ltx_multiview.py#L100-L110

I noticed that you used positional embeddings in self-attention, but not in cross-attention. Is this a mistake, or did you do it intentionally?

	if image_rotary_emb is not None: # for self attn, extend the sequence length according to the cross_view_attn param
	query = apply_rotary_emb(query, image_rotary_emb)
	key = apply_rotary_emb(key, image_rotary_emb)
	if cross_view_attn:
	query = rearrange(query, '(b v) l c -> b (v l) c', v=n_view)
	key = rearrange(key, '(b v) l c -> b (v l) c', v=n_view)
	value = rearrange(value, '(b v) l c -> b (v l) c', v=n_view)

	else: # for cross attn, extend the sequence length
	query = rearrange(query, '(b v) l c -> b (v l) c', v=n_view)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Question about positional embeddings #7

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Question about positional embeddings #7

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions