transformer_lens.model_bridge.generalized_components.t5gemma_decoder_block module

T5Gemma-specific decoder block bridge.

T5GemmaDecoderLayer uses Gemma-style flat attribute access (not T5’s .layer[] indexing). It has: self-attention + cross-attention + MLP, each with pre/post norms. This bridge monkey-patches the layer forward to insert intermediate hook points.

class transformer_lens.model_bridge.generalized_components.t5gemma_decoder_block.T5GemmaDecoderBlockBridge(name: str, config: Any | None = None, submodules: Dict[str, GeneralizedComponent] | None = None)

Bases: GeneralizedComponent

Bridge for T5Gemma decoder layers.

Inserts hook points between the three sub-components of each decoder layer: - hook_in (hook_resid_pre): residual before self-attention pre-norm - hook_resid_mid: residual after self-attention + residual add, before cross-attn pre-norm - hook_resid_mid2: residual after cross-attention + residual add, before MLP pre-norm - hook_out (hook_resid_post): residual after MLP + residual add

forward(*args: Any, **kwargs: Any) Any

Generic forward pass for bridge components with input/output hooks.

get_expected_parameter_names(prefix: str = '') list[str]
get_list_size() int
hook_aliases: Dict[str, str | List[str]] = {'hook_resid_post': 'hook_out', 'hook_resid_pre': 'hook_in'}
is_list_item: bool = True
real_components: Dict[str, tuple]
set_original_component(component: Module) None

Set the original component that this bridge wraps.

Parameters:

original_component – The original transformer component to wrap

training: bool