transformer_lens.conversion_utils package¶
Subpackages¶
- transformer_lens.conversion_utils.conversion_steps package
- Submodules
- transformer_lens.conversion_utils.conversion_steps.arithmetic_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.attention_auto_conversion module
- transformer_lens.conversion_utils.conversion_steps.base_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.callable_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.chain_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.rearrange_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.repeat_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.split_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.tensor_conversion_set module
- transformer_lens.conversion_utils.conversion_steps.ternary_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.transpose_tensor_conversion module
- transformer_lens.conversion_utils.conversion_steps.zeros_like_conversion module
- Module contents
- Submodules
- transformer_lens.conversion_utils.helpers package
Submodules¶
Module contents¶
Model bridge conversion utilities.
This module contains utilities for converting between different model architectures.
- class transformer_lens.conversion_utils.TensorConversionSet(fields: dict[str, Any])¶
Bases:
BaseTensorConversion- get_component(model: Any, name: str) Any¶
Get a component from the model using the field mapping.
- Parameters:
model – The model to get the component from.
name – The name of the component to get.
- Returns:
The requested component.
- get_conversion_action(field: str) BaseTensorConversion¶
- handle_conversion(input_value: Any, *full_context: Any) dict[str, Any]¶
- process_conversion(input_value: Any, remote_field: str, conversion: BaseTensorConversion, *full_context: Any) Any¶
- process_conversion_action(input_value: Any, conversion_details: Any, *full_context: Any) Any¶