transformer_lens.benchmarks.weight_processing module¶
Weight processing benchmarks for TransformerBridge.
- transformer_lens.benchmarks.weight_processing.benchmark_attention_output_centering(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark attention output centering - W_O should have mean ≈ 0.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with attention output centering verification details
- transformer_lens.benchmarks.weight_processing.benchmark_layer_norm_folding(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark layer norm folding - norm weights should be identity after folding.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with layer norm folding verification details
- transformer_lens.benchmarks.weight_processing.benchmark_mlp_output_centering(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark MLP output centering - MLP output weights should have mean ≈ 0.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with MLP output centering verification details
- transformer_lens.benchmarks.weight_processing.benchmark_no_nan_inf(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark that weights contain no NaN or Inf values.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with NaN/Inf verification details
- transformer_lens.benchmarks.weight_processing.benchmark_unembed_centering(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark unembed centering - unembed matrix should have mean ≈ 0.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with unembed centering verification details
- transformer_lens.benchmarks.weight_processing.benchmark_value_bias_folding(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark value bias folding - b_V should be zero after folding.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with value bias folding verification details
- transformer_lens.benchmarks.weight_processing.benchmark_weight_magnitudes(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark that weight magnitudes are in reasonable ranges.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with weight magnitude verification details
- transformer_lens.benchmarks.weight_processing.benchmark_weight_modification(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark that weight modifications propagate correctly.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model (not used)
- Returns:
BenchmarkResult with weight modification verification details
- transformer_lens.benchmarks.weight_processing.benchmark_weight_processing(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None) BenchmarkResult¶
Benchmark weight processing (folding, centering) application.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model
- Returns:
BenchmarkResult with weight processing verification details
- transformer_lens.benchmarks.weight_processing.benchmark_weight_sharing(bridge: TransformerBridge, test_text: str, reference_model: HookedTransformer | None = None, atol: float = 0.001) BenchmarkResult¶
Benchmark weight sharing and modification effects.
- Parameters:
bridge – TransformerBridge model to test
test_text – Input text for testing
reference_model – Optional HookedTransformer reference model
atol – Absolute tolerance for effect comparison
- Returns:
BenchmarkResult with weight sharing verification details