transformer_lens.config package¶
Submodules¶
- transformer_lens.config.hooked_transformer_config module
HookedTransformerConfigHookedTransformerConfig.NTK_by_parts_factorHookedTransformerConfig.NTK_by_parts_high_freq_factorHookedTransformerConfig.NTK_by_parts_low_freq_factorHookedTransformerConfig.NTK_original_ctx_lenHookedTransformerConfig.act_fnHookedTransformerConfig.attention_dirHookedTransformerConfig.attn_onlyHookedTransformerConfig.attn_scaleHookedTransformerConfig.attn_scores_soft_capHookedTransformerConfig.attn_typesHookedTransformerConfig.checkpoint_indexHookedTransformerConfig.checkpoint_label_typeHookedTransformerConfig.checkpoint_valueHookedTransformerConfig.d_vocab_outHookedTransformerConfig.decoder_start_token_idHookedTransformerConfig.dtypeHookedTransformerConfig.epsHookedTransformerConfig.experts_per_tokenHookedTransformerConfig.final_rmsHookedTransformerConfig.from_checkpointHookedTransformerConfig.from_dict()HookedTransformerConfig.gated_mlpHookedTransformerConfig.init_modeHookedTransformerConfig.init_weightsHookedTransformerConfig.initializer_rangeHookedTransformerConfig.is_layer_norm_activation()HookedTransformerConfig.load_in_4bitHookedTransformerConfig.model_nameHookedTransformerConfig.n_devicesHookedTransformerConfig.n_paramsHookedTransformerConfig.norm_topk_probHookedTransformerConfig.normalization_typeHookedTransformerConfig.num_expertsHookedTransformerConfig.original_architectureHookedTransformerConfig.output_logits_soft_capHookedTransformerConfig.parallel_attn_mlpHookedTransformerConfig.post_embedding_lnHookedTransformerConfig.relative_attention_max_distanceHookedTransformerConfig.relative_attention_num_bucketsHookedTransformerConfig.rotary_adjacent_pairsHookedTransformerConfig.rotary_baseHookedTransformerConfig.rotary_base_localHookedTransformerConfig.rotary_dimHookedTransformerConfig.rotary_scaling_factorHookedTransformerConfig.scale_attn_by_inverse_layer_idxHookedTransformerConfig.seedHookedTransformerConfig.set_seed_everywhere()HookedTransformerConfig.tie_word_embeddingsHookedTransformerConfig.to_dict()HookedTransformerConfig.tokenizer_nameHookedTransformerConfig.tokenizer_prepends_bosHookedTransformerConfig.trust_remote_codeHookedTransformerConfig.ungroup_grouped_query_attentionHookedTransformerConfig.unwrap()HookedTransformerConfig.use_NTK_by_parts_ropeHookedTransformerConfig.use_attn_inHookedTransformerConfig.use_attn_scaleHookedTransformerConfig.use_hook_mlp_inHookedTransformerConfig.use_hook_tokensHookedTransformerConfig.use_local_attnHookedTransformerConfig.use_normalization_before_and_afterHookedTransformerConfig.use_qk_normHookedTransformerConfig.use_yarn_ropeHookedTransformerConfig.window_sizeHookedTransformerConfig.yarn_attention_factorHookedTransformerConfig.yarn_beta_fastHookedTransformerConfig.yarn_beta_slowHookedTransformerConfig.yarn_factorHookedTransformerConfig.yarn_original_max_position_embeddings
- transformer_lens.config.transformer_bridge_config module
- transformer_lens.config.transformer_lens_config module
TransformerLensConfigTransformerLensConfig.act_fnTransformerLensConfig.attn_onlyTransformerLensConfig.d_headTransformerLensConfig.d_mlpTransformerLensConfig.d_modelTransformerLensConfig.d_vocabTransformerLensConfig.default_prepend_bosTransformerLensConfig.deviceTransformerLensConfig.dtypeTransformerLensConfig.epsTransformerLensConfig.experts_per_tokenTransformerLensConfig.final_rmsTransformerLensConfig.from_dict()TransformerLensConfig.gated_mlpTransformerLensConfig.layer_norm_foldingTransformerLensConfig.n_ctxTransformerLensConfig.n_headsTransformerLensConfig.n_key_value_headsTransformerLensConfig.n_layersTransformerLensConfig.normalization_typeTransformerLensConfig.num_expertsTransformerLensConfig.positional_embedding_typeTransformerLensConfig.to_dict()TransformerLensConfig.unwrap()TransformerLensConfig.use_attn_resultTransformerLensConfig.use_split_qkv_inputTransformerLensConfig.uses_rms_norm
Module contents¶
Configuration classes for TransformerLens.