Contributing¶
Warning
HookedTransformer is deprecated as of TransformerLens 3.0 and will be removed in the next major version. New code should use TransformerBridge instead. Existing HookedTransformer code continues to work through the 3.x branch via a compatibility layer. See the migration guide for conversion recipes.
Setup¶
DevContainer¶
For a one-click setup of your development environment, this project includes a DevContainer. It can be used locally with VS Code or with GitHub Codespaces.
Manual Setup¶
As of TransformerLens 3.0, this project uses UV for package and environment management (it previously used Poetry). Install UV first, then run:
# resolves and installs dependencies into .venv
uv sync
# activate the virtual environment
source .venv/bin/activate
Dependency groups are defined in pyproject.toml under [dependency-groups]. The project sets default-groups = ["dev", "docs", "jupyter"], so uv sync installs all three out of the box — you do not need to pass --group flags for the standard contributor setup.
Standard contributor setup (recommended default):
uv syncInclude the optional
quantizationgroup (bitsandbytes, optimum-quanto):uv sync --all-groups
You can also add individual groups with uv sync --group <name>, or install without optional groups using uv sync --no-default-groups.
Requires Python 3.10 or higher.
Testing¶
If adding a feature, please add unit tests for it. If you need a model, please use one of the ones
that are cached by GitHub Actions (so that it runs quickly on the CD). These are gpt2,
attn-only-1l, attn-only-2l, attn-only-3l, attn-only-4l, tiny-stories-1M. Note gpt2 is
quite slow (as we only have CPU actions) so the smaller models like attn-only-1l and
tiny-stories-1M are preferred if possible.
Running the tests¶
Unit tests only via
make unit-testAcceptance tests only via
make acceptance-testDocstring tests only via
make docstring-testNotebook tests only via
make notebook-testRun all test suites mentioned
make test
Formatting¶
This project uses pycln, isort and black for formatting, pull requests are checked in github
actions.
Format all files via
make formatOnly check the formatting via
make check-format
Note that black line length is set to 100 in pyproject.toml (instead of the default 88).
Documentation¶
Please make sure to add thorough documentation for any features you add. You should do this directly
in the docstring, and this will then automatically generate the API docs when merged into main.
They will also be automatically checked with pytest (via
doctest).
If you want to view your documentation changes, run uv run docs-hot-reload. This will give you
hot-reloading docs (they change in real time as you edit docstrings).
For documentation generation to work, install with uv sync --group docs.
Docstring Style Guide¶
We follow the Google Python Docstring Style for writing docstrings, with some added features from reStructuredText (reST).
Sections and Order¶
You should follow this order:
"""Title In Title Case.
A description of what the function/class does, including as much detail as is necessary to fully understand it.
Warning:
Any warnings to the user (e.g. common pitfalls).
Examples:
Include any examples here. They will be checked with doctest.
>>> print(1 + 2)
3
Args:
param_without_type_signature:
Each description should be indented once more.
param_2:
Another example parameter.
Returns:
Returns description without type signature.
Raises:
Information about the error it may raise (if any).
"""
Supported Sphinx Properties¶
References to Other Functions/Classes¶
You can reference other parts of the codebase using cross-referencing (noting that you can omit the full path if it is in the same file).
:mod:transformer_lens # Function or module
:const:`transformer_lens.loading_from_pretrained.OFFICIAL_MODEL_NAMES`
:class:`transformer_lens.HookedTransformer`
:meth:`transformer_lens.HookedTransformer.from_pretrained`
:attr:`transformer_lens.HookedTransformer.cfg`
Maths¶
You can use LaTeX, but note that as you’re placing this in python strings the backwards slash (\)
must be repeated (i.e. \\). You can write LaTeX inline, or in “display mode”.
:math:`(a + b)^2 = a^2 + 2ab + b^2`
.. math::
:nowrap:
\\begin{eqnarray}
y & = & ax^2 + bx + c \\
f(x) & = & x^2 + 2xy + y^2
\\end{eqnarray}
Markup¶
Italics -
*text*Bold -
**text**Code -
``code``List items -
*itemNumbered items -
1. ItemQuotes - indent one level
External links =
`Link text <https://domain.invalid/>`