attribution-graphs

Here are 3 public repositories matching this topic...

LLM-Interp / CLT-Forge

A Mechanistic Interpretability Toolkit for Cross-Layer Transcoder Training and Attribution-Graph Visualization

transcoder visual-interface mechanistic-interpretability ai-interpretability attribution-graphs auto-interpretability cross-layer-transcoder transformer-circuits

Updated Apr 16, 2026
Python

peppinob-ol / attribution-graph-probing

Star

Automates attribution-graph analysis via probe prompting: circuit-trace a prompt, auto-generate concept probes, profile feature activations, cluster supernodes.

graph-analysis sparse-autoencoders mechanistic-interpretability llm-interpretability research-tooling circuit-tracing attribution-graphs probe-prompting prompt-probing neuronpedia feature-activation supernodes cross-layer-transcoder

Updated Jun 5, 2026
Jupyter Notebook

middesurya / daily-webapp-2026-07-01-circuittracelab

Star

Interactive lab for attribution graphs & circuit tracing — replace polysemantic neurons with monosemantic features, trace causal paths, and patch a real two-hop reasoning circuit. MIT Tech Review 2026 Breakthrough tech.

transcoder interpretability mechanistic-interpretability claude-code circuit-tracing attribution-graphs ai-built daily-webapp

Updated Jul 1, 2026
HTML

Improve this page

Add a description, image, and links to the attribution-graphs topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attribution-graphs topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly