Mar 10, 2025

Latent Knowledge Analysis via Feature-Based Causal Tracing

Letlotlo

Details

Details

Arrow
Arrow
Arrow

Summary

This project explores how factual knowledge is stored in large language models using Goodfire’s Ember API. By identifying and manipulating internal features related to specific facts, it shows how facts are encoded and how model behavior changes when those features are amplified or erased.

Cite this work:

@misc {

title={

Latent Knowledge Analysis via Feature-Based Causal Tracing

},

author={

Letlotlo

},

date={

3/10/25

},

organization={Apart Research},

note={Research submission to the research sprint hosted by Apart.},

howpublished={https://apartresearch.com}

}

Review

Review

Arrow
Arrow
Arrow

Reviewer's Comments

Reviewer's Comments

Arrow
Arrow
Arrow

No reviews are available yet

This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.
This work was done during one weekend by research workshop participants and does not represent the work of Apart Research.