TomL's picture

4

TomL

Aric

AI & ML interests

None yet

Recent Activity

new activity 14 days ago

google/gemma-scope-9b-pt-res:Removing SAEs with LR != 7e-5

updated a model 14 days ago

google/gemma-scope-9b-pt-res

new activity 14 days ago

google/gemma-scope-9b-pt-res:Delete layer_20/width_16k/average_l0_427

View all activity

Organizations

None yet

Aric's activity

New activity in google/gemma-scope-9b-pt-res 14 days ago

Removing SAEs with LR != 7e-5

#7 opened 14 days ago by

updated a model 14 days ago

google/gemma-scope-9b-pt-res

Updated 14 days ago • 4

New activity in google/gemma-scope-9b-pt-res 14 days ago

Delete layer_20/width_16k/average_l0_427

#6 opened 14 days ago by

New activity in google/gemma-scope-9b-pt-res 4 months ago

add experimental embedding SAEs

#4 opened 4 months ago by

New activity in google/gemma-scope-2b-pt-res 4 months ago

add experimental embedding SAEs

#7 opened 4 months ago by

authored 2 papers 6 months ago

Jumping Ahead: Improving Reconstruction Fidelity with JumpReLU Sparse Autoencoders

Paper • 2407.14435 • Published Jul 19, 2024 • 7

Progress measures for grokking via mechanistic interpretability

Paper • 2301.05217 • Published Jan 12, 2023

authored a paper over 1 year ago

Does Circuit Analysis Interpretability Scale? Evidence from Multiple Choice Capabilities in Chinchilla

Paper • 2307.09458 • Published Jul 18, 2023 • 10