Scaling Monosemanticity: Extracting Interpretable Features from Claude 3 Sonnet May 21, 2024 by Comments