deep-learning 5
- Noobs guide to mechanistic interpretability
- Removing the Refusal Direction:How I Turned Param-1 Into an Uncensored Model Without Fine-Tuning
- Executing Toxicity Mechanistic Localization of Toxic Behavior in a Fine-Tuned Transformer
- Building a Semantic Search Engine for 1M+ arXiv Papers (<10ms Query Time)
- Perceptron and MLPs