
hetong.wang809[at]gmail.com
👋 Hi! I'm Hetong Wang.
My name is 王鹤童 in Chinese. Feel free to call me Hetong
or Erika
alternatively!
I play with data, structure, and model behavior concurrently to explore mechanistic interpretability. Recently, I have been particularly excited about the following questions:
- Sparsity: How is information clustered and dispersed among the neurons of models? How do different training paradigms steer this distribution, and how do the circuits formed between neurons contribute to the generalisation performance?
- Training Dynamics: How is knowledge acquired, and how does it evolve under varying data mixtures and learning curricula?
I believe that addressing these questions can lead to greater data and parameter efficiency while allowing us to reclaim more control from the black-box nature of neural networks.
About
I obtained my master in Artificial Intelligence at The Univerisity of Edinburgh, where I am fortunately advised by Prof. Edoardo Ponti and Prof. Pasquale Minervini. Before that, I obtained my bachelor in Computer Science at The University of Liverpool, with a research focus on Reinforcement Learning.
NEWS
May 15, 2024 | 🎉 Our paper Probing the Emergence of Cross-lingual Alignment during LLM Training is accepted at ACL 2024! |
---|---|
May 01, 2024 | 👩🏻🎓 I am actively looking for PhD position start by 25 spring/fall! |
Jan 12, 2024 | 🗣️ I gave a talk on AI self-alignment at THUNLP&OpenBMB Lab. Check the recording here (in Manderin Chinese). |
Dec 20, 2023 | 📍 I start my research internship on-site at Tsinghua University, Bejing. |