Work

Bio

Current 

Research Fellow @ Center on Long-term Risk

Researcher @ Cadenza Labs


Previous

Visiting researcher @ Constellation Institute

Red-teamer @ Trajectory Labs

Research Fellow @ Pivotal Research

Research Fellow @ Apart Research

Organising & Research Management @ AI Safety Vietnam



My Resume

Research INTERESTS

I want to know and meet models. Current interests include functional introspection, preferences, identity, and selfhood.


In worlds where alignment is hard, I also care about reliable monitoring and oversight.

Papers

Improving Latent Introspection Elicits LLM Self-Reports — TBD.


Activation Space Monitors Can Amplify LLM Oversight — TBD.


Google Scholar

Contact