2026  1

February  1

Interpreting and Steering LLM Agents for Social Simulations

February 15, 2026 · Me

2025  3

December  2

Sybil-Resilient Preference Aggregation for RLHF

December 20, 2025 · Me

Probing Circuit Robustness: How Syntactic Form Shapes Neural Circuit Activation in LLMs

December 15, 2025 · Me

September  1

Theorizing with LLMs

September 6, 2025 · Me