-
Sparks of Cooperative Reasoning: LLMs as Strategic Hanabi Agents
Mahesh Ramesh, Kaousheik Jayakumar, Aswinkumar Ramkumar, Pavan Thodima, Aniket Rege, Emmanouil-Vasileios Vlatakis-Gkaragkounis
ICML 2026
Introduced a multi-turn benchmark to evaluate state-tracking and cooperation in frontier models. RL-trained a 4B model on curated data, outperforming all non-reasoning baselines. Released 1500+ (~90K data points) game trajectories for SFT and move-level ratings for RLVR.
-
CuRe: Cultural Gaps in the Long Tail of Text-to-Image Models
Aniket Rege, Zinnia Nie, Mahesh Ramesh, Unmesh Raskar, Zhuoran Yu, Aditya Kusupati, Yong Jae Lee, Ramya Korlakai Vinayak
ICCV 2025 · CVPR DemoDiv Workshop (Oral)
Curated 300 artifacts across 6 cultural axes and 64 countries. Developed Marginal Information Attribution metrics achieving 2x Spearman correlation improvement over prior scorers. Conducted 2,700 artifact-level surveys with culturally-aligned annotators.
-
MABViT: Modified Attention Block Enhances Vision Transformers
Mahesh Ramesh, Aswinkumar Ramkumar
Deployable AI Workshop, AAAI 2024 (Oral)
Integrated Gated Linear Units into the attention module for parallel MLP-attention computation. Achieved 0.6% accuracy gain over ViT-S/16, surpassed ViT-B/16 with half the parameters, and 17% faster training convergence.