OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization

Ao Qu; Chanakya Ekbote; Ehsan Hoque; Erik Cambria; Gianmarco Mengaldo; Jingyao Wu; Keane Ong; Luwei Xiao; Paul Pu Liang; Rui Mao

arxiv: 2602.10635 · v3 · pith:KEIFX3THnew · submitted 2026-02-11 · 💻 cs.AI · cs.LG

OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization

Keane Ong , Sabri Boughorbel , Luwei Xiao , Chanakya Ekbote , Wei Dai , Ao Qu , Jingyao Wu , Rui Mao

show 4 more authors

Ehsan Hoque Erik Cambria Gianmarco Mengaldo Paul Pu Liang

This is my paper

classification 💻 cs.AI cs.LG

keywords behavioralacrossbehaviorlearningmodelpolicysocialbest

0 comments

read the original abstract

Socially intelligent AI systems must reason across diverse human behavioral tasks and generalize to new social contexts. However, behavioral data is inherently heterogeneous, comprising diverse modalities and prediction targets that produce uneven training signals across samples, creating imbalanced learning dynamics that challenge existing AI models. To address this, we develop Omnisapiens-7B 2.0, a foundation model for social behavior processing that explicitly addresses learning from heterogeneous behavioral data. This is enabled through Heterogeneity-Aware Relative Policy Optimization, a new RL method that rebalances learning signals across samples by approximating each sample's contribution to the policy update and using these estimates to drive geometrically centered, inertially smoothed advantage modulation for stable training. Omnisapiens-7B 2.0 achieves the best and most consistent performance across 10 behavioral tasks, while also attaining the best performance on all five held-out benchmarks, with gains of up to +12.02% and +9.37% respectively. Furthermore, it demonstrates more consistent and interpretable reasoning traces, supporting reliable real-world behavioral applications. Our model is available at https://github.com/MIT-MI/human_behavior_atlas.

This paper has not been read by Pith yet.

OmniSapiens: A Foundation Model for Social Behavior Processing via Heterogeneity-Aware Relative Policy Optimization

discussion (0)