Mason Wang

Adaptive Preference Aggregation

Classical Bradley-Terry model - humans have underlying score, which is a logit with some noise.

What they do: