Adaptive Preference Aggregation

Classical Bradley-Terry model - humans have underlying score, which is a logit with some noise.

What they do:

solve circular agreement, use a tool from game theory - crowd preference/voting.
put N copies in candidate, each time sample, replace.
randomly
random nash equilibrium - choose 1/3.
very highly subjective/noisy samples.