Select your task to view ELO for the task, default is average, for a detailed description of the tasks check Transuasion Dataset
Model | Training | ELO (AVG) |
---|
@article{singh2024measuring,
title={Measuring and Improving Persuasiveness of Large Language Models},
author={Somesh Singh and Yaman K Singla and Harini SI and Balaji Krishnamurthy},
year={2024},
journal={arXiv preprint arXiv:2410.02653}
}
Get in touch with us at behavior-in-the-wild@googlegroups.com
We thank Adobe for their generous sponsorship.
This website is adapted from Nerfies, licensed under a Creative
Commons Attribution-ShareAlike 4.0 International License. We thank the LLaMA team for giving us access to their models, and open-source projects, including Vicuna.