Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Manya Wadhwa

Using Natural Language Explanations to Rescale Human Judgments

May 24, 2023

Manya Wadhwa, Jifan Chen, Junyi Jessy Li, Greg Durrett

Figure 1 for Using Natural Language Explanations to Rescale Human Judgments

Figure 2 for Using Natural Language Explanations to Rescale Human Judgments

Figure 3 for Using Natural Language Explanations to Rescale Human Judgments

Figure 4 for Using Natural Language Explanations to Rescale Human Judgments

The rise of large language models (LLMs) has brought a critical need for high-quality human-labeled data, particularly for processes like human feedback and evaluation. A common practice is to label data via consensus annotation over the judgments of multiple crowdworkers. However, different annotators may have different interpretations of labeling schemes unless given extensive training, and for subjective NLP tasks, even trained expert annotators can diverge heavily. We show that these nuances can be captured by high quality natural language explanations, and propose a method to rescale ordinal annotation in the presence of disagreement using LLMs. Specifically, we feed Likert ratings and corresponding natural language explanations into an LLM and prompt it to produce a numeric score. This score should reflect the underlying assessment of the example by the annotator. The presence of explanations allows the LLM to homogenize ratings across annotators in spite of scale usage differences. We explore our technique in the context of a document-grounded question answering task on which large language models achieve near-human performance. Among questions where annotators identify incompleteness in the answers, our rescaling improves correlation between nearly all annotator pairs, improving pairwise correlation on these examples by an average of 0.2 Kendall's tau.

* Data available at https://github.com/ManyaWadhwa/explanation_based_rescaling

Via

Access Paper or Ask Questions

Group Affect Prediction Using Multimodal Distributions

Mar 12, 2018

Saqib Shamsi, Bhanu Pratap Singh Rawat, Manya Wadhwa

Figure 1 for Group Affect Prediction Using Multimodal Distributions

Figure 2 for Group Affect Prediction Using Multimodal Distributions

Figure 3 for Group Affect Prediction Using Multimodal Distributions

Figure 4 for Group Affect Prediction Using Multimodal Distributions

We describe our approach towards building an efficient predictive model to detect emotions for a group of people in an image. We have proposed that training a Convolutional Neural Network (CNN) model on the emotion heatmaps extracted from the image, outperforms a CNN model trained entirely on the raw images. The comparison of the models have been done on a recently published dataset of Emotion Recognition in the Wild (EmotiW) challenge, 2017. The proposed method achieved validation accuracy of 55.23% which is 2.44% above the baseline accuracy, provided by the EmotiW organizers.

* This research paper has been accepted at Workshop on Computer Vision for Active and Assisted Living, WACV 2018

Via

Access Paper or Ask Questions