arxiv:2309.11925

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

Published on Sep 21, 2023

Upvote

Authors:

Ricardo Rei ,

Nuno M. Guerreiro ,

José G. C. de Souza ,

Abstract

Multilingual approaches built on COMETKIWI-22 achieve state-of-the-art performance in sentence- and word-level quality estimation and fine-grained error span detection at WMT 2023 QE shared task.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

We present the joint contribution of Unbabel and Instituto Superior T\'ecnico to the WMT 2023 Shared Task on Quality Estimation (QE). Our team participated on all tasks: sentence- and word-level quality prediction (task 1) and fine-grained error span detection (task 2). For all tasks, we build on the COMETKIWI-22 model (Rei et al., 2022b). Our multilingual approaches are ranked first for all tasks, reaching state-of-the-art performance for quality estimation at word-, span- and sentence-level granularity. Compared to the previous state-of-the-art COMETKIWI-22, we show large improvements in correlation with human judgements (up to 10 Spearman points). Moreover, we surpass the second-best multilingual submission to the shared-task with up to 3.8 absolute points.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Get this paper in your agent:

hf papers read 2309.11925

Don't have the latest CLI?

curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 2

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2309.11925 in a dataset README.md to link it from this page.

Scaling up COMETKIWI: Unbabel-IST 2023 Submission for the Quality Estimation Shared Task

Abstract

Community

Models citing this paper 2

Datasets citing this paper 0

Spaces citing this paper 3

Collections including this paper 2