We're afraid language models aren't modeling ambiguity

[Submitted on 27 Apr 2023]

Download PDF

Abstract: Ambiguity is an intrinsic feature of natural language. Managing ambiguity is
a key part of human language understanding, allowing us to anticipate
misunderstanding as communicators and revise our interpretations as listeners.
As language models (LMs) are increasingly employed as dialogue interfaces and
writing aids, handling ambiguous language is critical to their success. We
characterize ambiguity in a sentence by its effect on entailment relations with
another sentence, and collect AmbiEnt, a linguist-annotated benchmark of 1,645
examples with diverse kinds of ambiguity. We design a suite of tests based on
AmbiEnt, presenting the first evaluation of pretrained LMs to recognize
ambiguity and disentangle possible meanings. We find that the task remains
extremely challenging, including for the recent GPT-4, whose generated
disambiguations are considered correct only 32% of the time in human
evaluation, compared to 90% for disambiguations in our dataset. Finally, to
illustrate the value of ambiguity-sensitive tools, we show that a multilabel
NLI model can flag political claims in the wild that are misleading due to
ambiguity. We encourage the field to rediscover the importance of ambiguity for
NLP.

Submission history

From: Alisa Liu [view email]

[v1]
Thu, 27 Apr 2023 17:57:58 UTC (7,649 KB)

Read More
Zonia Klemp

Latest

BLXCKIE Previews New Song “Uphi Usomnyama”

MusicBLXCKIE Previews New Song “Uphi Usomnyama.” The SA...

Newsletter

Don't miss

BLXCKIE Previews New Song “Uphi Usomnyama”

MusicBLXCKIE Previews New Song “Uphi Usomnyama.” The SA...

How this Brisbane band remains strangely relevant, 30 years on

Music It’s a bit like naming a bridge after...

WD sees sustainability as key business driver in an ‘AI economy’

Hard drive company WD promoted long-term operations and sustainability executive Jackie Jung to become its first chief sustainability officer in February, as it steps up sales to companies building AI data centers. Her vision: Turn sustainability into a “brand” for WD, a strategy that reduces risk for the $6 billion company (formerly known as Western

5 Business Ideas Worth Starting in 2026

If there is one thing Nigerians understand well, it is how to spot opportunity inside hardship. In 2026, that mindset will matter more than ever. The economy is tough, competition is rising, and many people are looking for smarter ways to earn, build, and survive. But even in a difficult environment, some businesses still stand

Getting a business loan now comes with a frequent flyer upside

Australian fintech Prospa has partnered with Qantas Business Rewards, letting eligible SMEs earn up to 500,000 points per loan. What’s happening: Australian fintech lender Prospa has partnered with Qantas Business Rewards to allow eligible small and medium business owners to earn up to 500,000 Qantas Points per loan when taking out a Prospa Small Business