Is ChatGPT a General-Purpose Natural Language Processing Task Solver?

Chengwei Qin; Aston Zhang; Zhuosheng Zhang; Jiaao Chen; Michihiro   Yasunaga; Diyi Yang

by Chengwei Qin, Aston Zhang, Zhuosheng Zhang, Jiaao Chen, Michihiro Yasunaga, Diyi Yang

Released as a article .

2023

Abstract

Spurred by advancements in scale, large language models (LLMs) have demonstrated the ability to perform a variety of natural language processing (NLP) tasks zero-shot -- i.e., without adaptation on downstream data. Recently, the debut of ChatGPT has drawn a great deal of attention from the natural language processing (NLP) community due to the fact that it can generate high-quality responses to human input and self-correct previous mistakes based on subsequent conversations. However, it is not yet known whether ChatGPT can serve as a generalist model that can perform many NLP tasks zero-shot. In this work, we empirically analyze the zero-shot learning ability of ChatGPT by evaluating it on 20 popular NLP datasets covering 7 representative task categories. With extensive empirical studies, we demonstrate both the effectiveness and limitations of the current version of ChatGPT. We find that ChatGPT performs well on many tasks favoring reasoning capabilities (e.g., arithmetic reasoning) while it still faces challenges when solving specific tasks such as sequence tagging. We additionally provide in-depth analysis through qualitative case studies.
In text/plain format

Archived Content

There are no accessible files associated with this release. You could check other releases for this work for an accessible version.

"Dark" Preservation Only

Save Paper Now!

Know of a fulltext copy of on the public web? Submit a URL and we will archive it

Type article
Stage

submitted

Date 2023-11-19
Version v3
Language en ^?

arXiv 2302.06476v3

Work Entity
access all versions, variants, and formats of this works (eg, pre-prints)

Cite This

BibTeX
CSL-JSON
MLA
Harvard

Lookup Links

Worldcat
wikidata.org
CORE.ac.uk
Semantic Scholar
Google Scholar

Catalog Record
Revision: e1976e68-15f5-4034-9120-cc1a6c1ecbe9
API URL: JSON

Edit Metadata View History

Is ChatGPT a General-Purpose Natural Language Processing Task Solver? release_ujla2xtp4ndfdmxopvi2zprvyi

Abstract

Archived Content

Is ChatGPT a General-Purpose Natural Language Processing Task Solver? `release_ujla2xtp4ndfdmxopvi2zprvyi`