Evaluating GPT-4’s Ability to Identify Additional Context (Student Abstract)

Victoria Armstrong , Prof. Christian Muise

Abstract

We evaluate the strengths and weaknesses of the state-of-the-art in LLMs when identifying additional information supplied by users in dialogue agents. While GPT-4 can accurately identify additional information in some sentences, it fails to identify additional context more than 22% of the time. By understanding these limitations, we can harness LLMs within the scope of their abilities and compare to other approaches.

Publication

Canadian Conference on Electrical and Computer Engineering (CCECE): Student Poster

Date

August, 2024

Links

PDF