We are pleased to announce an upcoming talk by Swarnadeep Bhar from IRIT, who will be presenting on October 23rd, at 3 pm.
Abstract:
With the release of ChatGPT we have seen a significant interest in language models.While the central theory behind these models have been around since quite sometime, the large language models show scaling with the large amount of data and aligning them with human responses can unlock significant jumps in performance previously unseen. While they have an impressive performance over a range of bench marks, which were previously thought to be “impossible” to be solved by machine learning techniques, a range of new problems arise, with “hallucinated” responses and the tendency of these models to provide affirmative responses for any instruction, limit the blind deployment of these models in critical scenarios. In this talk, we’ll explore the core principles behind these models, their impressive capabilities, and the challenges they pose—such as hallucinations and overly affirmative responses. We'll also discuss key considerations to keep in mind when deploying these models for personal or critical use cases.