Claire Boyer (IMO, Université Paris Saclay)

par Antoine Marchina · Publié 13 décembre 2024 · Mis à jour 9 décembre 2024

Single location regression and attention models

Quand

13 décembre 2024

9h30 - 10h30

Où

Salle du Conseil, Espace Turing
45 rue des Saints-Pères, Paris, 75006

Type d’évènement

Séminaire de Statistiques

Single location regression and attention models

We will begin by defining a new regression task that can easily be illustrated in a natural language processing (NLP) context, for example to analyse sentiment in texts. We will propose a predictor to solve this task which can be interpreted as a very simplified architecture of transform (architecture corresponding to the T in ChatGPT). We will discuss some of its asymptotic statistical properties, and we will show that we can learn the optimal parameters of the problem by projected gradient descent, despite the non-convexity of the problem.

Claire Boyer (IMO, Université Paris Saclay)

Claire Boyer (IMO, Université Paris Saclay)

Single location regression and attention models

Quand

Où

Type d’évènement

Vous aimerez aussi...

Soutenance de thèse de Charles Laroche

Adrian Raftery

Christian Hirsch