Bandit Optimal Transport

Lorenzo Croissant

Pré-Publication, Document De Travail Année : 2025

Bandit Optimal Transport

(1, 2, 3)

1
2
3

Lorenzo Croissant

Fonction : Auteur
PersonId : 1504562

Centre de Recherche en Économie et Statistique

IA coopérative : équité, vie privée, incitations

Ecole Nationale de la Statistique et de l'Analyse Economique

Résumé

Despite the impressive progress in statistical Optimal Transport (OT) in recent years, there has been little interest in the study of the \emph{sequential learning} of OT. Surprisingly so, as this problem is both practically motivated and a challenging extension of existing settings such as linear bandits. This article considers (for the first time) the stochastic bandit problem of learning to solve generic Kantorovich and entropic OT problems from repeated interactions when the marginals are known but the cost is unknown. We provide $\tilde{\mathcal O}(\sqrt{T})$ regret algorithms for both problems by extending linear bandits on Hilbert spaces. These results provide a reduction to infinite-dimensional linear bandits. To deal with the dimension, we provide a method to exploit the intrinsic regularity of the cost to learn, yielding corresponding regret bounds which interpolate between $\tilde{\mathcal O}(\sqrt{T})$ and $\tilde{\mathcal O}(T)$.

Mots clés

Bandit Algorithms Optimal transport

Domaines

Machine Learning [stat.ML]

Fichier principal

Arxiv.pdf (477)

Origine	Fichiers produits par l'(les) auteur(s)

Lorenzo Croissant : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04938170

Soumis le : lundi 10 février 2025-13:23:47

Dernière modification le : mercredi 12 février 2025-03:26:37

Dates et versions

hal-04938170 , version 1 (10-02-2025)

Licence

Paternité - Partage selon les Conditions Initiales

Identifiants

HAL Id : hal-04938170 , version 1

Citer

Lorenzo Croissant. Bandit Optimal Transport. 2025. ⟨hal-04938170⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X GENES CNRS INRIA ENSAE CREST ENSAI INRIA2 X-CREST IP_PARIS

0 Consultations

0 Téléchargements

Bandit Optimal Transport

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Partager