Publications

Fast and Large-Scale Unbalanced Optimal Transport via its Semi-Dual and Adaptive Gradient Methods

COLT 2026 ยท 2026 F. Genans ๐Ÿ“„ Paper

In Unbalanced Optimal Transport (UOT), we investigate the entropically regularized semi-dual objective. We prove the convergence rates of SGD and ASGD in both stochastic and semi-discrete settings. For the discrete case, we introduce a smoothness-adaptive Nesterov accelerated gradient descent scheme. We establish its global convergence rate and demonstrate an enhanced local convergence of O(log(1/ฮด)/โˆšฮต) to achieve ฮด-accuracy.

Figure for Fast and Large-Scale Unbalanced Optimal Transport via its Semi-Dual and Adaptive Gradient Methods

Decreasing Entropic Regularization Averaged Gradient for Semi-Discrete Optimal Transport

NeurIPS 2025 ยท 2025 F. Genans, A. Godichon-Baggioni, F.-X. Vialard, O. Wintenberger ๐Ÿ“„ Paper

We introduce DRAG, a stochastic algorithm for semi-discrete Optimal Transport that decreases entropic regularization during training. This yields unbiased estimates and faster convergence than fixed-regularization methods, with both theory and experiments confirming its efficiency.

Figure for Decreasing Entropic Regularization Averaged Gradient for Semi-Discrete Optimal Transport

Stochastic Optimization in Semi-Discrete Optimal Transport: Convergence Analysis and Minimax Rate

NeurIPS 2025 โ€” Spotlight ยท 2025 F. Genans, A. Godichon-Baggioni, F.-X. Vialard, O. Wintenberger ๐Ÿ“„ Paper

We prove that Stochastic Gradient Descent can efficiently approximate the OT map in the semi-discrete setting, even in an online fashion, establishing the first minimax convergence guarantees for a broad class of cost functions and non-compact measures in this setting.

Figure for Stochastic Optimization in Semi-Discrete Optimal Transport: Convergence Analysis and Minimax Rate
Preprints

Increasing Missingness to Reduce Bias: Richardson-SGD with Missing Data

Preprint โ€” under review at NeurIPS 2026 ยท 2026 F. Genans, E. Scornet ๐Ÿ“„ Paper

A Richardson-extrapolation-based SGD scheme for learning with missing data, which paradoxically increases missingness during training to reduce the bias of the final estimator.

Figure for Increasing Missingness to Reduce Bias: Richardson-SGD with Missing Data

Geometry-Aware Optimal Transport: Fast Intrinsic Dimension and Wasserstein Distance Estimation

Preprint โ€” under review at NeurIPS 2026 ยท 2026 F. Genans, O. Wintenberger ๐Ÿ“„ Paper

We introduce a fast estimator for the one-sample Wasserstein discretization error that avoids the need for an OT solver. Building on this, we propose a resolution-dependent intrinsic dimension estimator for probability measures. Finally, we derive a new OT Richardson extrapolation estimator for the OT distance utilizing this intrinsic dimension.

Figure for Geometry-Aware Optimal Transport: Fast Intrinsic Dimension and Wasserstein Distance Estimation