Aquileo | Paper page - FlowBender: Feedback-Aware Training for Self-Correcting Conditional Flows
Papers
arxiv:2606.20404

FlowBender: Feedback-Aware Training for Self-Correcting Conditional Flows

Published on Jun 18
· Submitted by
Daniel Gilo
on Jun 19
Authors:
,
,
,

Abstract

FlowBender is a closed-loop framework that addresses constraint satisfaction in diffusion and flow models by training networks to correct alignment errors using inference-time feedback, outperforming traditional supervised and guidance-based approaches across multiple tasks.

Conditional diffusion and flow models routinely fail to satisfy the very constraints that define their task. For instance, a depth-conditioned model often produces images whose re-extracted depth disagrees with the input, even though the forward operator--the depth predictor defining the constraint--is available during both training and inference. Existing approaches generally fall into two categories: supervised models that treat the conditioning signal as a static cue and ignore alignment information at inference, and guidance-based methods that consult it through hand-tuned linear updates, typically trading fidelity to the condition against the plausibility of the generated sample. We argue that the fundamental gap in both paradigms is that the model is never trained to utilize its own alignment error. We introduce FlowBender, a closed-loop framework that treats this error as a first-class input, training the network to learn a correction policy conditioned on inference-time feedback. At each step, an unguided look-ahead pass estimates the clean signal, a task-specific deviation is computed via the forward operator, and a refinement pass consumes this signal to produce a corrected velocity. We propose several variants of FlowBender, including a gradient-based formulation for differentiable operators and a zero-order variant for non-differentiable settings such as JPEG compression. For efficient sampling, we introduce a prior-step shortcut that enables closed-loop correction at a minimal additional computational cost. Across image-to-image translation, restoration, and 3D mesh texturing, FlowBender consistently outperforms standard supervised baselines, alignment-loss-augmented training, and state-of-the-art inference-time guidance, improving fidelity and plausibility simultaneously rather than trading them against each other. Project page: https://flow-bender.github.io/

Community

Paper submitter

FlowBender enables closed-loop flow matching by learning to correct its own errors, surpassing standard supervised and guidance methods in 2D and 3D conditional generation tasks.
Project page: https://flow-bender.github.io/

This is a clever take on the conditioning problem. It always feels like a missed opportunity when models ignore the forward operator during training, so teaching the network to treat alignment error as an explicit input makes a lot of sense.

I am curious about the overhead here. Since the method performs an unguided look-ahead pass followed by a refinement pass, how does the inference speed compare to standard guidance methods when you aren't using the prior-step shortcut?

I made a podcast on it with ResearchPod, it makes it easy to get the key concepts on the go:
https://researchpod.app/episode/406c70d8-e8b3-4e2e-a51a-e0d1adc7616e

Sign up or log in to comment

Get this paper in your agent:

hf papers read 2606.20404
Don't have the latest CLI?
curl -LsSf https://hf.co/cli/install.sh | bash

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2606.20404 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2606.20404 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2606.20404 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.