Aquileo | [RLlib] Fix DQN RLModule forward methods such that they can handle dict spaces by ArturNiederfahrenhorst · Pull Request #60451 · ray-project/ray

ArturNiederfahrenhorst · 2026-01-23T11:01:28Z

Description

We don't natively build encoders for dict spaces and so we don't account for them in the forward method of the DQN rlm.
This is an issue because users may still want to use encoder configs for dictionaries or they may want to override DQNRLModule.build_encoder etc.

This PR makes a fix and introduces testing for different types of forward passes, observations spaces and configurations for the DQN RL Module.

gemini-code-assist

Code Review

This pull request addresses an issue in the DQN RLModule where dict spaces were not properly handled in the forward method. The changes include modifications to the forward_train method in default_dqn_torch_rl_module.py to correctly process dict observation spaces and the addition of a new test file test_dqn_rl_module.py to verify the fix and ensure compatibility with different observation spaces and configurations.

pseudo-rnd-thoughts

Overall looks good, minor changes and a question about working with Discrete observations

pseudo-rnd-thoughts · 2026-01-23T12:58:52Z

+class DictFlattenEncoder(nn.Module):
+    def __init__(self, obs_space, output_dim=64):
+        super().__init__()
+        total_dim = sum(


For nested composite spaces this won't work, Gymnasium has flatdim for this but I'm worried that we don't use Gymnasium's flatten function so the resulting spaces might be mismatched.
Therefore, this is more a note for the future

Yea. DictFlattenEncoder is just for testing here. Flattening all elements of the dict can be counterproductive if these are, for example, images. I think for any more involved obs space we should not just autogenerate some model under the hood.

I agree that for models that are too complex, we shouldn't allow autogeneration

Makes sense. Let's not do it for this test file though.

pseudo-rnd-thoughts · 2026-01-23T12:59:46Z

+    def forward(self, inputs):
+        obs = inputs[Columns.OBS]
+        flat_obs = torch.cat(
+            [obs[k].reshape(obs[k].shape[0], -1) for k in sorted(obs.keys())],


Does this work for discrete inputs?

Why not? Should be 1 hot encoded, mhh?

Yes, thats a good point that rationally for discrete inputs they should be one-hotted and we wouldn't get this problem but what happens if a discrete is used? Can we provide a helpful error?

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

pseudo-rnd-thoughts

LGTM

…ct spaces (ray-project#60451) ## Description We don't natively build encoders for dict spaces and so we don't account for them in the forward method of the DQN rlm. This is an issue because users may still want to use encoder configs for dictionaries or they may want to override DQNRLModule.build_encoder etc. This PR makes a fix and introduces testing for different types of forward passes, observations spaces and configurations for the DQN RL Module. --------- Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com> Signed-off-by: Adel Nour <ans9868@nyu.edu>

initial

4cc5578

ArturNiederfahrenhorst requested a review from a team as a code owner January 23, 2026 11:01

Reintroduce state handling properly

3acb090

gemini-code-assist Bot reviewed Jan 23, 2026

View reviewed changes

ArturNiederfahrenhorst added 2 commits January 23, 2026 12:04

use torch.cat instead of concat

8f61e4d

test dqn rl module to BUILD

6e93976

pseudo-rnd-thoughts requested changes Jan 23, 2026

View reviewed changes

ray-gardener Bot added the rllib RLlib related issues label Jan 23, 2026

Mark's comments

92f46f9

Signed-off-by: Artur Niederfahrenhorst <attaismyname@googlemail.com>

pseudo-rnd-thoughts approved these changes Jan 26, 2026

View reviewed changes

pseudo-rnd-thoughts added rllib-models An issue related to RLlib (default or custom) Models. go add ONLY when ready to merge, run all tests labels Jan 26, 2026

ArturNiederfahrenhorst merged commit d406677 into ray-project:master Jan 27, 2026
7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RLlib] Fix DQN RLModule forward methods such that they can handle dict spaces#60451

[RLlib] Fix DQN RLModule forward methods such that they can handle dict spaces#60451
ArturNiederfahrenhorst merged 5 commits into
ray-project:masterfrom
ArturNiederfahrenhorst:fixdqntorchrlm

ArturNiederfahrenhorst commented Jan 23, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

pseudo-rnd-thoughts left a comment •
edited

Loading

Uh oh!

pseudo-rnd-thoughts Jan 23, 2026

Uh oh!

ArturNiederfahrenhorst Jan 25, 2026 •
edited

Loading

Uh oh!

pseudo-rnd-thoughts Jan 26, 2026

Uh oh!

ArturNiederfahrenhorst Jan 27, 2026

Uh oh!

pseudo-rnd-thoughts Jan 23, 2026

Uh oh!

ArturNiederfahrenhorst Jan 25, 2026

Uh oh!

pseudo-rnd-thoughts Jan 26, 2026

Uh oh!

Uh oh!

Uh oh!

pseudo-rnd-thoughts left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

ArturNiederfahrenhorst commented Jan 23, 2026

Description

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

pseudo-rnd-thoughts left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

ArturNiederfahrenhorst Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

ArturNiederfahrenhorst Jan 27, 2026

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts Jan 23, 2026

Choose a reason for hiding this comment

Uh oh!

ArturNiederfahrenhorst Jan 25, 2026

Choose a reason for hiding this comment

Uh oh!

pseudo-rnd-thoughts Jan 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

pseudo-rnd-thoughts left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pseudo-rnd-thoughts left a comment •
edited

Loading

ArturNiederfahrenhorst Jan 25, 2026 •
edited

Loading