Aquileo | [Dashboard] Add Logical Memory Usage panel by yuhuan130 · Pull Request #60772 · ray-project/ray

yuhuan130 · 2026-02-05T07:59:29Z

Description

This PR adds a Logical Memory Usage panel to the Ray Default Dashboard.
It's positioned in the "Ray Resources by Node" section, right after the "Logical GPUs Usage" panel.

Related issues

Fixes #60715

Screenshot:

Testing:
✅ Tested locally with Prometheus + Grafana
✅ Dashboard generates correctly with the new panel
✅ Metrics display properly in Grafana

Signed-off-by: “Alex <alexchien130@gmail.com>

gemini-code-assist

Code Review

This pull request introduces a 'Logical Memory Usage' panel to the Ray dashboard. However, the implementation uses metrics for physical memory (ray_node_mem_used, ray_node_mem_total) instead of logical memory. This is inconsistent with the panel's title and the implementation of other logical resource panels like CPU and GPU. My review provides a correction to use the ray_resources metric with Name="memory" to accurately reflect logical memory usage and maintain consistency across the dashboard.

gemini-code-assist · 2026-02-05T08:00:33Z

+        targets=[
+            Target(
+                expr='sum(ray_node_mem_used{{instance=~"$Instance",{global_filters}}}) by (instance)',
+                legend="Memory Used: {{instance}}",
+            ),
+            Target(
+                expr='sum(ray_node_mem_total{{instance=~"$Instance",{global_filters}}})',
+                legend="MAX",
+            ),
+        ],


The metrics used here (ray_node_mem_used and ray_node_mem_total) represent physical node memory, not the logical memory allocated to tasks and actors. This is inconsistent with the panel's title ('Logical Memory Usage') and how other logical resource panels (CPU, GPU) are implemented, which use the ray_resources metric.

To accurately reflect logical memory usage and ensure consistency, you should use ray_resources{Name="memory"}. This will align the panel with the 'Logical CPUs Usage' and 'Logical GPUs Usage' panels.

For further consistency, you might also consider adding a 'MAX + PENDING' target, similar to the CPU and GPU panels, to show memory that will become available from pending nodes.

Suggested change

targets=[
Target(
expr='sum(ray_node_mem_used{{instance=~"$Instance",{global_filters}}}) by (instance)',
legend="Memory Used: {{instance}}",
),
Target(
expr='sum(ray_node_mem_total{{instance=~"$Instance",{global_filters}}})',
legend="MAX",
),
],
targets=[
Target(
expr='sum(ray_resources{{Name="memory",State="USED",instance=~"$Instance",{global_filters}}}) by (instance)',
legend="Memory Usage: {{instance}}",
),
Target(
expr='sum(ray_resources{{Name="memory",instance=~"$Instance",{global_filters}}})',
legend="MAX",
),
],

corrected in 7bb4c73

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

Signed-off-by: “Alex <alexchien130@gmail.com>

yuhuan130 · 2026-02-05T23:33:36Z

@bveeramani PTAL! Thank u.

bveeramani · 2026-02-10T01:33:55Z

@yuhuan130 as a sanity check, could run this pipeline and verify that the logical memory line is at 2 GiB?

import ray


def sleep(row):
    import time
    time.sleep(1)
    return row


ray.data.range(256, override_num_blocks=256).map(sleep, memory=2 * 1024**3).materialize()

yuhuan130 · 2026-02-10T11:19:19Z

@yuhuan130 as a sanity check, could run this pipeline and verify that the logical memory line is at 2 GiB?
import ray


def sleep(row):
    import time
    time.sleep(1)
    return row


ray.data.range(256, override_num_blocks=256).map(sleep, memory=2 * 1024**3).materialize()

Hey, I just ran the sanity check and this is the result! Got three cores running and each was distributed with 2GB. Looks good to me.

- ReadRange: Tasks: 5 [backpressured:tasks]; Actors: 0; Queued blocks: 250 (0.0B); Resources: 5.0 CPRunning Dataset: dataset_6_0. Active & requested resources: 3/8 CPU, 384.0MiB/1.0GiB object store: :Running Dataset: dataset_6_0. Active & requested resources: 3/8 CPU, 384.0MiB/1.0GiB object store: : 0.00 row [00:01, ? row/s]

Running Dataset: dataset_6_0. Active & requested resources: 6/8 CPU, 272.0B/1.0GiB object store: : 0Running Dataset: dataset_6_0. Active & requested resources: 6/8 CPU, 272.0B/1.0GiB object store:   0Running Dataset: dataset_6_0. Active & requested resources: 6/8 CPU, 272.0B/1.0GiB object store:   0

Running Dataset: dataset_6_0. Active & requested resources: 3/8 CPU, 48.0B/1.0GiB object store:  99%2026-02-10 03:01:52,981AINFO streaming_executor.py:304 -- ✔️  Dataset dataset_6_0 execution finished in 88.51 secondssks: 3; Actors: 0; Queued blocks: 0 (0.0B); Resources: 3.0 CPU, 24.0B object store: 
✔️  Dataset dataset_6_0 execution finished in 88.51 seconds: 100%|█| 256/256 [01:28<00:00, 2.90 row/ 
- ReadRange: Tasks: 0; Actors: 0; Queued blocks: 0 (0.0B); Resources: 0.0 CPU, 0.0B object store: 10
- Map(sleep): Tasks: 0; Actors: 0; Queued blocks: 0 (0.0B); Resources: 0.0 CPU, 0.0B object store: 1

## Description This PR adds a **Logical Memory Usage** panel to the Ray Default Dashboard. It's positioned in the "Ray Resources by Node" section, right after the "Logical GPUs Usage" panel. ## Related issues Fixes ray-project#60715 **Screenshot:** <img width="1440" height="780" alt="Screenshot 2026-02-05 at 00 11 16" src="https://github.com/user-attachments/assets/56d9962c-b6f3-49eb-a8e2-5374c367fc03" /> <img width="1440" height="775" alt="Screenshot 2026-02-05 at 00 10 43" src="https://github.com/user-attachments/assets/3c12c9f7-2935-43f0-b6ee-3b12d24ac964" /> **Testing:** ✅ Tested locally with Prometheus + Grafana ✅ Dashboard generates correctly with the new panel ✅ Metrics display properly in Grafana --------- Signed-off-by: “Alex <alexchien130@gmail.com> Co-authored-by: Balaji Veeramani <balaji@anyscale.com> Signed-off-by: Adel Nour <ans9868@nyu.edu>

## Description This PR adds a **Logical Memory Usage** panel to the Ray Default Dashboard. It's positioned in the "Ray Resources by Node" section, right after the "Logical GPUs Usage" panel. ## Related issues Fixes ray-project#60715 **Screenshot:** <img width="1440" height="780" alt="Screenshot 2026-02-05 at 00 11 16" src="https://github.com/user-attachments/assets/56d9962c-b6f3-49eb-a8e2-5374c367fc03" /> <img width="1440" height="775" alt="Screenshot 2026-02-05 at 00 10 43" src="https://github.com/user-attachments/assets/3c12c9f7-2935-43f0-b6ee-3b12d24ac964" /> **Testing:** ✅ Tested locally with Prometheus + Grafana ✅ Dashboard generates correctly with the new panel ✅ Metrics display properly in Grafana --------- Signed-off-by: “Alex <alexchien130@gmail.com> Co-authored-by: Balaji Veeramani <balaji@anyscale.com>

added logical memory usage panel to ray default dashboard

17d9d54

Signed-off-by: “Alex <alexchien130@gmail.com>

yuhuan130 requested a review from a team as a code owner February 5, 2026 07:59

gemini-code-assist Bot reviewed Feb 5, 2026

View reviewed changes

cursor Bot reviewed Feb 5, 2026

View reviewed changes

Comment thread python/ray/dashboard/modules/metrics/dashboards/default_dashboard_panels.py

yuhuan130 added 4 commits February 5, 2026 00:14

change to ray_resources(...)

7bb4c73

Signed-off-by: “Alex <alexchien130@gmail.com>

added pending mem

0e50ff7

Signed-off-by: “Alex <alexchien130@gmail.com>

lint

33b0b82

Signed-off-by: “Alex <alexchien130@gmail.com>

lint

122a709

Signed-off-by: “Alex <alexchien130@gmail.com>

ray-gardener Bot added the community-contribution Contributed by the community label Feb 5, 2026

Merge branch 'master' into add-memory-panel-clean

5fd984b

bveeramani approved these changes Feb 10, 2026

View reviewed changes

alanwguo approved these changes Feb 10, 2026

View reviewed changes

Yicheng-Lu-llll approved these changes Feb 10, 2026

View reviewed changes

bveeramani enabled auto-merge (squash) February 10, 2026 18:37

github-actions Bot added the go add ONLY when ready to merge, run all tests label Feb 10, 2026

MengjinYan approved these changes Feb 10, 2026

View reviewed changes

bveeramani merged commit 0eecdde into ray-project:master Feb 10, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Dashboard] Add Logical Memory Usage panel#60772

[Dashboard] Add Logical Memory Usage panel#60772
bveeramani merged 6 commits into
ray-project:masterfrom
yuhuan130:add-memory-panel-clean

yuhuan130 commented Feb 5, 2026 •
edited

Loading

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Feb 5, 2026

Uh oh!

yuhuan130 Feb 5, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

yuhuan130 commented Feb 5, 2026

Uh oh!

bveeramani commented Feb 10, 2026

Uh oh!

yuhuan130 commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

yuhuan130 commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related issues

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

yuhuan130 Feb 5, 2026

Choose a reason for hiding this comment

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yuhuan130 commented Feb 5, 2026

Uh oh!

bveeramani commented Feb 10, 2026

Uh oh!

yuhuan130 commented Feb 10, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yuhuan130 commented Feb 5, 2026 •
edited

Loading