add config files for async inf template#59926
Conversation
harshit-anyscale
commented
Jan 7, 2026
- adding anyscale template configs for async inf template
Signed-off-by: harshit <harshit@anyscale.com>
There was a problem hiding this comment.
Code Review
This pull request adds Anyscale template configurations for AWS and GCE for the asynchronous inference tutorial. The changes are straightforward. My review includes suggestions to explicitly define CPU resources for the head node in both configuration files. This improves clarity and ensures robust resource allocation, making the configurations more aligned with best practices and other similar files in the repository.
| head_node_type: | ||
| name: head | ||
| instance_type: m5.2xlarge |
There was a problem hiding this comment.
For clarity and to ensure correct resource allocation for the head node, it's good practice to explicitly define its resources. While Ray can sometimes infer resources from the instance type, specifying them explicitly makes the configuration more robust and easier to understand. The m5.2xlarge instance has 8 vCPUs.
head_node_type:
name: head
instance_type: m5.2xlarge
resources:
cpu: 8| head_node_type: | ||
| name: head | ||
| instance_type: n1-standard-8 |
There was a problem hiding this comment.
For clarity and to ensure correct resource allocation for the head node, it's good practice to explicitly define its resources. While Ray can sometimes infer resources from the instance type, specifying them explicitly makes the configuration more robust and easier to understand. The n1-standard-8 instance has 8 vCPUs.
head_node_type:
name: head
instance_type: n1-standard-8
resources:
cpu: 8- adding anyscale template configs for async inf template Signed-off-by: harshit <harshit@anyscale.com> Signed-off-by: jasonwrwang <jasonwrwang@tencent.com>