Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support probe timeout configuratoins #952

Open
ivelichkovich opened this issue May 22, 2024 · 4 comments
Open

Support probe timeout configuratoins #952

ivelichkovich opened this issue May 22, 2024 · 4 comments
Labels
enhancement New feature or request

Comments

@ivelichkovich
Copy link

What would you like to be added:

Support for timeouts on probe configurations, specifically for ofed-driver

Why is this needed:

lsmod can take longer than a second depending on what you have running on the host

@ivelichkovich ivelichkovich added the enhancement New feature or request label May 22, 2024
@rollandf
Copy link
Member

The MOFED probe timeouts are already available to be customized via the NicClusterPolicy CR.
See here:
https://github.com/Mellanox/network-operator/blob/master/api/v1alpha1/nicclusterpolicy_types.go#L90-L94

Please tell me if that meets your requirement.

@rollandf
Copy link
Member

@ivelichkovich BTW, I saw you have interest in IPAM solution with slicing per node.
You can check https://github.com/Mellanox/nvidia-k8s-ipam that implements the same.

@ivelichkovich
Copy link
Author

@ivelichkovich BTW, I saw you have interest in IPAM solution with slicing per node. You can check https://github.com/Mellanox/nvidia-k8s-ipam that implements the same.

Oh that's awesome, I'll explore that repo!

@ivelichkovich
Copy link
Author

ivelichkovich commented May 23, 2024

The MOFED probe timeouts are already available to be customized via the NicClusterPolicy CR. See here: https://github.com/Mellanox/network-operator/blob/master/api/v1alpha1/nicclusterpolicy_types.go#L90-L94

Please tell me if that meets your requirement.

so these do allow you to define the probes however PodProbeSpec only exposes these fields https://github.com/Mellanox/network-operator/blob/master/api/v1alpha1/nicclusterpolicy_types.go#L74 so having access to failureThreshold and timeoutSeconds would be nice to have. You could maybe just replace PodProbeSpec with the upstream probe object and then it would have 1:1 parity without needing to keep a copy of the struct in code and convert back/forth.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants