Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[CNIFailure] Health check for AWS IPVLAN found CNIFailure due to address reserved. #109

Open
3 tasks
sunya-ch opened this issue May 22, 2023 · 0 comments
Open
3 tasks
Labels
aws bug Something isn't working
Milestone

Comments

@sunya-ch
Copy link
Collaborator

sunya-ch commented May 22, 2023

Describe the bug
A clear and concise description of what the bug is.

Health check reports the following CNI Failure for aws-ipvlan (v1.1.0).
However, with CNI mechanism, it should keep reassigning the next ip address until the one that is not reserved.

    {
      "HostName": "<hostname>",
      "Connectivity": {
        "10.0.144.0/20": false
      },
      "Allocability": 0,
      "StatusCode": 602,
      "Status": "CNIFailure",
      "Message": "Failed to AssignIP: InvalidParameterValue: Address 10.0.144.1 is in subnet's reserved range.\n\tstatus code: 400, request id: xxx"
    }
# pod creation events (got 3 retires before success)

  Warning  FailedCreatePodSandBox  39s   kubelet  Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_multi-nic-iperf3-client_default_6483a267-5dfd-4ca0-896d-707b8c8869ae_0(860bd3e40b8fcfcb1de4d2b47483362a6b237ba748bf89bb4222ae01d6633a19): error adding pod default_multi-nic-iperf3-client to CNI network "multus-cni-network": plugin type="multus" name="multus-cni-network" failed (add): [default/multi-nic-iperf3-client/6483a267-5dfd-4ca0-896d-707b8c8869ae:multinic-aws-ipvlan]: error adding container to network "multinic-aws-ipvlan": Failed to AssignIP: InvalidParameterValue: Address 10.0.144.1 is in subnet's reserved range.
           status code: 400, request id: 751fb907-59f7-4a17-9031-a888d3f822f0
  Normal   AddedInterface          39s  multus   Add eth0 [10.128.2.177/23] from ovn-kubernetes
  Warning  FailedCreatePodSandBox  37s  kubelet  Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_multi-nic-iperf3-client_default_6483a267-5dfd-4ca0-896d-707b8c8869ae_0(2de014d5c95871357d29e29ab4b73baea2584506bbf02511ef797d0c56acef76): error adding pod default_multi-nic-iperf3-client to CNI network "multus-cni-network": plugin type="multus" name="multus-cni-network" failed (add): [default/multi-nic-iperf3-client/6483a267-5dfd-4ca0-896d-707b8c8869ae:multinic-aws-ipvlan]: error adding container to network "multinic-aws-ipvlan": Failed to AssignIP: InvalidParameterValue: Address 10.0.144.2 is in subnet's reserved range.
           status code: 400, request id: 3f34b056-6956-4b5a-9d32-cd7bdf939b20
  Normal   AddedInterface          24s  multus   Add eth0 [10.128.2.177/23] from ovn-kubernetes
  Warning  FailedCreatePodSandBox  22s  kubelet  Failed to create pod sandbox: rpc error: code = Unknown desc = failed to create pod network sandbox k8s_multi-nic-iperf3-client_default_6483a267-5dfd-4ca0-896d-707b8c8869ae_0(132f9fea6589753b4c0d44882c5343e6fb35af4df7a3073da6b9303feaf84d40): error adding pod default_multi-nic-iperf3-client to CNI network "multus-cni-network": plugin type="multus" name="multus-cni-network" failed (add): [default/multi-nic-iperf3-client/6483a267-5dfd-4ca0-896d-707b8c8869ae:multinic-aws-ipvlan]: error adding container to network "multinic-aws-ipvlan": Failed to AssignIP: InvalidParameterValue: Address 10.0.144.3 is in subnet's reserved range.
           status code: 400, request id: b14f8fea-1807-4b2c-8121-c680aac9c409
  Normal   AddedInterface  8s  multus   Add eth0 [10.128.2.177/23] from ovn-kubernetes
  Normal   AddedInterface  7s  multus   Add net1 [10.0.144.4/20] from default/multinic-aws-ipvlan

Should fix the aws-ipvlan itself to list up the reserved IP in advanced or/and let the health-check available for some max retry for CNIFailure.

To Reproduce
Steps to reproduce the behavior:
1.
2.
3.
4.

Expected behavior
A clear and concise description of what you expected to happen.

Screenshots
If applicable, add screenshots to help explain your problem.

  • manager container of controller and multi-nicd DS status:
  • multinicnetwork CR:
  • hostinterface list/CR:
  • cidr CR (multiNICIPAM: true):
  • ippools CR (multiNICIPAM: true):
  • log of manager container:
  • log of failed multi-nicd pod:

Environment (please complete the following information):

  • platform: [e.g. self-managed k8s, self-managed OpenShift, EKS, IKS, AKS]
  • node profile:
  • operator version :
  • cluster scale (number of nodes, pods, interfaces):

Additional context
Add any other context about the problem here.

@sunya-ch sunya-ch added the bug Something isn't working label May 22, 2023
@sunya-ch sunya-ch added this to the v1.1.0 milestone May 22, 2023
@sunya-ch sunya-ch added the aws label Jun 23, 2023
@sunya-ch sunya-ch modified the milestones: v1.1.0, v1.2.0, v1.2.1 Aug 22, 2023
@sunya-ch sunya-ch modified the milestones: v1.2.1, v1.2.2 Oct 25, 2023
@sunya-ch sunya-ch modified the milestones: v1.3.0, v2.0.0 Aug 29, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
aws bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant