Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update kubernetes_deployment.md #398

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

Rajakavitha1
Copy link
Contributor

@Rajakavitha1 Rajakavitha1 commented Sep 23, 2024

fixes: #383

"Fixes #383 " or "Add description of the problem this PR solves"

Proposed Changes

fixes: kserve#383

Signed-off-by: Rajakavitha Kodhandapani <[email protected]>
Copy link

netlify bot commented Sep 23, 2024

Deploy Preview for elastic-nobel-0aef7a ready!

Name Link
🔨 Latest commit bf1d4f9
🔍 Latest deploy log https://app.netlify.com/sites/elastic-nobel-0aef7a/deploys/66f102b1a4da0c0008e79c6d
😎 Deploy Preview https://deploy-preview-398--elastic-nobel-0aef7a.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

fixed formatting

Signed-off-by: Rajakavitha Kodhandapani <[email protected]>
format

Signed-off-by: Rajakavitha Kodhandapani <[email protected]>
Copy link
Member

@terrytangyuan terrytangyuan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for your contribution!

"disableIngressCreation": true,
"ingressDomain": "svc.cluster.local",
"ingressGateway" : "disabled",
"ingressService" : "disabled"
Copy link

@spolti spolti Sep 23, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

here we also need to set the default ingressClass otherwise you will need to do everything manually later.
If the reason is to decouple from Istio, most of k8s cluster have a default one, we need to give this info to the user e.g. minikube uses nginx.
Instead disabling the ingress would be better to explain both options.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please suggest what change needs to be made to the yaml file.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@spolti Can you provide some suggested verbiage here for @Rajakavitha1?

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it is described a few lines below, one of the removed.

btw @terrytangyuan, shouldn't we have a specific section for raw?
Instead of deleting the istio part, we should instead document both ways.
Istio is required for serverless and it is the default method.

You could change the beginning of the doc like keeping the requirements for the Serverless mode, and later, explain what is the RawDeployment (like it is done now) and describe the steps needed to install it.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

                "disableIstioVirtualHost": true,
                "disableIngressCreation": true,
                "ingressGateway" : "disabled",
                "ingressService" : "disabled"

I have a few questions regarding this configuration.

Isn't it true that the controller already handles these aspects? In other words, when using rawDeployment, these related calls may not even be triggered. Should we really be making changes in this area?

                "ingressDomain": "svc.cluster.local",

this would be needed.

"defaultDeploymentMode": "RawDeployment"

Additionally, I understand that this setting implies using rawDeployment as the default, but it doesn't mean that only rawDeployment will be used. I think this configuration might still be necessary when setting the serverlessMode in isvc.

Lastly, shouldn't the disableIngressCreation be set to false instead? does the ingress need not to be created?

@@ -1,52 +1,55 @@
# Kubernetes Deployment Installation Guide
KServe supports `RawDeployment` mode to enable `InferenceService` deployment with Kubernetes resources [`Deployment`](https://kubernetes.io/docs/concepts/workloads/controllers/deployment), [`Service`](https://kubernetes.io/docs/concepts/services-networking/service), [`Ingress`](https://kubernetes.io/docs/concepts/services-networking/ingress) and [`Horizontal Pod Autoscaler`](https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale). Comparing to serverless deployment it unlocks Knative limitations such as mounting multiple volumes, on the other hand `Scale down and from Zero` is not supported in `RawDeployment` mode.

Kubernetes 1.22 is the minimally required version and please check the following recommended Istio versions for the corresponding
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not sure if we need to remove it.

Copy link
Contributor Author

@Rajakavitha1 Rajakavitha1 Sep 24, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

My understanding was that Istio was not required The current installation guide for raw deployment mode is not accurate. For example, Istio is not needed if disableIngressCreation is enabled. See https://github.com/kserve/kserve/pull/3436

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, raw does not require it, but for the serverless mode the docs advises to use it, iirc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Improve raw deployment documentation
4 participants