How to deploy Triton on Vertex AI? #7584
Unanswered
btseitlinatwork
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hello
I am excited to use Triton with Google Cloud Vertex AI, but I can't seem to find a way.
I followed these tutorials:
None of them produced a working deployment. I am able to build the container (or rather rename the NGC container and upload it to google artifact registry). But deployment of the model to a Vertex endpoint fails with a non-descriptive error:
It seems like the container is receiving some unexpected command. But I don't understand what it was: there are no logs.
Does anyone have an up-to-date guide on how to deploy to Vertex?
Also, I see that all questions and discussions of Triton on Vertex are from 2022 and older. Should I look into deploying on GKE or another solution altogether?
Beta Was this translation helpful? Give feedback.
All reactions