Exam Certified Generative AI Engineer Associate topic 1 question 75 discussion

Actual exam question from Databricks's Certified Generative AI Engineer Associate

Question #: 75
Topic #: 1

[All Certified Generative AI Engineer Associate Questions]

A Generative AI Engineer developed an LLM application using the pay-per-token Foundation Model API. Now that the application is ready to be deployed, they would like to ensure the model endpoint can serve high incoming volumes of requests in production.

What should the Generative AI Engineer consider?

A. Switch to using External Models instead
B. Throttle the incoming batch of requests manually to avoid rate limiting issues
C. Change to a model with a fewer number of parameters in order to reduce hardware constraint issues
D. Deploy the model using provisioned throughput as it comes with performance guarantees

Show Suggested Answer

Suggested Answer: D 🗳️

by Duke_CT at June 18, 2025, 1:09 p.m.

Comments

Submit Cancel

seaun

2 weeks, 2 days ago

Selected Answer: D

Answer should be D

upvoted 1 times

...

Duke_CT

2 weeks, 3 days ago

Selected Answer: A

I believe it's A.

upvoted 1 times

...