Ideal parameters when serving only one client? #1268
Unanswered
andysalerno
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi,
I'm using TGI to serve up an assistant, for which I'm the only user.
TGI might be a bit overkill for this, since it is geared toward efficiently serving many clients. But it's so easy to set up, and so stable, that it's working beautifully for this scenario :)
So my question is, if I know I am only serving one request at a time, what would be the ideal parameters?
I'm guessing these:
but I'm not sure if there are any other considerations or any other things I could do.
Beta Was this translation helpful? Give feedback.
All reactions