Skip to content

Commit d22fd03

Browse files
committed
Resolve comments
1 parent 4f2e217 commit d22fd03

File tree

3 files changed

+8
-11
lines changed

3 files changed

+8
-11
lines changed

README.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -110,10 +110,11 @@ In this case, please install vLLM first. You can do so by running
110110
container with the following commands:
111111

112112
```
113-
mkdir -p /opt/tritonserver/backends/vllm
114-
git clone https://github.com/triton-inference-server/vllm_backend.git /opt/tritonserver/backends/vllm/vllm_backend
115-
cp -r /opt/tritonserver/backends/vllm/vllm_backend/src/* /opt/tritonserver/backends/vllm
116-
rm -rf /opt/tritonserver/backends/vllm/vllm_backend
113+
vllm_tmp_dir=/tmp/backends/vllm_backend
114+
mkdir -p /opt/tritonserver/backends/vllm $vllm_tmp_dir
115+
git clone https://github.com/triton-inference-server/vllm_backend.git $vllm_tmp_dir
116+
cp -r $vllm_tmp_dir/src/* /opt/tritonserver/backends/vllm
117+
rm -rf $vllm_tmp_dir
117118
```
118119

119120
## Using the vLLM Backend

src/model.py

Lines changed: 2 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -159,19 +159,15 @@ def init_engine(self):
159159
"model": self.args["model_name"],
160160
"version": self.args["model_version"],
161161
}
162-
self.metrics = VllmStatLogger(labels=labels)
162+
# Add vLLM custom metrics
163+
self.llm_engine.add_logger("triton", VllmStatLogger(labels=labels))
163164
except pb_utils.TritonModelException as e:
164165
if "metrics not supported" in str(e):
165166
# Metrics are disabled at the server
166-
self.metrics = None
167167
self.logger.log_info("[vllm] Metrics not supported")
168168
else:
169169
raise e
170170

171-
# Add vLLM custom metrics
172-
if self.metrics:
173-
self.llm_engine.add_logger("triton", self.metrics)
174-
175171
def setup_lora(self):
176172
self.enable_lora = False
177173

src/utils/metrics.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ def __init__(self, labels: Dict, local_interval: float = 0) -> None:
6767
self.metrics = TritonMetrics(labels=labels)
6868

6969
def info(self, type: str, obj: SupportsMetricsInfo) -> None:
70-
raise NotImplementedError
70+
pass
7171

7272
def _log_counter(self, counter, data: Union[int, float]) -> None:
7373
"""Convenience function for logging to counter.

0 commit comments

Comments
 (0)