Update README.md for vLLM 0.4.2 args by copasseron · Pull Request #102 · triton-inference-server/tutorials

copasseron · 2024-07-04T14:00:25Z

Triton's vLLM backend is based on vLLM 0.4.2 that propose more argument to the one in the documentation of the tutorial.

oandreeva-nv · 2024-08-21T19:19:03Z

Quick_Deploy/vLLM/README.md


 This file can be modified to provide further settings to the vLLM engine. See vLLM
-[AsyncEngineArgs](https://github.com/vllm-project/vllm/blob/32b6816e556f69f1672085a6267e8516bcb8e622/vllm/engine/arg_utils.py#L165)
+[AsyncEngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L615)


Good point! How about linking docs instead? potentially could be more sustainable.

Suggested change

[AsyncEngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L615)

[AsyncEngineArgs](https://docs.vllm.ai/en/latest/dev/engine/async_llm_engine.html)

oandreeva-nv · 2024-08-21T19:20:00Z

Quick_Deploy/vLLM/README.md

+[AsyncEngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L615)
 and
-[EngineArgs](https://github.com/vllm-project/vllm/blob/32b6816e556f69f1672085a6267e8516bcb8e622/vllm/engine/arg_utils.py#L11)
+[EngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L21)


ditto

Suggested change

[EngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L21)

[EngineArgs](https://docs.vllm.ai/en/latest/models/engine_args.html#engine-args)

oandreeva-nv · 2024-08-21T19:20:28Z

Thanks, @copasseron for this PR. Left some comments.

Update README.md for vLLM 0.4.2 args

ea9d5e0

Triton's vLLM backend is based on vLLM 0.4.2 that propose more argument to the one in the documentation of the tutorial.

oandreeva-nv reviewed Aug 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update README.md for vLLM 0.4.2 args#102

Update README.md for vLLM 0.4.2 args#102
copasseron wants to merge 1 commit intotriton-inference-server:mainfrom
copasseron:patch-2

copasseron commented Jul 4, 2024

Uh oh!

oandreeva-nv Aug 21, 2024

Uh oh!

oandreeva-nv Aug 21, 2024

Uh oh!

oandreeva-nv commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	[AsyncEngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L615)
	[AsyncEngineArgs](https://docs.vllm.ai/en/latest/dev/engine/async_llm_engine.html)

	[EngineArgs](https://github.com/vllm-project/vllm/blob/c7f2cf2b7f67bce5842fedfdba508440fe257375/vllm/engine/arg_utils.py#L21)
	[EngineArgs](https://docs.vllm.ai/en/latest/models/engine_args.html#engine-args)

Conversation

copasseron commented Jul 4, 2024

Uh oh!

oandreeva-nv Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv Aug 21, 2024

Choose a reason for hiding this comment

Uh oh!

oandreeva-nv commented Aug 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants