config the vLLM engineArgs in config.pbtxt by activezhao · Pull Request #63 · triton-inference-server/tutorials

activezhao · 2023-10-19T10:17:52Z

We get vLLm engineArgs from vllm_engine_args.json before, now, we can get them from config.pbtxt.

) * Initial Commit * Mount model repo so changes reflect, parameter tweaking, README file * Image name error * Incorporating review comments. Separate docker and model repo builds, add README, restructure repo * Tutorial restructuring. Using static model configurations * Bump triton container and update README * Remove client script * Incorporating review comments * Modify WIP line in vLLM tutorial * Remove trust_remote_code parameter from falcon model * Removing Mistral * Incorporating Feedback * Change input/output names * Pre-commit format * Different perf_analyzer example, config file format fixes * Deep dive changes to Triton tools section * Remove unused variable

Added Llama2 tutorial for TensorRT-LLM backend

…ference-server#65) * Updated vLLM tutorial's README to use vllm container --------- Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com>

oandreeva-nv · 2023-11-21T18:42:02Z

Hi @activezhao , may I kindly ask you to re-base your PR on top of the main branch and send us a CLA: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md#contributor-license-agreement-cla

activezhao · 2023-11-22T03:42:33Z

Hi @activezhao , may I kindly ask you to re-base your PR on top of the main branch and send us a CLA: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md#contributor-license-agreement-cla

Hi @oandreeva-nv OK, I will do it.

But, I find that the structure of Quick_Deploy/vLLM has changed a lot, will this pr still be OK?

# Conflicts: # Quick_Deploy/vLLM/config.pbtxt

activezhao · 2023-11-22T09:51:39Z

Hi @activezhao , may I kindly ask you to re-base your PR on top of the main branch and send us a CLA: https://github.com/triton-inference-server/server/blob/main/CONTRIBUTING.md#contributor-license-agreement-cla

Hi @oandreeva-nv Because the rebase involves too many files, I directly opened a new PR #72, and I have sent the CLA email.

Could you please close this PR and do CR in the new PR?

Thanks.

oandreeva-nv · 2023-11-22T17:27:50Z

Closing this PR per @activezhao request

change the configuration of engineArgs in config.pbtxt

8c34038

activezhao changed the title ~~change the configuration of engineArgs in config.pbtxt~~ config the vLLM engineArgs in config.pbtxt Oct 19, 2023

fpetrini15 and others added 8 commits October 23, 2023 17:50

Updated version of vllm (triton-inference-server#64)

17d6b79

Llama tutorial for TRTLLM (triton-inference-server#62)

5283ae5

Added Llama2 tutorial for TensorRT-LLM backend

Updated vllm tutorial to use vllm based container from NGC (triton-in…

9a179ca

…ference-server#65) * Updated vLLM tutorial's README to use vllm container --------- Co-authored-by: dyastremsky <58150256+dyastremsky@users.noreply.github.com>

update path to tokenizer (triton-inference-server#66)

f4bfa7c

Update TRT-LLM installation steps (triton-inference-server#68)

a68a4ac

Extra comments to Llama2 tutorial (triton-inference-server#67)

c55bd64

Add generate endpoint to Tutorial (triton-inference-server#71)

8a2d268

activezhao and others added 8 commits November 22, 2023 14:22

change the configuration of engineArgs in config.pbtxt

2f20f31

change the configuration of engineArgs in config.pbtxt

3e2b39d

Merge remote-tracking branch 'origin/main'

5009162

# Conflicts: # Quick_Deploy/vLLM/config.pbtxt

change the configuration of engineArgs in config.pbtxt

1a28ab1

change the configuration of engineArgs in config.pbtxt

ba8e8fd

change the configuration of engineArgs in config.pbtxt

b0bddfe

change the configuration of engineArgs in config.pbtxt

f3ded75

Merge remote-tracking branch 'origin/main'

8ec3b54

oandreeva-nv closed this Nov 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config the vLLM engineArgs in config.pbtxt#63

config the vLLM engineArgs in config.pbtxt#63
activezhao wants to merge 17 commits intotriton-inference-server:mainfrom
activezhao:main

activezhao commented Oct 19, 2023

Uh oh!

oandreeva-nv commented Nov 21, 2023

Uh oh!

activezhao commented Nov 22, 2023 •

edited

Loading

Uh oh!

activezhao commented Nov 22, 2023 •

edited

Loading

Uh oh!

oandreeva-nv commented Nov 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Conversation

activezhao commented Oct 19, 2023

Uh oh!

oandreeva-nv commented Nov 21, 2023

Uh oh!

activezhao commented Nov 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

activezhao commented Nov 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

oandreeva-nv commented Nov 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

activezhao commented Nov 22, 2023 •

edited

Loading

activezhao commented Nov 22, 2023 •

edited

Loading