❓ [Question] Is there any way to deploy on a single machine with multi-gpus？ #3092

SZ-ing · 2024-08-16T02:01:21Z

❓ Question

What you have already tried

Environment

Build information about Torch-TensorRT can be found by turning on debug messages

PyTorch Version (e.g., 1.0):
CPU Architecture:
OS (e.g., Linux):
How you installed PyTorch (conda, pip, libtorch, source):
Build command you used (if compiling from source):
Are you using local sources or building from archives:
Python version:
CUDA version:
GPU models and configuration:
Any other relevant information:

Additional context

As the title, I have a machine with multiple GPUs and I would like to know if there is any way to evenly distribute the model across these GPUs. Is there any way to achieve this?

The text was updated successfully, but these errors were encountered:

narendasan · 2024-08-16T17:58:01Z

Take a look at these tutorials:

There are many tools out there to help convert a model to one that can run on multiple GPUs that can help automate this: https://www.deepspeed.ai/tutorials/automatic-tensor-parallelism/
https://huggingface.co/docs/accelerate/basic_tutorials/launch

SZ-ing added the question Further information is requested label Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

❓ [Question] Is there any way to deploy on a single machine with multi-gpus？ #3092

❓ [Question] Is there any way to deploy on a single machine with multi-gpus？ #3092

SZ-ing commented Aug 16, 2024

narendasan commented Aug 16, 2024

❓ [Question] Is there any way to deploy on a single machine with multi-gpus？ #3092

❓ [Question] Is there any way to deploy on a single machine with multi-gpus？ #3092

Comments

SZ-ing commented Aug 16, 2024

❓ Question

What you have already tried

Environment

Additional context

narendasan commented Aug 16, 2024