How can use TRT in torchrec? #2307

yjjinjie · 2024-08-16T03:40:28Z

I see the example: https://github.com/pytorch/torchrec/blob/v0.2.0/examples/inference/dlrm_predict_single_gpu.py

but how can i split the embedding and dense model,and then dense model use trt, the conat (ebc+ trt_dense model) , export the torchscript model in C++ inference ?

PaulZhang12 · 2024-08-19T17:10:37Z

@yjjinjie that inference solution is legacy, will need to clean up. Check out this file for exporting a model to TorchScript for C++ serving: https://github.com/pytorch/torchrec/blob/main/torchrec/inference/dlrm_predict.py

You can apply lowering to the dense part of the model individually and then still torchscript it like the example above

yjjinjie · 2024-08-20T07:00:18Z

can you show a example--> how to lowering to the dense part of the model individually?

just lowering the denseArch?

https://github.com/pytorch/torchrec/blob/main/torchrec/models/dlrm.py#L115

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can use TRT in torchrec? #2307

How can use TRT in torchrec? #2307

yjjinjie commented Aug 16, 2024

PaulZhang12 commented Aug 19, 2024

yjjinjie commented Aug 20, 2024 •

edited

Loading

How can use TRT in torchrec? #2307

How can use TRT in torchrec? #2307

Comments

yjjinjie commented Aug 16, 2024

PaulZhang12 commented Aug 19, 2024

yjjinjie commented Aug 20, 2024 • edited Loading

yjjinjie commented Aug 20, 2024 •

edited

Loading