Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? #15276

Dsqds · 2023-07-10T05:27:12Z

It cost 0.11ms when I inferred resnet50-v2-7 with tvm-cuda on x86.
Then I used the tensorrt to infer the same model with fp16, it cost 0.4ms .
Is it normal that tvm-cuda so much faster than tensorrt on x86?

masahi · 2023-07-10T07:33:32Z

Please use https://discuss.tvm.apache.org/

Dsqds added needs-triage PRs or issues that need to be investigated by maintainers to find the right assignees to address it type: bug labels Jul 10, 2023

Dsqds changed the title ~~Why the time cost of inferring resnet50-v2-7 with tvm-cuda smallar than using tensorrt on x86?~~ Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? Jul 10, 2023

masahi closed this as completed Jul 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? #15276

Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? #15276

Dsqds commented Jul 10, 2023 •

edited

Loading

masahi commented Jul 10, 2023

Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? #15276

Why the time cost of inferring resnet50-v2-7 with tvm-cuda much smallar than using tensorrt on x86? #15276

Comments

Dsqds commented Jul 10, 2023 • edited Loading

masahi commented Jul 10, 2023

Dsqds commented Jul 10, 2023 •

edited

Loading