White Papers

33 CheXNet Inference with Nvidia T4 on Dell EMC PowerEdge R7425
Figure 12. CheXNet Inference TF-TRT 5.0 Integration in INT8int8 Precision Mode
Command line to execute the benchmark:
python3 tensorrt_chest.py
--savedmodel_dir=/home/dell/chest-x-ray/chexnet_saved_model/1541777429/ \
--image_file=image.jpg \
--int8 \
--output_dir=/home/dell/chest-x-ray/output_tensorrt_chexnet_1541777429/
--batch_size=1
Docker image for TensorFlow-GPU: nvcr.io/nvidia/tensorflow:18.10-py3
Where: --int8: Benchmark the model with TensorRT™ using int8 precision
Script Output sample:
==========================
network: tftrt_int8_frozen_graph.pb, batchsize 1, steps 100
fps median: 282.2, mean: 315.2, uncertainty: 6.8, jitter: 5.6
latency median: 0.00354, mean: 0.00329, 99th_p: 0.00371, 99th_uncertainty: 0.00008
==========================