Doc/3839/update sample int8 readme (#3845)

spadacesco · Francesco Spadafora · web-flow · commit 9d2613d353ca · 2024-05-15T11:15:17.000-07:00
* Replaced non existing call setINit8Mode

INT8 ANN quantization is started by setting the builder flag kINT8 over the setFlags of the builder option

Signed-off-by: Francesco Spadafora &lt;sf-fs@windowslive.com&gt;

* Fix wring naming in example

Replaced name with tensor_name in example

Signed-off-by: Francesco Spadafora &lt;sf-fs@windowslive.com&gt;

* setStrictTypeConstraints updated to BuilderFlag::kINT8

The documentation regarding debugging of the INT8 mode is updated to match the builder flags

Signed-off-by: Francesco Spadafora &lt;sf-fs@windowslive.com&gt;

* typo fix

Signed-off-by: Francesco Spadafora &lt;sf-fs@windowslive.com&gt;

---------

Signed-off-by: Francesco Spadafora &lt;sf-fs@windowslive.com&gt;
Co-authored-by: Francesco Spadafora &lt;francesco.spadafora@advertima.com&gt;
diff --git a/samples/sampleINT8API/README.md b/samples/sampleINT8API/README.md
@@ -47,7 +47,7 @@ Specifically, this sample performs the following steps:
 	`if (!builder->platformHasFastInt8()) return false;`
 
 2.  Enable INT8 mode by setting the builder flag:
-	`builder->setInt8Mode(true);`
+	`builder->setFlag(BuilderFlag::kINT8);`
 
 	You can choose not to provide the INT8 calibrator.
 	`builder->setInt8Calibrator(nullptr);`
@@ -56,10 +56,10 @@ Specifically, this sample performs the following steps:
 
 3.  Optionally and for debugging purposes, the following flag configures the builder to choose type conforming layer implementation, if one exists.
 
-	For example, in the case of `DataType::kINT8`, types are requested by `setInt8Mode(true)`. Setting this flag ensures that only the conformant layer implementation (with `kINT8` input and output types), are chosen even if a high performance non-conformat implementation is available. If no conformant layer exists, TensorRT will choose a non-conformant layer if available regardless of the setting for this flag.
-
 	`builder->setStrictTypeConstraints(true);`
 
+	Setting `setStrictTypeConstraints(true)` together with the builder flag `setFlag(BuilderFlag::kINT8)` ensures that only the conformant layer implementation (with `kINT8` input and output types) is chosen even if a high performance non-conformant implementation is available. If no conformant layer exists, TensorRT will choose a non-conformant layer if available regardless of the setting for this flag.
+
 ### Configuring the network to use custom dynamic ranges and set per-layer precision
 
 1.  Iterate through the network to set the per activation tensor dynamic range.
@@ -75,7 +75,8 @@ Specifically, this sample performs the following steps:
 
 3.  Set the dynamic range for per layer tensors:
 	```
-	string tensor_name = network->getLayer(i)->getOutput(j)->getName(); network->getLayer(i)->getOutput(j)->setDynamicRange(-tensorMap.at(name), tensorMap.at(name));
+	string tensor_name = network->getLayer(i)->getOutput(j)->getName(); 
+	network->getLayer(i)->getOutput(j)->setDynamicRange(-tensorMap.at(tensor_name), tensorMap.at(tensor_name));
 	```
 
 4.  Optional: This sample also showcases using layer precision APIs. Using these APIs, you can selectively choose to run the layer with user configurable precision and type constraints. It may not result in optimal inference performance, but can be helpful while debugging mixed precision inference.