Updated doc.

lu-ohai · lu-ohai · commit 3043c3b6074d · 2024-12-10T10:49:53.000-05:00
diff --git a/docs/source/user_guide/large_language_model/langchain_models.rst b/docs/source/user_guide/large_language_model/langchain_models.rst
@@ -26,47 +26,50 @@ By default, the integration uses the same authentication method configured with
 .. code-block:: python3
 
     import ads
-    from ads.llm import ChatOCIModelDeploymentVLLM
+    from ads.llm import ChatOCIModelDeployment
     
     ads.set_auth(auth="resource_principal")
     
-    llm = ChatOCIModelDeploymentVLLM(
+    llm = ChatOCIModelDeployment(
         model="odsc-llm",
         endpoint= f"https://modeldeployment.oci.customer-oci.com/<OCID>/predict",
-        # Optionally you can specify additional keyword arguments for the model, e.g. temperature.
+        # Optionally you can specify additional keyword arguments for the model, e.g. temperature and headers.
         temperature=0.1,
+        headers={"route": "v1/chat/completions"}, # default header for chat models
     )
 
 Alternatively, you may use specific authentication for the model:
 
 .. code-block:: python3
 
     import ads
-    from ads.llm import ChatOCIModelDeploymentVLLM
+    from ads.llm import ChatOCIModelDeployment
 
-    llm = ChatOCIModelDeploymentVLLM(
+    llm = ChatOCIModelDeployment(
         model="odsc-llm",
         endpoint= f"https://modeldeployment.oci.customer-oci.com/<OCID>/predict",
         # Use security token authentication for the model
         auth=ads.auth.security_token(profile="my_profile"),
-        # Optionally you can specify additional keyword arguments for the model, e.g. temperature.
+        # Optionally you can specify additional keyword arguments for the model, e.g. temperature and headers.
         temperature=0.1,
+        headers={"route": "v1/chat/completions"}, # default header for chat models
     )
 
 Completion Models
 =================
 
-Completion models takes a text string and input and returns a string with completions. To use completion models, your model should be deployed with the completion endpoint (``/v1/completions``). The following example shows how you can use the ``OCIModelDeploymentVLLM`` class for model deployed with vLLM container. If you deployed the model with TGI container, you can use ``OCIModelDeploymentTGI`` similarly.
+Completion models takes a text string and input and returns a string with completions. To use completion models, your model should be deployed with the completion endpoint (``/v1/completions``).
 
 .. code-block:: python3
 
-    from ads.llm import OCIModelDeploymentVLLM
+    from ads.llm import OCIModelDeploymentLLM
 
-    llm = OCIModelDeploymentVLLM(
+    llm = OCIModelDeploymentLLM(
         model="odsc-llm",
         endpoint= f"https://modeldeployment.oci.customer-oci.com/<OCID>/predict",
         # Optionally you can specify additional keyword arguments for the model.
         max_tokens=32,
+        headers={"route": "v1/completions"}, # default header for completion models
     )
 
     # Invoke the LLM. The completion will be a string.
@@ -87,18 +90,19 @@ Completion models takes a text string and input and returns a string with comple
 Chat Models
 ===========
 
-Chat models takes `chat messages <https://python.langchain.com/docs/concepts/#messages>`_ as inputs and returns additional chat message (usually `AIMessage <https://python.langchain.com/docs/concepts/#aimessage>`_) as output. To use chat models, your models must be deployed with chat completion endpoint (``/v1/chat/completions``). The following example shows how you can use the ``ChatOCIModelDeploymentVLLM`` class for model deployed with vLLM container. If you deployed the model with TGI container, you can use ``ChatOCIModelDeploymentTGI`` similarly.
+Chat models takes `chat messages <https://python.langchain.com/docs/concepts/#messages>`_ as inputs and returns additional chat message (usually `AIMessage <https://python.langchain.com/docs/concepts/#aimessage>`_) as output. To use chat models, your models must be deployed with chat completion endpoint (``/v1/chat/completions``).
 
 .. code-block:: python3
 
     from langchain_core.messages import HumanMessage, SystemMessage
-    from ads.llm import ChatOCIModelDeploymentVLLM
+    from ads.llm import ChatOCIModelDeployment
 
-    llm = ChatOCIModelDeploymentVLLM(
+    llm = ChatOCIModelDeployment(
         model="odsc-llm",
-        endpoint=f"<oci_model_deployment_url>>/predict",
+        endpoint=f"<oci_model_deployment_url>/predict",
         # Optionally you can specify additional keyword arguments for the model.
         max_tokens=32,
+        headers={"route": "v1/chat/completions"}, # default header for chat models
     )
 
     messages = [