added docs

KshitizLohia · KshitizLohia · commit 8fa1606b675d · 2023-10-16T13:37:23.000+05:30
diff --git a/ads/feature_store/docs/source/dataset.rst b/ads/feature_store/docs/source/dataset.rst
@@ -138,6 +138,10 @@ Feature store allows you to define expectations on data being materialized into
 
 .. code-block:: python3
 
+    from great_expectations.core import ExpectationSuite, ExpectationConfiguration
+    from ads.feature_store.common.enums import TransformationMode, ExpectationType
+    from ads.feature_store.feature_group import FeatureGroup
+
     expectation_suite = ExpectationSuite(
         expectation_suite_name="expectation_suite_name"
     )
@@ -186,6 +190,7 @@ dataset or it can be updated later as well.
 .. code-block:: python3
 
   # Define statistics configuration for selected features
+  from ads.feature_store.statistics_config import StatisticsConfig
   stats_config = StatisticsConfig().with_is_enabled(True).with_columns(["column1", "column2"])
 
 
@@ -194,6 +199,7 @@ This can be used with dataset instance.
 .. code-block:: python3
 
   from ads.feature_store.dataset import Dataset
+  from ads.feature_store.statistics_config import StatisticsConfig
 
   dataset = (
         Dataset
diff --git a/ads/feature_store/docs/source/feature_group.rst b/ads/feature_store/docs/source/feature_group.rst
@@ -128,19 +128,18 @@ Materialise Stream
 You can call the ``materialise_stream() -> FeatureGroupJob`` method of the ``FeatureGroup`` instance to load the streaming data to feature group. To persist the feature_group and save feature_group data along the metadata in the feature store, call the ``materialise_stream()``
 
 The ``.materialise_stream()`` method takes the following parameter:
-- ``input_dataframe``: Features in Streaming Dataframe to be saved.
-- ``query_name``: It is possible to optionally specify a name for the query to make it easier to recognise in the Spark UI. Defaults to ``None``.
-- ``ingestion_mode``: Specifies how data of a streaming DataFrame/Dataset is written to a streaming sink.
-    - ``"append"``: Only the new rows in the streaming DataFrame/Dataset will be written to the sink. If the query doesn’t contain aggregations, it will be equivalent to
--     append mode. Defaults to ``"append"``.
-    - ``"complete"``: All the rows in the streaming DataFrame/Dataset will be written to the sink every time there is some update.
-    - ``"update"``: only the rows that were updated in the streaming DataFrame/Dataset will be written to the sink every time there are some updates.
-- ``await_termination``: Waits for the termination of this query, either by query.stop() or by an exception. If the query has terminated with an exception, then the exception will be thrown. If timeout is set, it returns whether the query has terminated or not within the timeout seconds. Defaults to ``False``.
-- ``timeout``: Only relevant in combination with ``await_termination=True``.
-    - Defaults to ``None``.
-- ``checkpoint_dir``: Checkpoint directory location. This will be used to as a reference to from where to resume the streaming job. If ``None`` then hsfs will construct as "insert_stream_" + online_topic_name. Defaults to ``None``.
-- ``write_options``: Additional write options for Spark as key-value pairs.
-    - Defaults to ``{}``.
+    - ``input_dataframe``: Features in Streaming Dataframe to be saved.
+    - ``query_name``: It is possible to optionally specify a name for the query to make it easier to recognise in the Spark UI. Defaults to ``None``.
+    - ``ingestion_mode``: Specifies how data of a streaming DataFrame/Dataset is written to a streaming sink.
+        - ``append``: Only the new rows in the streaming DataFrame/Dataset will be written to the sink. If the query doesn’t contain aggregations, it will be equivalent to append mode. Defaults to ``"append"``.
+        - ``complete``: All the rows in the streaming DataFrame/Dataset will be written to the sink every time there is some update.
+        - ``update``: only the rows that were updated in the streaming DataFrame/Dataset will be written to the sink every time there are some updates.
+    - ``await_termination``: Waits for the termination of this query, either by ``query.stop()`` or by an exception. If the query has terminated with an exception, then the exception will be thrown. If timeout is set, it returns whether the query has terminated or not within the timeout seconds. Defaults to ``False``.
+    - ``timeout``: Only relevant in combination with ``await_termination=True``.
+        - Defaults to ``None``.
+    - ``checkpoint_dir``: Checkpoint directory location. This will be used to as a reference to from where to resume the streaming job. Defaults to ``None``.
+    - ``write_options``: Additional write options for Spark as key-value pairs.
+        - Defaults to ``{}``.
 
 .. seealso::
    :ref:`Feature Group Job`
@@ -200,6 +199,9 @@ With a ``FeatureGroup`` instance, You can save the expectation details using ``w
 .. image:: figures/validation.png
 
 .. code-block:: python3
+    from great_expectations.core import ExpectationSuite, ExpectationConfiguration
+    from ads.feature_store.common.enums import TransformationMode, ExpectationType
+    from ads.feature_store.feature_group import FeatureGroup
 
     expectation_suite = ExpectationSuite(
         expectation_suite_name="expectation_suite_name"
@@ -248,6 +250,7 @@ feature group or it can be updated later as well.
 .. code-block:: python3
 
   # Define statistics configuration for selected features
+  from ads.feature_store.statistics_config import StatisticsConfig
   stats_config = StatisticsConfig().with_is_enabled(True).with_columns(["column1", "column2"])