Add doc on customizing decoding

Conchylicultor · copybara-github · commit 51a66d4c3791 · 2019-07-17T12:55:53.000-07:00
PiperOrigin-RevId: 258624307
diff --git a/docs/_book.yaml b/docs/_book.yaml
@@ -23,6 +23,8 @@ upper_tabs:
         path: /datasets/splits
       - title: Add a dataset
         path: /datasets/add_dataset
+      - title: Feature decoding
+        path: /datasets/decode
       - title: Add huge datasets
         path: /datasets/beam_datasets
       - title: Store your dataset on GCS
diff --git a/docs/decode.md b/docs/decode.md
@@ -0,0 +1,156 @@
+# Customizing feature decoding
+
+*   [Usage examples](#usage-examples)
+    *   [Skipping the image decoding](#skipping-the-image-decoding)
+    *   [Filter/shuffle dataset before images get decoded](#filtershuffle-dataset-before-images-get-decoded)
+    *   [Cropping and decoding at the same time](#cropping-and-decoding-at-the-same-time)
+    *   [Customizing video decoding](#customizing-video-decoding)
+
+The `tfds.decode` API allows you override the default feature decoding. The main
+use case is to skip the image decoding for better performance.
+
+Warning: This API gives you access to the low-level `tf.train.Example` format on
+disk (as defined by the `FeatureConnector`). This API is targeted towards
+advanced users who want better read performance with images.
+
+## Usage examples
+
+### Skipping the image decoding
+
+To keep full control over the decoding pipeline, or to apply a filter before the
+images get decoded (for better performance), you can skip the image decoding
+entirely. This works with both `tfds.features.Image` and `tfds.features.Video`.
+
+```python
+ds = tfds.load('imagenet2012', split='train', decoders={
+    'image': tfds.decode.SkipDecoding(),
+})
+
+for example in ds.take(1):
+  assert example['image'].dtype == tf.string  # Images are not decoded
+```
+
+### Filter/shuffle dataset before images get decoded
+
+Similarly to the previous example, you can use `tfds.decode.SkipDecoding()` to
+insert additional `tf.data` pipeline customization before decoding the image.
+That way the filtered images won't be decoded and you can use a bigger shuffle
+buffer.
+
+```python
+# Load the base dataset without decoding
+ds, ds_info = tfds.load(
+    'imagenet2012',
+    split='train',
+    decoders={
+        'image': tfds.decode.SkipDecoding(),  # Image won't be decoded here
+    },
+    as_supervised=True,
+    with_info=True,
+)
+# Apply filter and shuffle
+ds = ds.filter(lambda image, label: label != 10)
+ds = ds.shuffle(10000)
+# Then decode with ds_info.features['image']
+ds = ds.map(
+    lambda image, label: ds_info.features['image'].decode_example(image), label)
+
+```
+
+### Cropping and decoding at the same time
+
+To override the default `tf.io.decode_image` operation, you can create a new
+`tfds.decode.Decoder` object using the `tfds.decode.make_decoder()` decorator.
+
+```python
+@tfds.decode.make_decoder()
+def decode_example(serialized_image, feature):
+  crop_y, crop_x, crop_height, crop_width = 10, 10, 64, 64
+  return tf.image.decode_and_crop_jpeg(
+      serialized_image,
+      [crop_y, crop_x, crop_height, crop_width],
+      channels=feature.feature.shape[-1],
+  )
+
+ds = tfds.load('imagenet2012', split='train', decoders={
+    # With video, decoders are applied to individual frames
+    'image': decode_example(),
+})
+```
+
+Which is equivalent to:
+
+```python
+def decode_example(serialized_image, feature):
+  crop_y, crop_x, crop_height, crop_width = 10, 10, 64, 64
+  return tf.image.decode_and_crop_jpeg(
+      serialized_image,
+      [crop_y, crop_x, crop_height, crop_width],
+      channels=feature.shape[-1],
+  )
+
+ds, ds_info = tfds.load(
+    'imagenet2012',
+    split='train',
+    with_info=True,
+    decoders={
+        'image': tfds.decode.SkipDecoding(),  # Skip frame decoding
+    },
+)
+ds = ds.map(functools.partial(decode_example, feature=ds_info.features['image']))
+```
+
+### Customizing video decoding
+
+Video are `Sequence(Image())`. When applying custom decoders, they will be
+applied to individual frames. This mean decoders for images are automatically
+compatible with video.
+
+```python
+@tfds.decode.make_decoder()
+def decode_example(serialized_image, feature):
+  crop_y, crop_x, crop_height, crop_width = 10, 10, 64, 64
+  return tf.image.decode_and_crop_jpeg(
+      serialized_image,
+      [crop_y, crop_x, crop_height, crop_width],
+      channels=feature.feature.shape[-1],
+  )
+
+ds = tfds.load('ucf101', split='train', decoders={
+    # With video, decoders are applied to individual frames
+    'video': decode_example(),
+})
+```
+
+Which is equivalent to:
+
+```python
+def decode_frame(serialized_image):
+  """Decodes a single frame."""
+  crop_y, crop_x, crop_height, crop_width = 10, 10, 64, 64
+  return tf.image.decode_and_crop_jpeg(
+      serialized_image,
+      [crop_y, crop_x, crop_height, crop_width],
+      channels=ds_info.features['video'].shape[-1],
+  )
+
+
+def decode_video(example):
+  """Decodes all individual frames of the video."""
+  video = example['video']
+  video = tf.map_fn(
+      decode_frame,
+      video,
+      dtype=ds_info.features['video'].dtype,
+      parallel_iterations=10,
+      back_prop=False,
+  )
+  example['video'] = video
+  return example
+
+
+ds, ds_info = tfds.load('ucf101', split='train', with_info=True, decoders={
+    'video': tfds.decode.SkipDecoding(),  # Skip frame decoding
+})
+ds = ds.map(decode_video)  # Decode the video
+```
diff --git a/docs/release_notes.md b/docs/release_notes.md
@@ -16,3 +16,5 @@
 *   It is now possible to add arbitrary metadata to `tfds.core.DatasetInfo`
     which will be stored/restored with the dataset. See `tfds.core.Metadata`.
 *   Better proxy support, possibility to add certificate
+*   Add `decoders` kwargs to override the default feature decoding
+    ([guide](https://github.com/tensorflow/datasets/tree/master/docs/decode.md)).
diff --git a/tensorflow_datasets/core/dataset_builder.py b/tensorflow_datasets/core/dataset_builder.py
@@ -374,7 +374,9 @@ def as_dataset(self,
         Defaults to `True` if `split == tfds.Split.TRAIN` and `False` otherwise.
       decoders: Nested dict of `Decoder` objects which allow to customize the
         decoding. The structure should match the feature structure, but only
-        customized feature keys need to be present.
+        customized feature keys need to be present. See
+        [the guide](https://github.com/tensorflow/datasets/tree/master/docs/decode.md)
+        for more info.
       as_supervised: `bool`, if `True`, the returned `tf.data.Dataset`
         will have a 2-tuple structure `(input, label)` according to
         `builder.info.supervised_keys`. If `False`, the default,
diff --git a/tensorflow_datasets/core/decode/base.py b/tensorflow_datasets/core/decode/base.py
@@ -159,7 +159,7 @@ def no_op_decoder(example, feature):
     \"\"\"Decoder simply decoding feature normally.\"\"\"
     return feature.decode_example(example)
 
-  tfds.load('mnist', split='train', decoder: {
+  tfds.load('mnist', split='train', decoders: {
       'image': no_op_decoder(),
   })
   ```
diff --git a/tensorflow_datasets/core/features/top_level_feature.py b/tensorflow_datasets/core/features/top_level_feature.py
@@ -54,7 +54,9 @@ def decode_example(self, serialized_example, decoders=None):
       serialized_example: Nested `dict` of `tf.Tensor`
       decoders: Nested dict of `Decoder` objects which allow to customize the
         decoding. The structure should match the feature structure, but only
-        customized feature keys need to be present.
+        customized feature keys need to be present. See
+        [the guide](https://github.com/tensorflow/datasets/tree/master/docs/decode.md)
+        for more info.
 
     Returns:
       example: Nested `dict` containing the decoded nested examples.
diff --git a/tensorflow_datasets/core/registered.py b/tensorflow_datasets/core/registered.py
@@ -250,7 +250,9 @@ def load(name,
       features.
     decoders: Nested dict of `Decoder` objects which allow to customize the
       decoding. The structure should match the feature structure, but only
-      customized feature keys need to be present.
+      customized feature keys need to be present. See
+      [the guide](https://github.com/tensorflow/datasets/tree/master/docs/decode.md)
+      for more info.
     with_info: `bool`, if True, tfds.load will return the tuple
       (tf.data.Dataset, tfds.core.DatasetInfo) containing the info associated
       with the builder.