Add a binary mask to jax.Array #18517

vcharraut · 2023-11-14T12:03:10Z

vcharraut
Nov 14, 2023

Hello!

Description

I'm unsure if this question has been already answered or not but I lack of answers regarding my issue.

Basically, I try to apply a mask to an array to filter values. I know it is already possible to do jnp.where operation to apply a filter of zeros values for example, but in my case, I need to add the result to a buffer, so it is annoying to add fakes values to this buffer.

In fact I want to add transitions into a replay buffer in reinforcement learning, but some of my transitions are invalid and the invalid ones are defined by a mask vector of boolean.
And I don't really know how to properly deal with dynamic/static shape in this matter

Code

import jax
import jax.numpy as jnp
from functools import partial
import flax

num_envs = 3

@flax.struct.dataclass
class ReplayBufferState:
    """Contains data related to a replay buffer."""

    data: jnp.ndarray
    insert_position: int
    sample_position: int

def init_replay_buffer_state(data_shape):
    data = jnp.zeros(data_shape, dtype=jnp.float32)
    return ReplayBufferState(data, 0, 0)

@partial(jax.jit)
def insert_in_replay_state(buffer_state: ReplayBufferState, samples: jax.Array, mask: jax.Array) -> ReplayBufferState:
    mask_size = jnp.sum(mask)
    indices_mask = jnp.nonzero(mask, size=mask_size)[0]

    # Apply the mask to the samples to keep the valid ones
    new_samples = jnp.take(samples, indices_mask, axis=0)
    samples_size = len(new_samples)

    # Current buffer state
    data = buffer_state.data
    insert_idx = buffer_state.insert_position
    size_buffer = buffer_state.sample_position

    # Insert the new samples in the buffer
    data = jax.lax.dynamic_update_slice_in_dim(data, new_samples, insert_idx, axis=0)
    insert_idx = (insert_idx + samples_size) % size_buffer
    sample_idx = jnp.minimum(buffer_state.sample_position + samples_size, size_buffer)

    return buffer_state.replace(
        data=data,
        insert_position=insert_idx,
        sample_position=sample_idx,
    )

# Create a buffer state
buffer_state = init_replay_buffer_state((1000, 10))

# Dummy samples
samples = jnp.ones((num_envs, 10))

# Mask to insert only the first two samples
mask = jnp.array([True, True, False])

buffer_state = insert_in_replay_state(buffer_state, samples, mask)

Output

@partial(jax.jit)
def insert_in_replay_state(buffer_state: ReplayBufferState, samples: jax.Array, mask: jax.Array) -> ReplayBufferState:
         mask_size = jnp.sum(mask)
--->     indices_mask = jnp.nonzero(mask, size=mask_size)[0]
         # Apply the mask to the samples to keep the valid ones
         new_samples = jnp.take(samples, indices_mask, axis=0)

TracerIntegerConversionError: The __index__() method was called on traced array with shape int32[].

I know one solution could be to specify the mask_size value as static in the function args

@partial(jax.jit, static_argnames="mask_size")
def insert_in_replay_state(buffer_state: ReplayBufferState, samples: jax.Array, mask: jax.Array, mask_size: int) -> ReplayBufferState:
    indices_mask = jnp.nonzero(mask, size=mask_size)[0]

But in my code I already call the insert function in a jitted function, so I can't have access to any python value

Answered by jakevdp

Nov 14, 2023

The problem is that your approach attempts to construct dynamically-sized arrays: mask_size depends on the contents of the traced array mask, and so it is a dynamic value. Because of this, it cannot be used in the static size argument of jnp.nonzero(mask, size=mask_size). For more on this, see JAX Sharp Bits: Dynamic Shapes.

So what you need to do here is express the update you have in mind without constructing any dynamically-shaped arrays. Here's an example of the kind of approach you might use:

@partial(jax.jit)
def insert_in_replay_state(buffer_state: ReplayBufferState, samples: jax.Array, mask: jax.Array) -> ReplayBufferState:
    # Padded indices of the mask elements
    samples_size =

View full answer

jakevdp · 2023-11-14T13:43:59Z

jakevdp
Nov 14, 2023
Maintainer

Hi - thanks for the question! The code you pasted calls mask_to_indices with missing arguments, and refers to an undefined function update_by_slice_in_dim. I'm having trouble inferring its intent: can you take a look and edit the question so that the code reproduces the problem you're asking about?

1 reply

vcharraut Nov 14, 2023
Author

Hey! Sure, I added more context to the code to make it more understandable, thanks for the feedback

The only suboptimal code that I suppose would work is to replace the masked samples with samples from the replay buffer, but it would mean duplications samples

jakevdp · 2023-11-14T17:00:58Z

jakevdp
Nov 14, 2023
Maintainer

The problem is that your approach attempts to construct dynamically-sized arrays: mask_size depends on the contents of the traced array mask, and so it is a dynamic value. Because of this, it cannot be used in the static size argument of jnp.nonzero(mask, size=mask_size). For more on this, see JAX Sharp Bits: Dynamic Shapes.

So what you need to do here is express the update you have in mind without constructing any dynamically-shaped arrays. Here's an example of the kind of approach you might use:

@partial(jax.jit)
def insert_in_replay_state(buffer_state: ReplayBufferState, samples: jax.Array, mask: jax.Array) -> ReplayBufferState:
    # Padded indices of the mask elements
    samples_size = jnp.sum(mask)
    mask_indices = jnp.where(mask, size=len(mask), fill_value=len(mask))

    # Current buffer state
    data = buffer_state.data
    insert_idx = buffer_state.insert_position
    size_buffer = buffer_state.sample_position

    # Create a copy of the buffer with samples inserted at insert_idx
    data_indices = insert_idx + jnp.arange(len(mask))
    update_mask = jnp.arange(len(mask))[:, None] < samples_size
    data = data.at[data_indices].set(jnp.where(update_mask, samples[mask_indices], data[data_indices]))
    insert_idx = (insert_idx + samples_size) % size_buffer
    sample_idx = jnp.minimum(buffer_state.sample_position + samples_size, size_buffer)

    return buffer_state.replace(
        data=data,
        insert_position=insert_idx,
        sample_position=sample_idx,
    )

1 reply

vcharraut Nov 15, 2023
Author

Thanks a lot for your time, it exactly answers my problem, I will continue to check the documentation to understand better. Have a good day!

hr0nix · 2023-11-15T17:58:20Z

hr0nix
Nov 15, 2023

I've played a little with the idea of replay buffers in JAX here, you might find it interesting

0 replies

Add a binary mask to jax.Array #18517

Uh oh!

Uh oh!

vcharraut Nov 14, 2023

Description

Code

Output

Replies: 3 comments · 2 replies

Uh oh!

Uh oh!

jakevdp Nov 14, 2023 Maintainer

Uh oh!

Uh oh!

vcharraut Nov 14, 2023 Author

Uh oh!

Uh oh!

jakevdp Nov 14, 2023 Maintainer

Uh oh!

vcharraut Nov 15, 2023 Author

Uh oh!

hr0nix Nov 15, 2023

vcharraut
Nov 14, 2023

Replies: 3 comments 2 replies

jakevdp
Nov 14, 2023
Maintainer

vcharraut Nov 14, 2023
Author

jakevdp
Nov 14, 2023
Maintainer

vcharraut Nov 15, 2023
Author

hr0nix
Nov 15, 2023