Precision differences between TPU and CPU for same computations #20510

yixiaoer · 2024-03-31T10:04:47Z

yixiaoer
Mar 31, 2024

I compared results for some computations with JAX on both TPU and CPU using dtype('float32') and setting the precision on TPU to HIGHEST:

import jax; jax.config.update('jax_default_matmul_precision', jax.lax.Precision.HIGHEST)
import einops as op
import jax.numpy as jnp
import jax.random as jrand

batch_size, input_size, len_, hidden_size = 10, 512, 20, 1024
key = jrand.key(42)
key, subkey = jrand.split(key)
x = jrand.normal(subkey, (batch_size, input_size, len_))
key, subkey = jrand.split(key)
w = jrand.normal(subkey, (batch_size, input_size, hidden_size))

out_einsum = op.einsum(x, w, 'b i l, b i h-> b l h')
out_tanh = jnp.tanh(x)
out_sigmoid = jax.nn.sigmoid(x)
out_elemal = x * x

cpu_device = jax.devices('cpu')[0]
with jax.default_device(cpu_device):
    out_einsum_cpu = op.einsum(x, w, 'b i l, b i h-> b l h')
    out_tanh_cpu = jnp.tanh(x)
    out_sigmoid_cpu = jax.nn.sigmoid(x)
    out_elemal_cpu = x * x

print(jnp.allclose(out_einsum, out_einsum_cpu, atol=1e-5))  # False
print(jnp.allclose(out_tanh, out_tanh_cpu, atol=1e-5))  # False
print(jnp.allclose(out_sigmoid, out_sigmoid_cpu, atol=1e-5))  # True
print(jnp.allclose(out_elemal, out_elemal_cpu, atol=1e-5))  # True

Given the README description of precision on TPU, I think both TPU and CPU uses 32-bit values, and expected identical results:

Why there are differences in the outcomes of einsum and tanh between TPU and CPU?

Answered by jakevdp

Apr 2, 2024

Hi - thanks for the question!

In general, you'll find that operations on TPU will be less accurate than operations on CPU; the reason for this comes down to the backend-dependent implementations of various ops. In broad strokes, TPU operations tend to trade accuracy for speed, and so things like tanh will not be computed to full 32-bit precision. The reason for this is that TPUs are purpose-built for running bfloat16 neural networks, so in most cases it's wasteful to spend cycles computing activation functions to full precision when those extra decimals will be truncated in the next matmul.

Regarding jax_default_matmul_precision, keep in mind that this will only affect matmul-like operati…

View full answer

jakevdp · 2024-04-02T16:04:50Z

jakevdp
Apr 2, 2024
Maintainer

Hi - thanks for the question!

In general, you'll find that operations on TPU will be less accurate than operations on CPU; the reason for this comes down to the backend-dependent implementations of various ops. In broad strokes, TPU operations tend to trade accuracy for speed, and so things like tanh will not be computed to full 32-bit precision. The reason for this is that TPUs are purpose-built for running bfloat16 neural networks, so in most cases it's wasteful to spend cycles computing activation functions to full precision when those extra decimals will be truncated in the next matmul.

Regarding jax_default_matmul_precision, keep in mind that this will only affect matmul-like operations, and that HIGHEST on TPU does not mean that you're getting float32-precision: it means that it's using 3 bfloat16 passes to approximate float32 precision matmuls: this is why the einsum outputs do not exactly match the float32 counterparts.

We should probably update the docs you refer to in order to make these features more clear.

Does that answer your question?

2 replies

jakevdp Apr 2, 2024
Maintainer

#20537 updates the docs to (hopefully) make this more clear.

yixiaoer Apr 3, 2024
Author

makes sense to me! thanks for the detailed explanation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Precision differences between TPU and CPU for same computations #20510

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment 2 replies

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Precision differences between TPU and CPU for same computations #20510

Uh oh!

yixiaoer Mar 31, 2024

Replies: 1 comment · 2 replies

Uh oh!

Uh oh!

jakevdp Apr 2, 2024 Maintainer

Uh oh!

jakevdp Apr 2, 2024 Maintainer

Uh oh!

yixiaoer Apr 3, 2024 Author

yixiaoer
Mar 31, 2024

Replies: 1 comment 2 replies

jakevdp
Apr 2, 2024
Maintainer

jakevdp Apr 2, 2024
Maintainer

yixiaoer Apr 3, 2024
Author