Add Bfloat16: alternative 16 bit floating point precision to half #1825

yhmtsai · 2025-04-11T16:54:57Z

This PR adds bfloat16 precision but user can only select one of bfloat16 or half in ginkgo now.
There will be another pr to enable the possibility of having both at the same time, which should mostly contain the convert function/precision chain and some improvement to reduce the copy-paste stuff.

This PR currently use gko::float16 for either half or bfloat16. I think float16 is confusing to people because it is the same term used by half.

summary for different vendor bfloat16,

CUDA:
- they use __nv_bfloat16
- some ptx requires sm_80 to support (not have the guard in this pr). normal operations require at least sm_80 or cuda 12.2 similar to half
- only use b16 in .reg, which is unlike f16 for half precision
HIP:
- they have two bfloat16 format (hip_bfloat16 and __hip_bfloat16), hip_bfloat16 is quite early support but likely use float for arithmetic operation internally. __hip_bfloat16 has more native operation on bfloat16. __hip_bfloat16 supports from 5.6.0 but we need at least 6.2.0 to get enough implementation for the operation overload and conversion. before 5.4.0, it does not contain operator=(float).
- more trouble on finding sqrt. hip tries to use system sqrt only in lambda function in the kernel (works in __global__). I need to add __device__ I guess it limits the searching space because we only provide the sqrt(bfloat16) in device.
SYCL:
- it is in the experimental namespace
- not proper implementation or they have some default implementation for std::numeric_limits on bfloat16 because I do not get any compilation issue. provide device_numeric_limits now
- unary operation - on rvalue will gives float before 2025.0.1 because it only accepted non-const reference before 2025.0.1

MarcelKoch

I have mostly smaller comments for this, rest looks good.

core/base/mtx_io.cpp

core/test/base/bfloat16.cpp

dpcpp/matrix/coo_kernels.dp.cpp

dpcpp/matrix/csr_kernels.dp.cpp

dpcpp/preconditioner/batch_block_jacobi.hpp

hip/base/types.hip.hpp

reference/test/solver/batch_bicgstab_kernels.cpp

…d implicit conversion

…loat16 (like rocm4.5/5.1.4)

sonarqubecloud · 2025-05-07T01:15:09Z

Quality Gate passed

Issues
32 New issues
0 Accepted issues

Measures
0 Security Hotspots
73.2% Coverage on New Code
14.7% Duplication on New Code

See analysis details on SonarQube Cloud

yhmtsai self-assigned this Apr 11, 2025

yhmtsai requested review from a team April 11, 2025 16:55

yhmtsai added the 1:ST:ready-for-review This PR is ready for review label Apr 11, 2025

yhmtsai mentioned this pull request Apr 15, 2025

Allow to enable half and bfloat16 at the same time #1827

Merged

3 tasks

MarcelKoch requested changes Apr 15, 2025

View reviewed changes

yhmtsai requested a review from MarcelKoch April 17, 2025 09:13

MarcelKoch requested changes Apr 17, 2025

View reviewed changes

reference/test/solver/batch_bicgstab_kernels.cpp Outdated Show resolved Hide resolved

yhmtsai force-pushed the add_bfloat16 branch from 4d1bfdf to 7846b40 Compare April 22, 2025 09:21

yhmtsai requested a review from MarcelKoch April 22, 2025 12:58

MarcelKoch approved these changes Apr 22, 2025

View reviewed changes

yhmtsai force-pushed the add_bfloat16 branch from 7846b40 to 9d0f785 Compare April 22, 2025 15:54

yhmtsai added 1:ST:ready-to-merge This PR is ready to merge. and removed 1:ST:ready-for-review This PR is ready for review labels May 2, 2025

yhmtsai added 3 commits May 2, 2025 15:59

add bfloat16 type

f5bc26c

add type mapping

94a4a08

fix bfloat16 epsilon

1287fd3

yhmtsai force-pushed the add_bfloat16 branch from 9d0f785 to f6fa83d Compare May 2, 2025 13:59

yhmtsai added 2 commits May 2, 2025 16:08

type-related component

512c215

add bfloat16 memory operation

608be2f

yhmtsai added 4 commits May 2, 2025 16:08

use float16 as bfloat16 and fix the test

70a62ae

fix the header and add cuda_arch require for bfloat16

f7d57c1

fix header

f4348dc

add GINKGO_ENABLE_BFLOAT16 flag

3e63cbe

yhmtsai force-pushed the add_bfloat16 branch from f6fa83d to a71f07a Compare May 2, 2025 14:08

adapt hip two different bfloat16

b46d890

yhmtsai force-pushed the add_bfloat16 branch from 5bd2204 to 1418329 Compare May 5, 2025 15:30

MarcelKoch added this to the Ginkgo 1.10.0 milestone May 6, 2025

yhmtsai force-pushed the add_bfloat16 branch from 9e7c027 to cf681ad Compare May 6, 2025 09:24

yhmtsai added 10 commits May 6, 2025 12:07

adapt the hip_bfloat16 does not have constexpr contructor from int an…

a244e3e

…d implicit conversion

fix numeric_limits, reduction, unary- issue on bfloat16 from sycl

8888988

change condition based on 16 bit not type

a0a10ee

add bf16_alias.hpp

4d34a77

SKIP the test for bfloat16 when needing quite relaxed condition

9f201f3

add the CI job to check bfloat16

e5f62db

fix atomic arch requirement

eb12082

fix nv_bfloat162 atomicAdd condition

9445d09

fix intel/llvm bit_cast not self contained

984b265

also use implicit_explicit_conversion in operator= because old hip_bf…

29df96b

…loat16 (like rocm4.5/5.1.4)

yhmtsai force-pushed the add_bfloat16 branch from cf681ad to 29df96b Compare May 6, 2025 10:07

yhmtsai merged commit 93df224 into develop May 6, 2025
11 of 13 checks passed

yhmtsai deleted the add_bfloat16 branch May 6, 2025 20:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Bfloat16: alternative 16 bit floating point precision to half #1825

Add Bfloat16: alternative 16 bit floating point precision to half #1825

Uh oh!

yhmtsai commented Apr 11, 2025 •

edited

Loading

Uh oh!

MarcelKoch left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented May 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Add Bfloat16: alternative 16 bit floating point precision to half #1825

Add Bfloat16: alternative 16 bit floating point precision to half #1825

Uh oh!

Conversation

yhmtsai commented Apr 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MarcelKoch left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sonarqubecloud bot commented May 7, 2025

Quality Gate passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yhmtsai commented Apr 11, 2025 •

edited

Loading