You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
- Option 1: Download and install the [oneAPI DPC++ Compiler](https://github.com/intel/llvm/releases?q=oneAPI+DPC%2B%2B+Compiler) and [oneTBB](https://github.com/oneapi-src/oneTBB/releases) with:
62
62
```bash
63
-
export OCLV="2024.18.6.0.02_rel"
64
-
export TBBV="2021.13.0"
63
+
export OCLV="2024.18.10.0.08_rel"
64
+
export TBBV="2022.0.0"
65
65
sudo apt update && sudo apt upgrade -y
66
66
sudo apt install -y g++ git make ocl-icd-libopencl1 ocl-icd-opencl-dev
Copy file name to clipboardExpand all lines: README.md
+21-2Lines changed: 21 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
# FluidX3D
2
2
3
-
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs via [OpenCL](https://github.com/ProjectPhysX/OpenCL-Wrapper"OpenCL-Wrapper"). Free for non-commercial use.
3
+
The fastest and most memory efficient lattice Boltzmann CFD software, running on all GPUs and CPUs via [OpenCL](https://github.com/ProjectPhysX/OpenCL-Wrapper"OpenCL-Wrapper"). Free for non-commercial use.
- reduced memory footprint on CPUs and iGPU from 72 to 55 Bytes/cell (fused OpenCL host+device buffers for `rho`/`u`/`flags`), allowing 31% higher resolution in the same RAM capacity
198
+
- faster hardware-supported and faster fallback emulation atomic floating-point addition for `PARTICLES` extension
199
+
- hardened `calculate_f_eq()` against bad user input for `D2Q9`
200
+
- fixed velocity voxelization for overlapping geometry with different velocity
201
+
- fixed Remaining Time printout during paused simulation
202
+
- fixed CPU/GPU memory printout for CPU/iGPU simulations
196
203
197
204
</details>
198
205
@@ -759,6 +766,8 @@ section Orange Pi 5 Mali-G610 MP4
Copy file name to clipboardExpand all lines: src/lbm.cpp
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -1259,8 +1259,8 @@ void LBM_Domain::allocate_transfer(Device& device) { // allocate all memory for
1259
1259
if(Dy>1u) Amax = max(Amax, (ulong)Nz*(ulong)Nx); // Ay
1260
1260
if(Dz>1u) Amax = max(Amax, (ulong)Nx*(ulong)Ny); // Az
1261
1261
1262
-
transfer_buffer_p = Memory<char>(device, Amax, max(transfers*(uint)sizeof(fpxx), 17u)); // only allocate one set of transfer buffers in plus/minus directions, for all x/y/z transfers
transfer_buffer_p = Memory<char>(device, Amax, max(transfers*(uint)sizeof(fpxx), 17u), true, true, 0, false); // only allocate one set of transfer buffers in plus/minus directions, for all x/y/z transfers
1263
+
transfer_buffer_m = Memory<char>(device, Amax, max(transfers*(uint)sizeof(fpxx), 17u), true, true, 0, false); // these transfer buffers must not be zero-copy!
0 commit comments