Skip to content

Commit df81b15

Browse files
authored
Merge pull request #4774 from rgommers/improve-docs
Improve documention content, formatting, and html theme
2 parents acf0c3c + c1c0dbf commit df81b15

File tree

7 files changed

+382
-126
lines changed

7 files changed

+382
-126
lines changed

docs/about.md

Lines changed: 32 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -2,25 +2,45 @@
22

33
We have a [GitHub discussions](https://github.com/OpenMathLib/OpenBLAS/discussions/) forum to discuss usage and development of OpenBLAS. We also have a [Google group for *users*](https://groups.google.com/forum/#!forum/openblas-users) and a [Google group for *development of*](https://groups.google.com/forum/#!forum/openblas-dev) OpenBLAS.
44

5-
## Donations
6-
7-
You can read OpenBLAS statement of receipts and disbursement and cash balance on [google doc](https://docs.google.com/spreadsheet/ccc?key=0AghkTjXe2lDndE1UZml0dGpaUzJmZGhvenBZd1F2R1E&usp=sharing). A backer list is available [on GitHub](https://github.com/OpenMathLib/OpenBLAS/blob/develop/BACKERS.md).
8-
9-
We welcome the hardware donation, including the latest CPU and boards.
10-
115
## Acknowledgements
126

13-
This work is partially supported by
7+
This work was or is partially supported by the following grants, contracts and institutions:
8+
149
* Research and Development of Compiler System and Toolchain for Domestic CPU, National S&T Major Projects: Core Electronic Devices, High-end General Chips and Fundamental Software (No.2009ZX01036-001-002)
1510
* National High-tech R&D Program of China (Grant No.2012AA010903)
11+
* [PerfXLab](http://www.perfxlab.com/)
12+
* Chan Zuckerberg Initiative's Essential Open Source Software for Science program:
13+
* Cycle 1 grant: [Strengthening NumPy's foundations - growing beyond code](https://figshare.com/articles/journal_contribution/Proposal_NumPy_OpenBLAS_for_Chan_Zuckerberg_Initiative_EOSS_2019_round_1/10302167) (2019-2020)
14+
* Cycle 3 grant: [Improving usability and sustainability for NumPy and OpenBLAS](https://chanzuckerberg.com/eoss/proposals/improving-usability-and-sustainability-for-numpy-and-openblas/) (2020-2021)
15+
* Sovereign Tech Fund funding: [Keeping high performance linear algebra computation accessible and open for all](https://www.sovereigntechfund.de/tech/openblas) (2023-2024)
16+
17+
Over the course of OpenBLAS development, a number of donations were received.
18+
You can read OpenBLAS's statement of receipts and disbursement and cash balance in
19+
[this Google doc](https://docs.google.com/spreadsheet/ccc?key=0AghkTjXe2lDndE1UZml0dGpaUzJmZGhvenBZd1F2R1E&usp=sharing) (covers 2013-2016).
20+
A list of backers is available [in BACKERS.md](https://github.com/OpenMathLib/OpenBLAS/blob/develop/BACKERS.md) in the main repo.
21+
22+
### Donations
23+
24+
We welcome hardware donations, including the latest CPUs and motherboards.
25+
26+
27+
## Open source users of OpenBLAS
28+
29+
Prominent open source users of OpenBLAS include:
30+
31+
* [Julia](https://julialang.org) - a high-level, high-performance dynamic programming language for technical computing
32+
* [NumPy](https://numpy.org) - the fundamental package for scientific computing with Python
33+
* [SciPy](https://scipy.org) - fundamental algorithms for scientific computing in Python
34+
* [R](https://www.r-project.org/) - a free software environment for statistical computing and graphics
35+
* [OpenCV](https://opencv.org/) - the world's biggest computer vision library
1636

17-
## Users of OpenBLAS
37+
OpenBLAS is packaged in most major Linux distros, as well as general and
38+
numerical computing-focused packaging ecosystems like Nix, Homebrew, Spack and
39+
conda-forge.
1840

19-
* <a href='http://julialang.org/'>Julia - a high-level, high-performance dynamic programming language for technical computing</a><br />
20-
* Ceemple v1.0.3 (C++ technical computing environment), including OpenBLAS, Qt, Boost, OpenCV and others. The only solution with immediate-recompilation of C++ code. Available from <a href='http://www.ceemple.com'>Ceemple C++ Technical Computing</a>.
21-
* [netlib-java](https://github.com/fommil/netlib-java) and various upstream libraries, allowing OpenBLAS to be used from languages on the Java Virtual Machine.
41+
OpenBLAS is used directly by libraries written in C, C++ and Fortran (and
42+
probably other languages), and directly by end users in those languages.
2243

23-
<!-- TODO: academia users, industry users, hpc centers deployed openblas, etc. -->
2444

2545
## Publications
2646

docs/build_system.md

Lines changed: 23 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,8 @@
1+
This page describes the Make-based build, which is the default/authoritative
2+
build method. Note that the OpenBLAS repository also supports building with
3+
CMake (not described here) - that generally works and is tested, however there
4+
may be small differences between the Make and CMake builds.
5+
16
!!! warning
27
This page is made by someone who is not the developer and should not be considered as an official documentation of the build system. For getting the full picture, it is best to read the Makefiles and understand them yourself.
38

@@ -95,10 +100,21 @@ NUM_PARALLEL - define this to the number of OpenMP instances that your code m
95100
```
96101

97102

98-
OpenBLAS uses a fixed set of memory buffers internally, used for communicating and compiling partial results from individual threads.
99-
For efficiency, the management array structure for these buffers is sized at build time - this makes it necessary to know in advance how
100-
many threads need to be supported on the target system(s).
101-
With OpenMP, there is an additional level of complexity as there may be calls originating from a parallel region in the calling program. If OpenBLAS gets called from a single parallel region, it runs single-threaded automatically to avoid overloading the system by fanning out its own set of threads.
102-
In the case that an OpenMP program makes multiple calls from independent regions or instances in parallel, this default serialization is not
103-
sufficient as the additional caller(s) would compete for the original set of buffers already in use by the first call.
104-
So if multiple OpenMP runtimes call into OpenBLAS at the same time, then only one of them will be able to make progress while all the rest of them spin-wait for the one available buffer. Setting NUM_PARALLEL to the upper bound on the number of OpenMP runtimes that you can have in a process ensures that there are a sufficient number of buffer sets available
103+
OpenBLAS uses a fixed set of memory buffers internally, used for communicating
104+
and compiling partial results from individual threads. For efficiency, the
105+
management array structure for these buffers is sized at build time - this
106+
makes it necessary to know in advance how many threads need to be supported on
107+
the target system(s).
108+
109+
With OpenMP, there is an additional level of complexity as there may be calls
110+
originating from a parallel region in the calling program. If OpenBLAS gets
111+
called from a single parallel region, it runs single-threaded automatically to
112+
avoid overloading the system by fanning out its own set of threads. In the case
113+
that an OpenMP program makes multiple calls from independent regions or
114+
instances in parallel, this default serialization is not sufficient as the
115+
additional caller(s) would compete for the original set of buffers already in
116+
use by the first call. So if multiple OpenMP runtimes call into OpenBLAS at the
117+
same time, then only one of them will be able to make progress while all the
118+
rest of them spin-wait for the one available buffer. Setting `NUM_PARALLEL` to
119+
the upper bound on the number of OpenMP runtimes that you can have in a process
120+
ensures that there are a sufficient number of buffer sets available.

0 commit comments

Comments
 (0)