You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Rank 0 runs on node aa, bound to logical socket 1, cores 0-2.
1115
1116
* Rank 1 runs on node bb, bound to logical socket 0, cores 0 and 1.
1116
-
* Rank 2 runs on node cc, bound to logical cores 1 and 2.
1117
+
* Rank 2 runs on node cc, bound to logical cores 2 and 3.
1117
1118
1118
-
Rankfiles can alternatively be used to specify physical processor
1119
-
locations. In this case, the syntax is somewhat different. Sockets are
1120
-
no longer recognized, and the slot number given must be the number of
1121
-
the physical PU as most OS's do not assign a unique physical
1122
-
identifier to each core in the node. Thus, a proper physical rankfile
1123
-
looks something like the following:
1119
+
Note that only logicical processor locations are supported. By default, the values specifed are assumed to be cores. If you intend to specify specific hardware threads then you must add the ``:hwtcpus`` qualifier to the ``--map-by`` command line option (e.g., ``--map-by rankfile:file=myrankfile:hwtcpus``).
1124
1120
1125
-
.. code::
1126
-
1127
-
shell$ cat myphysicalrankfile
1128
-
rank 0=aa slot=1
1129
-
rank 1=bb slot=8
1130
-
rank 2=cc slot=6
1131
-
1132
-
This means that
1133
-
1134
-
* Rank 0 will run on node aa, bound to the core that contains physical
1135
-
PU 1
1136
-
* Rank 1 will run on node bb, bound to the core that contains physical
1137
-
PU 8
1138
-
* Rank 2 will run on node cc, bound to the core that contains physical
1139
-
PU 6
1140
-
1141
-
Rankfiles are treated as logical by default, and the MCA parameter
1142
-
``rmaps_rank_file_physical`` must be set to 1 to indicate that the
1143
-
rankfile is to be considered as physical.
1121
+
If the binding specification overlaps between any two ranks then an error occurs. If you intend to allow processes to share the same logical processing unit then you must pass the ``--bind-to :overload-allowed`` command line option to tell the runtime to ignore this check.
1144
1122
1145
1123
The hostnames listed above are "absolute," meaning that actual
1146
1124
resolveable hostnames are specified. However, hostnames can also be
@@ -1157,12 +1135,12 @@ hostnames, indexed from 0. For example:
All socket/core slot locations are specified as logical indexes.
1164
1142
1165
-
.. note:: The Open MPI v1.6 series used physical indexes.
1143
+
.. note:: The Open MPI v1.6 series used physical indexes. Starting in Open MPI v5.0 only logicial indexes are supported and the ``rmaps_rank_file_physical`` MCA parameter is no longer recognized.
1166
1144
1167
1145
You can use tools such as Hwloc's `lstopo(1)` to find the logical
0 commit comments