You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: readme.md
+1-8Lines changed: 1 addition & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -241,14 +241,7 @@ The `MoDInfiniTransformer` module takes the following arguments:
241
241
-`sampling_factor`: A numeric value in the interval (1, `segment_len`) that determines the number of tokens to select from each segment during the top-k selection. A larger value of `sampling_factor` results in fewer tokens being selected.
242
242
-`update`: The type of update to use for the memory matrix. Can be "linear" or "delta". (Default is "linear".)
243
243
-`causal`: Whether to use causal attention in SDP calculations (where each position can only attend to previous positions). (Default is False.)
244
-
-`positional_embedder`: The type of positional embeddings to apply. The following embedding methods are supported:
0 commit comments