update documentation (#258)

davidkoski · web-flow · commit 046cf153135b · 2025-03-27T14:30:04.000-07:00
- fix broken link to Troubleshooting
- use current Mistral model identifier
- clarify use of mlx-run
diff --git a/README.md b/README.md
@@ -49,6 +49,9 @@ the command line:
 ./mlx-run llm-tool --prompt "swift programming language"
 ```
 
+Note: `mlx-run` is a shell script that uses `xcode` command line tools to
+locate the built binaries.  It is equivalent to running from Xcode itself.
+
 See also:
 
 - [MLX troubleshooting](https://swiftpackageindex.com/ml-explore/mlx-swift/main/documentation/mlx/troubleshooting)
diff --git a/Tools/llm-tool/README.md b/Tools/llm-tool/README.md
@@ -15,7 +15,7 @@ Build the `llm-tool` scheme in Xcode.
 To run this in Xcode simply press cmd-opt-r to set the scheme arguments.  For example:
 
 ```
---model mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
+--model mlx-community/Mistral-7B-Instruct-v0.3-4bit
 --prompt "swift programming language"
 --max-tokens 50
 ```
@@ -27,7 +27,7 @@ the Hugging Face HubApi stores the downloaded files.
 
 The model should be a path in the Hugging Face repository, e.g.:
 
-- `mlx-community/Mistral-7B-v0.1-hf-4bit-mlx`
+- `mlx-community/Mistral-7B-Instruct-v0.3-4bit`
 - `mlx-community/phi-2-hf-4bit-mlx`
 
 See [LLM](../../Libraries/MLXLLM/README.md) for more info.
@@ -40,12 +40,15 @@ Use the `mlx-run` script to run the command line tools:
 ./mlx-run llm-tool --prompt "swift programming language"
 ```
 
+Note: `mlx-run` is a shell script that uses `xcode` command line tools to
+locate the built binaries.  It is equivalent to running from Xcode itself.
+
 By default this will find and run the tools built in _Release_ configuration.  Specify `--debug`
 to find and run the tool built in _Debug_ configuration.
 
 See also:
 
-- [MLX troubleshooting](https://ml-explore.github.io/mlx-swift/MLX/documentation/mlx/troubleshooting)
+- [MLX troubleshooting](https://swiftpackageindex.com/ml-explore/mlx-swift/main/documentation/mlx/troubleshooting)
 
 ### Troubleshooting
 
@@ -126,7 +129,7 @@ Here is an example run using adapters on the last 4 layers of the model:
 giving output like this:
 
 ```
-Model: mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
+Model: mlx-community/Mistral-7B-Instruct-v0.3-4bit
 Total parameters: 1,242M
 Trainable parameters: 0.426M
 Iteration 1: validation loss 2.443872, validation time 3.330629s
@@ -163,7 +166,7 @@ You can test the LoRA adapated model against the `test` dataset using this comma
 
 ```
 ./mlx-run llm-tool lora test \ 
-    --model mlx-community/Mistral-7B-v0.1-hf-4bit-mlx \
+    --model mlx-community/Mistral-7B-Instruct-v0.3-4bit \
     --data Data/lora \
     --adapter /tmp/lora-layers-4.safetensors \
     --batch-size 1 --lora-layers 4 \
@@ -173,7 +176,7 @@ You can test the LoRA adapated model against the `test` dataset using this comma
 This will run all the items (100 in the example data we are using) in the test set and compute the loss:
 
 ```
-Model: mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
+Model: mlx-community/Mistral-7B-Instruct-v0.3-4bit
 Total parameters: 1,242M
 Trainable parameters: 0.426M
 Test loss 1.327623, ppl 3.772065
@@ -192,7 +195,7 @@ Given that format you might issue a command like this:
 
 ```
 ./mlx-run llm-tool lora eval \
-    --model mlx-community/Mistral-7B-v0.1-hf-4bit-mlx \
+    --model mlx-community/Mistral-7B-Instruct-v0.3-4bit \
     --adapter /tmp/lora-layers-4.safetensors \
     --lora-layers 4 \
     --prompt "table: 1-10015132-16
@@ -206,7 +209,7 @@ A: "
 You might be treated to a response like this:
 
 ```
-Model: mlx-community/Mistral-7B-v0.1-hf-4bit-mlx
+Model: mlx-community/Mistral-7B-Instruct-v0.3-4bit
 Total parameters: 1,242M
 Trainable parameters: 0.426M
 Starting generation ...
@@ -223,7 +226,7 @@ have the adapter weights merged in:
 
 ```
 ./mlx-run llm-tool lora fuse \
-    --model mlx-community/Mistral-7B-v0.1-hf-4bit-mlx \
+    --model mlx-community/Mistral-7B-Instruct-v0.3-4bit \
     --adapter /tmp/lora-layers-4.safetensors \
     --output mlx-community/mistral-lora
 ```
diff --git a/Tools/mnist-tool/README.md b/Tools/mnist-tool/README.md
@@ -35,4 +35,4 @@ to find and run the tool built in _Debug_ configuration.
 
 See also:
 
-- [MLX troubleshooting](https://ml-explore.github.io/mlx-swift/MLX/documentation/mlx/troubleshooting)
+- [MLX troubleshooting](https://swiftpackageindex.com/ml-explore/mlx-swift/main/documentation/mlx/troubleshooting)

Original file line number	Diff line number	Diff line change
`@@ -35,4 +35,4 @@ to find and run the tool built in _Debug_ configuration.`
`35`	`35`
`36`	`36`	`See also:`
`37`	`37`
`38`		`-- [MLX troubleshooting](https://ml-explore.github.io/mlx-swift/MLX/documentation/mlx/troubleshooting)`
	`38`	`+- [MLX troubleshooting](https://swiftpackageindex.com/ml-explore/mlx-swift/main/documentation/mlx/troubleshooting)`