huggingface
diff --git a/‎README.md
Lines changed: 78 additions & 59 deletions b/‎README.md
Lines changed: 78 additions & 59 deletions
diff --git a/‎Sources/Hub/Downloader.swift
Lines changed: 158 additions & 5 deletions b/‎Sources/Hub/Downloader.swift
Lines changed: 158 additions & 5 deletions
@@ -1,40 +1,95 @@
-# `swift-transformers`
+<p align="center">
+  <picture>
+    <source media="(prefers-color-scheme: dark)" srcset="media/swift-t-banner.png">
+    <source media="(prefers-color-scheme: light)" srcset="media/swift-t-banner.png">
+    <img alt="Swift + Transformers" src="media/swift-t-banner.png" style="max-width: 100%;">
+  </picture>
+  <br/>
+  <br/>
+</p>
+
+
 [![Unit Tests](https://github.com/huggingface/swift-transformers/actions/workflows/unit-tests.yml/badge.svg)](https://github.com/huggingface/swift-transformers/actions/workflows/unit-tests.yml)
 [![](https://img.shields.io/endpoint?url=https%3A%2F%2Fswiftpackageindex.com%2Fapi%2Fpackages%2Fhuggingface%2Fswift-transformers%2Fbadge%3Ftype%3Dswift-versions)](https://swiftpackageindex.com/huggingface/swift-transformers)
 [![](https://img.shields.io/endpoint?url=https%3A%2F%2Fswiftpackageindex.com%2Fapi%2Fpackages%2Fhuggingface%2Fswift-transformers%2Fbadge%3Ftype%3Dplatforms)](https://swiftpackageindex.com/huggingface/swift-transformers)
 
-This is a collection of utilities to help adopt language models in Swift apps. It tries to follow the Python `transformers` API and abstractions whenever possible, but it also aims to provide an idiomatic Swift interface and does not assume prior familiarity with [`transformers`](https://github.com/huggingface/transformers) or [`tokenizers`](https://github.com/huggingface/tokenizers).
+`swift-transformers` is a collection of utilities to help adopt language models in Swift apps. 
 
+It tries to follow the Python `transformers` API and abstractions whenever possible, but it also aims to provide an idiomatic Swift interface and does not assume prior familiarity with [`transformers`](https://github.com/huggingface/transformers) or [`tokenizers`](https://github.com/huggingface/tokenizers).
 
-## Rationale and Overview
 
-Please, check [our post](https://huggingface.co/blog/swift-coreml-llm).
+## Rationale & Overview
 
-## Modules
+Check out [our announcement post](https://huggingface.co/blog/swift-coreml-llm).
 
-- `Tokenizers`. Utilities to convert text to tokens and back. Follows the abstractions in [`tokenizers`](https://github.com/huggingface/tokenizers) and [`transformers.js`](https://github.com/xenova/transformers.js). Usage example:
+## Modules
 
+- `Tokenizers`: Utilities to convert text to tokens and back, with support for Chat Templates and Tools. Follows the abstractions in [`tokenizers`](https://github.com/huggingface/tokenizers). Usage example:
 ```swift
 import Tokenizers
-
 func testTokenizer() async throws {
-    let tokenizer = try await AutoTokenizer.from(pretrained: "pcuenq/Llama-2-7b-chat-coreml")
-    let inputIds = tokenizer("Today she took a train to the West")
-    assert(inputIds == [1, 20628, 1183, 3614, 263, 7945, 304, 278, 3122])
+    let tokenizer = try await AutoTokenizer.from(pretrained: "deepseek-ai/DeepSeek-R1-Distill-Qwen-7B")
+    let messages = [["role": "user", "content": "Describe the Swift programming language."]]
+    let encoded = try tokenizer.applyChatTemplate(messages: messages)
+    let decoded = tokenizer.decode(tokens: encoded)
 }
 ```
 
-However, you don't usually need to tokenize the input text yourself - the [`Generation` code](https://github.com/huggingface/swift-transformers/blob/17d4bfae3598482fc7ecf1a621aa77ab586d379a/Sources/Generation/Generation.swift#L82) will take care of it.
+- `Hub`: Utilities for interacting with the Hugging Face Hub! Download models, tokenizers and other config files. Usage example:
+```swift
+import Hub
+func testHub() async throws {
+    let repo = Hub.Repo(id: "mlx-community/Qwen2.5-0.5B-Instruct-2bit-mlx")
+    let filesToDownload = ["config.json", "*.safetensors"]
+    let modelDirectory: URL = try await Hub.snapshot(
+        from: repo,
+        matching: filesToDownload,
+        progressHandler: { progress in
+            print("Download progress: \(progress.fractionCompleted * 100)%")
+        }
+    )
+    print("Files downloaded to: \(modelDirectory.path)")
+}
+```
 
-- `Hub`. Utilities to download configuration files from the Hub, used to instantiate tokenizers and learn about language model characteristics.
+- `Generation`: Algorithms for text generation. Handles tokenization internally. Currently supported ones are: greedy search, top-k sampling, and top-p sampling.
+- `Models`: Language model abstraction over a Core ML package.
 
-- `Generation`. Algorithms for text generation. Currently supported ones are greedy search and top-k sampling.
 
-- `Models`. Language model abstraction over a Core ML package.
+## Usage via SwiftPM
 
+To use `swift-transformers` with SwiftPM, you can add this to your `Package.swift`:
+
+```swift
+dependencies: [
+    .package(url: "https://github.com/huggingface/swift-transformers", from: "0.1.17")
+]
+```
+
+And then, add the Transformers library as a dependency to your target:
+
+```swift
+targets: [
+    .target(
+        name: "YourTargetName",
+        dependencies: [
+            .product(name: "Transformers", package: "swift-transformers")
+        ]
+    )
+]
+```
+
+## Projects that use swift-transformers ❤️ 
+
+- [WhisperKit](https://github.com/argmaxinc/WhisperKit): A Swift Package for state-of-the-art speech-to-text systems from [Argmax](https://github.com/argmaxinc)
+- [MLX Swift Examples](https://github.com/ml-explore/mlx-swift-examples): A Swift Package for integrating MLX models in Swift apps.
+
+Using `swift-transformers` in your project? Let us know and we'll add you to the list!
 
 ## Supported Models
 
+You can run inference on Core ML models with `swift-transformers`. Note that Core ML is not required to use the `Tokenizers` or `Hub` modules.
+
 This package has been tested with autoregressive language models such as:
 
 - GPT, GPT-Neox, GPT-J.
@@ -43,61 +98,25 @@ This package has been tested with autoregressive language models such as:
 - Falcon.
 - Llama 2.
 
-Encoder-decoder models such as T5 and Flan are currently _not supported_. They are high up in our [priority list](#roadmap).
+Encoder-decoder models such as T5 and Flan are currently _not supported_.
 
 ## Other Tools
 
 - [`swift-chat`](https://github.com/huggingface/swift-chat), a simple app demonstrating how to use this package.
 - [`exporters`](https://github.com/huggingface/exporters), a Core ML conversion package for transformers models, based on Apple's [`coremltools`](https://github.com/apple/coremltools).
 - [`transformers-to-coreml`](https://huggingface.co/spaces/coreml-projects/transformers-to-coreml), a no-code Core ML conversion tool built on `exporters`.
 
-## SwiftPM
-
-To use `swift-transformers` with SwiftPM, you can add this to your `Package.swift`:
-
-```swift
-dependencies: [
-    .package(url: "https://github.com/huggingface/swift-transformers", from: "0.1.5")
-]
-```
-
-And then, add the Transformers library as a dependency to your target:
+## Contributing 
 
-```
-targets: [
-    .target(
-        name: "YourTargetName",
-        dependencies: [
-            .product(name: "Transformers", package: "swift-transformers")
-        ]
-    )
-]
-```
+Swift Transformers is a community project and we welcome contributions. Please
+check out [Issues](https://github.com/huggingface/swift-transformers/issues)
+tagged with `good first issue` if you are looking for a place to start!
 
-## <a name="roadmap"></a> Roadmap / To Do
-
-- [ ] Tokenizers: download from the Hub, port from [`tokenizers`](https://github.com/huggingface/tokenizers)
-  - [x] BPE family
-  - [x] Fix Falcon, broken while porting BPE
-  - [x] Improve tests, add edge cases, see https://github.com/xenova/transformers.js/blob/27920d84831e323275b38f0b5186644b7936e1a2/tests/generate_tests.py#L24
-  - [x] Include fallback `tokenizer_config.json` for known architectures whose models don't have a configuration in the Hub (GPT2)
-  - [ ] Port other tokenizer types: Unigram, WordPiece
-- [ ] [`exporters`](https://github.com/huggingface/exporters) – Core ML conversion tool.
-  - [x] Allow max sequence length to be specified.
-  - [ ] Allow discrete shapes
-  - [x] Return `logits` from converted Core ML model
-  - [x] Use `coremltools` @ `main` for latest fixes. In particular, [this merged PR](https://github.com/apple/coremltools/pull/1915) makes it easier to use recent versions of transformers.
-- [ ] Generation
-  - [ ] Nucleus sampling (we currently have greedy and top-k sampling)
-  - [ ] Use [new `top-k` implementation in `Accelerate`](https://developer.apple.com/documentation/accelerate/bnns#4164142).
-  - [ ] Support discrete shapes in the underlying Core ML model by selecting the smallest sequence length larger than the input.
-- [ ] Optimization: cache past key-values.
-- [ ] Encoder-decoder models (T5)
-- [ ] [Demo app](https://github.com/huggingface/swift-chat)
-  - [ ] Allow system prompt to be specified.
-  - [ ] How to define a system prompt template?
-  - [ ] Test a code model (to stretch system prompt definition)
+Please ensure your code passes the build and test suite before submitting a pull
+request. You can run the tests with `swift test`.
 
 ## License
 
 [Apache 2](LICENSE).
+
+
@@ -12,6 +12,8 @@ import Combine
 class Downloader: NSObject, ObservableObject {
     private(set) var destination: URL
 
+    private let chunkSize = 10 * 1024 * 1024  // 10MB
+
     enum DownloadState {
         case notStarted
         case downloading(Double)
@@ -29,7 +31,17 @@ class Downloader: NSObject, ObservableObject {
 
     private var urlSession: URLSession? = nil
 
-    init(from url: URL, to destination: URL, using authToken: String? = nil, inBackground: Bool = false) {
+    init(
+        from url: URL,
+        to destination: URL,
+        using authToken: String? = nil,
+        inBackground: Bool = false,
+        resumeSize: Int = 0,
+        headers: [String: String]? = nil,
+        expectedSize: Int? = nil,
+        timeout: TimeInterval = 10,
+        numRetries: Int = 5
+    ) {
         self.destination = destination
         super.init()
         let sessionIdentifier = "swift-transformers.hub.downloader"
@@ -43,10 +55,28 @@ class Downloader: NSObject, ObservableObject {
 
         self.urlSession = URLSession(configuration: config, delegate: self, delegateQueue: nil)
 
-        setupDownload(from: url, with: authToken)
+        setupDownload(from: url, with: authToken, resumeSize: resumeSize, headers: headers, expectedSize: expectedSize, timeout: timeout, numRetries: numRetries)
     }
 
-    private func setupDownload(from url: URL, with authToken: String?) {
+    /// Sets up and initiates a file download operation
+    ///
+    /// - Parameters:
+    ///   - url: Source URL to download from
+    ///   - authToken: Bearer token for authentication with Hugging Face
+    ///   - resumeSize: Number of bytes already downloaded for resuming interrupted downloads
+    ///   - headers: Additional HTTP headers to include in the request
+    ///   - expectedSize: Expected file size in bytes for validation
+    ///   - timeout: Time interval before the request times out
+    ///   - numRetries: Number of retry attempts for failed downloads
+    private func setupDownload(
+        from url: URL,
+        with authToken: String?,
+        resumeSize: Int,
+        headers: [String: String]?,
+        expectedSize: Int?,
+        timeout: TimeInterval,
+        numRetries: Int
+    ) {
         downloadState.value = .downloading(0)
         urlSession?.getAllTasks { tasks in
             // If there's an existing pending background task with the same URL, let it proceed.
@@ -71,14 +101,137 @@ class Downloader: NSObject, ObservableObject {
                 }
             }
             var request = URLRequest(url: url)
+            
+            // Use headers from argument else create an empty header dictionary
+            var requestHeaders = headers ?? [:]
+            
+            // Populate header auth and range fields
             if let authToken = authToken {
-                request.setValue("Bearer \(authToken)", forHTTPHeaderField: "Authorization")
+                requestHeaders["Authorization"] = "Bearer \(authToken)"
+            }
+            if resumeSize > 0 {
+                requestHeaders["Range"] = "bytes=\(resumeSize)-"
             }
+            
+            
+            request.timeoutInterval = timeout
+            request.allHTTPHeaderFields = requestHeaders
 
-            self.urlSession?.downloadTask(with: request).resume()
+            Task {
+                do {
+                    // Create a temp file to write
+                    let tempURL = FileManager.default.temporaryDirectory.appendingPathComponent(UUID().uuidString)
+                    FileManager.default.createFile(atPath: tempURL.path, contents: nil)
+                    let tempFile = try FileHandle(forWritingTo: tempURL)
+                    
+                    defer { tempFile.closeFile() }
+                    try await self.httpGet(request: request, tempFile: tempFile, resumeSize: resumeSize, numRetries: numRetries, expectedSize: expectedSize)
+                    
+                    // Clean up and move the completed download to its final destination
+                    tempFile.closeFile()
+                    try FileManager.default.moveDownloadedFile(from: tempURL, to: self.destination)
+                    
+                    self.downloadState.value = .completed(self.destination)
+                } catch {
+                    self.downloadState.value = .failed(error)
+                }
+            }
         }
     }
 
+    /// Downloads a file from given URL using chunked transfer and handles retries.
+    ///
+    /// Reference: https://github.com/huggingface/huggingface_hub/blob/418a6ffce7881f5c571b2362ed1c23ef8e4d7d20/src/huggingface_hub/file_download.py#L306
+    ///
+    /// - Parameters:
+    ///   - request: The URLRequest for the file to download
+    ///   - resumeSize: The number of bytes already downloaded. If set to 0 (default), the whole file is download. If set to a positive number, the download will resume at the given position
+    ///   - numRetries: The number of retry attempts remaining for failed downloads
+    ///   - expectedSize: The expected size of the file to download. If set, the download will raise an error if the size of the received content is different from the expected one.
+    /// - Throws: `DownloadError.unexpectedError` if the response is invalid or file size mismatch occurs
+    ///           `URLError` if the download fails after all retries are exhausted
+    private func httpGet(
+        request: URLRequest,
+        tempFile: FileHandle,
+        resumeSize: Int,
+        numRetries: Int,
+        expectedSize: Int?
+    ) async throws {
+        guard let session = self.urlSession else {
+            throw DownloadError.unexpectedError
+        }
+        
+        // Create a new request with Range header for resuming
+        var newRequest = request
+        if resumeSize > 0 {
+            newRequest.setValue("bytes=\(resumeSize)-", forHTTPHeaderField: "Range")
+        }
+        
+        // Start the download and get the byte stream
+        let (asyncBytes, response) = try await session.bytes(for: newRequest)
+        
+        guard let response = response as? HTTPURLResponse else {
+            throw DownloadError.unexpectedError
+        }
+                
+        guard (200..<300).contains(response.statusCode) else {
+            throw DownloadError.unexpectedError
+        }
+
+        var downloadedSize = resumeSize
+        
+        // Create a buffer to collect bytes before writing to disk
+        var buffer = Data(capacity: chunkSize)
+        
+        var newNumRetries = numRetries
+        do {
+            for try await byte in asyncBytes {
+                buffer.append(byte)
+                // When buffer is full, write to disk
+                if buffer.count == chunkSize {
+                    if !buffer.isEmpty { // Filter out keep-alive chunks
+                        try tempFile.write(contentsOf: buffer)
+                        buffer.removeAll(keepingCapacity: true)
+                        downloadedSize += chunkSize
+                        newNumRetries = 5
+                        guard let expectedSize = expectedSize else { continue }
+                        let progress = expectedSize != 0 ? Double(downloadedSize) / Double(expectedSize) : 0
+                        downloadState.value = .downloading(progress)
+                    }
+                }
+            }
+            
+            if !buffer.isEmpty {
+                try tempFile.write(contentsOf: buffer)
+                downloadedSize += buffer.count
+                buffer.removeAll(keepingCapacity: true)
+                newNumRetries = 5
+            }
+        } catch let error as URLError {
+            if newNumRetries <= 0 {
+                throw error
+            }
+            try await Task.sleep(nanoseconds: 1_000_000_000)
+            
+            let config = URLSessionConfiguration.default
+            self.urlSession = URLSession(configuration: config, delegate: self, delegateQueue: nil)
+            
+            try await httpGet(
+                request: request,
+                tempFile: tempFile,
+                resumeSize: downloadedSize,
+                numRetries: newNumRetries - 1,
+                expectedSize: expectedSize
+            )
+        }
+        
+        // Verify the downloaded file size matches the expected size
+        let actualSize = try tempFile.seekToEnd()
+        if let expectedSize = expectedSize, expectedSize != actualSize {
+            throw DownloadError.unexpectedError
+        }
+    }
+    
     @discardableResult
     func waitUntilDone() throws -> URL {
         // It's either this, or stream the bytes ourselves (add to a buffer, save to disk, etc; boring and finicky)