🧠 OtosakuStreamingASR-iOS

OtosakuStreamingASR is a lightweight on-device streaming speech recognition engine for iOS. It performs real-time audio processing using a Conformer-based architecture and CTC decoding.

🚀 Features

✅ Fully offline
🎙 Real-time streaming speech recognition
🛠 Modular architecture (feature extractor, encoder, decoder)

🎥 Demo

Watch the model running live on iPhone 13:

📆 Installation

Add the Swift Package to your Xcode project:

https://github.com/Otosaku/OtosakuStreamingASR-iOS

🧰 Usage Example

import OtosakuStreamingASR
                                                                                                
let asr = OtosakuStreamingASR()

try asr.prepareModel(from: modelURL)

asr.subscribe { text in
    print("🗣 Recognized: \(text)")
}

// Raw audio chunk: [Double] in range [-1.0, 1.0], strictly 2559 samples per chunk (80ms at 16kHz)
try asr.predictChunk(rawChunk: yourRawAudioChunk)

try asr.stop() // Finalize and decode remaining buffer

asr.reset() // Reset internal model state

🧠 Model Details

Architecture: Fast Conformer (Cache-Aware Streaming)
Language: 🇷🇺 Russian (fine-tuned from English)
Training: 250 hours of Russian speech (30 epochs)
WER (Word Error Rate):
- Russian (fine-tuned): 11%
- English (before fine-tuning): 6.5% on LibriSpeech test-other

🔗 Download Russian model: Link to model

For other languages or custom domains, contact me:

📧 otosaku.dsp@gmail.com

🧵 OtosakuStreamingASR API

Method	Description
`prepareModel(from:)`	Load model from directory
`predictChunk(rawChunk:)`	Submit audio frame (`[Double]`)
`stop()`	Finalize and decode remaining buffer
`reset()`	Reset model state
`subscribe { String in }`	Receive transcribed text in real time

⚠️ Input audio must be sampled at 16kHz and normalized to [-1.0, 1.0], strictly 2559 samples per chunk (80ms at 16kHz)

🔒 Privacy First

This package is designed with privacy in mind:

Runs entirely on-device
No cloud calls or external dependencies

📩 Contact

If you have any questions, suggestions, or are interested in adapting the model to another language or domain:

Email: otosaku.dsp@gmail.com

📄 License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Example		Example
Media		Media
Sources/OtosakuStreamingASR		Sources/OtosakuStreamingASR
Tests/OtosakuStreamingASRTests		Tests/OtosakuStreamingASRTests
.gitignore		.gitignore
Package.resolved		Package.resolved
Package.swift		Package.swift
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 OtosakuStreamingASR-iOS

🚀 Features

🎥 Demo

📆 Installation

🧰 Usage Example

🧠 Model Details

🧵 OtosakuStreamingASR API

🔒 Privacy First

📩 Contact

📄 License

About

Uh oh!

Releases 1

Packages

Languages

Otosaku/OtosakuStreamingASR-iOS

Folders and files

Latest commit

History

Repository files navigation

🧠 OtosakuStreamingASR-iOS

🚀 Features

🎥 Demo

📆 Installation

🧰 Usage Example

🧠 Model Details

🧵 OtosakuStreamingASR API

🔒 Privacy First

📩 Contact

📄 License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages