Voxii - VR Language Learning App 🌐🕶️

Voxii is an immersive virtual reality language learning application designed for Windows desktop. Step into lifelike scenarios—like ordering coffee in a bustling café or navigating a vibrant marketplace—and engage in realistic, conversational interactions that make learning a new language both effective and enjoyable.

🚀 Features

🎭 Life-like Scenes: Immerse yourself in diverse environments to practice speaking naturally.
🗣️ Real-Life Scenarios: Engage in common situations to enhance your English conversational skills.
🎤 Speech Recognition & Generation: Speak directly into your microphone and receive real-time responses from the Voxii avatar.
✅ Avatar Corrections: Receive polite and constructive feedback on any mistakes during conversations.
📊 Score & Corrections: Access immediate feedback, improvement tips, and track your conversation scores with a single button click.

🛠️ Getting Started

📋 Prerequisites

Before you begin, ensure you have the following:

Operating System: Windows 10 or higher
Hardware: Meta Quest 3 headset and controllers
Software:
- Unity 2022.3.39f1
- .NET Framework
API Key: Obtain an API key from Groq

🏁 Installation Steps

Clone the Repository:

git clone https://github.com/YourUsername/Voxii.git
cd Voxii

Configure the API Key:
- Create a .env file in the root directory of the project.
- Add your Groq API key in the following format:
```
GROQ_API_KEY=<YOUR_GROQ_API_KEY_HERE>
```
Install Dependencies:
- Open the project in Unity.
- Let Unity import all necessary packages and dependencies.
Launch the Application:
- Press the Play button in Unity to start the application.
- Controls:
  - Primary Button (X or A): Hold to start speaking.
  - Secondary Button (Y or B): Press to receive feedback on your conversation.

❓ Troubleshooting

API Key Issues:
- Ensure the .env file is correctly placed in the root directory.
- Verify that the API key is valid and has not expired.
Controller Not Responding:
- Check the connection between your VR controllers and the desktop.
- Ensure the controllers are properly paired and recognized by the system.
Speech Recognition Errors:
- Confirm that your microphone is functioning and properly configured.
- Test your microphone with other applications to ensure it's working correctly.

⚡ Quick Start

Ready to dive in? Follow these simple steps to start your language learning journey with Voxii:

Set Up Your Environment:
- Ensure all prerequisites are met.
- Configure your .env file with the Groq API key.
Launch Voxii:
- Open the project in Unity.
- Click the Play button to start the application.
Begin Learning:
- Hold the Primary Button (X or A) on your controller to initiate conversation.
- Engage with the Voxii avatar in various scenarios.
- Press the Secondary Button (Y or B) to receive feedback and track your progress.
Monitor Your Progress:
- View your conversation scores and error corrections in real-time.
- Use the feedback to improve your language skills continuously.

Demo

Description	Demo
👉 Click the image to watch the demo video
👉 Click the image to view the pitch deck

Documentation

Note: Only public methods are included here.

📂 Folder Structure within `/Assets/Scripts`

AI 🤖: Contains the AI avatar's logic and the Grammar Checker AI logic
LevelManagement 🎮: Contains the logic for managing the various scenes from the main menu to the post-level screen
Utility 🛠️: Contains utility scripts like the SendWavFile for saving audio clips as .wav files

📚 Classes in Each Folder

/Assets/AI

AIVoice.cs (static class) 🎤

Description:
Text to speech for the AI avatar

🛠️ Methods

static async Task Speak(string msg)
- Asynchronously outputs AI speech for the given message using Piper TTS
static async Task SpeakRepeat()
- Outputs the "Can you repeat that" message audio
static async Task SpeakInitialMsg()
- Outputs the initial message for the specific avatar

ChatLoop.cs 🔄

Description:
The user-avatar conversation loop

⚙️ Public Variables

GroqApiClient groqApi
bool isResponding: Tracks if the AI is responding
bool userSpeaking: Tracks if the user is speaking
string chatLogFilePath: File path for chat logs
GameObject loadingSymbol
int msgsSent
TextMeshProUGUI messagesRemainingValue
LevelManagement levelManagement

🛠️ Methods

async Task SendUserMessage(string msg)
- Sends a message to the AI avatar, adds it to logs and chat history
void setIsResponding(bool value)
- Setter for isResponding
void setUserSpeaking(bool value)
- Setter for userSpeaking

GroqApiClient.cs 📡

Description:
Handles requests sent to Groq's llama3

🛠️ Methods

GroqApiClient(string apiKey = "")
- Initializes the Groq API client with an API key or from a .env file
async Task<JObject?> CreateChatCompletionAsync(JObject request)
- Sends a request to the AI, including parameters and chat history

Scorer.cs 📝

Description:
Contains the AI scoring logic for the user's English

⚙️ Public Variables

GroqApiClient groqApi

🛠️ Methods

Scorer(string chatLogFilePath)
- Initializes the scorer with the chat logs of the conversation so far
async Task<string> GetScore()
- Gets the score on the user's English of the conversation
async Task<string> GetResponseOutput(JArray msgs)
- Overload the GetResponseOutput method to accept a JArray parameter
- Useful for testing the scoring system with different chat logs
ScoreResult ParseScoreNumbers(string scoreString)
- Parses the score string to extract the number of errors and accuracy
SentimentResult ParseSentiment(string scoreString)
- Parses the sentiment from the AI response
public List<ErrorExample> ParseErrorExamples(string scoreString)
- Parses the error examples from the AI response
float CalculateResponseTime(string chatLogFilePath)
- Calculates the time taken for the user to respond to the AI
static int CalculatePoints(ScoreResult scoreResult, float responseTime)
- Calculates the points based on the score result
async Task<int> CalculatePointsAsync()
- Step-by-step calculation of points
async Task<(ScoreResult, float, List<ErrorExample>)> GetResultsAndResponseTimeAsync
- Gets ScoreResult object
class ScoreResult
- Holds data needed for score result (NumberOfErrors, Accuracy)
class SentimentResult
- Holds data needed for sentiment result (Sentiment)
class ErrorExample
- Holds data needed for error examples (Category, Incorrect, Corrected, Reasoning)

WhisperTranscriber.cs 🗣️

Description:
Contains the logic for transcribing the user's speech to text

🛠️ Components

TextMeshProUGUI displayText

🛠️ Methods

async Task<string> TranscribeRecording()
- Retrieves the string for the user's speech stored in recording.wav in the application's persistent data path

SentimentAnalyzer.cs 😊

Description:
Analyzes the sentiment of the AI response each time the AI responds
Used to update the animation of the AI avatar

🛠️ Methods

public async Task<bool> IsPositiveOrNeutralSentiment(string message)
- Sends the AI avatar's response to the Groq API to analyze the sentiment
- Returns true if the sentiment is positive or neutral, false otherwise

/Assets/LevelManagement

OnboardingData.cs 📄

Description:
Data container class to hold onboarding data instances

⚙️ Public Variables

string PersonName { get; set; }
string LanguageProficiency { get; set; }
string LanguageToLearn { get; set; }
List<string> PhrasesToWorkOn { get; set; }
string Scene { get; set; }
Dictionary<string, string> SceneToRole { get; set; }

FeedbackScreenController.cs 💬

Description:
Controls the feedback screen

⚙️ Public Variables

GameObject screen1
GameObject screen2
Button nextButton
Button doneButton
TextMeshProUGUI pointValue
TextMeshProUGUI grammarErrorValue
TextMeshProUGUI responseTimeValue
TextMeshProUGUI relevanceValue
TextMeshProUGUI feedbackCategory
TextMeshProUGUI feedbackIncorrect
TextMeshProUGUI feedbackCorrected
TextMeshProUGUI feedbackReasoning

🛠️ Methods

void ShowSecondScreen()
void ChangeToMenuScene()

LevelManagement.cs

↔️

Description:
Switches between the main menu and the post-level screen displays based on the scene

⚙️ Public Variables

GameObject display1
GameObject display2

🛠️ Methods

void goToMainMenu()
void goToPostLevel()
void switchDisplays()

MenuData.cs 🗂️ (static class - WavUtility)

Description:
Handles menu data storage and retrieval

⚙️ Public Variables

static List<bool> OptionsSelected
static string SceneSelection
static float LanguageProficiency
static float AvatarHostility
static string filePath

🛠️ Methods

static void SetFilePath(string appDataPath)
- Needed because Application.persistentDataPath can't be accessed by a static non-MonoBehavior class
static void SaveDataToJson()
static void LoadDataFromJson()
static string getRole()
- Retrieves the role based on the scene
class MenuDataModel
- Stores necessary data (OptionsSelected, SceneSelection, LanguageProficiency, AvatarHostility)

ResultsData.cs 📊

Description:
Class to store result data

⚙️ Public Variables

static int points
static int errors
static int relevanceScore
static int responseTime
static string feedbackCategory
static string feedbackIncorrect
static string feedbackCorrected
static string feedbackReasoning

/Assets/Utility

XRInputActions.inputactions 🎮

Description:
Contains the mappings which map from the keyboard to the controller's controls

🛠️ Methods

void StartLoading()
- Initializes the loading icon logic

LoadingSymbolController.cs ⏳

Description:
Handles the loading symbol logic

🛠️ Components

GameObject loadingSymbol

🛠️ Methods

void ShowLoadingSymbol()
- Displays the loading symbol
void HideLoadingSymbol()
- Hides the loading symbol

MicRecorder.cs 🎙️

Description:
Handles recording through the microphone

🛠️ Components

GameObject loadingSymbol

🛠️ Methods

void StartRecording()
- Starts recording the microphone
void StopRecording()
- Stops recording the microphone
void SaveRecording()
- Saves the recording in the persistent data path as recording.wav
void PlayRecording()
- Plays the recording

PrimaryBtnHold.cs

▶️

Description:
Logic for actions when the primary button is held

🛠️ Components

InputActionAsset inputActionAsset
MicRecorder micRecorder
WhisperTranscriber whisperTranscriber
ChatLoop chatLoop

⚙️ Public Variables

bool isRecording: Tracks whether the application is recording

SecondaryBtnPress.cs 🔁

Description:
Logic for actions when the secondary button is pressed

🛠️ Components

InputActionAsset inputActionAsset
ChatLoop chatLoop
PrimaryBtnHold primaryBtnHold

SaveWavFile.cs 💾 (static class - WavUtility)

Description:
Contains logic to save an audio clip in .wav format

🛠️ Methods

static byte[] FromAudioClip(AudioClip clip)
- Converts an audio clip into byte[]
static void SaveWav(string filePath, AudioClip clip)
- Saves the audio clip as a .wav file to the specified path

BackgroundMusicController.cs 🎵

Description:
Controls the background music

⚙️ Public Variables

Button muteButton
GameObject symbol: Symbol to show the mute button
Sprite unmutedImage
Sprite mutedImage

🛠️ Methods

void ToggleMute()
void UpdateButtonImage()
- Changes the button image based on the mute state

Name		Name	Last commit message	Last commit date
Latest commit History 176 Commits
.vscode		.vscode
Assets		Assets
Packages		Packages
ProjectSettings		ProjectSettings
.DS_Store		.DS_Store
.gitattributes		.gitattributes
.gitignore		.gitignore
.vsconfig		.vsconfig
README.md		README.md

KalebCole/Voxii

Folders and files

Latest commit

History

Repository files navigation

Voxii - VR Language Learning App 🌐🕶️

🚀 Features

🛠️ Getting Started

📋 Prerequisites

🏁 Installation Steps

❓ Troubleshooting

⚡ Quick Start

Demo

Documentation

📑 Table of Contents

📂 Folder Structure within /Assets/Scripts

📚 Classes in Each Folder

/Assets/AI

🛠️ Methods

⚙️ Public Variables

🛠️ Methods

🛠️ Methods

⚙️ Public Variables

🛠️ Methods

🛠️ Components

🛠️ Methods

🛠️ Methods

/Assets/LevelManagement

⚙️ Public Variables

⚙️ Public Variables

🛠️ Methods

⚙️ Public Variables

🛠️ Methods

⚙️ Public Variables

🛠️ Methods

⚙️ Public Variables

/Assets/Utility

🛠️ Methods

🛠️ Components

🛠️ Methods

🛠️ Components

🛠️ Methods

🛠️ Components

⚙️ Public Variables

🛠️ Components

🛠️ Methods

⚙️ Public Variables

🛠️ Methods

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

📂 Folder Structure within `/Assets/Scripts`

Packages