Iris Chrome Extension

A Chrome extension that integrates AI capabilities directly into your browser experience. The primary goal of this project is to demonstrate that advanced AI capabilities can be seamlessly integrated directly into the Chrome browser without requiring external applications or services.

🎭 Use Cases

Iris Chrome Extension can enhance your browsing experience in numerous ways. Here are some examples of what it can do:

Professional Communication

Transform casual emails into polished professional correspondence with a single prompt.

▶️ Click to watch demo video

Social Media Content Creation

Create engaging social media posts tailored to specific platforms like LinkedIn.

▶️ Click to watch demo video

Information Extraction & Analysis

Extract structured data from visual content like graphs and convert it into organized tables.

▶️ Click to watch demo video

These are just a few examples of how Iris can assist with your browsing tasks. The extension is designed to be versatile and can help with a wide range of web-based activities.

⚠️ Known Limitations

Click Handlers: Browser interactions like clicking elements may be inconsistent. As a workaround, you can use keyboard shortcuts for more reliable navigation.
Browser Compatibility: Currently optimized for Chrome and Chromium-based browsers.
Extension Sandbox Limitations:
- Limited access to certain browser APIs due to Chrome's extension sandbox security model
- Cannot access privileged browser pages (chrome:// URLs)
- Cross-origin restrictions limit interactions with iframes from different domains
- Content scripts have restricted access to the webpage's JavaScript environment
- Local file access is restricted for security reasons
- API rate limits may affect performance with rapid or complex interactions

✨ Features

AI-powered browser assistance via sidepanel
Webpage interaction capabilities
Screenshot and visual analysis
Navigation controls for browsing assistance
Contextual understanding of web content

🚀 Quick Start

To get started with the Iris Chrome Extension, you can bring your own API keys for AI model access or use OpenRouter to access a variety of models. This extension has been tested with Google Gemma 27B and Nova ACT, both of which perform exceptionally well with the extension's features. This flexibility allows you to tailor the extension's capabilities to your specific needs.

Additionally, the extension requires access to a coordinate mapping server that generates the layout of elements on the screen. We provide our own servers for this purpose, which are the same servers we use for our computer use agents. You can obtain your API keys and access our servers at https://api.tryiris.dev.

All configuration options, including API key management and model selection, can be accessed through the extension's settings tab. Simply click on the extension icon and navigate to the settings to customize your experience.

🛠️ Installation

Developer Mode

Clone this repository:

git clone https://github.com/pokemonlabs/iris_chrome_extension.git
cd iris_chrome_extension

Install dependencies:
```
bun install
```
Build the extension:
```
bun run build
```
Load the extension in Chrome:
- Open Chrome and go to chrome://extensions/
- Enable "Developer mode" in the top-right corner
- Click "Load unpacked" and select the dist folder in the project directory

🔧 Development

Scripts

Build: bun run build
Watch for changes: bun run watch
Build icons: bun run build:icons
Build manifest: bun run build:manifest

Project Structure

src/ - TypeScript source files
- background.ts - Service worker background script
- content-script.ts - Content script for webpage interaction
- sidepanel.ts - Sidepanel UI logic
- manifest.ts - Extension manifest configuration
public/ - Static assets
dist/ - Build output directory

⌨️ Keyboard Shortcuts

For more reliable navigation when click handlers aren't working:

Tab: Move focus to the next clickable element
Enter: Click the focused element
Escape: Cancel current operation

🔍 Our Focus

This extension serves as a proof-of-concept that advanced AI capabilities can be delivered directly through a browser extension, providing a seamless user experience without requiring external tools. By integrating AI directly into Chrome, we're demonstrating the potential for browser-native AI assistants that can understand and interact with web content in sophisticated ways.

While this browser extension is a significant step forward, our main focus continues to be on developing even more capable computer use agents that can perform complex tasks across entire operating systems. These agents are designed to understand and interact with computer interfaces in a more comprehensive way. Check out our other exciting projects at https://github.com/pokemonlabs.

📝 License

MIT License

🗺️ Roadmap

Here are our planned enhancements for future releases:

UI TARS Integration: Integrate with UI TARS for improved visual understanding and interaction capabilities
Local LLM Support: Add support for running models locally to enhance privacy and reduce latency
Cross-browser Compatibility: Extend support to Firefox, Safari, and other browsers
Enhanced DOM Navigation: Improved element selection and manipulation capabilities
Workflow Automation: Enable creation and execution of multi-step browser workflows
Offline Mode: Core functionality that works without internet connection
Accessibility Features: Improved screen reader compatibility and accessibility options

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
public		public
scripts		scripts
src		src
.gitignore		.gitignore
README.md		README.md
bun.lock		bun.lock
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Iris Chrome Extension

🎭 Use Cases

Professional Communication

Social Media Content Creation

Information Extraction & Analysis

⚠️ Known Limitations

✨ Features

🚀 Quick Start

🛠️ Installation

Developer Mode

🔧 Development

Scripts

Project Structure

⌨️ Keyboard Shortcuts

🔍 Our Focus

📝 License

🗺️ Roadmap

🤝 Contributing

About

Uh oh!

Releases 1

Packages

Uh oh!

Languages

iris-networks/iris_chrome_extension

Folders and files

Latest commit

History

Repository files navigation

Iris Chrome Extension

🎭 Use Cases

Professional Communication

Social Media Content Creation

Information Extraction & Analysis

⚠️ Known Limitations

✨ Features

🚀 Quick Start

🛠️ Installation

Developer Mode

🔧 Development

Scripts

Project Structure

⌨️ Keyboard Shortcuts

🔍 Our Focus

📝 License

🗺️ Roadmap

🤝 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Languages

Packages