A powerful AI-driven application that transforms images into engaging stories using Hugging Face models. The interface is built using three methods AWS SageMaker for inference,Direct inference API and local call for the model. The backend models used are image to text, llm and text to speech generation.The application generate simple funny immersive story.
- 📷 Image Processing: Upload an image via an HTML page and let AI analyze it.
- 📝 Story Generation: Uses an LLM to craft compelling narratives based on image content.
- 🎙 Speech Synthesis: Converts the generated text into speech for an immersive storytelling experience.
- ☁️ Cloud Deployment: Scalable inference using AWS SageMaker.
- Hugging Face: For deep learning models and image-to-text conversion.
- AWS SageMaker: For cloud-based model inference.
- Backend LLM App: Manages prompt engineering and story generation.
- Speech Generator: Converts text into lifelike narration.
- Flask/FastAPI: Backend API for processing requests.
- HTML & JavaScript: Frontend for image upload and user interaction.
- Clone the repository:
git clone https://github.com/manjushree08/image-to-storyteller.git cd image-to-storyteller
- Create and activate a virtual environment:
python -m venv venv source venv/bin/activate # On Windows use: venv\Scripts\activate
- Install dependencies:
pip install -r requirements.txt
- Run the backend service:
python app.py
- Open
index.html
in a web browser and upload an image. - The backend will process the image, generate a story, and return the narrated version.
To deploy the model on AWS SageMaker:
- Train and save the model:
deployment.py
Contributions are welcome! To contribute:
- Fork the repo.
- Create a new branch (
git checkout -b feature-branch
). - Commit your changes (
git commit -m 'Add new feature'
). - Push to GitHub (
git push origin feature-branch
). - Submit a pull request.
This project is licensed for Personal Use Only. Redistribution or commercial use is not permitted.
- Author: Manjushree
- GitHub: manjushree08
- LinkedIn: Manju Shree