🐍🤖 Apoco AI Code Challenge

Your mission is to build and evaluate a smart, LLM-powered Q&A Agent with inline citations. Show us your practical skills, theoretical understanding, and communication style!

🛠️ Tech Stack

Python 3.x
Lightweight Open-source LLM (e.g., Llama 3.1:8B, Granite 3.3:8B, or similar)
Any open-source libraries (HuggingFace, PyTorch, inference provider SDKs, etc.)

🎯 Objectives

!IMPORTANT You must do the agent implementation by yourself; using existing solutions from frameworks is not allowed.

1. Build a RAG-based Q&A Agent

Retrieve relevant context for each user question from an external source (e.g., Wikipedia).
Generate answers with inline citations referencing the retrieved sources.
Minimize hallucinations and handle irrelevant or malicious inputs robustly.
Clearly indicate when an answer cannot be provided based on available sources.

2. Evaluation & Analysis

Design and implement an evaluation pipeline (automated or manual) to assess your agent’s performance.
- Use relevant metrics (e.g., factual accuracy, citation correctness, robustness to adversarial input).
- Include a small set of test questions and report the results.
Error analysis: Why the agent can fail in some cases and why is it happening?

3. Documentation & Reflection

Code documentation: Comment your project and document the complex parts.
Design decisions: Briefly explain your choices (model, retrieval method, prompt design, etc.).
Ethical & safety considerations: Discuss how you addressed hallucination, bias, and user safety.
Improvement ideas: Suggest concrete ways to further improve your agent.

🌟 Bonus Tasks (Optional)

Dockerize your application.
BeeAI Integration: Wrap your agent in a BeeAI Platform-compatible server so it can be registered and used locally (UI/CLI).
Configurability: Allow switching between models, data sources, etc.
Experiment with fine-tuning.
Your creative idea! 💡

🧑‍⚖️ Assessment Criteria

Technical correctness: Does the agent work as specified? Are citations accurate?
Code quality: Is the code clean, modular, and well-documented?
Evaluation rigor: Are the evaluation and error analysis meaningful?
Communication: Are design decisions and ethical considerations clearly explained?
Creativity & initiative: Are there thoughtful improvements or extra features?

📝 Submission Guidelines

Create a private GitHub repository and invite @Tomas2D.

❓ Need Clarification?

Feel free to ask questions (tomas.dvorak@apoco.com). We value both technical skill and critical thinking.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🐍🤖 Apoco AI Code Challenge

🛠️ Tech Stack

🎯 Objectives

1. Build a RAG-based Q&A Agent

2. Evaluation & Analysis

3. Documentation & Reflection

🌟 Bonus Tasks (Optional)

🧑‍⚖️ Assessment Criteria

📝 Submission Guidelines

❓ Need Clarification?

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

ApocoHQ/python-ai-code-challenge

Folders and files

Latest commit

History

Repository files navigation

🐍🤖 Apoco AI Code Challenge

🛠️ Tech Stack

🎯 Objectives

1. Build a RAG-based Q&A Agent

2. Evaluation & Analysis

3. Documentation & Reflection

🌟 Bonus Tasks (Optional)

🧑‍⚖️ Assessment Criteria

📝 Submission Guidelines

❓ Need Clarification?

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Packages