🦙 Introducing Multimodal Llama 3.2

Welcome to the "Introducing Multimodal Llama 3.2" course! 🚀 This course covers the latest advancements in the Llama model family, including multimodality, custom tool calling, and the new Llama Stack.

📘 Course Summary

This course explores the new capabilities of Llama 3.2, focusing on custom tool calling, multimodal prompting, and the Llama Stack for orchestration. Learn how the Llama family of open models, ranging from 1B to 405B parameters, is driving AI innovation, allowing developers to customize, fine-tune, or build new applications.

What You’ll Learn:

🧠 Llama 3.2 Features: Learn about the new models, their training, key features, and how they integrate into the Llama family.
🖼️ Multimodal Prompting: Explore advanced image reasoning use cases such as understanding car dashboard errors, adding up receipts, grading math homework, and more.
🎯 Role-based Prompting: Understand how Llama 3.1 and 3.2 use different roles—system, user, assistant, and ipython—and the prompt format that identifies these roles.
🔢 Tokenization: Learn how Llama uses the tiktoken tokenizer with an expanded 128k vocabulary that improves encoding efficiency and supports seven non-English languages.
🔧 Tool Calling: Learn how to prompt Llama to call both built-in and custom tools with examples for web search and solving math equations.
🛠️ Llama Stack API: Discover the Llama Stack API, a standardized interface for toolchain components like fine-tuning and synthetic data generation, enabling you to customize Llama models and build agentic applications.

🔑 Key Points

🖼️ Multimodal Capabilities: Leverage the image classification, vision reasoning, and tool use capabilities of Llama 3.2.
🧩 Advanced Prompting Techniques: Learn the details of prompting, tokenization, and tool calling in Llama 3.2.
🛠️ Llama Stack: Gain knowledge of the Llama Stack, a standardized interface for building advanced AI applications on top of the Llama models.

👨‍🏫 About the Instructor

👨‍💻 Amit Sangani: Senior Director of AI Partner Engineering at Meta, Amit is a key contributor to the Llama model development and will guide you through the advanced capabilities of Llama 3.2.

🔗 To enroll in the course or for more information, visit 📚 deeplearning.ai.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Lesson_2.ipynb		Lesson_2.ipynb
Lesson_3.ipynb		Lesson_3.ipynb
Lesson_4.ipynb		Lesson_4.ipynb
Lesson_5.ipynb		Lesson_5.ipynb
Lesson_6.ipynb		Lesson_6.ipynb
Lesson_7.ipynb		Lesson_7.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🦙 Introducing Multimodal Llama 3.2

📘 Course Summary

🔑 Key Points

👨‍🏫 About the Instructor

About

Uh oh!

Releases

Packages

Languages

ksm26/Introducing-Multimodal-Llama-3.2

Folders and files

Latest commit

History

Repository files navigation

🦙 Introducing Multimodal Llama 3.2

📘 Course Summary

🔑 Key Points

👨‍🏫 About the Instructor

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages