Overview of Llama 3.2

A one-pager on the Llama models so far:

Llama 3.1 models

What’s new in Llama 3.1 and 3.2?

Tokenizer → a new tokenizer with a vocabulary of 128K tokens.
Context window → a larger context window of 128K tokens.
Languages → native support of 8 languages: English, German, French, Italian, Portuguese, Hindi, Spanish, and Thai.
Tool calling → Native support for tool calling.
Llama stack → a set of APIs and CLI for the entire Llama lifecycle, and the API providers and distributions.

What’s new in Llama 3.2?

Multimodal input in 11B and 90B models
- Image (object, scenes, drawing) and OCR understanding
- Captioning and QA
- Visual reasoning (equations, charts, documents)
Smaller sizes in 1B and 3B text-only models
- New SLM models use cases:
  - On-device summarization
  - Writing, translation
  - QA in multiple languages

Vision