Multimodal

LLaMA 3.2 90B Vision Environmental Impact

Q: What is LLaMA 3.2 90B Vision's carbon footprint?

Based on the carbon intensity of its inference location (Self-hosted (varies, requires multi-GPU)), each LLaMA 3.2 90B Vision query produces approximately 0.6g of CO2.

Q: How does LLaMA 3.2 90B Vision compare to a Google search?

A LLaMA 3.2 90B Vision query uses 5x more than a traditional Google search in terms of energy consumption. A Google search uses approximately 0.3 Wh, while LLaMA 3.2 90B Vision uses 1.5 Wh per query.

Q: How much water does LLaMA 3.2 90B Vision use?

Each LLaMA 3.2 90B Vision query consumes approximately 2.8 mL of water, primarily used for cooling the data centers that process the request.

StandardEstimated

Large open-source multimodal model rivalling GPT-4V

Architecture: Vision-Language Transformer (decoder-only)
Parameters: 90B
Context: 128,000 tokens
Provider: Meta

1.5 Wh

Energy per query

0.60 g

CO₂ per query

3 mL

Water per query

5x more than

vs Google search

Energy per query

1.5 Wh

5x more than a Google search (0.3 Wh)

CO2 per query

0.60 g

Global Average grid (475 gCO₂/kWh)

Water per query

3 mL

~357 queries to fill 1 litre

Processing location

Self-hosted (varies, requires multi-GPU)

Provider

How does LLaMA 3.2 90B Vision compare?

Ranked #75 of 152 models by energy per query

View full comparison

Detailed Breakdown

Energy Consumption

LLaMA 3.2 90B Vision is the largest open-source multimodal model, consuming approximately 1.5 Wh per query. It approaches GPT-4V-class capabilities for image understanding while being fully open-weight. Requires multi-GPU setups for inference.

Meta AI — Llama 3.2 (Sep 2024)

Power Source & Carbon

Requires cloud GPU clusters for inference due to its size. Available through major cloud providers and specialised inference platforms.

Meta Infrastructure Evolution (2025)

Water Usage

At ~2.8 mL per query, the 90B vision model has a moderate water footprint.

UC Riverside — Making AI Less Thirsty (2023)

About LLaMA 3.2 90B Vision

LLaMA 3.2 90B Vision is a multimodal model from Meta, released in September 25, 2024. Large open-source multimodal model rivalling GPT-4V. Each query uses 1.5 Wh of energy and produces 0.60 g of CO₂. That's 5x the energy of a Google search — reflecting the computational demands of multimodal.

These figures are estimates derived from hardware specifications and API benchmarks — Meta has not published official energy data for LLaMA 3.2 90B Vision. Actual consumption may vary significantly depending on batching, quantisation, and infrastructure optimisations that we cannot observe from outside.

LLaMA 3.2 90B Vision in Context

92%

potential savings

The efficiency alternative

Phi-4 Multimodal performs the same type of task using just 0.12 Wh per query — 92% less energy than LLaMA 3.2 90B Vision. For a user sending 25 queries per day, switching would save 12.6 kWh per year.

Key Insights

Open-source weights — can be self-hosted on infrastructure you control

Meta LLaMA Family

How energy efficiency has evolved across versions.

LLaMA 3.1 70B2024-07-23

1.1 Wh

LLaMA 3.1 8B2024-07-23

0.30 Wh

LLaMA 3.1 405B2024-07-23

4.0 Wh

LLaMA 3.2 90B VisionCurrent2024-09-25

1.5 Wh

LLaMA 3.2 1B2024-09-25

0.22 Wh

LLaMA 3.2 11B Vision2024-09-25

0.50 Wh

LLaMA 3.3 70B2024-12-06

1.0 Wh

LLaMA 4 Scout2025-04-05

0.60 Wh

LLaMA 4 Maverick2025-04-05

1.8 Wh

What does your LLaMA 3.2 90B Vision usage cost the planet?

Use our calculator to estimate your personal environmental footprint based on how often you use LLaMA 3.2 90B Vision.

Calculate My Compute

Frequently Asked Questions

How much energy does LLaMA 3.2 90B Vision use per query?

Each LLaMA 3.2 90B Vision query consumes approximately 1.5 Wh of energy. This is 5x more than a traditional Google search (~0.3 Wh).

What is LLaMA 3.2 90B Vision's carbon footprint?

Based on the carbon intensity of Self-hosted (varies, requires multi-GPU), each query produces approximately 0.60 g of CO2. The grid in this region has a carbon intensity of 475 g CO2/kWh with 27% renewable energy.

How much water does LLaMA 3.2 90B Vision use?

Each query consumes approximately 3 mL of water, primarily used for cooling the data centers that process the request.

How does LLaMA 3.2 90B Vision compare to a Google search?

A LLaMA 3.2 90B Vision query uses 5x more than a Google search in terms of energy. A Google search uses approximately 0.3 Wh, while LLaMA 3.2 90B Vision uses 1.5 Wh.

Technical Details

Architecture

Vision-Language Transformer (decoder-only)

Parameters

90B

Context window

128,000 tokens

Release date

2024-09-25

Open source

Yes

Training data cutoff

2024-08

Sources

Meta AI — Llama 3.2 (Sep 2024)

Related Models

LLaMA 3.2 11B VisionStandard