NVIDIA: Llama Nemotron Embed VL 1B V2 (free)

1

Get your API key

Create an API key from your OpenRouter dashboard and set it as an environment variable:

2

Make your first request

Use nvidia/llama-nemotron-embed-vl-1b-v2:free with the OpenRouter API:

OpenRouter supports image input embeddings for models that can generate embeddings from both text and images. Pass multimodal content using the content array format with text and image_url components. Learn more about image embeddings.

In the examples below, the OpenRouter-specific headers are optional. Setting them allows your app to appear on the OpenRouter leaderboards.

Using third-party SDKs

For information about using third-party SDKs and frameworks with OpenRouter, please see our frameworks documentation.

Endpoint

POSThttps://openrouter.ai/api/v1/embeddings

AuthorizationBearer $OPENROUTER_API_KEY

Content-Typeapplication/json

HTTP-Refereroptional — your site URL, for rankings

X-Titleoptional — your site name, for rankings

Modelnvidia/llama-nemotron-embed-vl-1b-v2:free

Parameters

Name	Type	Default	Description
`temperature`	float	`1`	This setting influences the variety in the model's responses.
`max_tokens`	integer	—	This sets the upper limit for the number of tokens the model can generate in response.
`seed`	integer	—	If specified, the inferencing will sample deterministically, such that repeated requests with the same seed and parameters should return the same result.
`top_p`	float	`1`	This setting limits the model's choices to a percentage of likely tokens: only the top tokens whose probabilities add up to P.

NVIDIA: Llama Nemotron Embed VL 1B V2 (free)