fireworks-ai

Exploring Fireworks AI's Nomic-Embed-Text-v1.5: A Multimodal LLM with Enhanced Context Length

Tal Peretz

05 Jan 2025 — 2 min read

The release of Fireworks AI/Nomic-Embed-Text-v1.5 marks a significant advancement in the landscape of language models, offering a range of features designed to enhance the performance and flexibility of text embeddings. This blog post delves into the key aspects of this model, its technical capabilities, and practical applications.

Unveiling Key Features

The nomic-embed-text-v1.5 model introduces multimodal capabilities, aligning seamlessly with the nomic-embed-vision-v1 embedding space. This enables the generation of multimodal embeddings, a feature that is particularly advantageous for complex applications requiring both text and visual data processing.

A standout feature of this model is its support for a context length of up to 8192 tokens, which surpasses the capabilities of other models such as OpenAI Ada-002. This extended context length facilitates superior performance in both short and long context tasks, providing users with greater flexibility.

Matryoshka Representation Learning

Another innovative feature is the use of Matryoshka Representation Learning. This technique allows developers to adjust the embedding size across various dimensions (768, 512, 256, 128, 64) with minimal impact on performance. This adaptability makes the model a versatile tool for developers dealing with varying computational resources.

Practical Usage and Integration

When implementing the nomic-embed-text-v1.5 model, it is essential to include a task instruction prefix in the text prompt. This prefix specifies the task being performed, such as classification: <text here> or search_query: <text here>.

The model is readily accessible through multiple APIs, including the Nomic Embedding API and Fireworks AI API. It is also compatible with libraries like sentence_transformers and langchain_fireworks, which facilitate seamless integration into existing workflows.

For example, using the Nomic Python client, you can generate embeddings with the following code snippet:

from nomic import embed
output = embed.text(texts=['Nomic Embedding API', '#keepAIOpen'], model='nomic-embed-text-v1.5', task_type='search_document', dimensionality=256)
print(output)

Similarly, integration with Fireworks AI is demonstrated using the FireworksEmbeddings class:

from langchain_fireworks import FireworksEmbeddings
embedding = FireworksEmbeddings(model="nomic-ai/nomic-embed-text-v1.5")
res_query = embedding.embed_query("The test information")
res_document = embedding.embed_documents(["test1", "another test"])
print(res_query[:5])
print(res_document[1][:5])

Technical Insights

The model's training was conducted using a multi-stage contrastive learning pipeline, which began with a BERT initialization and expanded to a 2048 context length BERT, inspired by MosaicBERT. This approach ensures the model's performance is on par with other BERT models, while also taking advantage of its longer context length.

Authentication and Limitations

To ensure secure access, the model uses Bearer authentication headers for API requests. Users should be aware that the input text must not exceed 8192 tokens and cannot be an empty string, ensuring optimal performance and reliability.

In conclusion, Fireworks AI's Nomic-Embed-Text-v1.5 offers a robust and flexible solution for applications that require advanced text embeddings. Its multimodal capabilities, extended context length, and adjustable dimensionality make it an invaluable asset for developers across various industries.

Exploring Fireworks AI's Nomic-Embed-Text-v1.5: A Multimodal LLM with Enhanced Context Length

Tal Peretz

Unveiling Key Features

Matryoshka Representation Learning

Practical Usage and Integration

Technical Insights

Authentication and Limitations

Read more

Introducing Gemini 2.0 Flash Preview Image Generation: Google's Next-Step Generative AI Model

Exploring Google's Gemini 2.5 Flash Preview TTS: Powerful, Cost-Efficient Text-to-Speech

Introducing Vertex AI Gemini-2.5-Pro-Preview-TTS: Google's New Flagship LLM Explained

Introducing Gemini 2.5 Pro Preview TTS: Google's Next-Generation Multimodal AI