Introducing Together AI's New Embedding Models: From 151M to 350M Parameters

Introducing Together AI's New Embedding Models: From 151M to 350M Parameters

Together AI continues to push the boundaries of AI technology with their latest offering: the Together AI/together-ai-embedding-151m-to-350m models. These new language models are designed to provide efficient and cost-effective solutions for embedding tasks, making them ideal for a wide range of applications.

One of the standout features of these models is their pricing structure. With an input price of just $0.016 per 1M tokens and an output price of $0, they offer an affordable solution for businesses of all sizes. Additionally, these models support a maximum number of tokens, further enhancing their versatility.

Together Inference Engine 2.0

The introduction of the Together Inference Engine 2.0 marks a significant leap in AI capabilities. This upgraded engine boasts a decoding throughput that is 4x faster than open-source vLLM and outperforms many commercial solutions by 1.3x to 2.5x. The engine includes the Together Turbo and Together Lite endpoints, allowing users to choose between high performance and cost-efficiency. Together Turbo matches the quality of full-precision FP16 models, while Together Lite is optimized for speed and cost-effectiveness.

Together Enterprise Platform

The Together Enterprise Platform empowers businesses to train, fine-tune, and run inference on any model, whether in the cloud or on-premise. This platform offers:

  • 2-3x faster inference and up to 50% lower operational costs.
  • Continuous model optimization techniques such as auto fine-tuning and adaptive speculators.
  • Flexibility in deployment, ensuring data remains secure within the organization's firewall.

Model Support and Optimization

Supporting over 200 models, including leading families like Llama and custom models, the platform employs advanced optimization techniques like speculative decoding and optimized kernels (e.g., FlashAttention-3). These innovations ensure that the models deliver superior performance and efficiency.

Data Privacy and Model Ownership

Data privacy and model ownership are paramount in the Together Enterprise Platform. Organizations retain complete control over their models and proprietary data, meeting the strictest privacy and compliance policies. This focus on security makes the platform a trusted choice for enterprises.

While the specific Together AI/together-ai-embedding-151m-to-350m models are part of a broader strategy to enhance performance, cost-efficiency, and data privacy, they exemplify Together AI's commitment to providing cutting-edge AI solutions.

Read more