Introducing Together AI's New 41.1B-80B Parameter LLMs

Introducing Together AI's New 41.1B-80B Parameter LLMs

Together AI has expanded its suite of Large Language Models (LLMs) with the introduction of models in the 41.1B to 80B parameter range. These models are designed to cater to a variety of needs, including chat, language, and code processing, all at a competitive price point.

Key Features:

  • Model Sizes and Pricing: Together AI offers models between 41.1B and 80B parameters at a rate of $0.90 per 1 million tokens. This pricing applies equally to input and output tokens.
  • Model Types: The $0.90 per 1 million tokens rate covers Chat, Language, and Code models, providing flexibility for different applications.
  • Specific Models: Notable models within this range include LLAMA 3 and LLAMA 3.1. For example, the 70B model is available in LITE ($0.54 per 1M tokens), TURBO ($0.88 per 1M tokens), and REFERENCE ($0.90 per 1M tokens) versions.
  • Usage and Billing: Together AI employs a serverless endpoint model, ensuring you only pay for what you use. Costs are calculated based on the total number of tokens processed, simplifying budgeting and cost management.

With these new offerings, Together AI continues to provide advanced, cost-effective solutions for businesses and developers looking to leverage powerful LLMs for their projects. Whether you're developing chatbots, natural language processing tools, or code assistants, these models offer robust performance at an accessible price.

Read more

Introducing Perplexity's Sonar Reasoning Pro: Advanced Reasoning and Real-Time Web Integration for Complex AI Tasks

Introducing Perplexity's Sonar Reasoning Pro: Advanced Reasoning and Real-Time Web Integration for Complex AI Tasks

Artificial Intelligence continues to evolve rapidly, and Perplexity's latest offering, Sonar Reasoning Pro, exemplifies this advancement. Designed to tackle complex tasks with enhanced reasoning and real-time web search capabilities, Sonar Reasoning Pro presents substantial improvements for enterprise-level applications, research, and customer service. Key Capabilities of Sonar Reasoning Pro

Introducing nscale/DeepSeek-R1-Distill-Qwen-7B: A Compact Powerhouse for Advanced Reasoning Tasks

Introducing nscale/DeepSeek-R1-Distill-Qwen-7B: A Compact Powerhouse for Advanced Reasoning Tasks

As the AI landscape continues to evolve, developers and enterprises increasingly seek powerful yet computationally efficient language models. The newly released nscale/DeepSeek-R1-Distill-Qwen-7B provides an intriguing solution, combining advanced reasoning capabilities with a compact 7-billion parameter footprint. This distillation from the powerful DeepSeek R1 into the Qwen 2.5-Math-7B base