Close Menu
AsiaTokenFundAsiaTokenFund
  • Home
  • Crypto News
    • Bitcoin
    • Altcoin
  • Web3
    • Blockchain
  • Trading
  • Regulations
    • Scams
  • Submit Article
  • Contact Us
  • Terms of Use
    • Privacy Policy
    • DMCA
What's Hot

Top 14 Altcoins Set to Explode in MAY 2025 as Bitcoin Dominance Drops

May 10, 2025

Metaplanet Will Outperform MicroStrategy in Bitcoin Returns, Says Adam Back

May 10, 2025

Tech Expert Predicts $1 Million Bitcoin — ‘Only One More 10x Left’

May 10, 2025
Facebook X (Twitter) Instagram
Facebook X (Twitter) YouTube LinkedIn
AsiaTokenFundAsiaTokenFund
ATF Capital
  • Home
  • Crypto News
    • Bitcoin
    • Altcoin
  • Web3
    • Blockchain
  • Trading
  • Regulations
    • Scams
  • Submit Article
  • Contact Us
  • Terms of Use
    • Privacy Policy
    • DMCA
AsiaTokenFundAsiaTokenFund

Together AI Expands DeepSeek-R1 Deployment with Enhanced Serverless APIs and Reasoning Clusters

0
By Aggregated - see source on February 13, 2025 Blockchain
Share
Facebook Twitter LinkedIn Pinterest Email


Felix Pinkston
Feb 13, 2025 11:11

Together AI enhances DeepSeek-R1 deployment with new serverless APIs and reasoning clusters, offering high-speed and scalable solutions for large-scale reasoning model applications.





Together AI has announced significant advancements in the deployment of its DeepSeek-R1 reasoning model, introducing enhanced serverless APIs and dedicated reasoning clusters. This move is aimed at supporting the increasing demand from companies integrating sophisticated reasoning models into their production applications.

Enhanced Serverless APIs

The new Together Serverless API for DeepSeek-R1 is reportedly twice as fast as any other API currently available in the market, enabling low-latency, production-grade inference with seamless scalability. This API is designed to offer companies fast, responsive user experiences and efficient multi-step workflows, crucial for modern applications relying on reasoning models.

Key features of the serverless API include instant scalability without infrastructure management, flexible pay-as-you-go pricing, and enhanced security with hosting in Together AI’s data centers. The OpenAI-compatible APIs further facilitate easy integration into existing applications, offering high rate limits of up to 9000 requests per minute on the scale tier.

Introduction of Together Reasoning Clusters

To complement the serverless solution, Together AI has launched Together Reasoning Clusters, which provide dedicated GPU infrastructure optimized for high-throughput, low-latency inference. These clusters are particularly suited for handling variable, token-heavy reasoning workloads, achieving decoding speeds of up to 110 tokens per second.

The clusters leverage the proprietary Together Inference Engine, which is reported to be 2.5 times faster than open-source engines like SGLang. This efficiency allows for the same throughput with significantly fewer GPUs, reducing infrastructure costs while maintaining high performance.

Scalability and Cost Efficiency

Together AI offers a range of cluster sizes to match different workload demands, with contract-based pricing models ensuring predictable costs. This setup is particularly beneficial for enterprises with high-volume workloads, providing a cost-effective alternative to token-based pricing.

Additionally, the dedicated infrastructure ensures secure, isolated environments within North American data centers, meeting privacy and compliance requirements. With enterprise support and service level agreements guaranteeing 99.9% uptime, Together AI ensures reliable performance for mission-critical applications.

For more information, visit Together AI.

Image source: Shutterstock


Credit: Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email

Related Posts

Coinbase Unleashes 24/7 U.S. BTC & ETH Futures Post Deribit

May 9, 2025

Germany Seizes $38M from eXch in Laundering Crackdown

May 9, 2025

Meta Explores Adding Stablecoins, Potentially to Instagram – Report

May 9, 2025
Leave A Reply Cancel Reply

What's New Here!

Top 14 Altcoins Set to Explode in MAY 2025 as Bitcoin Dominance Drops

May 10, 2025

Metaplanet Will Outperform MicroStrategy in Bitcoin Returns, Says Adam Back

May 10, 2025

Tech Expert Predicts $1 Million Bitcoin — ‘Only One More 10x Left’

May 10, 2025

Solana Price Analysis: SOL Price Approaches Major Resistance Around $181 Amid Altcoin FOMO

May 9, 2025
AsiaTokenFund
Facebook X (Twitter) LinkedIn YouTube
  • Home
  • Crypto News
    • Bitcoin
    • Altcoin
  • Web3
    • Blockchain
  • Trading
  • Regulations
    • Scams
  • Submit Article
  • Contact Us
  • Terms of Use
    • Privacy Policy
    • DMCA
© 2025 asiatokenfund.com - All Rights Reserved!

Type above and press Enter to search. Press Esc to cancel.

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.