Close Menu
    Facebook X (Twitter) Instagram
    Wifi PortalWifi Portal
    • Blogging
    • SEO & Digital Marketing
    • WiFi / Internet & Networking
    • Cybersecurity
    • Tech Tools & Mobile / Apps
    • Privacy & Online Earning
    Facebook X (Twitter) Instagram
    Wifi PortalWifi Portal
    Home»WiFi / Internet & Networking»Nvidia claims 10x cost savings with open-source inference models
    WiFi / Internet & Networking

    Nvidia claims 10x cost savings with open-source inference models

    adminBy adminFebruary 13, 2026No Comments1 Min Read
    Facebook Twitter LinkedIn Telegram Pinterest Tumblr Reddit WhatsApp Email
    Big data technology and data science illustration. Data flow concept. Querying, analysing, visualizing complex information. Neural network for artificial intelligence. Data mining. Business analytics.
    Share
    Facebook Twitter LinkedIn Pinterest Email

    Nvidia noted that cost per token went from 20 cents on the older Hopper platform to 10 cents on Blackwell. Moving to Blackwell’s native low-precision NVFP4 format further reduced the cost to just 5 cents, so a basic upgrade gave a 4x improvement in cost per token while maintaining the accuracy that customers expect.

    Nvidia outlined four industry deployments in a blog post showing how this combination of Blackwell infrastructure, NVFP4, optimized software stacks and open-source models delivers significant cost reductions. They break down like this:

    • Healthcare — In healthcare, tedious, time-consuming tasks like medical coding, documentation and managing insurance forms cut into the time doctors can spend with patients. Sully.ai helps tackle this problem through AI agents to handle routine tasks that take up time.

    The problem is that Sully.ai’s proprietary, closed source models didn’t scale well. So Sully.ai used Baseten’s open-source Model API on Blackwell GPUs with NVFP4 data format, the TensorRT-LLM library and the Dynamo inference framework .The result was a 90% drop in inference costs dropped by 90%, representing a 10x reduction compared with the prior closed source implementation, while response times improved by 65% for critical workflows like generating medical notes.

    10x claims Cost inference Models Nvidia opensource savings
    Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
    Previous ArticleGoogle Links China, Iran, Russia, North Korea to Coordinated Defense Sector Cyber Operations
    Next Article Nothing has finally given its phones a truly essential feature
    admin
    • Website

    Related Posts

    4 open-source apps that replace Microsoft’s expensive software (and do it better)

    April 17, 2026

    AI shifts IT roles from operator to orchestrator

    April 16, 2026

    IBM unveils security services for thwarting agentic attacks, automating threat assessment

    April 16, 2026
    Add A Comment
    Leave A Reply Cancel Reply

    Search Blog
    About
    About

    At WifiPortal.tech, we share simple, easy-to-follow guides on cybersecurity, online privacy, and digital opportunities. Our goal is to help everyday users browse safely, protect personal data, and explore smart ways to earn online. Whether you’re new to the digital world or looking to strengthen your online knowledge, our content is here to keep you informed and secure.

    Trending Blogs

    4 open-source apps that replace Microsoft’s expensive software (and do it better)

    April 17, 2026

    These 5 free Microsoft Store apps deserve a place on every Windows PC

    April 17, 2026

    NIST Limits CVE Enrichment After 263% Surge in Vulnerability Submissions

    April 17, 2026

    AI Agents Are Here And Your Website Isn’t Ready, Says No Hacks Podcast Host

    April 17, 2026
    Categories
    • Blogging (63)
    • Cybersecurity (1,362)
    • Privacy & Online Earning (170)
    • SEO & Digital Marketing (836)
    • Tech Tools & Mobile / Apps (1,629)
    • WiFi / Internet & Networking (227)

    Subscribe to Updates

    Stay updated with the latest tips on cybersecurity, online privacy, and digital opportunities straight to your inbox.

    WifiPortal.tech is a blogging platform focused on cybersecurity, online privacy, and digital opportunities. We share easy-to-follow guides, tips, and resources to help you stay safe online and explore new ways of working in the digital world.

    Our Picks

    4 open-source apps that replace Microsoft’s expensive software (and do it better)

    April 17, 2026

    These 5 free Microsoft Store apps deserve a place on every Windows PC

    April 17, 2026

    NIST Limits CVE Enrichment After 263% Surge in Vulnerability Submissions

    April 17, 2026
    Most Popular
    • 4 open-source apps that replace Microsoft’s expensive software (and do it better)
    • These 5 free Microsoft Store apps deserve a place on every Windows PC
    • NIST Limits CVE Enrichment After 263% Surge in Vulnerability Submissions
    • AI Agents Are Here And Your Website Isn’t Ready, Says No Hacks Podcast Host
    • Wavelet: headphone equalizer 26.04 APK Download by pittvandewitt
    • GitLab 18.11 brings agentic AI to security fixes, CI pipelines, and delivery analytics
    • OnePlus’ Europe exit isn’t official yet, but the signs aren’t great
    • Some Windows servers enter reboot loops after April patches
    © 2026 WifiPortal.tech. Designed by WifiPortal.tech.
    • Home
    • About Us
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer

    Type above and press Enter to search. Press Esc to cancel.