NewsBytes
    Hindi Tamil Telugu
    More
    In the news
    Narendra Modi
    Amit Shah
    Box Office Collection
    Bharatiya Janata Party (BJP)
    OTT releases
    Hindi Tamil Telugu
    NewsBytes
    User Placeholder

    Hi,

    Logout

    India
    Business
    World
    Politics
    Sports
    Technology
    Entertainment
    Auto
    Lifestyle
    Inspirational
    Career
    Bengaluru
    Delhi
    Mumbai

    Download Android App

    Follow us on
    • Facebook
    • Twitter
    • Linkedin
    Home / News / Technology News / DeepSeek reveals new AI reasoning technique amid next-gen model anticipation
    Next Article
    DeepSeek reveals new AI reasoning technique amid next-gen model anticipation
    The new technique was developed with Tsinghua University

    DeepSeek reveals new AI reasoning technique amid next-gen model anticipation

    By Akash Pandey
    Apr 06, 2025
    05:50 pm

    What's the story

    Chinese AI start-up, DeepSeek, has unveiled a novel way to improve the reasoning capabilities of large language models (LLMs). The announcement comes ahead of the debut of the company's next-generation model.

    The novel technique, developed with Tsinghua University researchers, combines generative reward modeling (GRM) with self-principled critique tuning.

    According to a paper published on Friday, the dual approach seeks to make LLMs respond to general queries better and faster.

    Performance

    DeepSeek-GRM models outperform existing methods

    The newly developed DeepSeek-GRM models have achieved competitive performance compared to existing methods.

    The researchers said that these models have "achieved competitive performance" with robust public reward models.

    Reward modeling is a technique that guides an LLM toward human preferences, and its successful application in the DeepSeek-GRM models contributes to improved AI reasoning capabilities.

    Open-source

    Plans to make GRM models open source

    DeepSeek has said that it plans to open source the GRM models, but there is no timeline yet.

    The announcement was made in an academic paper released on arXiv, an online scientific paper repository.

    The release of this research comes amid speculation of DeepSeek's next move after the global attention its V3 foundation model and R1 reasoning model received.

    Anticipation

    DeepSeek-R2 release anticipated

    According to Reuters, DeepSeek-R2, the successor of the R1 model, could be released as early as this month.

    The speculation comes as the company looks to capitalize on its growing reputation in the tech industry.

    The launch of DeepSeek-R1 generated a lot of interest with its cost-effective performance matching leading models.

    However, DeepSeek has neither confirmed nor denied these reports of an R2 release.

    Facebook
    Whatsapp
    Twitter
    Linkedin
    Related News
    Latest
    DeepSeek
    Artificial Intelligence and Machine Learning

    Latest

    Man, woman break into Salman's home separately; both arrested Bollywood
    Man uses ChatGPT as lawyer—wins ₹2L refund for canceled flight ChatGPT
    Watch: Massive explosion at SpaceX base during Starship engine test SpaceX
    IPL 2025: Tim Seifert replaces Jacob Bethell at RCB camp Indian Premier League (IPL)

    DeepSeek

    Is DeepSeek's AI sending user data to Chinese servers? China
    Energy stocks see sell-off as DeepSeek's power efficiency breaks assumptions China
    Alibaba claims its new AI model can beat DeepSeek, ChatGPT Artificial Intelligence and Machine Learning
    DeepSeek's AI app tops global downloads, India leads user base Artificial Intelligence and Machine Learning

    Artificial Intelligence and Machine Learning

    Microsoft's new AI agents will fight cybercrimes: Here's how Cybersecurity
    NVIDIA's AI assistant—that makes you a better gamer—now available NVIDIA
    Google releases its most intelligent AI model with thinking built-in Google
    Alibaba just made developing AI agents way cheaper Alibaba Group
    Indian Premier League (IPL) Celebrity Hollywood Bollywood UEFA Champions League Tennis Football Smartphones Cryptocurrency Upcoming Movies Premier League Cricket News Latest automobiles Latest Cars Upcoming Cars Latest Bikes Upcoming Tablets
    About Us Privacy Policy Terms & Conditions Contact Us Ethical Conduct Grievance Redressal News News Archive Topics Archive Download DevBytes Find Cricket Statistics
    Follow us on
    Facebook Twitter Linkedin
    All rights reserved © NewsBytes 2025