Follow Slashdot blog updates by subscribing to our blog RSS feed

 



Forgot your password?
typodupeerror
AI Microsoft

Microsoft Announces Phi-4 AI Model Optimized for Accuracy and Complex Reasoning (computerworld.com) 31

An anonymous reader shared this report from Computerworld: Microsoft has announced Phi-4 — a new AI model with 14 billion parameters — designed for complex reasoning tasks, including mathematics. Phi-4 excels in areas such as STEM question-answering and advanced problem-solving, surpassing similar models in performance. Phi-4, part of the Phi small language models (SLMs), is currently available on Azure AI Foundry under the Microsoft Research License Agreement and will launch on Hugging Face [this] week, the company said in a blog post.

The company emphasized that Phi-4's design focuses on improving accuracy through enhanced training and data curation.... "Phi-4 outperforms comparable and even larger models on tasks like mathematical reasoning, thanks to a training process that combines synthetic datasets, curated organic data, and innovative post-training techniques," Microsoft said in its announcement. The model leverages a new training approach that integrates multi-agent prompting workflows and data-driven innovations to enhance its reasoning efficiency. The accompanying report highlights that Phi-4 balances size and performance, challenging the industry norm of prioritizing larger models... Phi-4 achieved a score of 80.4 on the MATH benchmark and has surpassed other systems in problem-solving and reasoning evaluations, according to the technical report accompanying the release. This makes it particularly appealing for domain-specific applications requiring precision, like scientific computation or advanced STEM problem-solving.

Microsoft emphasized its commitment to ethical AI development, integrating advanced safety measures into Phi-4. The model benefits from Azure AI Content Safety features such as prompt shields, protected material detection, and real-time application monitoring. These features, Microsoft explained, help users address risks like adversarial prompts and data security threats during AI deployment. The company also reiterated that Azure AI Foundry, the platform hosting Phi-4, offers tools to measure and mitigate AI risks. Developers using the platform can evaluate and improve their models through built-in metrics and custom safety evaluations, Microsoft added... With Phi-4, Microsoft continues to evolve its AI offerings while promoting responsible use through robust safeguards. Industry watchers will observe how this approach shapes adoption in critical fields where reasoning and security are paramount.

This discussion has been archived. No new comments can be posted.

Microsoft Announces Phi-4 AI Model Optimized for Accuracy and Complex Reasoning

Comments Filter:
  • A lot of suggestive but vacuous buzzwords.
    And again, claims of "reasoning".

    • Worse, I've seen increasing anthropomorphizing about consciousness and self-awareness. This should basically be illegal. These are and always will be machines, not sentient beings. Let's not redefine terms that are vague enough already after thousands of years of use.
      • by Tablizer ( 95088 )

        "Sentience" hasn't been easy to define. That's not an excuse to butcher the term yet more, though. But marketers will be marketers.

        • Digital marketing in 2025: Certainly, [incorrect name reference]. Here’s a paragraph packed with digital marketing and AI buzzwords: Unlock the power of synergistic omnichannel strategies by leveraging AI-driven predictive analytics to optimize your customer journey mapping and achieve unparalleled ROI through hyper-personalized content. With cutting-edge machine learning algorithms, you can harness the potential of big data, natural language processing (NLP), and computer vision to deliver seamless
          • by cstacy ( 534252 )

            synergistic omnichannel strategies...
            seamless customer experience across touchpoints...
            end-to-end martech stack...
            conversion rate optimization...
            ensuring actionable insights...
            growth hacking and drive viral campaigns...
            growth-focused
            AI-enabled
            automation workflows...
            enhance lead scoring...
            demand generation funnel
            dominate SEO, social media engagement...
            disruptive innovation,
            blockchain-integrated marketing,
            and deep learning-powered
            brand storytelling

            That's great but you left out "Paradigm" "Quantum" and "3D-printed". I'll be expecting a revised brochure in a carbon fiber folder on Tuesday.

            • by commodore73 ( 967172 ) on Monday December 16, 2024 @06:40AM (#65016295)
              Stick it up your TPS report gramps.
            • by clovis ( 4684 )

              I miss the good old days when it would have been enough to say it was turbo-charged.

              • I miss my "turbo" button. I'll ask an LLM what that was supposed to do, because it never seemed to do anything for me.
                • I will insult myself by responding to my own comment, but the content was written by something else. If true, this is even worse than I had imagined.

                  On many older IBM PC compatibles—particularly those from the 1980s and early 1990s—the "turbo" button was a hardware feature designed to toggle the computer’s operating speed between two predefined clock rates. Despite the name, pressing the turbo button didn’t typically make the machine faster. Instead, it often reverted the system f
      • by gweihir ( 88907 )

        Indeed. What they do is essentially a "Big Lie" (https://en.wikipedia.org/wiki/Big_lie) to make AI seem to be something it very much is not. Essentially fraud and endangering their users.

    • Comment removed based on user account deletion
  • now math education will suffer too.

    • It was already suffering badly before any application of linear regression or SVD became "AI".

      • by narcc ( 412956 )

        Nothing changed. It's been like that since the term "AI" was coined. You can cry all you want about how you wish the term was restricted to science fiction robots, but it isn't going to do you any good. The time to register your complaint was 1962. A few people did, as it happens, but it didn't make any difference then either. It's long-past time to get over it.

  • LLMs cannot "reason" at all. And they now claim "complex reasoning"? Are they going for a "Big Lie" (https://en.wikipedia.org/wiki/Big_lie) approach?

  • Fage. Phi-4 (it look like an A). I'll see myself out.

  • Can someone explain to me what "mathematical reasoning" is? Is it in any way related to, I don't know, computation? Because I thought that was what computers have been designed and built to do since the beginning.

    • I know you are kind of joking, but I would regard mathematical reasoning as a general ability to analyse mathematical problem statements, descriptions, claims, proofs and so on, identify errors or potential solutions.

      Generating methods or instructions for computation by a traditional program (or compiler) would probably fit into this broad category. But actually following those instructions, to perform the actual computation, would not..

Statistics are no substitute for judgement. -- Henry Clay

Working...