Generation AI

Blog archive

Microsoft Goes Multi-Modal for AI

Some time back, I noted that the relationship between Microsoft and OpenAI was starting to fray, which could lead to Microsoft embracing other large language models for its AI offerings.

Well, you hardly needed to be Nostradamus to see this coming. Microsoft has indeed ended its exclusive offering of ChatGPT and added new services. At the end of March, Microsoft introduced new features in Copilot to allow users to utilize multiple AI ​models simultaneously within the same workflow.

In a new feature called "Critique," Copilot's Researcher agent will examine outputs from both ChatGPT and Anthropic's Claude models for every ​response. While ChatGPT generates the response, Claude ​will review the output for accuracy and quality before presenting it ⁠to the user.

Microsoft plans to make that workflow examination bi-directional in ​the future, allowing ChatGPT to review Claude's output as well.

Additionally, Microsoft unveiled Copilot Cowork, a tool based on Anthropic's Claude Cowork product. It is currently in testing mode.

"Having various different models from ​different vendors in Copilot is highly attractive -- but we're taking this to the next level, where customers actually get the benefits of the models working together," Nicole Herskowitz, corporate vice president of Microsoft ​365 and Copilot, said in an interview with Reuters.

The multi-model approach will help speed ​up user workflow, keep in check AI hallucinations - where systems generate false information -- and produce more reliable ‌outputs, ⁠boosting productivity and quality for customers, Herskowitz added.

I understand and appreciate what Microsoft is trying to do, but this is rife with potential trouble. I can’t I tell you how many times I have run queries on multiple chat bots using the exact same prompt wording and gotten back different results.

At that point, you wondering who's right? If ChatGPT says one thing and Google Gemini says another, which is the right answer? And oftentimes I would go to a third chatbot, usually Perplexity, and gotten a third answer.

Plus, the popular chat bots out there all have different areas of expertise where they really excel. So one size does not necessarily fit all or address all subjects. Copilot is considered the best chatbot for best for enterprise productivity, owing to its integration with Microsoft products Like Office 365.

ChatGPT is viewed as best all-around, while Claude from Anthropic is considered best for long-form writing, legal/academic analysis, and nuanced reasoning. Google Gemini is best at real-time data access and Google Workspace integration and Perplexity AI is best for search and research.

ChatBot makers are not deaf to these criticisms of inaccuracy and have added links to sources, the easiest and most obvious solution to the problem period but they don't always provide links and sometimes you have to ask for sources. Even then you don't get anything useful.

But at least Microsoft is diversifying off of just being dependent on ChatGPT, which I often find to be the most inaccurate of the chat bots. I much prefer Perplexity, for both accuracy and providing me with sources.

To address this problem, Microsoft is also launching a feature called Council that will allow users to compare responses from different AI models side-by-side. The upgrades come as Microsoft makes its new Copilot Cowork agentic AI tool more widely available to members in ​its 'Frontier' program, which provides ​customers with early ⁠access to some of its latest AI features.

Posted by Andy Patrizio on 04/08/2026


Featured

comments powered by Disqus

Subscribe on YouTube