Microsoft Goes Multi-Modal for AI -- Redmondmag.com

Generation AI

Microsoft Goes Multi-Modal for AI

Some time back, I noted that the relationship between Microsoft and OpenAI was starting to fray, which could lead to Microsoft embracing other large language models for its AI offerings.

Well, you hardly needed to be Nostradamus to see this coming. Microsoft has indeed ended its exclusive offering of ChatGPT and added new services. At the end of March, Microsoft introduced new features in Copilot to allow users to utilize multiple AI models simultaneously within the same workflow.

In a new feature called "Critique," Copilot's Researcher agent will examine outputs from both ChatGPT and Anthropic's Claude models for every response. While ChatGPT generates the response, Claude will review the output for accuracy and quality before presenting it ⁠to the user.

Microsoft plans to make that workflow examination bi-directional in the future, allowing ChatGPT to review Claude's output as well.

Additionally, Microsoft unveiled Copilot Cowork, a tool based on Anthropic's Claude Cowork product. It is currently in testing mode.

"Having various different models from different vendors in Copilot is highly attractive -- but we're taking this to the next level, where customers actually get the benefits of the models working together," Nicole Herskowitz, corporate vice president of Microsoft 365 and Copilot, said in an interview with Reuters.

The multi-model approach will help speed up user workflow, keep in check AI hallucinations - where systems generate false information -- and produce more reliable ‌outputs, ⁠boosting productivity and quality for customers, Herskowitz added.

I understand and appreciate what Microsoft is trying to do, but this is rife with potential trouble. I can’t I tell you how many times I have run queries on multiple chat bots using the exact same prompt wording and gotten back different results.

At that point, you wondering who's right? If ChatGPT says one thing and Google Gemini says another, which is the right answer? And oftentimes I would go to a third chatbot, usually Perplexity, and gotten a third answer.

Plus, the popular chat bots out there all have different areas of expertise where they really excel. So one size does not necessarily fit all or address all subjects. Copilot is considered the best chatbot for best for enterprise productivity, owing to its integration with Microsoft products Like Office 365.

ChatGPT is viewed as best all-around, while Claude from Anthropic is considered best for long-form writing, legal/academic analysis, and nuanced reasoning. Google Gemini is best at real-time data access and Google Workspace integration and Perplexity AI is best for search and research.

ChatBot makers are not deaf to these criticisms of inaccuracy and have added links to sources, the easiest and most obvious solution to the problem period but they don't always provide links and sometimes you have to ask for sources. Even then you don't get anything useful.

But at least Microsoft is diversifying off of just being dependent on ChatGPT, which I often find to be the most inaccurate of the chat bots. I much prefer Perplexity, for both accuracy and providing me with sources.

To address this problem, Microsoft is also launching a feature called Council that will allow users to compare responses from different AI models side-by-side. The upgrades come as Microsoft makes its new Copilot Cowork agentic AI tool more widely available to members in its 'Frontier' program, which provides customers with early ⁠access to some of its latest AI features.

Posted by Andy Patrizio on 04/08/2026

Featured

Anthropic Claude Goes GA in Microsoft Foundry

Anthropic's Claude family of AI models is now generally available in Microsoft Foundry on Azure, giving enterprise developers another frontier model they can deploy, manage and govern through Microsoft's cloud AI platform.
Taking Hyper-V Health Monitoring to the Next Level, Part 2

Let's launch a PowerShell script that goes beyond replication health to evaluate Hyper-V failover readiness by checking storage, networking, memory, VM configuration and other key conditions that could determine whether a virtual machine can successfully fail over.
Microsoft Moves Up Quantum Safe Security Timeline

Microsoft is accelerating its quantum-safe security timeline, saying advances in quantum computing and new federal requirements have pushed post-quantum cryptography from a future planning issue into an immediate engineering priority.
Microsoft: AI Builders Are Growing More Confident in Enterprise Agents

Artificial intelligence agents are moving beyond experimentation and into production, according to Microsoft's inaugural 2026 Agent Confidence Index.
Taking Hyper-V Health Monitoring to the Next Level, Part 1

A new PowerShell-based Hyper-V health tool goes beyond replication status to estimate whether a VM is actually ready for a successful failover.