⇋ Sweep 🌐 Perspectives ⚡ Quick Answer ◯ Split ✦ Synthesis 📊 Excel 📄 Documents 💰 Pricing 👤 Profile ❓ Help
Consensus
94%
Perspectives

Is GPT-4o still the best at reasoning? Perspectives from 8 AIs on the same test.

One logic puzzle. Eight Perspectives. One surprising outlier. The consensus was 94% β€” but the gap at the bottom was revealing.

8 May 2026 5 min
Decision framework
Decision guide

The 3-question framework for choosing the right AI before you Broadcast

Speed, depth, or accuracy β€” you rarely need all three. Here's how to decide which AI to lead with, informed by real Perspectives.

7 May 2026 6 min
Forge tips
Forge tips

Broadcast vs Perspectives: which one to use and when

Broadcast fires instantly and surfaces the strongest answer. Perspectives lets you see all the reasoning side by side. Here's the decision tree.

6 May 2026 3 min
Consensus
61%
Perspectives

61% consensus: what low-agreement Perspectives actually mean

Not every question has a clear answer. When Forge returns a low consensus score, here's what it means β€” and what to do next.

5 May 2026 4 min
Excel + Forge
Forge tips

Drop your spreadsheet into Forge. Ask a question. Get 8 Perspectives on your data.

Forge Excel reads your .xlsx, flags errors, and lets 8 AIs analyse it simultaneously. A walkthrough with real data and real Perspectives.

4 May 2026 5 min
Launch story
Product update

Why we launched Forge on TikTok first β€” and what the Perspectives showed us

Our first 48 hours live. What we got right, what we didn't, and the one metric that told us we had something real.

3 May 2026 7 min
Don't just read about it

Ask the question you just read about.
Get Perspectives from 8 AIs.

Try Forge free
All posts
Perspectives

We asked 8 AIs whether remote work is better.
Here's what 91% consensus looks like.

Forge Team 9 May 2026 4 min read
Live Perspectives β€” question asked in Forge
Claude 8.7s
Remote work improves focus for deep work but reduces spontaneous collaboration. Productivity gains are real but unevenly distributed across role types. Knowledge workers benefit most; managers least.
ChatGPT 4.9s
Benefits depend heavily on role type. Knowledge workers gain autonomy and focus time while coordination costs rise. The net effect depends on how well the organisation has adapted.
Gemini 5.6s
Studies show 13% productivity gains on average, though social costs are significant. Innovation and culture maintenance remain the primary unsolved challenges of distributed work.
Consensus score
All 8 AIs agree on the core finding β€” productivity gains exist, benefits vary by role
91%

The question was simple: "Is remote work better than working in an office?" We asked it in Forge, fired it at 8 AIs simultaneously, and watched the Perspectives come back.

What we didn't expect was how fast the consensus formed β€” or how revealing the outlier at the bottom would be.

What the 91% consensus means

A 91% consensus score in Forge means the models largely agree on the core answer. They agree that remote work produces measurable productivity gains for knowledge workers, that those gains aren't uniform across role types, and that the social and cultural costs are real and ongoing.

Key finding
Eight AIs, one question, 91% consensus. The remaining 9% disagreement wasn't about whether remote work is productive β€” it was about who it's productive for, and under what conditions.

This is exactly the kind of nuance that gets lost when you ask one AI and take the answer at face value. Claude emphasised managerial overhead. ChatGPT focused on organisational adaptation. Gemini led with the quantitative studies. None of them were wrong. Together, they were more right than any one of them alone.

The outlier: Grok

Grok's Perspective diverged most from the consensus. Where every other AI acknowledged measurable productivity benefits, Grok led with the social costs and questioned the validity of the underlying studies.

This isn't a failure of the model β€” it's the value of Perspectives. If you had only asked Grok, you'd have walked away with a significantly more skeptical view of remote work. If you'd only asked Claude, you'd have walked away feeling validated. Forge shows you both, plus six more.

"The most useful AI answer isn't always the most confident one. It's the one that shows you where the uncertainty actually lives."

What to do with a high consensus score

When Forge returns 90%+ consensus, you can trust the core finding. The question then becomes: which AI's framing of that finding is most useful for your specific situation?

This is why Synthesis exists. After seeing all Perspectives, Forge can produce a Best-of-Best answer that pulls the strongest elements from each response into a single output β€” tuned to your specific context.

Try it yourself
Ask the same question.
See what your 8 AIs say.
Get Perspectives β€” free

What a low consensus score looks like

For contrast, we ran a second question: "Will AI replace software engineers within 10 years?" Consensus: 61%.

That 39% disagreement isn't noise β€” it's signal. It tells you this is a genuinely contested question where the AI models themselves don't have a clear answer, which means you shouldn't trust any single AI's confident response on it. Forge's consensus score is, in effect, a calibration tool for your trust in the answer.

Read next