The OpenAI announcement will transform the way Mindset AI agents engage with users and knowledge

Published by

No items found.

Anna Kocsis

Published on

May 17, 2024

July 27, 2024

Read time

min read

Here is a breakdown of OpenAI's announcements:

‍

Speed & intelligence combined

GPT-4o is 2x faster than GPT4 whilst still having the same level of intelligence
GPT4o sets a new high score of 88.7% on the Gen AI ‘General Knowledge Test’ (called the 0-shot COT MMLU test)

‍

‍

Improved vision

GPT-4o has radically improved vision capabilities across the majority of tasks. Vision enables an LLM to understand data inside charts, graphs or images.

‍

Improved non-English language capabilities

GPT-4o has improved capabilities in 50 non-English languages.

‍

Increased context window

GPT-4o has a 128K context window and has a knowledge cut-off date of October 2023. Context windows enable the LLM to understand more data after each user's request.

‍

Multi-modal level up (coming soon)

This is a major update. Previously, with GPT-4, you could use ‘Voice Mode’ to talk to ChatGPT, but it had a frustrating average delay of 2.8 seconds (GPT-3.5) and 5.4 seconds (GPT-4). This made conversations with AI feel unnatural, with constant pauses and confusion when interrupting GPT, making it obvious you were speaking to a computer.

‍

Voice Mode in GPT-4 worked by using three separate models: one to transcribe audio to text, GPT-3.5 or GPT-4 to process the text, and another to convert the text back to audio.

‍

This process caused GPT-4 to lose a lot of information—it couldn't directly understand tone, multiple speakers, or background noises, and it couldn't produce laughter, singing, or express emotion.

‍

With Omnimodel Voice Mode, this is the first-ever end-to-end model that handles text, vision, and audio within a single neural network. This allows the new model to respond to audio inputs in as little as 232 milliseconds, with an average of 320 milliseconds. This is similar to human response time.

‍

Watch this demo for a sense of what these new capabilities will feel like:

What does all of this mean for Mindset customers?

‍

This model will enable Mindset to transform how knowledge is acquired, bringing us closer to our goal of creating AI agents that feel human.

‍

Mindset has already integrated GPT-4o into our platform and you will be able to turn on GPT4-o from Tuesday 21st of May. We will continue to incorporate the latest voice capabilities as soon as they become available.

‍

‍

Here are some ways Mindset believes these new multi-modal capabilities will impact you:

‍

Get answers quicker than before

From Tuesday 21st May, you will immediately notice faster responses, when GPT-4o is turned on. GPT-4o is the fastest model created so far, ensuring there is no lag time in responses.

‍

Speak to agents

With our current work on 'Capabilities', agents can run learning scenarios, provide feedback using specific frameworks, act as ideation partners, and more. However, we understand that typing long messages can be frustrating when you are short on time.

‍

With GPT-4o audio, this experience can become a natural conversation. Users will soon be able to talk to the agent, just as they do with Siri or Alexa, and run through scenarios without breaking the flow of learning. Agents will have full conversations with your users, providing an immersive learning experience.

‍

More accurate search through conversation

Mindset has been transforming how your users search for knowledge. We have introduced ‘Chain of Thought’, allowing AI to use logical reasoning to meet users' requests accurately. We have also added clarification steps, enabling agents to ask questions back and much more.

‍

With OpenAIs audio capabilities, search and clarification will be possible through speech.

‍

More accurate Search, across all your data sources

Thanks to a larger context window (128k tokens Vs 32k with GPT4), we can improve search accuracy even further. This allows us to analyse more data and make better connections within your knowledge base.

‍

With upcoming integrations (GDrive, SharePoint, Slack, Teams), users will be able to search across multiple data sources simultaneously, making the process faster and more convenient.

‍

Search for images and ask questions about them

LLMs have previously struggled to understand dense or detailed images, often missing small but critically important details. This changed with GPT-4o.

‍

Mindset has been working on integrating these new vision capabilities into our content ingestion process. This new model will enable us to understand images in your charts, graphs, PDFs, and more.

‍

As a result, your users can search and ask questions about images and receive answers that accurately describe the information within them.

‍

Next Steps

‍

GPT-4o will be available in your admin console on Tuesday 21st May, for your agents.

‍

This will give you the benefits of increased speed and a larger context window for better search accuracy. Next week, we will release new vision capabilities for ingesting charts and graphs, allowing users to ask questions about data that was previously invisible to agents.

‍

Very soon, OpenAI will allow us to access the new voice mode capabilities. When that happens, you'll be the first to know...

‍

Table of contents

Articles

Stay tuned for the latest AI thought leadership.

The OpenAI announcement will transform the way Mindset AI agents engage with users and knowledge

Published by

Published on

Read time

Category

Here is a breakdown of OpenAI's announcements:

Speed & intelligence combined

Improved vision

Improved non-English language capabilities

Increased context window

Multi-modal level up (coming soon)

What does all of this mean for Mindset customers?

Get answers quicker than before

Speak to agents

More accurate search through conversation

More accurate Search, across all your data sources

Search for images and ask questions about them

Next Steps

Become an AI expert

Articles

Meta Ray-Ban Display Smart Glasses: Yay Or Nay? | In The Loop Episode 32

What are AI Companions & Should They Be Legal? | In The Loop Episode 31

The Real Cost Of AGI—According To OpenAI | In The Loop Episode 30

Is The AI Bubble About To Burst? | In The Loop Episode 28

What’s Replacing SCORM—And Should SCORM Be Replaced Or “Just” Transformed?

Top Four AI Trends & Predictions Of Summer 2025 | In The Loop Episode 26

Why Is Corporate E-Learning So Bad & How To Fix It With AI? | In The Loop Episode 25

GPT-5 Review: Everything You Need To Know | In The Loop Episode 27

What’s The Future Of SCORM With AI?

Why do people use SCORM?

What Is Context Engineering And Why Should You Care? | In The Loop Episode 23

How Do I Integrate AI Into My Product—Ideally By Yesterday

What Jobs Will AI Create—And Do The Luddites Have A Point? | In The Loop Episode 22

New Release: Mindset AI SDK 2.4 - Fonts Customization

How Enterprise CIOs Build & Buy Gen AI In 2025 | In The Loop Episode 21

Three Reasons Why Apple Is Cooked | In The Loop Episode 20

New Release: Mindset AI SDK 2.2 Multi-Tenancy Agents & Session Control

New Release: Mindset AI SDK 2.1 Theme Customization

Mary Meeker AI Trends 2025: Three Reasons Why AI Is Different From Any Other Tech In History | In The Loop Episode 19

What Happens To Entry-Level Jobs In The AI Era? | In The Loop Episode 18

What Is The Difference Between A2A And MCP? [With Videos]

Mindset AI Appoints Pip White as Non-Executive Director

Google I/O & Microsoft Build In 10 Minutes: What We Learned From The Two Biggest AI Conferences | In The Loop Episode 17

The Top Five AI Features SaaS Companies Are Shipping In 2025 (And Why They Work) | In The Loop Episode 16

New Release: Mindset AI SDK 2.0

Google, OpenAI, Meta, Anthropic & The Three Battles To Own All AI | In The Loop Episode 15

Should Conversational AI Agents Get Priority On Your E-Learning Platform’s Roadmap?

In The Loop Episode 14 | The Real State Of AI Adoption In 2025: What's AI Actually Used For?

In The Loop Episode 13 | Cluely: The AI App That Made Cheating Viral—And Maybe Acceptable?

The New Playbook For Shipping AI Agents — Why Companies are Building on Mindset AI

In The Loop Episode 12 | Google Agent2Agent (A2A): The Future Of AI Agent Protocols Or A Flop?

How To Turn Your E-Learning Business Into An AI Coaching Solution

In The Loop Episode 11 | Shopify Memo: No Humans Hired Without AI Approval—Tobias Lütke's Vision

Mindset AI Raises £4.3 Million To Meet Growing Demand For Embedded AI Agents For SaaS Businesses

In The Loop Episode 10 | Does ChatGPT's Viral Image Generator & The Ghibli Craze Spell The End Of Art & Creativity?

How To Monetize Your AI Agents: A Product Leader's Guide To Revenue Generation In EdTech

In The Loop Episode 9 | Apple’s AI Crisis Exposed: Is It Having A Nokia Moment?

In The Loop Episode 8 | Model Context Protocol (MCP): The Newest AI Buzzword Explained

In The Loop Episode 7 | Vibe Coding: Will Developers Be Out Of A Job In Six Months? Dario Amodei’s Take

When To Use Agentic RAG—And What Is It Anyway?

In The Loop Episode 6 | Multi-Agent Systems: The Next Big Shift In AI—Yet People Have No Clue About Them

Agentic AI 101: Everything You Ever Wanted To Know About AI Agents But Never Dared Ask

In The Loop Episode 5 | The Rise Of Vertical AI Agents: Why SaaS Companies Should Be Worried

In The Loop Episode 4 | Why Microsoft's CEO Thinks Everyone's Wrong About AI Agents & AGI

AI Expert Interview: The Benefits And Drawbacks Of Agentic AI

In The Loop Episode 3 | The Real AI Challenge: Designing Human-Agent Interfaces That Work

AI Agents vs. Everything AI: All The Definitions You'll Ever Need

In The Loop Episode 2 | The Future of AI Agents: What’s Real, What’s Hype & What’s Next

When Did AI Agents Become A Thing? The History & Evolution Of Agentic AI

In The Loop Episode 1 | DeepSeek’s AI Breakthrough: Hype or Game-Changer? A No-Nonsense Breakdown

What Is The Future Of Agentic AI: Eight Predictions From A CPO

How to use AI agents to fix broken search in learning platforms

The OpenAI announcement will transform the way Mindset AI agents engage with users and knowledge

How AI can support self-guided employee onboarding and reduce ramp times

The future of knowledge management: How AI will change the way we manage knowledge

How AI can make your video content more interactive and engaging

The future of HR: AI-powered knowledge assistants

Why learners choose Google over your learning platform and how AI can change that

ChatGPT Intellectual Property Issues: How To Protect IP

Book a demo today.