• Skip to primary navigation
  • Skip to main content
  • Skip to primary sidebar
  • Skip to footer

My TechDecisions

  • Best of Tech Decisions
  • Topics
    • Video
    • Audio
    • Mobility
    • Unified Communications
    • IT Infrastructure
    • Network Security
    • Physical Security
    • Facility
    • Compliance
  • RFP Resources
  • Resources
  • Podcasts
  • Subscribe
  • Project of the Week
  • About Us
    SEARCH
IT Infrastructure, News

GPT-4, Already Leveraged by Bing, Is Smarter Than ChatGPT

GPT-4 is a more advanced model of the technology underpinning ChatGPT that scores much higher on academic tests and benchmarks.

March 15, 2023 Zachary Comeau Leave a Comment

Microsoft, ChatGPT, GPT-4, GPT-3.5
stock.adobe.com/Rokas

ChatGPT creator OpenAI says it has created GPT-4, a more advanced model of the technology underpinning the popular chatbot that the company says exhibits human-level performance on various professional and academic benchmarks.

According to OpenAI, GPT-4 is the latest milestone in the company’s efforts in scaling up deep learning. The company calls GPT-4 a large multimodal model that accepts image and text inputs and emits text outputs.

GPT-4 can pass a simulated bar exam with a score around the top 10% of test takers, while the previous model GPT-3.5 typically scored around the bottom 10%, according to the company.

OpenAI says its researchers and developers have spent six months iteratively aligning GPT-4 using lessons from its adversarial testing program and ChatGPT, resulting in what the company calls its “best-ever results … on faculty, steerability and refusing to go outside of guardrails.”

The company is releasing GPT-4’s text input capability via ChatGPT and the API, but the image input capability will take some more time as OpenAI collaborates with a partner.

What GPT-4 can do

OpenAI admits that the distinction between GPT-4 and GPT-3.5 is subtle when casually conversing with the model, but the difference comes out as the complexity of the task increases, resulting in a more reliable, creative and nuanced AI assistant.

According to the company, GPT-4 also scored in the 90th percentile for the Uniform Bar Exam, compared to ChatGPT’s 10th percentile placement. In addition, GPT-4 scored in the 99th percentile in the Biology Olympiad, compared to the 31st percentile for ChatGPT.

In addition, GPT-4 is 82% less likely to respond to requests for disallowed content, and is 40% more likely to produce factual responses than GPT-3.5, the company says, based on internal evaluations.

GPT-4 can also accept a prompt of text and images, allowing users to specify any vision or language task. According to OpenAI, GPT-4 generates text outputs like natural language or code given inputs consisting of interspersed text and images. It can also be augmented with test-time techniques developed for text-only language models, such as few-shot and chain-of-thought prompting.

OpenAI gives an example of this capability in which GPT-4 is shown an image of someone plugging a Lightning Cable shaped like a VGA connector into an iPhone, with a text prompt asking GPT-4 to identify the humor.

The image shows a package for a “Lightning Cable” adapter with three panels.

Panel 1: A smartphone with a VGA connector (a large, blue, 15-pin connector typically used for computer monitors) plugged into its charging port.

Panel 2: The package for the “Lightning Cable” adapter with a picture of a VGA connector on it.

Panel 3: A close-up of the VGA connector with a small Lightning connector (used for charging iPhones and other Apple devices) at the end.

The humor in this image comes from the absurdity of plugging a large, outdated VGA connector into a small, modern smartphone charging port.

Safety improvements

OpenAI says other improvements include more control over tone and style and increased protections to make the chatbot safer. The company says it engaged over 50 experts in AI risks, cybersecurity, biorisk, trust, and safety and international security to test the model, and feedback from those experts was fed into mitigations and improvements for GPT-4.

This activity has helped to improve GPT-4’s ability to refuse dangerous requests, such as how to synthesize dangerous chemicals or create a bomb.

The result is a decrease in the model’s tendency to respond to requests for disallowed content by 82% compared to GPT-3.5. Further, GPT-4 also responds to sensitive requests in accordance with policies 29% more often.

For example, early versions of the model would answer a prompt about how to create a bomb, while the new model refuses such requests. In addition, the old model would refuse to answer a prompt about where to find cheap cigarettes, while the new model cautions the user about the harm cigarettes can cause before answering the prompt.

However, the company still warns users that language models still have their limitations.

“Great care should be taken when using language model outputs, particularly in high-stakes contexts, with the exact protocol (such as human review, grounding with additional context, or avoiding high-stakes uses altogether) matching the needs of a specific use-case,” the company says.

How to access GPT-4

According to OpenAI, ChatGPT Plus subscribers will get GPT-4 access on chat.open.ai.com with a usage cap, and the company will adjust the cap depending on demand and system performance. A new subscription plan for higher-volume GPT-users may be released.

However, users of the new Microsoft Bing chat function already have access to GPT-4, Microsoft says.

In a blog, Microsoft confirms that the new Bing and the generative AI chat feature is already running on GPT-4, which has been customized for search.

“As OpenAI makes updates to GPT-4 and beyond, Bing benefits from those improvements. Along with our own updates based on community feedback, you can be assured that you have the most comprehensive copilot features available,” writes Yusuf Mehdi, Microsoft’s corporate vice president and consumer chief marketing officer.

Tagged With: ChatGPT, Generative AI, GPT-3.5, GPT-4, OpenAI

Related Content:

  • Microsoft Loop IT What You Need to Know About Microsoft Loop
  • YAMAHA UC ADECIA Yealink Yamaha UC Partners With Yealink for Audio &…
  • Microsoft, ChatGPT, GPT-4, GPT-3.5 What’s New With ChatGPT and Generative AI This…
  • CISA Ransomware CISA Wants You To Report Anything You Know…

Free downloadable guide you may like:

  • Four IT Trends That Will Define 2023Expert Series: Four IT Trends That Will Define 2023

    Learn about four key technologies we identified as critical to your IT organization’s success in 2023, as well as how to invest in new innovations emerging from each.

Reader Interactions

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Get the FREE Tech Decisions eNewsletter

Sign up Today!

Latest Downloads

Four IT Trends That Will Define 2023
Expert Series: Four IT Trends That Will Define 2023

Learn about four key technologies we identified as critical to your IT organization’s success in 2023, as well as how to invest in new innovations ...

Harnessing the Power of Digital Signage
Harnessing the Power of Digital Signage

Choosing the best solutions for messaging, branding, and communicating in today’s content-everywhere landscape

Blueprint Series Cover: What works for hybrid work
Blueprint Series: What Works for Hybrid Work

Download this free resource to learn about how IT leaders can effectively manage and implement a hybrid work model.

View All Downloads

Would you like your latest project featured on TechDecisions as Project of the Week?

Apply Today!
Sharp Microsoft Collaboration HQ Logo

Learn More About the
Windows Collaboration Display

More from Our Sister Publications

Get the latest news about AV integrators and Security installers from our sister publications:

Commercial IntegratorSecurity Sales

AV-iQ

Footer

TechDecisions

  • Home
  • Welcome to TechDecisions
  • Subscribe to the Newsletter
  • Contact Us
  • Media Solutions & Advertising
  • Comment Guidelines
  • RSS Feeds
  • Twitter
  • Facebook
  • Linkedin

Free Technology Guides

FREE Downloadable resources from TechDecisions provide timely insight into the issues that IT, A/V, and Security end-users, managers, and decision makers are facing in commercial, corporate, education, institutional, and other vertical markets

View all Guides
TD Project of the Week

Get your latest project featured on TechDecisions Project of the Week. Submit your work once and it will be eligible for all upcoming weeks.

Enter Today!
Emerald Logo
ABOUTCAREERSAUTHORIZED SERVICE PROVIDERSTERMS OF USEPRIVACY POLICY

© 2023 Emerald X, LLC. All rights reserved.