OpenAI Assures No Intellectual Drop in AI
OpenAI Assures No Intellectual Drop in AI
If you’ve read Twitter or Reddit’s ChatGPT subreddit, you’ll see one question asked more than any other: is ChatGPT getting dumber?Is the performance of the world’s leading generative AI chatbot decreasing as time passes, or are ChatGPT’s millions of users collectively hallucinating quality issues?
MUO VIDEO OF THE DAY
SCROLL TO CONTINUE WITH CONTENT
Disclaimer: This post includes affiliate links
If you click on a link and make a purchase, I may receive a commission at no extra cost to you.
Is ChatGPT Getting Worse?
It’s something many ChatGPT users have wondered: is ChatGPT getting dumber? OpenAI releases frequent updates to ChatGPT designed to tweak its responses, safety, and more, using user feedback , prompts, and user data to inform its direction.
But where ChatGPT felt like a genius solution to almost any problem when it launched, more users than ever report issues with its responses and output. Particularly of note are ChatGPTs reasoning, coding, and mathematical skills, though others note that it struggles with creative tasks, too.
The easiest way for most ChatGPT users to check how its responses have changed over time is to repeat a previously used prompt (preferably from the earlier days of ChatGPT) and analyze the two outputs.
Responses requiring specific outputs, like those involving coding and math, are likely easiest to compare directly.
Stanford Study Suggests ChatGPT Drop Off
A combined Stanford University and UC Berkley research group believes those feelings that ChatGPT is changing could be right. Lingjiao Chen, Matei Zaharia, and James Zou’s paper How Is ChatGPT’s Behavior Changing over Time? [PDF] is one of the first in-depth studies into ChatGPT’s changing capabilities.
The report summary explains:
We find that the performance and behavior of both GPT-3.5 and GPT-4 can vary greatly over time. For example, GPT-4 (March 2023) was very good at identifying prime numbers (accuracy 97.6%) but GPT-4 (June 2023) was very poor on these same questions (accuracy 2.4%). Interestingly, GPT-3.5 (June 2023) was much better than GPT-3.5 (March 2023) in this task. GPT-4 was less willing to answer sensitive questions in June than in March, and both GPT-4 and GPT-3.5 had more formatting mistakes in code generation in June than in March.
When presented with math problems ChatGPT could previously solve earlier in 2023, responses later in the year were wildly inaccurate. Furthermore, ChatGPT explained in detail why the answer was correct despite being wrong. Instances of AI hallucination are nothing new, but the figures in the charts below suggest a significant change in overall reasoning.
Image Credit: Stanford/Berkeley
Image Credit: Stanford/Berkeley
The charts suggest ChatGPT’s responses are drifting, further emphasized by the report.
GPT-4’s accuracy dropped from 97.6% in March to 2.4% in June, and there was a large improvement of GPT-3.5’s accuracy, from 7.4% to 86.8%. In addition, GPT-4’s response became much more compact: its average verbosity (number of generated characters) decreased from 821.2 in March to 3.8 in June. On the other hand, there was about 40% growth in GPT-3.5’s response length. The answer overlap between their March and June versions was also small for both services.
The report explains that ChatGPT’s large language model chain of thought “did not work” when presented with the questions in June. Conversation drift has always been a notable issue with LLMs, but the extreme variance in responses suggests performance issues and changes to ChatGPT.
Is ChatGPT Getting Worse? OpenAI Says No
Is it just a coincidence that both casual and prolific ChatGPT users are noticing ChatGPT’s changing quality?
The research paper would suggest not, but OpenAI VP for Product, Peter Welinder, suggests otherwise.
Furthermore, Welinder later pointed to OpenAI’s releases for ChatGPT and the constant stream of updates the company has delivered throughout 2023.
Still, that didn’t stop numerous responses to his tweet detailing how users have found ChatGPT’s responses wanting, with many taking the time to annotate prompts and responses.
Can OpenAI Restore ChatGPT to Its Original State?
The early days of ChatGPT seem long in the distance now; November 2022 is a hazy memory, and the world of AI moves fast.
For many, the Stanford/Berkeley study perfectly illustrates the issues and frustrations of using ChatGPT. Others claim the tweaks and changes made to ChatGPT to make it a safer, more inclusive tool have also directly altered its ability to reason properly, nerfing its knowledge and overall capabilities to the point it’s unusable.
It seems there is little doubt that ChatGPT has changed. Whether ChatGPT will regain its original prowess is another question entirely.
SCROLL TO CONTINUE WITH CONTENT
Also read:
- [New] In 2024, Make a Video From Photo with Music
- [Updated] In 2024, Key Techniques for Capturing YouTube Streaming Content
- [Updated] In 2024, Leveraging Snap Camera for Better Online Interaction on Google Meet
- [Updated] Update Complete Shorts Visibility Restored
- Blastro Music Video Downloads in Multiple Formats: MP4, WMV, AVI - Compatible with Mac & Windows
- Complete Code of Conduct for YouTube Creators for 2024
- Elevate Your Speaking Power with These Top 7 Tools
- Experts' Choice for Premium Stylus Tablets of the Year 2Eighty-Four: In-Depth Analysis and Ratings | PCMag
- How Can I Recover Corrupted Excel File 2000 | Stellar
- In 2024, Locked Out of iPhone 12? 5 Ways to get into a Locked iPhone 12
- Starting Out: Professional Guide to Prompt Engineering
- The Intersection of Websites and Artificial Intelligence
- Transform Your Exercise Routine: Discover the Power of the New AirPods Pro's Fitness Tracking and Heart Rate Sensor Features
- Transform Your YouTube Venture From Free to Earnings at the 500 Mark for 2024
- Title: OpenAI Assures No Intellectual Drop in AI
- Author: Brian
- Created at : 2024-12-25 16:51:17
- Updated at : 2024-12-27 17:13:39
- Link: https://tech-savvy.techidaily.com/openai-assures-no-intellectual-drop-in-ai/
- License: This work is licensed under CC BY-NC-SA 4.0.