More Testing of GPT5 and Comparing Against Other Models

There are various Youtubers on AI who are giving their opinions of OpenAI GPT5. Theo is crowning GPT5 as the best model but others feel it is a good and fast model but are not blown away by it.

GPT5 is the number one model per the LMArena leaderboard with a 1481 score and 1460 for Gemini and 1429 for Grok 4. There are other OpenAI models on the board but the other OpenAI models were deprecated.

There were a variety of tests that Grok 4 did well like simulating smoke or finding Waldo which GPT5 failed.

There in an assessment of what OpenAI GPT5 means for work, research and for coding. GTP5 seems to be prioritizing performance in Canvas. It did a tourist planning task in Canvas using GPT5 but failed for Luvable using GPT5. Checkpoint your coding because applets created by GPT5 can work but are also fragile.

Prompting matters for GPT5 performance. If correctness matters needs think hard in the prompt or think hard button. It then achieves better performance. GPT5 vanilla and pro underperformed even old OpenAI models and other models.

It is a step forward and is progress. It is jump on the coding side and reliability but there is still a lot of work to be done for worldchanging capability for AI.
It is advancing the models jaggedly.
It is advancing what the other models did but a bit better in several cases.

Wes Roth shows significant GPT5 capabilities like a very good Minecraft one shot.

Read More

Latest

What Did FDA Vaccine Advisors Decide This Week?

You don't have permission to access "http://www.medpagetoday.com/quizzes/news-quiz/121502" on this server. Reference #18.b1382f17.1780175539.dc13dc https://errors.edgesuite.net/18.b1382f17.1780175539.dc13dc

How AI Made 2026 the Hardest Year to Get Into Medical School

You don't have permission to access "http://www.medpagetoday.com/popmedicine/popmedicine/121477" on this server. Reference #18.b1382f17.1780175544.dc1ddb https://errors.edgesuite.net/18.b1382f17.1780175544.dc1ddb

Newsletter

Don't miss

What Did FDA Vaccine Advisors Decide This Week?

You don't have permission to access "http://www.medpagetoday.com/quizzes/news-quiz/121502" on this server. Reference #18.b1382f17.1780175539.dc13dc https://errors.edgesuite.net/18.b1382f17.1780175539.dc13dc

How AI Made 2026 the Hardest Year to Get Into Medical School

You don't have permission to access "http://www.medpagetoday.com/popmedicine/popmedicine/121477" on this server. Reference #18.b1382f17.1780175544.dc1ddb https://errors.edgesuite.net/18.b1382f17.1780175544.dc1ddb

Brittany Mahomes Rocks Corset and Barely-There Lace Shorts at Stagecoach

Music Brittany Mahomes just delivered a lesson in festival...

US Business Leaders Optimistic About China Cooperation, Emphasize Importance of Chinese Market

© 2026 China Money Network. All Rights Reserved. Disclaimer: The views, opinions, forecasts, and statements made by our hosts and guests are the personal views of those respective individuals and may or may not be either endorsed or accepted by China Money Network Limited or the companies with which these individuals are employed.

Tesla’s Business Has Become Much More Diversified in Just the Past Five Years. Does That Make Its Stock a Better Buy Today?

Key Points Tesla's energy generation and storage segment generated 27% revenue growth last year. The company's non-automotive segments were able to help offset a double-digit decline in auto revenue in 2025. These 10 stocks could mint the next wave of millionaires › Tesla (NASDAQ: TSLA) is known for its electric vehicles (EVs), and while they

WD sees sustainability as key business driver in an ‘AI economy’

Hard drive company WD promoted long-term operations and sustainability executive Jackie Jung to become its first chief sustainability officer in February, as it steps up sales to companies building AI data centers. Her vision: Turn sustainability into a “brand” for WD, a strategy that reduces risk for the $6 billion company (formerly known as Western