{"id":869583,"date":"2025-09-01T01:13:57","date_gmt":"2025-09-01T06:13:57","guid":{"rendered":"https:\/\/newsycanuse.com\/index.php\/2025\/09\/01\/more-testing-of-gpt5-and-comparing-against-other-models\/"},"modified":"2025-09-01T01:13:57","modified_gmt":"2025-09-01T06:13:57","slug":"more-testing-of-gpt5-and-comparing-against-other-models","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2025\/09\/01\/more-testing-of-gpt5-and-comparing-against-other-models\/","title":{"rendered":"More Testing of GPT5 and Comparing Against Other Models"},"content":{"rendered":"<div id=\"page\">\n\t\t<main id=\"main\"><\/p>\n<nav aria-label=\"breadcrumbs\">\n<p><a href=\"https:\/\/www.nextbigfuture.com\">Home<\/a><span> \u00bb <\/span><a href=\"https:\/\/www.nextbigfuture.com\/category\/artificial-intelligence\">Artificial intelligence<\/a><span> \u00bb <\/span><span>More Testing of GPT5 and Comparing Against Other Models<\/span><\/p>\n<\/nav>\n<article id=\"post-204698\" itemtype=\"https:\/\/schema.org\/CreativeWork\" itemscope>\n<div itemprop=\"text\">\n<p>There are various Youtubers on AI who are giving their opinions of OpenAI GPT5. Theo is crowning GPT5 as the best model but others feel it is a good and fast model but are not blown away by it.<\/p>\n<p>GPT5 is the number one model per the LMArena leaderboard with a 1481 score and 1460 for Gemini and 1429 for Grok 4. There are other OpenAI models on the board but the other OpenAI models were deprecated.<br \/>\n<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/nextbigfuture.s3.amazonaws.com\/uploads\/2025\/08\/Screenshot-2025-08-08-at-10.16.02-AM-1024x660.jpg\" alt width=\"1024\" height=\"660\"  ><\/p>\n<p><iframe loading=\"lazy\" title=\"YouTube video player\" src=\"https:\/\/www.youtube.com\/embed\/NiURKoONLVY?si=1lne7eoQZwpVS4R9\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>There were a variety of tests that Grok 4 did well like simulating smoke or finding Waldo which GPT5 failed.<br \/>\n<iframe loading=\"lazy\" title=\"YouTube video player\" src=\"https:\/\/www.youtube.com\/embed\/8pYYDL0yMwg?si=4IyJ_Jf7jyj40xqo\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>There in an assessment of what OpenAI GPT5 means for work, research and for coding. GTP5 seems to be prioritizing performance in Canvas. It did a tourist planning task in Canvas using GPT5 but failed for Luvable using GPT5. Checkpoint your coding because applets created by GPT5 can work but are also fragile. <\/p>\n<p>Prompting matters for GPT5 performance. If correctness matters needs think hard in the prompt or think hard button. It then achieves better performance. GPT5 vanilla and pro underperformed even old OpenAI models and other models.<\/p>\n<p>It is a step forward and is progress. It is jump on the coding side and reliability but there is still a lot of work to be done for worldchanging capability for AI.<br \/>\nIt is advancing the models jaggedly.<br \/>\nIt is advancing what the other models did but a bit better in several cases.<br \/>\n<iframe loading=\"lazy\" title=\"YouTube video player\" src=\"https:\/\/www.youtube.com\/embed\/DbX_0_0LGag?si=DDoWJBrFgogu0-mb\" width=\"560\" height=\"315\" frameborder=\"0\" allowfullscreen=\"allowfullscreen\"><\/iframe><\/p>\n<p>Wes Roth shows significant GPT5 capabilities like a very good Minecraft one shot.<br \/>\n<iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/4wXQt6SVO_U?si=LjeZKsLkj10lii0T\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p><iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/v3zirumCo9A?si=EukLREXNu34No43R\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p><iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/osnVKnD7W9M?si=7ABkp0UIwHGCmqt0\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<p><iframe loading=\"lazy\" width=\"560\" height=\"315\" src=\"https:\/\/www.youtube.com\/embed\/mfIMmkLiP-Q?si=Q4_nZYh4-kPEhQrJ\" title=\"YouTube video player\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><\/p>\n<div itemtype=\"http:\/\/schema.org\/Person\" itemscope itemprop=\"author\">\n<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/nextbigfuture.s3.amazonaws.com\/uploads\/2020\/08\/brianheadshot.jpg\" width=\"100\" height=\"100\" alt itemprop=\"image\"><\/p>\n<div itemprop=\"description\">\n<p>Brian Wang is a Futurist Thought Leader and a popular Science blogger with 1 million readers per month. His blog Nextbigfuture.com is ranked #1 Science News Blog. It covers many disruptive technology and trends including Space, Robotics, Artificial Intelligence, Medicine, Anti-aging Biotechnology, and Nanotechnology.<\/p>\n<p>Known for identifying cutting edge technologies, he is currently a Co-Founder of a startup and fundraiser for high potential early-stage companies. He is the Head of Research for Allocations for deep technology investments and an Angel Investor at Space Angels.<\/p>\n<p>A frequent speaker at corporations, he has been a TEDx speaker, a Singularity University speaker and guest at numerous interviews for radio and podcasts.\u00a0 He is open to public speaking and advising engagements.<\/p>\n<\/div>\n<\/div><\/div>\n<p>\t\t\t\t\t<\/main>\n\t<\/div>\n<p><a href=\"https:\/\/www.nextbigfuture.com\/2025\/08\/more-testing-of-gpt5-and-comparing-against-other-models.html\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Home \u00bb Artificial intelligence \u00bb More Testing of GPT5 and Comparing Against Other Models There are various Youtubers on AI who are giving their opinions of OpenAI GPT5. Theo is crowning GPT5 as the best model but others feel it is a good and fast model but are not blown away by it. GPT5 is [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":869584,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[3786,285,104640],"tags":[],"class_list":{"0":"post-869583","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-comparing","8":"category-testing","9":"category-youtube-videos"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/869583","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=869583"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/869583\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/869584"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=869583"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=869583"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=869583"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}