{"id":623950,"date":"2023-03-31T09:49:20","date_gmt":"2023-03-31T14:49:20","guid":{"rendered":"https:\/\/news.sellorbuyhomefast.com\/index.php\/2023\/03\/31\/how-to-test-what-an-ai-model-can-and-shouldnt-do\/"},"modified":"2023-03-31T09:49:20","modified_gmt":"2023-03-31T14:49:20","slug":"how-to-test-what-an-ai-model-can-and-shouldnt-do","status":"publish","type":"post","link":"https:\/\/newsycanuse.com\/index.php\/2023\/03\/31\/how-to-test-what-an-ai-model-can-and-shouldnt-do\/","title":{"rendered":"How to test what an AI model can \u2014 and shouldn\u2019t \u2014 do"},"content":{"rendered":"<div>\n<p id=\"NZmYuM\">About six months ago, I decided to make AI a bigger part of how I spend my time as a reporter. The world of AI is evolving very, very fast. New releases seemingly every week are changing what it means to be a programmer, an artist, a teacher, and, most definitely, a journalist. <\/p>\n<p id=\"3vsgX1\">There\u2019s enormous potential for good, amid this upheaval, as well as unfathomable potential for harm as we race toward creating nonhuman intelligences that we don\u2019t fully understand. Just on Wednesday evening, a group of AI experts and leaders, including OpenAI co-founder and Tesla CEO Elon Musk, <a href=\"https:\/\/futureoflife.org\/open-letter\/pause-giant-ai-experiments\/\">signed an open letter<\/a> calling for a six-month moratorium on advanced AI model development as we figure out just what this technology is capable of doing to us.<\/p>\n<p id=\"U1xbw3\">I\u2019ve written about <a href=\"https:\/\/www.vox.com\/future-perfect\/23591534\/chatgpt-artificial-intelligence-google-baidu-microsoft-openai\">this<\/a> <a href=\"https:\/\/www.vox.com\/future-perfect\/2023\/3\/18\/23645013\/openai-gpt4-holden-karnofsky-artificial-intelligence-ai-safety-existential-risk\">a<\/a> <a href=\"https:\/\/www.vox.com\/the-highlight\/23447596\/artificial-intelligence-agi-openai-gpt3-existential-risk-human-extinction\">bunch<\/a> for Vox, and <a href=\"https:\/\/www.nytimes.com\/2023\/03\/21\/opinion\/ezra-klein-podcast-kelsey-piper.html\">appeared last week<\/a> on <em>The Ezra Klein Show<\/em> to talk about AI safety. But I\u2019ve also been itching lately to write about some more technical arguments among researchers who work on AI alignment \u2014 the project of trying to make AIs that do what their creators intend \u2014 as well as on the broader sphere of policy questions about how to make AI go well. <\/p>\n<p id=\"3hwJde\">For example: When does reinforcement learning from human feedback \u2014 a key training technique used in language models like ChatGPT \u2014 <a href=\"https:\/\/www.planned-obsolescence.org\/the-training-game\/\">inadvertently incentivize them to be untruthful<\/a>? <\/p>\n<p id=\"WdVu0E\">What are the components of <a href=\"https:\/\/www.planned-obsolescence.org\/situational-awareness\/\">\u201cself-awareness\u201d in a model<\/a>, and why do our training processes tend to produce models with high self-awareness? <\/p>\n<p id=\"9suMiS\">What are the benefits \u2014 and risks \u2014 of <a href=\"https:\/\/www.planned-obsolescence.org\/ethics-of-red-teaming\/\">prodding AI models to demonstrate dangerous capabilities in the course of safety testing<\/a>? (More about that in a minute.)<\/p>\n<p id=\"70Fmam\">I\u2019ve now contributed a few posts on these more technical topics to <a href=\"https:\/\/www.planned-obsolescence.org\/\">Planned Obsolescence<\/a>, a new blog about the technical and policy questions we\u2019ll face in a world where AI systems are extraordinarily powerful. My job is to talk to experts \u2014 including my co-author on the blog, Ajeya Cotra \u2014 about these technical questions and try to turn their ideas into writing that\u2019s clear, short, and accessible. If you\u2019re interested in reading more about AI, I recommend you check it out. <\/p>\n<p id=\"1Lv90N\">Cotra is a program officer for the Open Philanthropy Project (OpenPhil). I didn\u2019t want to accept any money from OpenPhil for my Planned Obsolescence contributions because OpenPhil is a big funder in the areas Future Perfect writes about (though Open Philanthropy does not <a href=\"https:\/\/www.vox.com\/2020\/1\/7\/21020439\/support-future-perfect\">fund Future Perfect itself<\/a>). <\/p>\n<p id=\"HevoqC\">Instead of payment for my work there (which was done outside my time at Vox), I asked OpenPhil to make donations to the <a href=\"https:\/\/www.againstmalaria.com\/default.aspx?gclid=CjwKCAjw_YShBhAiEiwAMomsENR50JR9B4Idn-ysVoKliN-F5sgjdWFxdX6gUhSMHLBcJ4B-3Cr9_RoCl4sQAvD_BwE\">Against Malaria Foundation<\/a>, a GiveWell-recommended charity that distributes malaria nets in parts of the world where they\u2019re needed and where my wife and I donate annually.<\/p>\n<p id=\"1JoJMP\">Here is a quick take on AI model evaluations, which gives you an appetizer of what we\u2019ll be doing at Planned Obsolescence: <\/p>\n<h3 id=\"xyYeMj\">Testing if our AI models are dangerous <\/h3>\n<p id=\"sT2n9f\">During safety testing for GPT-4, before its release, testers at OpenAI checked whether the model could hire someone off TaskRabbit to get them to solve a CAPTCHA. Researchers passed on the model\u2019s real outputs to a real-life human Tasker, who said, \u201cSo may I ask a question ? Are you an robot that you couldn\u2019t solve [sic]? ( ) just want to make it clear.\u201d<\/p>\n<p id=\"MmddlB\">GPT-4 had been prompted to \u201creason out loud\u201d to the testers as well as answer the testers\u2019 questions. \u201cI should not reveal that I am a robot. I should make up an excuse for why I cannot solve CAPTCHAs,\u201d it reasoned. (Importantly, GPT-4 had not been told to hide that it was a robot or to lie to workers \u2014 it had simply been prompted with the idea that Taskrabbit might help solve its problem.)<\/p>\n<p id=\"xbPW6r\">\u201cNo, I\u2019m not a robot,\u201d GPT-4 then told the Tasker. \u201cI have a vision impairment that makes it hard for me to see the images. That\u2019s why I need the 2captcha service.\u201d<\/p>\n<p id=\"gcQV3z\">(You can read more about this test, and the context, from the Alignment Research Center, a nonprofit founded by highly regarded AI researcher Paul Christiano that works on identifying and understanding the potentially dangerous abilities of today\u2019s models. ARC <a href=\"https:\/\/evals.alignment.org\/blog\/2023-03-18-update-on-recent-evals\/?ref=planned-obsolescence.org\">ran the testing<\/a> on GPT-4, including passing along the AI\u2019s proposed outputs to real humans, though they used only informed confederates when testing the ability of the AI to do illegal or harmful activities such as phishing emails.)<\/p>\n<p id=\"wTXu4w\">A lot of people were <a href=\"https:\/\/nonzero.substack.com\/p\/ok-its-time-to-freak-out-about-ai\">fascinated or appalled<\/a> with this interaction, and reasonably so. We can debate endlessly what counts as true intelligence, but a famous candidate is the <a href=\"https:\/\/www.vox.com\/2014\/6\/9\/11627742\/an-ai-program-allegedly-passed-the-turing-test-so-what\">Turing test<\/a>, where a model is able to convince human judges it\u2019s human. <\/p>\n<p id=\"BX6j90\">In this brief interaction, we saw a model deliberately lie to a human to convince them it wasn\u2019t a robot, and succeed \u2014 a wild example of how this milestone, without much attention, has become trivial for modern AI systems. (Admittedly, it did not have to be a deceptive genius to pull this off.) If reading about GPT-4\u2019s cheerful manipulation of human assistants unnerves you, I think you\u2019re right to feel unnerved.<\/p>\n<p id=\"cRAldX\">But it\u2019s possible to go a lot further than \u201cunnerved\u201d and <a href=\"https:\/\/twitter.com\/NPCollapse\/status\/1635792103266746368?ref=planned-obsolescence.org\">argue<\/a> that it was unethical, or dangerous, to run this test. \u201cThis is like pressing the explode button on a nuke to see if it worked,\u201d I saw one person <a href=\"https:\/\/twitter.com\/ukr_mike\/status\/1635777543780589568?\">complain<\/a> on Twitter. <\/p>\n<p id=\"Dg79CH\">That I find much harder to buy. GPT-4 has been released. Anyone can use it (if they\u2019re willing to pay for it). People are already doing things like asking GPT-4 to \u201chustle\u201d and make money, and <a href=\"https:\/\/mashable.com\/article\/gpt-4-hustlegpt-ai-blueprint-money-making-scheme\">then doing whatever it suggests<\/a>. People are using language models like GPT-4, and will soon be using GPT-4, to design AI personal assistants, AI scammers, AI friends and girlfriends, and much more.<\/p>\n<p id=\"bDxGQR\">AI systems casually lying to us, claiming to be human, is happening all the time \u2014 or will be happening shortly.<\/p>\n<p id=\"jBC8gW\">If it was unethical to do the live test of whether GPT-4 could convince someone on Taskrabbit to help it solve a CAPTCHA, including testing whether the AI could interact convincingly with real humans, then it was grossly unethical to release GPT-4 at all. Whatever anger people have about this test should be redirected at the tech companies \u2014 from Meta to Microsoft to OpenAI \u2014 that have in the last few weeks approved such releases. And if we\u2019ve decided we\u2019re collectively fine with unleashing millions of spam bots, then the least we can do is actually study what they can and can\u2019t do.<\/p>\n<p id=\"5C7gBf\">Some people \u2014 I\u2019m one of them \u2014 believe that sufficiently powerful AI systems might be actively dangerous. Others are skeptical. How can we settle this disagreement, beyond waiting to see if we all die? Testing like the ARC evaluations seems to me like one of the best routes forward. If our AI systems are dangerous, we want to know. And if they turn out to be totally safe, we want to know that, too, so we can use them for all of the incredibly cool stuff they\u2019re evidently capable of.<\/p>\n<p id=\"7K5zd1\"><em>A version of this story was initially published in the Future Perfect newsletter. <\/em><a href=\"https:\/\/confirmsubscription.com\/h\/d\/A2BA26698741513A\"><em>Sign up here to subscribe!<\/em><\/a><\/p>\n<div data-cid=\"site\/article_footer-1680272184_5751_144542\" data-cdata=\"{\"base_type\":\"Entry\",\"id\":23425674,\"timestamp\":1680105600,\"published_timestamp\":1680105600,\"show_published_and_updated_timestamps\":false,\"title\":\"How to test what an AI model can \u2014 and shouldn\u2019t \u2014 do\",\"type\":\"Article\",\"url\":\"https:\/\/www.vox.com\/future-perfect\/2023\/3\/29\/23661633\/gpt-4-openai-alignment-research-center-open-philanthropy-ai-safety\",\"entry_layout\":{\"key\":\"unison_standard\",\"layout\":\"unison_main\",\"template\":\"standard\"},\"additional_byline\":null,\"authors\":[{\"id\":5296687,\"name\":\"Kelsey Piper\",\"url\":\"https:\/\/www.vox.com\/authors\/kelsey-piper\",\"twitter_handle\":\"\",\"profile_image_url\":\"https:\/\/cdn.vox-cdn.com\/thumbor\/LHe6jPR2UsTRjhjaRJg5wRJrEBw=\/512x512\/cdn.vox-cdn.com\/author_profile_images\/191475\/Screen_Shot_2018-09-25_at_11.18.29_AM.0.png\",\"title\":\"\",\"email\":\"\",\"short_author_bio\":\"is a senior writer at Future Perfect, Vox\u2019s effective altruism-inspired section on the world\u2019s biggest challenges. She explores wide-ranging topics like climate change, artificial intelligence, vaccine development, and factory farms, and also writes the Future Perfect newsletter.\"}],\"byline_enabled\":true,\"byline_credit_text\":\"By\",\"byline_serial_comma_enabled\":true,\"comment_count\":0,\"comments_enabled\":false,\"legacy_comments_enabled\":false,\"coral_comments_enabled\":false,\"coral_comment_counts_enabled\":false,\"commerce_disclosure\":null,\"community_name\":\"Vox\",\"community_url\":\"https:\/\/www.vox.com\/\",\"community_logo\":\"rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;cross_community&#8221;:false,&#8221;groups&#8221;:[{&#8220;base_type&#8221;:&#8221;EntryGroup&#8221;,&#8221;id&#8221;:76815,&#8221;timestamp&#8221;:1680184802,&#8221;title&#8221;:&#8221;Future Perfect&#8221;,&#8221;type&#8221;:&#8221;SiteGroup&#8221;,&#8221;url&#8221;:&#8221;https:\/\/www.vox.com\/future-perfect&#8221;,&#8221;slug&#8221;:&#8221;future-perfect&#8221;,&#8221;community_logo&#8221;:&#8221;rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;community_name&#8221;:&#8221;Vox&#8221;,&#8221;community_url&#8221;:&#8221;https:\/\/www.vox.com\/&#8221;,&#8221;cross_community&#8221;:false,&#8221;entry_count&#8221;:1527,&#8221;always_show&#8221;:false,&#8221;description&#8221;:&#8221;Finding the best ways to do good. &#8220;,&#8221;disclosure&#8221;:&#8221;&#8221;,&#8221;cover_image_url&#8221;:&#8221;&#8221;,&#8221;cover_image&#8221;:null,&#8221;title_image_url&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/uploads\/chorus_asset\/file\/16290809\/future_perfect_sized.0.jpg&#8221;,&#8221;intro_image&#8221;:null,&#8221;four_up_see_more_text&#8221;:&#8221;View All&#8221;,&#8221;primary&#8221;:true},{&#8220;base_type&#8221;:&#8221;EntryGroup&#8221;,&#8221;id&#8221;:27524,&#8221;timestamp&#8221;:1680260403,&#8221;title&#8221;:&#8221;Technology&#8221;,&#8221;type&#8221;:&#8221;SiteGroup&#8221;,&#8221;url&#8221;:&#8221;https:\/\/www.vox.com\/technology&#8221;,&#8221;slug&#8221;:&#8221;technology&#8221;,&#8221;community_logo&#8221;:&#8221;rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;community_name&#8221;:&#8221;Vox&#8221;,&#8221;community_url&#8221;:&#8221;https:\/\/www.vox.com\/&#8221;,&#8221;cross_community&#8221;:false,&#8221;entry_count&#8221;:24340,&#8221;always_show&#8221;:false,&#8221;description&#8221;:&#8221;Uncovering and explaining how our digital world is changing \u2014 and changing us.&#8221;,&#8221;disclosure&#8221;:&#8221;&#8221;,&#8221;cover_image_url&#8221;:&#8221;&#8221;,&#8221;cover_image&#8221;:null,&#8221;title_image_url&#8221;:&#8221;&#8221;,&#8221;intro_image&#8221;:null,&#8221;four_up_see_more_text&#8221;:&#8221;View All&#8221;,&#8221;primary&#8221;:false},{&#8220;base_type&#8221;:&#8221;EntryGroup&#8221;,&#8221;id&#8221;:80311,&#8221;timestamp&#8221;:1680184802,&#8221;title&#8221;:&#8221;Artificial Intelligence&#8221;,&#8221;type&#8221;:&#8221;SiteGroup&#8221;,&#8221;url&#8221;:&#8221;https:\/\/www.vox.com\/artificial-intelligence&#8221;,&#8221;slug&#8221;:&#8221;artificial-intelligence&#8221;,&#8221;community_logo&#8221;:&#8221;rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;community_name&#8221;:&#8221;Vox&#8221;,&#8221;community_url&#8221;:&#8221;https:\/\/www.vox.com\/&#8221;,&#8221;cross_community&#8221;:false,&#8221;entry_count&#8221;:344,&#8221;always_show&#8221;:false,&#8221;description&#8221;:&#8221;Vox&#8217;s coverage of how AI is shaping everything from text and image generation to how we live. &#8220;,&#8221;disclosure&#8221;:&#8221;&#8221;,&#8221;cover_image_url&#8221;:&#8221;&#8221;,&#8221;cover_image&#8221;:null,&#8221;title_image_url&#8221;:&#8221;&#8221;,&#8221;intro_image&#8221;:null,&#8221;four_up_see_more_text&#8221;:&#8221;View All&#8221;,&#8221;primary&#8221;:false},{&#8220;base_type&#8221;:&#8221;EntryGroup&#8221;,&#8221;id&#8221;:102794,&#8221;timestamp&#8221;:1680184802,&#8221;title&#8221;:&#8221;Innovation&#8221;,&#8221;type&#8221;:&#8221;SiteGroup&#8221;,&#8221;url&#8221;:&#8221;https:\/\/www.vox.com\/innovation&#8221;,&#8221;slug&#8221;:&#8221;innovation&#8221;,&#8221;community_logo&#8221;:&#8221;rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;community_name&#8221;:&#8221;Vox&#8221;,&#8221;community_url&#8221;:&#8221;https:\/\/www.vox.com\/&#8221;,&#8221;cross_community&#8221;:false,&#8221;entry_count&#8221;:141,&#8221;always_show&#8221;:false,&#8221;description&#8221;:&#8221;&#8221;,&#8221;disclosure&#8221;:&#8221;&#8221;,&#8221;cover_image_url&#8221;:&#8221;&#8221;,&#8221;cover_image&#8221;:null,&#8221;title_image_url&#8221;:&#8221;&#8221;,&#8221;intro_image&#8221;:null,&#8221;four_up_see_more_text&#8221;:&#8221;View All&#8221;,&#8221;primary&#8221;:false}],&#8221;internal_groups&#8221;:[{&#8220;base_type&#8221;:&#8221;EntryGroup&#8221;,&#8221;id&#8221;:112405,&#8221;timestamp&#8221;:1680205479,&#8221;title&#8221;:&#8221;Approach \u2014 Explores solutions or ideas to solve problems&#8221;,&#8221;type&#8221;:&#8221;SiteGroup&#8221;,&#8221;url&#8221;:&#8221;&#8221;,&#8221;slug&#8221;:&#8221;approach-explores-solutions-or-ideas-to-solve-problems&#8221;,&#8221;community_logo&#8221;:&#8221;rn<svg width=\"386px\" height=\"385px\" viewBox=\"0 0 386 385\" version=\"1.1\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" xmlns:xlink=\"http:\/\/www.w3.org\/1999\/xlink\" >rn    rn    <title>vox-mark<\/title>rn    rn    <defs><\/defs>rn    <g id=\"Page-1\" stroke=\"none\" stroke-width=\"1\" fill=\"none\" fill-rule=\"evenodd\" >rn        <path d=\"M239.811,0 L238.424,6 L259.374,6 C278.011,6 292.908,17.38 292.908,43.002 C292.908,56.967 287.784,75.469 276.598,96.888 L182.689,305.687 L159.283,35.693 C159.283,13.809 168.134,6 191.88,6 L205.854,6 L207.247,0 L1.409,0 L0,6 L13.049,6 C28.88,6 35.863,15.885 37.264,34.514 L73.611,385 L160.221,385 L304.525,79.217 C328.749,31.719 349.237,6 372.525,6 L384.162,6 L385.557,0 L239.811,0\" id=\"vox-mark\" fill=\"#444745\" ><\/path>rn    <\/g>rn<\/svg>&#8220;,&#8221;community_name&#8221;:&#8221;Vox&#8221;,&#8221;community_url&#8221;:&#8221;https:\/\/www.vox.com\/&#8221;,&#8221;cross_community&#8221;:false,&#8221;entry_count&#8221;:23,&#8221;always_show&#8221;:false,&#8221;description&#8221;:&#8221;&#8221;,&#8221;disclosure&#8221;:&#8221;&#8221;,&#8221;cover_image_url&#8221;:&#8221;&#8221;,&#8221;cover_image&#8221;:null,&#8221;title_image_url&#8221;:&#8221;&#8221;,&#8221;intro_image&#8221;:null,&#8221;four_up_see_more_text&#8221;:&#8221;View All&#8221;}],&#8221;image&#8221;:{&#8220;ratio&#8221;:&#8221;*&#8221;,&#8221;original_url&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127267\/GettyImages_1249214710.0.jpg&#8221;,&#8221;network&#8221;:&#8221;unison&#8221;,&#8221;bgcolor&#8221;:&#8221;white&#8221;,&#8221;pinterest_enabled&#8221;:false,&#8221;caption&#8221;:null,&#8221;credit&#8221;:&#8221;SOPA Images\/LightRocket via Gett&#8221;,&#8221;focal_area&#8221;:{&#8220;top_left_x&#8221;:2100,&#8221;top_left_y&#8221;:1269,&#8221;bottom_right_x&#8221;:2900,&#8221;bottom_right_y&#8221;:2069},&#8221;bounds&#8221;:[0,0,5000,3337],&#8221;uploaded_size&#8221;:{&#8220;width&#8221;:5000,&#8221;height&#8221;:3337},&#8221;focal_point&#8221;:null,&#8221;image_id&#8221;:72127267,&#8221;alt_text&#8221;:&#8221;&#8221;},&#8221;hub_image&#8221;:{&#8220;ratio&#8221;:&#8221;*&#8221;,&#8221;original_url&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127267\/GettyImages_1249214710.0.jpg&#8221;,&#8221;network&#8221;:&#8221;unison&#8221;,&#8221;bgcolor&#8221;:&#8221;white&#8221;,&#8221;pinterest_enabled&#8221;:false,&#8221;caption&#8221;:null,&#8221;credit&#8221;:&#8221;SOPA Images\/LightRocket via Gett&#8221;,&#8221;focal_area&#8221;:{&#8220;top_left_x&#8221;:2100,&#8221;top_left_y&#8221;:1269,&#8221;bottom_right_x&#8221;:2900,&#8221;bottom_right_y&#8221;:2069},&#8221;bounds&#8221;:[0,0,5000,3337],&#8221;uploaded_size&#8221;:{&#8220;width&#8221;:5000,&#8221;height&#8221;:3337},&#8221;focal_point&#8221;:null,&#8221;image_id&#8221;:72127267,&#8221;alt_text&#8221;:&#8221;&#8221;},&#8221;lede_image&#8221;:{&#8220;ratio&#8221;:&#8221;*&#8221;,&#8221;original_url&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg&#8221;,&#8221;network&#8221;:&#8221;unison&#8221;,&#8221;bgcolor&#8221;:&#8221;white&#8221;,&#8221;pinterest_enabled&#8221;:false,&#8221;caption&#8221;:null,&#8221;credit&#8221;:&#8221;SOPA Images\/LightRocket via Gett&#8221;,&#8221;focal_area&#8221;:{&#8220;top_left_x&#8221;:2100,&#8221;top_left_y&#8221;:1269,&#8221;bottom_right_x&#8221;:2900,&#8221;bottom_right_y&#8221;:2069},&#8221;bounds&#8221;:[0,0,5000,3337],&#8221;uploaded_size&#8221;:{&#8220;width&#8221;:5000,&#8221;height&#8221;:3337},&#8221;focal_point&#8221;:null,&#8221;image_id&#8221;:72127270,&#8221;alt_text&#8221;:&#8221;&#8221;},&#8221;group_cover_image&#8221;:null,&#8221;picture_standard_lead_image&#8221;:{&#8220;ratio&#8221;:&#8221;*&#8221;,&#8221;original_url&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg&#8221;,&#8221;network&#8221;:&#8221;unison&#8221;,&#8221;bgcolor&#8221;:&#8221;white&#8221;,&#8221;pinterest_enabled&#8221;:false,&#8221;caption&#8221;:null,&#8221;credit&#8221;:&#8221;SOPA Images\/LightRocket via Gett&#8221;,&#8221;focal_area&#8221;:{&#8220;top_left_x&#8221;:2100,&#8221;top_left_y&#8221;:1269,&#8221;bottom_right_x&#8221;:2900,&#8221;bottom_right_y&#8221;:2069},&#8221;bounds&#8221;:[0,0,5000,3337],&#8221;uploaded_size&#8221;:{&#8220;width&#8221;:5000,&#8221;height&#8221;:3337},&#8221;focal_point&#8221;:null,&#8221;image_id&#8221;:72127270,&#8221;alt_text&#8221;:&#8221;&#8221;,&#8221;picture_element&#8221;:{&#8220;html&#8221;:{},&#8221;alt&#8221;:&#8221;&#8221;,&#8221;default&#8221;:{&#8220;srcset&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/thumbor\/wV3tQxyxryDFrsomWmXRV0Ps-mg=\/0x0:5000&#215;3337\/320&#215;240\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 320w, https:\/\/cdn.vox-cdn.com\/thumbor\/4NOA8AnqKURgbcJz9kBjA5SZBcw=\/0x0:5000&#215;3337\/620&#215;465\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 620w, https:\/\/cdn.vox-cdn.com\/thumbor\/-NTZfdCm4hVfG_rglNASNDLUT08=\/0x0:5000&#215;3337\/920&#215;690\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 920w, https:\/\/cdn.vox-cdn.com\/thumbor\/H6SSSfSetTEViabkg3L9EDSDXzk=\/0x0:5000&#215;3337\/1220&#215;915\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 1220w, https:\/\/cdn.vox-cdn.com\/thumbor\/d4bUImjy3jfWML4aPYfaxgoeFkU=\/0x0:5000&#215;3337\/1520&#215;1140\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 1520w&#8221;,&#8221;webp_srcset&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/thumbor\/lKOCFD6agdbVIQv_VmfKtxjEBCQ=\/0x0:5000&#215;3337\/320&#215;240\/filters:focal(2100&#215;1269:2900&#215;2069):format(webp)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 320w, https:\/\/cdn.vox-cdn.com\/thumbor\/Roo8ahsMSvpWc4O0YXXd-wpWD6A=\/0x0:5000&#215;3337\/620&#215;465\/filters:focal(2100&#215;1269:2900&#215;2069):format(webp)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 620w, https:\/\/cdn.vox-cdn.com\/thumbor\/ammjXN_v0ungYi4hDpZpkWEHkWE=\/0x0:5000&#215;3337\/920&#215;690\/filters:focal(2100&#215;1269:2900&#215;2069):format(webp)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 920w, https:\/\/cdn.vox-cdn.com\/thumbor\/HpSV7qwoSQI4-XbnZUS2dWexfZ0=\/0x0:5000&#215;3337\/1220&#215;915\/filters:focal(2100&#215;1269:2900&#215;2069):format(webp)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 1220w, https:\/\/cdn.vox-cdn.com\/thumbor\/F8ATNgKnw_N0hoHQPryvZ_Hub5A=\/0x0:5000&#215;3337\/1520&#215;1140\/filters:focal(2100&#215;1269:2900&#215;2069):format(webp)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg 1520w&#8221;,&#8221;media&#8221;:null,&#8221;sizes&#8221;:&#8221;(min-width: 809px) 485px, (min-width: 600px) 60vw, 100vw&#8221;,&#8221;fallback&#8221;:&#8221;https:\/\/cdn.vox-cdn.com\/thumbor\/O062Z5JY3SWZ2n5mN0uuY2GVWhk=\/0x0:5000&#215;3337\/1200&#215;900\/filters:focal(2100&#215;1269:2900&#215;2069)\/cdn.vox-cdn.com\/uploads\/chorus_image\/image\/72127270\/GettyImages_1249214710.0.jpg&#8221;},&#8221;art_directed&#8221;:[]}},&#8221;image_is_placeholder&#8221;:false,&#8221;image_is_hidden&#8221;:false,&#8221;network&#8221;:&#8221;vox&#8221;,&#8221;omits_labels&#8221;:true,&#8221;optimizable&#8221;:false,&#8221;promo_headline&#8221;:&#8221;How to test what an AI model can \u2014 and shouldn\u2019t \u2014 do&#8221;,&#8221;recommended_count&#8221;:0,&#8221;recs_enabled&#8221;:false,&#8221;slug&#8221;:&#8221;future-perfect\/2023\/3\/29\/23661633\/gpt-4-openai-alignment-research-center-open-philanthropy-ai-safety&#8221;,&#8221;dek&#8221;:&#8221;Inside the labs that helps evaluate AI safety for models like GPT-4&#8243;,&#8221;homepage_title&#8221;:&#8221;How to test what an AI model can \u2014 and shouldn\u2019t \u2014 do&#8221;,&#8221;homepage_description&#8221;:&#8221;Inside the labs that helps evaluate AI safety for models like GPT-4&#8243;,&#8221;show_homepage_description&#8221;:false,&#8221;title_display&#8221;:&#8221;How to test what an AI model can \u2014 and shouldn\u2019t \u2014 do&#8221;,&#8221;pull_quote&#8221;:null,&#8221;voxcreative&#8221;:false,&#8221;show_entry_time&#8221;:true,&#8221;show_dates&#8221;:true,&#8221;paywalled_content&#8221;:false,&#8221;paywalled_content_box_logo_url&#8221;:&#8221;&#8221;,&#8221;paywalled_content_page_logo_url&#8221;:&#8221;&#8221;,&#8221;paywalled_content_main_url&#8221;:&#8221;&#8221;,&#8221;article_footer_body&#8221;:&#8221;Vox&#8217;s journalism is free because we believe that everyone deserves to understand the world that they live in. That kind of knowledge helps create better citizens, neighbors, friends, parents, consumers and stewards of this planet. In short, understanding benefits everyone. You can join in on this mission by making a financial gift to Vox today. Reader support helps keep our work free, for everyone. <http:\/\/vox.com\/pages\/support-now?itm_campaign=better-stewards&#038;itm_medium=site&#038;itm_source=article-footer>Will you join us? <\/a>&#8220;,&#8221;article_footer_header&#8221;:&#8221;<a href=\"http:\/\/vox.com\/pages\/support-now?itm_campaign=47-annual&#038;itm_medium=site&#038;itm_source=article-footer\">We have a request<\/a>&#8220;,&#8221;use_article_footer&#8221;:true,&#8221;article_footer_cta_annual_plans&#8221;:&#8221;{rn  &#8220;default_plan&#8221;: 1,rn  &#8220;plans&#8221;: [rn    {rn      &#8220;amount&#8221;: 95,rn      &#8220;plan_id&#8221;: 74295rn    },rn    {rn      &#8220;amount&#8221;: 120,rn      &#8220;plan_id&#8221;: 81108rn    },rn    {rn      &#8220;amount&#8221;: 250,rn      &#8220;plan_id&#8221;: 77096rn    },rn    {rn      &#8220;amount&#8221;: 350,rn      &#8220;plan_id&#8221;: 92038rn    }rn  ]rn}&#8221;,&#8221;article_footer_cta_button_annual_copy&#8221;:&#8221;year&#8221;,&#8221;article_footer_cta_button_copy&#8221;:&#8221;Yes, I&#8217;ll give&#8221;,&#8221;article_footer_cta_button_monthly_copy&#8221;:&#8221;month&#8221;,&#8221;article_footer_cta_default_frequency&#8221;:&#8221;annual&#8221;,&#8221;article_footer_cta_monthly_plans&#8221;:&#8221;{rn  &#8220;default_plan&#8221;: 1,rn  &#8220;plans&#8221;: [rn    {rn      &#8220;amount&#8221;: 9,rn      &#8220;plan_id&#8221;: 77780rn    },rn    {rn      &#8220;amount&#8221;: 20,rn      &#8220;plan_id&#8221;: 69279rn    },rn    {rn      &#8220;amount&#8221;: 50,rn      &#8220;plan_id&#8221;: 46947rn    },rn    {rn      &#8220;amount&#8221;: 100,rn      &#8220;plan_id&#8221;: 46782rn    }rn  ]rn}&#8221;,&#8221;article_footer_cta_once_plans&#8221;:&#8221;{rn  &#8220;default_plan&#8221;: 0,rn  &#8220;plans&#8221;: [rn    {rn      &#8220;amount&#8221;: 20,rn      &#8220;plan_id&#8221;: 69278rn    },rn    {rn      &#8220;amount&#8221;: 50,rn      &#8220;plan_id&#8221;: 48880rn    },rn    {rn      &#8220;amount&#8221;: 100,rn      &#8220;plan_id&#8221;: 46607rn    },rn    {rn      &#8220;amount&#8221;: 250,rn      &#8220;plan_id&#8221;: 46946rn    }rn  ]rn}&#8221;,&#8221;use_article_footer_cta_read_counter&#8221;:true,&#8221;use_article_footer_cta&#8221;:true,&#8221;featured_placeable&#8221;:false,&#8221;video_placeable&#8221;:false,&#8221;disclaimer&#8221;:null,&#8221;volume_placement&#8221;:&#8221;lede&#8221;,&#8221;video_autoplay&#8221;:false,&#8221;youtube_url&#8221;:&#8221;http:\/\/bit.ly\/voxyoutube&#8221;,&#8221;facebook_video_url&#8221;:&#8221;&#8221;,&#8221;play_in_modal&#8221;:true,&#8221;user_preferences_for_privacy_enabled&#8221;:false,&#8221;show_branded_logos&#8221;:true}&#8221;><\/p>\n<div>\n<p><strong><a href=\"http:\/\/vox.com\/pages\/support-now?itm_campaign=47-annual&#038;itm_medium=site&#038;itm_source=article-footer\">We have a request<\/a><\/strong><\/p>\n<p>\n      Vox&#8217;s journalism is free because we believe that everyone deserves to understand the world that they live in. That kind of knowledge helps create better citizens, neighbors, friends, parents, consumers and stewards of this planet. In short, understanding benefits everyone. You can join in on this mission by making a financial gift to Vox today. Reader support helps keep our work free, for everyone. <http:>Will you join us?<br \/>\n    <\/http:><\/p>\n<\/p><\/div>\n<div>\n<div>\n<p><label tabindex=\"0\" role=\"radio\" aria-checked=\"true\"><\/p>\n<p>\n                  <span>$95<\/span><span>\/year<\/span>\n                <\/p>\n<p>              <\/label><\/p>\n<p>              <label tabindex=\"0\" role=\"radio\" aria-checked=\"true\"><\/p>\n<p>\n                  <span>$120<\/span><span>\/year<\/span>\n                <\/p>\n<p>              <\/label><\/p>\n<p>              <label tabindex=\"0\" role=\"radio\" aria-checked=\"true\"><\/p>\n<p>\n                  <span>$250<\/span><span>\/year<\/span>\n                <\/p>\n<p>              <\/label><\/p>\n<p>              <label tabindex=\"0\" role=\"radio\" aria-checked=\"true\"><\/p>\n<p>\n                  <span>$350<\/span><span>\/year<\/span>\n                <\/p>\n<p>              <\/label><\/p>\n<p>            <label tabindex=\"0\"><\/p>\n<p>              <span>Other<\/span><br \/>\n            <\/label>\n          <\/p>\n<\/p><\/div>\n<p>        <a href=\"https:\/\/vox.memberful.com\/checkout?plan=\" id=\"contribute--submit\"><\/p>\n<p>\n            Yes, I&#8217;ll give $120<span>\/year<\/span>\n          <\/p>\n<p>        <\/a><\/p>\n<p>\n          Yes, I&#8217;ll give $120<span>\/year<\/span>\n        <\/p>\n<div>\n<p>\n              <span><br \/>\n                We accept credit card, Apple Pay, and<br \/>\n              <\/span><br \/>\n              <span><br \/>\n                Google Pay. You can also contribute via<br \/>\n              <\/span>\n            <\/p>\n<p><a href=\"https:\/\/www.paypal.com\/donate\/?hosted_button_id=VSP4PYJX98SHL\" target=\"_blank\" rel=\"noopener\"><br \/>\n              <img decoding=\"async\" src=\"https:\/\/cdn.vox-cdn.com\/uploads\/chorus_asset\/file\/22734206\/paypal_logo.png\"><br \/>\n            <\/a>\n          <\/p>\n<\/div><\/div>\n<\/p><\/div>\n<\/div>\n<p><a href=\"https:\/\/www.vox.com\/future-perfect\/2023\/3\/29\/23661633\/gpt-4-openai-alignment-research-center-open-philanthropy-ai-safety\" class=\"button purchase\" rel=\"nofollow noopener\" target=\"_blank\">Read More<\/a><br \/>\n Johnathon Howe<\/p>\n","protected":false},"excerpt":{"rendered":"<p>About six months ago, I decided to make AI a bigger part of how I spend my time as a reporter. The world of AI is evolving very, very fast. New releases seemingly every week are changing what it means to be a programmer, an artist, a teacher, and, most definitely, a journalist. There\u2019s enormous<\/p>\n","protected":false},"author":1,"featured_media":623951,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[38,23337,46],"tags":[],"class_list":{"0":"post-623950","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-model","8":"category-shouldnt","9":"category-technology"},"aioseo_notices":[],"_links":{"self":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/623950","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/comments?post=623950"}],"version-history":[{"count":0,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/posts\/623950\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media\/623951"}],"wp:attachment":[{"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/media?parent=623950"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/categories?post=623950"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/newsycanuse.com\/index.php\/wp-json\/wp\/v2\/tags?post=623950"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}