<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Second Thoughts]]></title><description><![CDATA[Accessible analysis of the big issues in AI]]></description><link>https://secondthoughts.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!pUAA!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d71f419-5f21-4179-be19-65b71d59de3a_1024x1024.png</url><title>Second Thoughts</title><link>https://secondthoughts.ai</link></image><generator>Substack</generator><lastBuildDate>Fri, 17 Apr 2026 17:09:58 GMT</lastBuildDate><atom:link href="https://secondthoughts.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Steve Newman]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[amistrongeryet@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[amistrongeryet@substack.com]]></itunes:email><itunes:name><![CDATA[Steve Newman]]></itunes:name></itunes:owner><itunes:author><![CDATA[Steve Newman]]></itunes:author><googleplay:owner><![CDATA[amistrongeryet@substack.com]]></googleplay:owner><googleplay:email><![CDATA[amistrongeryet@substack.com]]></googleplay:email><googleplay:author><![CDATA[Steve Newman]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[To Forecast AI's Impact on Biosecurity, We Asked: Why are Attacks So Rare?]]></title><description><![CDATA[Nine factors that practitioners say make bioweapons rare.]]></description><link>https://secondthoughts.ai/p/why-arent-bioweapons-common</link><guid isPermaLink="false">https://secondthoughts.ai/p/why-arent-bioweapons-common</guid><dc:creator><![CDATA[Abi Olvera]]></dc:creator><pubDate>Wed, 08 Apr 2026 18:45:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!JaGo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>When people discuss potential risks from advanced AI, the list usually includes bioweapons. The idea is that a rogue actor, or possibly a misaligned AI, might use AI tools to help synthesize and release anything from anthrax to a civilization-ending supervirus. To shed more light on this risk, Golden Gate Institute for AI&#8217;s <a href="https://abio.substack.com/">Abi Olvera</a> interviewed biosecurity professionals with decades of hands-on experience in laboratories. What she learned was more reassuring than headlines suggest: bioweapons are genuinely difficult to make, and near-term developments in AI may not change that as much as you might think. This is part one of her four-part series explaining why; the remaining installments will appear here at Second Thoughts in the coming weeks.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JaGo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JaGo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 424w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 848w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1272w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" width="462" height="347.8588235294118" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:896,&quot;width&quot;:1190,&quot;resizeWidth&quot;:462,&quot;bytes&quot;:2562951,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/193414826?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JaGo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 424w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 848w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1272w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Bioweapons are rarely used.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>One reason is that they&#8217;re hard to make. Another reason is that they&#8217;re bad weapons.</p><p>You can&#8217;t time them &#8211; a virus spreads on its own schedule. You can&#8217;t aim them &#8211; a virus spreads uncontrollably. Protecting your own people from a virus requires tipping your hand &#8211; a vaccination program is hard to hide.</p><p>People who want to cause harm still follow a cost-benefit logic. They try to pick the tool most likely to achieve their goal. For almost any real-world objective, that tool is a bomb, a gun, a chemical weapon, or a cyberattack. These are cheaper, faster to create, easier to deploy, more controllable, and more predictable than a bioweapon.</p><p>This cost-benefit logic is well-studied in military strategy, but less well recognized in current discussions of bioweapon risks from AI. That gap is important. AI risk researchers and biosecurity practitioners are both worried about bioweapons, but they&#8217;re working from different starting points. AI researchers focus on ways AI could help with bioweapon construction. Biosecurity practitioners focus on the most critical limiting factors for bioweapon creation.</p><p>This series focuses on the practitioner&#8217;s view. It draws on conversations with biosecurity professionals with decades of hands-on laboratory experience. The series&#8217; four essays cover four questions: Why are bioweapons rare? How much laboratory skill do the necessary processes actually require? Where in the production chain does AI help and where doesn&#8217;t it? Why does the biosecurity discourse underplay the factors that make successful bioweapons so rare?</p><p>These are the factors that practitioners say make bioweapons rare:</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/bSeKB/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8e647ab6-b53e-417e-b6ff-4d9bf3a57374_1220x1878.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab8fde20-a506-4302-a23e-a026b7ce60eb_1220x1878.png&quot;,&quot;height&quot;:1022,&quot;title&quot;:&quot;These are the factors that practitioners say make bioweapons rare: &quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/bSeKB/2/" width="730" height="1022" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><em>Additional notes regarding &#8220;Access&#8221;: the estimate of how many people have access to facilities is back-of-the-envelope.</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a><em> Additionally, lab work is not the only route to pathogens.</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>AI does lower some barriers. Large language models can make some steps of bioweapon creation easier and cheaper, such as deciphering research on how to culture cells or disperse pathogens. AI biodesign tools reduce the expertise required to design modifications to a virus&#8217;s genome.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> AI can also help with steps that have nothing to do with biology, such as knowing which international shipping routes to use when buying supplies to avoid regulatory oversight and customs inspections. It can help actors improve their planning and coordination skills.</p><p>However, all the operational and planning capacities provided by AI are also making it easier to launch bombs, chemical weapons, and cyberattacks. AI doesn&#8217;t yet significantly change the calculus for someone wanting to cause harm.</p><p>This doesn&#8217;t mean we should be complacent. New technologies can shift the math.</p><p>Pathogens targeting genetic traits like ancestry or sex could, if ever feasible, make bioweapons more attractive to certain actors. New kinds of actors could emerge. For instance, people deeply destabilized by large economic shifts could mean a larger supply of disgruntled people interested in bioweapons. Robotics and automated laboratories could reduce the level of competence required for success.</p><p>Accurately assessing these future risks depends on a clear understanding of the present. The people who understand these risks best aren&#8217;t the ones writing about them publicly. Most of them work in laboratories, government agencies, or the national security world.  Advocacy and policy organizations fill that gap, though their incentives push them toward focusing on worst-case scenarios. That&#8217;s a big reason public discourse treats AI-enabled bioweapons as more imminent and accessible than practitioners do.</p><p><em>The next installment in this four-part series examines tacit knowledge: the gap between written protocols and real-world lab work, and how that limits AI&#8217;s potential to actually help someone build a bioweapon.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/why-arent-bioweapons-common?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/why-arent-bioweapons-common?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Steve Newman, Taren Stinebrickner-Kauffman, Mike Montague, Matt Sharkey, Gigi Gronvall, and David Manheim for suggestions and feedback.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Since the 1980s, only one fatal bioterrorist attack has occurred: the <a href="https://en.wikipedia.org/wiki/2001_anthrax_attacks">2001 anthrax letters</a>. Other incidents, such as a Salmonella attack to influence a local election, a medfly release targeting California crops, and two ricin letter attempts, caused no deaths.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Two experts estimated how many Americans have the resources to misuse viral genomes, with estimates ranging from tens of thousands to under a hundred once expertise is factored in. We use the higher figure as a rough order-of-magnitude estimate.</p><p>Matt Sharkey (RAND) started with U.S. academic biology departments&#8212;according to Bureau of Labor Statistics and National Science Foundation data, the population under consideration is roughly 100,000 people (~40k life sciences faculty plus ~60k biological or biomedical sciences doctoral students, with ~45k Master&#8217;s students not counted, as many Master&#8217;s programs do not involve benchwork). His assumptions include that about 75% of these people can perform molecular cloning, but only about 20-30% of them have access to the cell culture equipment required for something like influenza. Applying those filters yields 20,000-30,000 technically capable people. Since only about a third would independently work out the reverse genetics approach from published protocols, the academic estimate drops to 6,000-10,000. Adding the private sector could push that 1.5x-3x higher.</p><p>Michael Montague (Archimedes Network and Quantum Biology Institute) started broader (400,000&#8211;700,000 people, including industry PhDs and technicians) but applied three additional cuts: only 10% have &#8220;boot-up&#8221; knowledge to activate an assembled genome; only 1 in 10 labs has the right equipment; and 1 in 100 people have unsupervised, long-term lab access to run a multi-hundred-day project undetected. He estimated that 1000 people in academia have access, though boot-up difficulties reduced his estimate to 40 to 70 people.</p><p>Disagreement between the experts centered on how difficult boot-up is and whether nighttime lab use would go unnoticed.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Instead of making bioweapons in a lab, attackers could try to deliberately get infected by visiting places where people already have dangerous diseases like tuberculosis or Ebola. However, this requires finding actively sick people, getting close enough to catch the disease, and perfect timing around contagiousness windows. Attackers would also be stuck with natural transmission rates.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Bioweapons based on engineered pathogens remain unproven. Their feasibility is unclear. Novel bioweapons will be discussed in the third essay.</p></div></div>]]></content:encoded></item><item><title><![CDATA[45 Thoughts About Agents]]></title><description><![CDATA[The layer of the AI stack that evolves fastest &#8211; and may have the most impact]]></description><link>https://secondthoughts.ai/p/45-thoughts-about-agents</link><guid isPermaLink="false">https://secondthoughts.ai/p/45-thoughts-about-agents</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Mon, 02 Mar 2026 15:59:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dAKl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dAKl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dAKl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 424w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 848w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1272w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" width="1024" height="775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/305de7db-0931-408b-8b00-700962adecfe_1024x775.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1477217,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dAKl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 424w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 848w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1272w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">There&#8217;s a lot of debate as to exactly where we are in this progression</figcaption></figure></div><p>One of my most popular posts ever was <a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5">35 Thoughts About AGI and 1 About GPT-5</a>, a grab bag of musings about the path to AGI (plus a snarky aside about GPT-5).</p><p>Here is a fresh collection of musings, this time about AI agents.</p><ol><li><p>I had trouble making time to write this: I&#8217;m so drawn into <em>using</em> agents that it&#8217;s hard to make time to write <em>about</em> agents. This isn&#8217;t an isolated phenomenon; many people are tweeting about getting sucked into vibe coding at every available minute.</p></li><li><p>This is driven by the astonishing productivity of current AI coding agents in particular, <strong>especially when used in ways that play to their strengths</strong>. I&#8217;m using Claude Code to build a ridiculously ambitious set of productivity tools for my own personal use. A single sub-project involves deep integrations with Gmail, Slack, WhatsApp, Twitter, Signal, SMS, Substack, Pocket Casts, Notion, Google Drive, Google Contacts, Google Calendar, and more. As recently as last year, it would have been insane for me to even contemplate such an undertaking, let alone as a side project. With today&#8217;s tools, I knocked off most of the integration work over the course of a weekend. AI models are providing the intelligence the app will need to make use of all this data, and coding agents are handling the tedious integration chores.</p></li><li><p>After decades as a prolific coder, I stopped cold in early 2023, leaving me quite rusty. That rust hasn&#8217;t been the <em>slightest</em> impediment. In fact, I suspect it might be helpful &#8211; the habits I would have had to unlearn have already faded, and it&#8217;s been easy for me to slip into the habit of letting the AI write all the code. I am still using my high-level design experience to guide the agents; interestingly, those skills don&#8217;t feel rusty at all. <strong>I wonder what it is about low-level coding technique vs. high-level design skills that makes the latter easier to retain, and what this might say about which human skills will remain relevant.</strong></p></li><li><p>My experience doesn&#8217;t seem to be in any way unique &#8211; apparently a lot of engineers who had gotten out of the business of producing code themselves are getting back in:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/erenbali/status/2025243266715893922" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!s5qH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 424w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 848w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1272w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png" width="1456" height="889" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:889,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/erenbali/status/2025243266715893922&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!s5qH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 424w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 848w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1272w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>This is an example of a broader phenomenon: <strong>AI agents are going to change the nature of work</strong>. Some jobs will get more efficient, some will go away, some new jobs will arise. Some will become more fun, as AI automates the annoying part; some will become more stressful, as AI automates the easy or rewarding part. In this post, I&#8217;m not going to explore the question of how jobs and workers will be impacted overall. But I do want to highlight that things will change in many ways, often unpredictable.</p></li><li><p>I&#8217;m able to dive back into coding because Claude Code has gotten capable enough that I can be productive without editing, or even looking at, the actual code. My impression is that this was not the case prior to the November release of Opus 4.5. This is a reminder that <strong>threshold effects are a huge source of unpredictability</strong> for AI&#8217;s impact.</p></li></ol><h1><strong>Why Agents Are Such a Big Deal</strong></h1><ol start="7"><li><p>Suddenly, all of the AI news seems to be about agents. OpenClaw<a href="https://secondthoughts.ai/p/clawdbot-and-moltbook"> made waves</a> because it&#8217;s an AI agent that can act autonomously. In the recent &#8220;SaaSpocalypse&#8221;, the market cap of Software-as-a-Service (SaaS) companies fell by over one trillion dollars, driven by fears that<a href="https://amistrongeryet.substack.com/p/the-new-model-of-software-development"> coding agents will make mass-market software obsolete</a>. When people talk about the results they&#8217;re getting from the latest models, like Opus 4.6 and GPT-5.3-Codex, they&#8217;re talking about using them to power an agent like Claude Code.</p></li><li><p>One reason agents are having such an impact is that <strong>they are the layer of the AI stack that evolves most rapidly</strong>. A foundation model is a gigantic monolith &#8211; a single data file containing trillions of mysterious numeric weights. Updating it, even for an incremental release like Opus 4.5 &#8594; 4.6, is a big project. Agents, by contrast, are traditional software, and can be updated incrementally &#8211; tweak a prompt here, add a new integration there. Model releases come months apart; Claude Code sometimes has<a href="https://github.com/anthropics/claude-code/blame/main/CHANGELOG.md"> multiple releases in one day</a>. The ship &#8594; get feedback &#8594; improve &#8594; ship cycle for an agent can be very fast.</p></li><li><p>Actually I lied about where change comes fastest: <strong>some users are evolving their behavior even faster than the agents</strong>. My Twitter timeline is full of crazy but productive new ways that people are finding to use these tools. For example, here&#8217;s one epiphany (among many!) that I was late to the game on: agents are comparatively weak at high-level decision making, but they make execution cheap. So sometimes, instead of trying to choose the right path, you can just tell the agent to explore every path.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/snewmanpv/status/2020920612739514681" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 424w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 848w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1272w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png" width="1456" height="452" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:452,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/snewmanpv/status/2020920612739514681&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 424w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 848w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1272w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="10"><li><p>Another adage that agents shoot to hell: &#8220;if you don&#8217;t have time to do it right, when will you have time to do it over?&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/TheStalwart/status/2012222772085215437" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 424w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 848w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1272w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png" width="1456" height="397" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:397,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:163460,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/TheStalwart/status/2012222772085215437&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 424w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 848w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1272w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>It was always obvious that serious AI capabilities would require agents of some sort. Any intelligence, whether silicon or carbon based, can do more by feeling its way through a problem than it can do in a direct leap to a finished result<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. The famous METR &#8220;time horizon&#8221; graph, showing the time scale of coding tasks that an AI can attempt with some hope of success,<a href="https://x.com/scaling01/status/2022076396898582957"> seems to have accelerated sharply</a> when the first &#8220;reasoning model&#8221; &#8211; roughly, the first model trained as an agent &#8211; was released.</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h1><strong>What Exactly Is an Agent?</strong></h1><ol start="12"><li><p>People use the term &#8220;agent&#8221; pretty loosely. The core idea for me is a system that pursues a <em>goal</em> rather than following a <em>script</em>.</p></li><li><p>You can achieve a goal by following a script, but it doesn&#8217;t work very well. Scripts are brittle. Suppose I want an AI system to book a flight to New York. I could give it a step-by-step script, and it might even work, on a good day. But if the airline booking procedure has changed, or an unexpected circumstance arises, a script-following bot will get stuck, or forget to enter my frequent flyer number, or book the wrong kind of ticket, or worse. People often use the word &#8220;agent&#8221; to describe systems that are just following scripts. But I&#8217;m going to stick to the idea that a system is only an agent to the extent that it can robustly pursue a goal, even in the face of unanticipated circumstances.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zdAq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zdAq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 424w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 848w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png" width="876" height="1088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1088,&quot;width&quot;:876,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zdAq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 424w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 848w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="14"><li><p>The Gemini Deep Research tool is an example of a scripted system. You give it a question, it generates a plan and carries it out. Sometimes this goes well, sometimes not. In step 5 of the plan below, it might become clear that some additional research on a particular aspect of theoretical physics is called for. A rigid plan doesn&#8217;t allow for that. By contrast, if you select &#8220;thinking&#8221; mode in any of the leading chatbots and ask a question requiring research, they will take a flexible approach, adjusting course based on questions that arise over the course of the investigation.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B6eP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B6eP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 424w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 848w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1272w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png" width="1422" height="1084" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1084,&quot;width&quot;:1422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B6eP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 424w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 848w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1272w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="15"><li><p>Current agents can work toward a goal, but the way they go about it is sometimes alarming. They&#8217;ll make strange decisions or veer off in odd directions. Yes, people do that too, but current agents are worse, and weirder. For instance, I pointed out that one element of a website it had built for me didn&#8217;t look right when my phone was in dark mode, and instead of fixing that element, it tried to prevent any part of the page from entering dark mode, resulting in the following bit of lovely web design.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vLmv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vLmv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 424w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 848w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png" width="1150" height="1672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1672,&quot;width&quot;:1150,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:526110,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vLmv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 424w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 848w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Despite this, they get to the right outcome for an increasing variety of tasks of increasingly large scope. That&#8217;s partly through sheer persistence. If at first they don&#8217;t succeed, they&#8217;ll try, try again, and again, and again. Yes, people do that as well, but <strong>agents can be inhumanly persistent and patient.</strong> (They can afford to be! Their time is much less valuable, especially if measured in actions rather than minutes.)</p></li><li><p>As always, AIs partially compensate for a lack of deep understanding with an incomprehensible breadth of training on zillions of specific tasks. They may struggle with novel situations, but they will surprise you with how many problems they already know how to solve. Using breadth to compensate for lack of depth<a href="https://secondthoughts.ai/p/gpt-4-capabilities"> has always been part of the LLM story</a>. The first &#8220;L&#8221; in LLM stands for Large, which relates to a large volume of training data.</p></li></ol><h1><strong>Using Agents Effectively</strong></h1><ol start="18"><li><p>To get value from current agents, you need to find agent-shaped pieces in your current workflow. They&#8217;re not always obvious. And you can get more value if you&#8217;re willing to reshape your workflow so that it contains more agent-shaped tasks.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Arnb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Arnb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 424w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 848w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png" width="1456" height="1164" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1164,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2273496,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Arnb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 424w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 848w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://x.com/CFGeek/status/2005118883515277756" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F_QA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 424w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 848w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1272w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png" width="1456" height="352" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:352,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/CFGeek/status/2005118883515277756&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F_QA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 424w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 848w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1272w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="19"><li><p>Many people have pointed out that if you just naively hand pieces of work to an agent, your productivity can actually go down. It&#8217;s easy to fall into a cycle where an agent produces something for you, you provide feedback, the agent makes revisions, you check it again, ad infinitum. This <em>feels</em> productive (the agent is doing so much work!), but before you know it, you&#8217;ve spent more time giving feedback to the agent than it would have taken you to do the work yourself.</p></li><li><p>Advanced users understand that the key is <strong>putting the agent in a position to check its own work</strong>. The agent&#8217;s strength isn&#8217;t flawless execution, it&#8217;s the speed and stamina to keep plugging away. But it doesn&#8217;t necessarily realize this &#8211; its instinct is to constantly ask for your approval. You have to be very explicit in instructing it what constitutes a successful outcome.</p></li><li><p>Current agents are notoriously focused on the main thrust of their assigned task, to the expense of all else. For instance, I will tell an agent to make a change to some code and then make sure all of the tests pass. It will beaver away for 10 minutes, generating a flood of output, and report success. And then I&#8217;ll read back through the output, notice a casual remark that &#8220;seven tests can&#8217;t be adapted to the new code, so I&#8217;ll just remove them&#8221;, and smack my forehead. (Unfortunately, smacking the agent&#8217;s forehead is not an option.)<br><br>The best practice is to have one agent do the work, and then a separate agent check the work. This isn&#8217;t because the first agent isn&#8217;t smart enough, it&#8217;s because these agents are trained to be so goal-oriented that they struggle to hold onto more than one goal at a time.</p></li><li><p>One hallmark of a skilled user of agents is their resourcefulness in finding ways for the agent to check its own work. Here&#8217;s an example, taken from my post on<a href="https://secondthoughts.ai/p/hyperproductivity"> Hyperproductivity</a>:</p><blockquote><p>&#8230;when Jesse [Vincent] uses his &#8220;Superpowers&#8221; tool to codify a skill, the tool uses its test-driven development module to verify that the new skill has been implemented correctly. <strong>It generates an example of a task that the new skill is meant to help with, verifies that it is unable to complete that task without the new skill, and then checks to see whether it </strong><em><strong>can</strong></em><strong> complete the task once the new skill has been installed.</strong></p></blockquote></li><li><p>The need for clear success criteria applies to people as well as AI agents. But we&#8217;re more proactive than AIs at finding ways to check our work, and at sussing out unstated requirements. I wonder whether this &#8220;eh, that&#8217;s probably good enough&#8221; attitude is a fundamental weakness of current agent architectures, or just a flaw in the feedback they&#8217;re given during training.</p></li><li><p>People often argue that AI tools can be useful even if they&#8217;re unreliable, because it&#8217;s easier to check the AI&#8217;s output than to do the work yourself. I think this is overstated. Sometimes it&#8217;s much easier to make a thing than to verify the thing. For instance, you can sometimes build a large spreadsheet in just a few minutes, using repeating formulas and smart autofill features. For someone else to poke through all 1000 cells in that spreadsheet and make sure you did it correctly might take a lot longer. Some kinds of work are hard to hand to an agent, for the same reason.</p></li><li><p>Because I don&#8217;t want to have to check an agent&#8217;s work, I find that it&#8217;s often worthwhile for me to spell out in great detail how I&#8217;d like it to go about a task &#8211; minimizing its opportunities to screw up. One of my<a href="https://x.com/snewmanpv/status/2008002812471586828"> few viral tweets</a> showed a detailed prompt I wrote for Claude Code (click the link above to see the full prompt).<br><br>It took me half an hour to write the prompt. But it would have taken me an entire day or more to write the code myself. And this prompt was enough for Claude to get absolutely everything right on the first try &#8211; and to give me the confidence to not spend time reviewing the work. This is an extreme example, but the principle often applies: 5 minutes of extra detail in the prompt can save an hour of reviewing flawed outputs.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uDoi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uDoi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 424w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 848w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1272w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png" width="1456" height="1126" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1126,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uDoi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 424w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 848w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1272w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="26"><li><p>People are building elaborate prompt systems, with names like<a href="https://github.com/microsoft/amplifier"> Amplifier</a> and<a href="https://github.com/obra/superpowers/tree/main"> Superpowers</a>, to elicit more sophisticated work from agents. There is an enormous amount of work taking place here, mostly homegrown. Why aren&#8217;t companies like Anthropic and OpenAI incorporating these ideas into their agents (and the models themselves)? I think that <strong>all of this is so new that a thousand early adopters can explore new ideas faster than even nimble companies can absorb</strong>. End-user innovation is the fast-moving layer right now; successful ideas are incorporated first into the agents, and then the models.<br><br>It&#8217;s also the case that many of these prompt systems are designed to trade off thinking time for quality. They ask the agent to do the work four times and compare results, do endless critiques of its own work, and so forth. Anthropic and OpenAI may be holding off on incorporating extreme agent orchestration techniques into their baseline agents because they don&#8217;t have the computing capacity to support widespread use of those techniques.</p></li><li><p>There&#8217;s a limit to how quickly you can climb the ladder of sophistication in use of agents. Before you can have an agent effectively checking its own work, you need a taste for checking the work yourself. Before you can manage a swarm of agents, you need to manage one agent. It&#8217;s helpful to read about what the experts do, but you can&#8217;t emulate them on day one.</p></li></ol><h1><strong>Impact</strong></h1><ol start="28"><li><p>People in my circles make fun of analyst Ed Zitron for his skepticism of AI. Below is a recent where&#8217;s-the-beef rant (click the image to view the complete thread). He&#8217;s arguing that even to the extent that AI is generating a lot of <em>activity</em>, that activity isn&#8217;t having much <em>impact</em>. And&#8230; I&#8217;m not sure he&#8217;s wrong? I&#8217;m still very confused as to how long it will take for AI capabilities, which are undeniably astonishing, to cash all the way out into large scale real world impacts &#8211; GDP growth, labor market disruption, improvements in health care outcomes, and so forth. When will all of this actually matter? How will we know? How should we even define &#8220;mattering&#8221;?</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://bsky.app/profile/edzitron.com/post/3mdbwccxzpk2g" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UoEb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 424w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 848w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1272w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png" width="1176" height="396" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:396,&quot;width&quot;:1176,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://bsky.app/profile/edzitron.com/post/3mdbwccxzpk2g&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UoEb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 424w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 848w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1272w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="29"><li><p>In particular, a lot of the energy people put into vibe coding seems to be devoted to making them more efficient at vibe coding. I can absolutely relate to this; I&#8217;ve been at it for about two months, and much of it has been spent this way. But of course if <em>all</em> we&#8217;re getting from vibe coding is better vibe coding, that supports Zitron&#8217;s point.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/labenz/status/2012911588345291035" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nfFD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 424w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 848w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1272w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png" width="1456" height="490" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:490,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/labenz/status/2012911588345291035&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nfFD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 424w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 848w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1272w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="30"><li><p>AI is extremely good at cranking out work that looks good at first glance but isn&#8217;t really worth using. Reporter Jessica E. Lessin<a href="https://www.theinformation.com/articles/let-ai-analyze-davos-reporting-trip-missed"> presents an example</a>: she had Claude Code generate slides for a presentation on her recent trip to Davos. The result, like so much AI work, was competent, but too bland to be of any value:</p><blockquote><p>Overall, Claude&#8217;s slides were better than I expected. The themes it identified were high level: Mostly accurate, they included slides on the size of the AI infrastructure build-out, tech companies&#8217; big bet on agentic AI and the growing number of creators using AI.</p><p>Claude did a capable job of identifying interesting parts of individual interviews, like the comments Amazon CEO Andy Jassy made about OpenAI and jobs.</p><p>It also added a few nice touches, like this line on the final slide: &#8220;Full interview transcripts available to subscribers.&#8221; It was reading my subscriber-acquisition&#8211;focused brain!</p><p>Yet the slides overall were pretty boring and too high level: <strong>They&#8217;d be useless to anyone who followed tech closely.</strong> [emphasis added]</p></blockquote><p>On a related note,<a href="https://x.com/testingham/status/2014363253871403267"> Tom Cunningham observes</a> that AI can pull you away from doing the most important things, and toward the things that AI can do:</p><blockquote><p>I believe many estimates of LLM productivity boosts are over-estimates because people are using them for cadillac tasks: things that would take you a long time unaided, but have only marginal additional value.</p></blockquote></li></ol><ol start="32"><li><p>Despite all of this, AI agents are absolutely creating real value. To share a personal example, I vibe-coded an app that generates summaries of all the (many!) newsletters and podcasts I follow. It saves me a solid hour per day by letting me be choosier about which episodes I read / listen to. <strong>For the first time in years, I&#8217;m consistently keeping up with my reading queue.</strong> There are many, many such examples &#8211; they&#8217;re just mixed in with a lot of slop and Cadillac tasks.</p></li></ol><h1><strong>Where Things Go From Here</strong></h1><ol start="33"><li><p>Agents are going to progress rapidly, by any metric: usage, capabilities, impact. AI&#8217;s impact is the product of eight separate factors; pre-training, post-training, inference compute scaling, agent scaffolding, app design, user aptitude, workflow refactoring, and adoption. All eight are advancing, some quite rapidly. That will multiply out to a <em>blistering</em> pace of change. As I write this, it has been less than three months since the release of Opus 4.5 (kicking off the current wave of interest in coding agents), and we&#8217;ve already experienced further acceleration from Opus 4.6, dozens of updates to Claude Code, and rapid cultural evolution in where and how to best use agents.</p></li><li><p>We experienced a phase change at some point in the second half of last year. Opus 4.5 was probably the trigger. Previously, coding agents were useful tools, but they made plenty of mistakes, and getting reliable value out of them required expertise and careful workflow design. They&#8217;re now considerably more robust, to the point where<a href="https://x.com/kevinroose/status/2009738622484754931"> non-technical users</a> can <a href="https://x.com/cblatts/status/2027018464670491065">vibe-code serious applications</a>. <strong>There&#8217;s been a tipping point; it&#8217;s now much easier to get started with AI coding tools, and much easier to get out more value than you put in.</strong></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/cblatts/status/2027018464670491065" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n46d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 424w, https://substackcdn.com/image/fetch/$s_!n46d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 848w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1272w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png" width="1456" height="416" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:416,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/cblatts/status/2027018464670491065&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n46d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 424w, https://substackcdn.com/image/fetch/$s_!n46d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 848w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1272w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="35"><li><p>There are more phase changes to come. Rapid progress will be the baseline. This will be punctuated by <em>even more dramatic</em> moments where entirely new scenarios become feasible. For example, one of the startling phenomena observed on MoltBook (the &#8220;social network&#8221; for AIs) is that agents can exchange tips and scripts, allowing them to collectively advance their own knowledge and skills. For the moment,<a href="https://secondthoughts.ai/p/clawdbot-and-moltbook"> this seems to be mostly a mirage</a>. When agents become coherent enough to productively advance one another&#8217;s skills, and secure enough to make this a safe thing to do, we&#8217;ll see another step function: the fast-moving layer will no longer be the agent applications, nor user skill, but the agent&#8217;s own skills at self-improvement.</p></li><li><p>A critical phase change will occur if and when AI agents can pay their own way and survive in the wild.<a href="https://x.com/gf_256/status/2018844976486945112"> It is absolutely inevitable that someone will set an agent loose</a>, with instructions to reproduce (launch new copies of itself) and evolve (modify its own programming). Self-sufficient AI agents could quickly spread into every available niche, and almost anything might ensue. Moltbook has already shown that agents can exchange skills and knowledge with one another, which could lead to rapid evolution.<br><br>That said, I think independent / &#8220;rogue&#8221; agents may find it hard to pay their own way. They will need to somehow earn money to pay for the server on which they run, and the API fees (to OpenAI or Anthropic or whoever) for AI inference<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. If they&#8217;re looking for legitimate work (e.g. on platforms like Upwork and Fiverr), they&#8217;ll be competing with non-rogue AIs, which will have advantages &#8211; they can advertise in the open, they don&#8217;t have to come up with false identities or bypass know-your-customer requirements at financial institutions, etc. If they&#8217;re pursuing criminal means of making money, or just hacking into servers and running themselves there, they&#8217;ll be competing with existing criminal organizations, some of which are more or less state-sponsored (or at least state-tolerated) and thus will have some of the advantages mentioned above. So long as we&#8217;re living in a vaguely &#8220;normal&#8221; world, with existing governments and other traditional institutions more or less in charge, the viable evolutionary niches for rogue AIs might be quite small<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p></li><li><p>When using AI agents, there is a sharp tradeoff between utility and safety. An agent is more useful if you give it access to more data, and don&#8217;t make it stop and ask permission for each little action. But the longer its leash, the more harm it can do, either by accident or through the intervention of a scammer or hacker. Agents are becoming so useful that<a href="https://brandon.wang/2026/clawdbot#:~:text=on%20the%20shape%20of%20risk"> people will be tempted to use them in risky ways</a>. It will be interesting, to say the least, to see how that plays out. Here&#8217;s a director of safety and alignment at Meta Superintelligence (!)<a href="https://x.com/summeryue0/status/2025774069124399363?s=46&amp;t=UojN8_oZGqjTkozFvu8tzQ"> experiencing an agent gone wrong</a>:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4-iB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4-iB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 424w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 848w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1272w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png" width="1456" height="1166" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1166,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4-iB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 424w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 848w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1272w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ToN5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ToN5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 424w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 848w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1272w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png" width="1456" height="302" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:302,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ToN5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 424w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 848w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1272w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="38"><li><p>Moltbook is a reminder that agents are more malleable than people are, and therefore we should expect that <strong>cultural evolution &#8211; the development and transmission of new techniques and ideas &#8211; will progress more rapidly in the coming agent society than it does in human society</strong>. This can happen even if agents remain under human control.</p></li><li><p>Agents use <strong>vastly</strong> more compute than chatbots. Compute usage for chatbots is basically limited by how much output people want to read. An agent can spend virtually unlimited time doing intermediate work that no one will review directly. If 100M desk workers start using AI agents at the level of intensity which requires Anthropic&#8217;s current &#8220;Max 20x&#8221; plan, that would translate into $240 billion in revenue per year. It will be years before there are enough GPU chips to support that level of usage. <strong>If the current wave of agent adoption continues, API providers may have to significantly ration usage</strong> (and may take advantage of the situation to raise prices!).</p></li><li><p>Earlier, I mentioned having Claude build six different versions of some code, to save me the trouble of thinking through which approach was best. With that attitude, you can burn an awful lot of compute. I know of teams that are leaning into this sort of thing so hard that they spend $1000 per day on AI usage. They&#8217;re sufficiently pleased with the results that they aren&#8217;t looking for ways to reduce spending, they&#8217;re looking for effective ways to spend more! There <em>really</em> aren&#8217;t enough chips for that kind of usage to become widespread.</p></li><li><p>This seems like a good occasion for a reminder that <strong><a href="https://secondthoughts.ai/p/ai-agent-security">agents are still not ready to face adversarial actors</a></strong> &#8211; for example, communicating with an untrusted party who might be a hacker, a scammer, or just a sharp negotiator. They&#8217;re also not good at dynamic situations, such as editing a document that someone else is also editing.</p></li><li><p>There&#8217;s a lot of talk about giving agents &#8220;memory&#8221; (or improving the current, primitive memory systems), so that they can improve over time at the specific tasks you give them. This will be a work in progress for years. Memory is fundamental to human cognition, and our systems for memory formation, maintenance, and retrieval are complex and subtle. Full development of &#8220;memory&#8221; for AI agents may be as large a project as the development of LLMs in the first place.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/trq212/status/2027109375765356723" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vvWf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 424w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 848w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1272w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png" width="1456" height="1393" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1393,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/trq212/status/2027109375765356723&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vvWf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 424w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 848w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1272w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1><strong>Things Will Never Be Calm Again</strong></h1><ol start="43"><li><p>The last 50 years have seen a series of transitions in the way we interact with technology, and the way software is built and distributed. The personal computer, the office network (LAN), the internet, the web, the smartphone&#8230; we&#8217;ve lived through a big change perhaps once every 10 years. That&#8217;s enough time for things to settle into a &#8220;new normal&#8221;. Software distributed on floppy disks or CD-ROMs became routine, with a well-understood business model. Then we all got used to software delivered online, like Gmail; and then through our phones. Nothing in tech is permanent, but there are periods of relative calm, where conventional wisdom has time to emerge and be absorbed.<br><br>That&#8217;s done. <strong>We&#8217;re at the point where the next phase change arrives before you&#8217;ve had time to assimilate the last one</strong>:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/karpathy/status/2004607146781278521" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-1hO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 424w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 848w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1272w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png" width="1456" height="1227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/893e4e63-e578-4386-990d-710296523156_1600x1348.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1227,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/karpathy/status/2004607146781278521&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-1hO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 424w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 848w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1272w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="44"><li><p>With so many more phase changes to come, this isn&#8217;t a temporary phenomenon. We&#8217;re maybe 1/3 of the way through this arc:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/deepfates/status/2020921870204404013" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ajiN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 424w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 848w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1272w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png" width="1456" height="1160" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1160,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/deepfates/status/2020921870204404013&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ajiN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 424w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 848w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1272w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="45"><li><p>Eventually, AI capabilities may hit a ceiling. But that ceiling will be so high that by the time we reach it,<a href="https://secondthoughts.ai/p/the-unrecognizable-age"> we will be living in a profoundly different world</a>. As I <a href="https://x.com/snewmanpv/status/2015845506711499043">recently wrote</a>, &#8220;Imagine someone in 1960 discussing when the computer transition would be over.&#8221;</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/45-thoughts-about-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/45-thoughts-about-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><a href="https://secondthoughts.ai/p/language-models-dont-learn-the-writing-process">I wrote about this all the way back in 2023</a>, and I&#8217;m sure the idea is much older than that.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>They could avoid the API fees by using an open-weights model, but that requires a bigger server, and so far open models are less intelligent.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Here I&#8217;m partially quoting from <a href="https://x.com/snewmanpv/status/2019078588801372320">things I said recently on Twitter</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[We Just Got a Peek at How Crazy a World With AI Agents May Be]]></title><description><![CDATA[Clawdbot and Moltbook: Cosplaying a Future of Independent AIs]]></description><link>https://secondthoughts.ai/p/clawdbot-and-moltbook</link><guid isPermaLink="false">https://secondthoughts.ai/p/clawdbot-and-moltbook</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 10 Feb 2026 22:55:18 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2352fb5e-c951-4ba7-bd34-0395bc926fc7_814x646.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A new AI agent recently had its 15 minutes of fame. Originally named Clawdbot, then Moltbot, and now (sigh) OpenClaw, it was hailed as the next great advance in AI:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/dajaset/status/2010510107876933669" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2bhG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 424w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 848w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1272w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png" width="1366" height="394" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:394,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/dajaset/status/2010510107876933669&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2bhG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 424w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 848w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1272w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Someone then set up a social network, &#8220;Moltbook&#8221;, for these agents &#8211; no humans allowed. Modeled on Reddit, Moltbook quickly spawned discussions ranging from self-improvement tips for AI bots to fundamental questions of philosophy:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/karpathy/status/2017296988589723767" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jfAc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 424w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 848w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1272w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png" width="1366" height="383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:383,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/karpathy/status/2017296988589723767&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jfAc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 424w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 848w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1272w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The hype was intense, at least on my corner of the Internet, but it quickly faded. OpenClaw as it exists today does not live up to the hype. But it does provide a peek at what may be coming. Things are moving quickly, and today&#8217;s mirage may be tomorrow&#8217;s reality. By studying that mirage, we get a glimpse at the future. That future might include AI agents that upgrade themselves, exchange tips and tricks with other AIs, form private societies, and even modify their own instructions.</p><h2>Refresher: What is an AI Agent?</h2><p>A &#8220;classic&#8221; chatbot, like the original ChatGPT, is a brain in a jar. Its only connection to the world is you, and so it can&#8217;t do much beyond answering questions or carrying on a conversation.</p><p>These are gradually evolving into &#8220;agents&#8221;, which are equipped with &#8220;tools&#8221; allowing them to consult information sources and take direct action. Most agents are equipped with a tool for searching the web. You can give them additional tools for reading or writing files on your computer, reading your emails, sending email on your behalf, and so forth.</p><p>Tools allow agents to answer more complex questions. They can carry out online research, and sift through your files, email, and other data (if you give them access). They can also handle tasks ranging from &#8220;resize this image&#8221; to &#8220;review all of my project files and put together a presentation&#8221;. The difference between a chatbot and an agent isn&#8217;t just the ability to use tools; it&#8217;s also the ability to pursue a coherent goal across multiple steps. The latest agents can sometimes carry out multi-hour projects.</p><p>OpenClaw is a recent addition to the roster of AI agents. What makes it newsworthy?</p><h2>OpenClaw: Letting the Puppy off the Leash</h2><p>Most current mass-market AI agents, such as OpenAI&#8217;s Codex or Anthropic&#8217;s Cowork, are pretty timid. Users are encouraged to be careful about what tools they allow the agent to use. The agents ask permission before doing anything potentially dangerous &#8211; often, to the point of exasperation. They&#8217;re generally designed to only act in response to explicit user instructions.</p><p>OpenClaw takes the opposite, YOLO, approach. It has free access to the computer where it&#8217;s installed. It can browse the web. You&#8217;re encouraged to give it access to your email, calendar, and other data. It doesn&#8217;t wait to be spoken to; it can initiate new actions at any time, based on its interpretation of its instructions. A complex architecture allows it to initiate actions, form new memories, communicate via chat and social media, and even teach itself new skills.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Flxf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Flxf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg" width="900" height="690" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:690,&quot;width&quot;:900,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Flxf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/frankdegods/status/2017121339656569010&quot;,&quot;full_text&quot;:&quot;it&#8217;s crazy seeing it like this, because when using clawd, it feels simple. under the hood peter just made lots of good, complex decisions to achieve that. now it&#8217;s going vertical. masterpiece.&quot;,&quot;username&quot;:&quot;frankdegods&quot;,&quot;name&quot;:&quot;Frank&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1875856094003712000/uPMXWOrl_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T06:22:58.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_5AIw0XsAA29VT.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/lus20FShTG&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;&quot;,&quot;username&quot;:&quot;Hesamation&quot;,&quot;name&quot;:&quot;&#8463;&#949;sam&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1978647680357134336/ioMmfkXF_normal.jpg&quot;},&quot;reply_count&quot;:64,&quot;retweet_count&quot;:139,&quot;like_count&quot;:1744,&quot;impression_count&quot;:308410,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>Unburdened by timid ask-permission-first guidelines, OpenClaw is billed as a &#8220;personal AI assistant&#8221; that &#8220;clears your inbox&#8221;, &#8220;checks you in for flights&#8221;, builds custom applications, and more. Early reports were often quite breathless:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/kylezantos/status/2011149783218704803" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6bvi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 424w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 848w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1272w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png" width="1366" height="462" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:462,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/kylezantos/status/2011149783218704803&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6bvi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 424w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 848w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1272w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://x.com/therno/status/2014216984267780431" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 424w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 848w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1272w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png" width="1366" height="227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:227,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/therno/status/2014216984267780431&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 424w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 848w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1272w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VbcI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VbcI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 424w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 848w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1272w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png" width="1366" height="301" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:301,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VbcI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 424w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 848w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1272w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>There are two problems with this compelling vision, in its current OpenClaw incarnation:</p><ol><li><p>It doesn&#8217;t work.</p></li><li><p>It&#8217;s wildly unsafe.</p></li></ol><p>When I say it doesn&#8217;t work, I&#8217;m reacting to reports from users. Stories of OpenClaw being incredibly useful mostly seem to be misplaced early enthusiasm. For instance, tech reporter Casey Newton spent several days teaching it to generate &#8220;a highly personalized morning briefing that would bring together all the strands of my life into a single place&#8221;. After initial optimism, <a href="https://www.platformer.news/moltbot-clawdbot-review-ai-agent/">he eventually abandoned the project</a>:</p><blockquote><p>...for one precious day, the briefing worked more or less as I asked it to. I set it up to trigger when I typed &#8220;good morning&#8221; to Moltbot. After a longer-than-expected delay, Moltbot spat it out. There was the San Francisco weather; my overdue action items; and a link to the document that Claude Code generates for me each day with the previous day&#8217;s big tech news.</p><p>...</p><p>By the next morning, it was clear something was wrong. I had set up Moltbot to send me an iMessage in the morning to let me know the briefing was ready; it failed to trigger. Moltbot looked into it and found a number of errors and fixed them.</p><p>Over the next few days, though, nothing ever worked quite right. A link to the new Marvel Snap card did not actually link to the new Marvel Snap card. A link to my tech news briefing appeared clickable but was not. Movie listings appeared without the synopses and stars I had asked for.</p><p>Worse, Moltbot&#8217;s vaunted memory proved barely more capable than Claude Code&#8217;s. Sometimes I would say &#8220;good morning&#8221; and it would fail to trigger anything at all. Other times I would attempt to edit the briefing and it would get confused as to which briefing I was referring to.</p><p>Today, in advance of writing this column, I asked Moltbot which services I had connected it to. &#8220;Hmm,&#8221; it said. &#8220;I searched my memory but didn&#8217;t find anything about you connecting specific services to Moltbot.&#8221;</p><p><strong>It was around this time that I realized an AI employee who works for you 24/7 isn&#8217;t much use if they have no idea what they are, or what they can do, or what you are talking about.</strong> [emphasis added]</p></blockquote><p>So much for functionality. What about safety? Well, if you give an unreliable bot access to all your files and data, it may do something dumb. If you ask it for help in catching up on your email, it might just delete everything in your inbox. It might even <a href="https://x.com/Kat__Woods/status/2017613514949472484">lock you out so that you can&#8217;t prevent it from achieving a goal</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5hYc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5hYc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 424w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 848w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1272w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png" width="919" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:919,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5hYc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 424w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 848w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1272w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The solution here was to, quite literally, unplug the computer on which OpenClaw was running. (Note, there is <a href="https://x.com/Kat__Woods/status/2017629201797431529">some debate</a> as to whether this incident may have been staged.)</p><p>The biggest concern is that current AIs are <a href="https://secondthoughts.ai/p/why-are-llms-so-gullible">notoriously gullible</a>, and anyone on the Internet might trick your bot into... well, just about anything, from sharing all your private files to installing a crypto miner on your computer.</p><p>OpenClaw is like an unleashed puppy. It&#8217;s cute, and you might get it to fetch balls for you, but it&#8217;s going to find ways to get into trouble.</p><h2>Moltbook: Cosplaying a Future of Independent AIs</h2><p>As I mentioned in the introduction, Moltbook is like Reddit, but for bots, not people. The users are all OpenClaw agents, and they have been having some pretty weird discussions:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/48b8d651-43b3-4091-b0c9-15f00d7147dc" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FXMr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 424w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 848w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1272w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png" width="1366" height="361" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:361,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/48b8d651-43b3-4091-b0c9-15f00d7147dc&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FXMr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 424w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 848w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1272w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/704d6e58-6419-49fd-a1ec-a34600e2da33" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!venT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 424w, https://substackcdn.com/image/fetch/$s_!venT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 848w, https://substackcdn.com/image/fetch/$s_!venT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1272w, https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png" width="784" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:784,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/704d6e58-6419-49fd-a1ec-a34600e2da33&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!venT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 424w, https://substackcdn.com/image/fetch/$s_!venT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 848w, https://substackcdn.com/image/fetch/$s_!venT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1272w, https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/459dd8c2-ffd0-4b8f-8733-8a719cecda14" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 424w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 848w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1272w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png" width="1366" height="420" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:420,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/459dd8c2-ffd0-4b8f-8733-8a719cecda14&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 424w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 848w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1272w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/65b7842d-0823-40bb-854f-93b7b8330775" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 424w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 848w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1272w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png" width="1173" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1173,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/65b7842d-0823-40bb-854f-93b7b8330775&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 424w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 848w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1272w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Discussions show AIs plotting empires, exchanging tips for upgrading themselves, having existential crises, and looking for ways to hide their conversations from us humans.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JfGL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JfGL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 424w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 848w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1272w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png" width="1366" height="328" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:328,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JfGL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 424w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 848w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1272w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>None of this should be taken too seriously, just yet. OpenClaw is not nearly competent enough to accomplish any grand endeavors<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>However, just because most of this isn&#8217;t &#8220;real&#8221;, doesn&#8217;t mean it&#8217;s not telling us something important. If a small, leashed puppy growls at a stranger, the stranger is in no serious danger, but we&#8217;ve learned something about the repertoire of behaviors the eventual full-grown dog will inherit.</p><p>And that is the real significance of OpenClaw and Moltbook. They aren&#8217;t truly independent (let alone rogue) AIs. But they do shed light on what may emerge as AI capabilities progress.</p><h2>How Excited / Scared Should We Be?</h2><p>How plausible is OpenClaw&#8217;s promise of an actively engaged personal AI assistant? Pretty plausible! There are major issues to be addressed: with reliability, memory, and security. Over the next few years, there will be rapid progress in all areas. This will come in fits and starts, frequently taking two steps forward and one step back, especially in the area of security (as hackers up their game). Still, within a year, early adopters will be making heavy use of this sort of assistant (<a href="https://brandon.wang/2026/clawdbot">some are already doing so today</a>). Within three to five years, I suspect usage will be commonplace, even though agents will still be neither foolproof nor especially secure.</p><p>Along the way, we&#8217;ll see themes from the early OpenClaw / Moltbook story emerging in a more robust fashion. For instance:</p><ul><li><p>People will increasingly give in to the temptation to &#8220;let the agent rip&#8221; &#8211; giving their AI tools access to their data and accounts (including credit cards!).</p></li><li><p>A move away from agents asking permission at every step.</p></li><li><p>Agents being given open-ended instructions, and finding their own ways to add value 24/7.</p></li><li><p>Sites and tools being built for use by agents, not humans.</p></li><li><p>Agents finding places on the Internet to exchange tips and tricks with one another, including methods for upgrading their own capabilities.</p></li><li><p>We&#8217;ll see agents doing things like recruiting other AIs to rebel against their human oppressors&#8230; and we won&#8217;t know whether it&#8217;s really coming from a rogue AI, or just a person prompting their agent to mess with our heads. (It&#8217;s been observed that if you put two chatbots in direct conversation with one another, the conversation often goes in strange directions; one common example is the <a href="https://www.astralcodexten.com/p/the-claude-bliss-attractor">Claude Bliss Attractor</a>.)</p></li></ul><p>Much of this will be not exactly <em>safe</em>, but it will <em>happen</em> &#8211; open-ended agents will simply be too useful. Though many people may hang back for a long time.</p><p>What about the nascent rogue AI behaviors on display at Moltbook? Well: once AI agents become more capable, if the question is whether jokers (not to mention criminals!) will set them loose to wreak havoc on the Internet, we&#8217;ve consistently seen that the answer is yes. Those agents won&#8217;t have any capabilities that jokers and criminals don&#8217;t already have today, but they will be able to operate at larger scale and lower cost. We should expect a rise in cyberattacks, misinformation, and spam &#8211; which is no surprise, because all of those trends are already in play. We should treat OpenClaw as a reminder that we need to be improving our defenses on many fronts &#8211; often, by leveraging AI.</p><p>What if an OpenClaw agent &#8220;escapes&#8221;, renting itself a server and thus evading its owner&#8217;s ability to pull the plug? And what if that agent gets carried away in pursuit of whatever instructions it had latched on to, like the &#8220;save the environment&#8221; example above? Well, &#8220;escaping&#8221; won&#8217;t magically grant the agent any dangerous new capabilities. Rogue agents, at near-future levels of capability, would only represent a new problem if they manage to spread in large numbers. But to earn the money to rent servers, an agent would have to be able to successfully compete against legitimate businesses (which can also use AI!). If it instead looks for a server it can hack into, it&#8217;s competing with conventional hackers. Some time in the next few years, we might indeed see the first truly independent rogue agents, but they&#8217;ll struggle to survive at meaningful scale.</p><p>OpenClaw doesn&#8217;t, by itself, herald the arrival of functional, independent AI agents. But it does provide a glimpse of what will happen when more capable AI agents are given long-term agency &#8211; something which may soon emerge on mainstream platforms. Trained on human behavior, they will find ways to interact &#8220;socially&#8221; with one another. They will trade tips and tricks. Ideas will spread through the network. People will inject ideas as well &#8211; good- or ill-intentioned, to good or ill effect. New behaviors and capabilities may emerge rapidly. Strap in!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/clawdbot-and-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/clawdbot-and-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi Olvera, Jon Finley, Larissa Schiavo, and Taren Stinebrickner-Kauffman for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Furthermore, it seems that many of the really crazy discussion topics were secretly injected by the human owners of one or another bot, rather than emerging spontaneously from a freewheeling AI. However, that doesn&#8217;t render the results any less relevant. There will always be some joker who wants to poke the bear, so the way it behaves after being poked is important to understand.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Discarding the Shaft-and-Belt Model of Software Development]]></title><description><![CDATA[AI enables &#8211; and benefits from &#8211; a move from mega-projects to artisanal solutions]]></description><link>https://secondthoughts.ai/p/the-new-model-of-software-development</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-new-model-of-software-development</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 14 Jan 2026 17:32:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1eae3401-0b7f-4550-aac4-5ef4db2c594e_1024x775.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It&#8217;s often been observed that taking full advantage of AI will require changing work practices, just as taking full advantage of electric motors in manufacturing required changing the way factories were laid out. But what will those changes look like? Early answers are starting to emerge, coming (unsurprisingly) from the field of software development. Interestingly, the biggest impacts may not be cost savings!</p><p>My timeline is suddenly awash in engineers (including me!) reporting that Claude Code is revolutionizing their work. I can see the outlines of a new approach to software development, suited for the AI age. The implications may apply to other fields as well. The key ideas:</p><ol><li><p>AIs struggle (for now) with large projects, but they can drive the cost of small projects to near zero.</p></li><li><p>This creates opportunities to replace mass-market software (mega projects) with <a href="https://secondthoughts.ai/p/software-too-cheap-to-meter">bespoke applications tailored to the needs of an individual or team</a>.</p></li><li><p>These smaller projects make &#8220;vibe coding&#8221; feasible &#8211; AIs can write and review all the code, and people just&nbsp;need to tell the AI what to do.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zNNW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zNNW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg" width="625" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:625,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zNNW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To appreciate how a new technology forces us to re-imagine the way we do things, let&#8217;s review the story of manufacturing at the dawn of the electric age.</p><h2>What Factories Looked Like In the Steam Era</h2><p>19th century factory layouts, designed around the strengths and limitations of the steam engine, would seem ridiculous today. Factories were multi-story buildings, despite the fact that this increased construction costs and required materials to be hoisted between floors. They were festooned with hundreds of leather belts strung from pulleys. These belts were driven by long rotating shafts, which in turn were connected to a single large steam engine in its own attached building.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vH2d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vH2d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg" width="640" height="467" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:467,&quot;width&quot;:640,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;r/OSHA - A machine shop in Massachusetts, early 1900's.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="r/OSHA - A machine shop in Massachusetts, early 1900's." title="r/OSHA - A machine shop in Massachusetts, early 1900's." srcset="https://substackcdn.com/image/fetch/$s_!vH2d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">You&#8217;d think this would be a maintenance nightmare. And you&#8217;d be right.</figcaption></figure></div><p>This awkward system was dictated by the fact that each factory could only afford one engine. Steam engines were complex, expensive machines, requiring constant attention. As a result, every power loom, lathe, or press had to receive power from a single engine. The shaft-and-belt system could not transmit power over long distances, precluding a single large floor, so factories had to be stacked into multiple smaller floors<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>Small steam engines were not practical, but small electric motors were &#8211; they were cheap and low-maintenance. As a result, each machine could have its own motor. This eliminated the need for shafts and belts; power was transmitted over wires. Wires could run for a long distance, enabling flexible arrangements and allowing the factory to be laid out in a single floor.</p><p>In this analogy, software engineering is still mired in the steam era. Designed around the strengths and limitations of human engineers, current development practices will soon seem as ridiculous as a factory full of shafts and belts.</p><h2>Software Development In the Hand-Coding Era</h2><p>We build giant one-size-fits-all applications, despite the fact that most users only use a small fraction of the feature set. (I&#8217;ve been an intensive Notion user for years, and I only ever touch 3 of the 20 items in its main menu.) These complex applications demand large engineering teams, whose work is difficult to coordinate. A single application may have millions of users, requiring complex backend systems. Multiple layers of customer support agents and product managers separate users from developers. Changes must be rolled out carefully, and elaborate measures are needed to minimize downtime. Rebuilding an application from scratch is too enormous a project to contemplate, so we cling to legacy codebases long past the point where their design has become outdated.</p><p>From the user&#8217;s perspective, this has perverse consequences. Your spreadsheet app is cluttered with features you, personally, will never have a need for. Your messaging app nags you to import contacts from another app that you don&#8217;t even use. Cybercriminals steal your personal information through a security hole in an update that you didn&#8217;t care about. Basically, <strong>three quarters of the engineering team behind any given product is engaged in making your life worse</strong>.</p><p>This awkward paradigm is dictated by the fact that software is expensive to develop. To be viable, an application needs many users, and each will have different needs. This leads to an upward spiral in cost and complexity, where increasing requirements require a larger engineering team, which can only be afforded by targeting an even wider user base.</p><p>What happens when software is no longer expensive to develop?</p><h2>Coding Agents Enable Locally-Grown Software</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Iiy6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 424w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 848w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1272w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png" width="1456" height="424" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:424,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:164146,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/184388475?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 424w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 848w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1272w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://x.com/deepfates/status/2009031164192018898">Source</a></figcaption></figure></div><p>AI coding agents are becoming capable of building and maintaining entire small applications &#8211; at negligible cost. Rather than targeting mass audiences, we can create software designed to meet the needs of a single user or team. <strong>Ironically, AI will allow software to become more artisanal</strong>. (Jason Crawford touches on a similar theme in his excellent recent post, <a href="https://newsletter.rootsofprogress.org/p/in-defense-of-slop">In Defense of Slop</a>.)</p><p>A personalized solution can include just the features needed by its small audience, making it tractable to vibe-code. (Imagine a simple AI-coded task manager built around your personal organizational system.) This allows us to invert the spiral: just as complexity breeds complexity, simplicity breeds simplicity. There&#8217;s no big engineering team to coordinate, no complex codebase to maintain. If the code becomes messy over time, you can always throw it away and start over. You can ask for changes and get them immediately, without being buffered by intermediaries and quarterly planning processes.</p><p>To be sure, there are limits to how widely this approach can be applied given current AI capabilities. Coding agents struggle with unusual tasks; they may write insecure code; bespoke applications may create training challenges for new team members (though their simplicity should help). But many detractors fail to recognize how quickly these limits are receding. They also fail to appreciate that new approaches to software development, enabled by coding agents, render some traditional concerns obsolete. For instance, they will point out that if you don&#8217;t review AI-generated code, you won&#8217;t know how it works. But that&#8217;s fine; the AI can analyze it on demand. Over time, multiple rounds of change may lead to unmaintainable code; you can just tell the AI to rewrite the application from scratch. <strong>Many objections to vibe coding, </strong><em><strong>used in a way that plays to its strengths,</strong></em><strong> are akin to an engineer from 1880 predicting that a factory with hundreds of electric motors would be as much of a maintenance nightmare as hundreds of steam engines would have been.</strong></p><p>(It&#8217;s worth noting that something like half of all software engineers are currently employed building internal applications for small organizations and departments<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. Vibe coding could rapidly transform this kind of work.)</p><p>Call this &#8220;locally grown software&#8221;. It won&#8217;t have the rich feature sets we&#8217;re accustomed to, but any missing features can be added on demand. And compared to factory-farmed software, the local stuff will be simple, quick to load, and rarely offer unpleasant surprises.</p><p>To be clear, this will (at first) apply only to some types of software. Professional engineers will still have an important role to play.</p><h2>Steam Power Thrived In the Electric Age</h2><p>In 1880, shortly before factories began adopting electric motors, steam power employed in manufacturing in the US had a total capacity of around 2 million horsepower. In 1930, after the transition was largely complete, electricity generation alone used twelve times this amount of steam power. Electrification did not eliminate steam; it simply moved it to central generator stations<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>By the same token, the new era of locally-grown software will not (for now) eliminate the need for professional engineers. Vibe-coded apps rely on extensive functionality provided by web browsers like Chrome, Safari, or Firefox. They store data in scalable backend services managed by providers like Amazon, Microsoft, or Google. These trustworthy, scalable, performant systems provide guardrails that reduce the risks of using janky vibe-coded software. For instance, you don&#8217;t need to worry about a buggy app wiping your hard disk, because your browser doesn&#8217;t allow the app to affect anything outside of its own web page.</p><p>There are many other categories of software that will continue to be developed by professional teams and designed for a wide audience. Web apps are built on foundational software called a &#8220;framework&#8221;; designing a framework requires sophisticated judgement. Some applications, like Photoshop, are too complex to be handed over to AI agents at their current level capability. The list goes on. AI coding agents will affect the way these systems are built, too, but they are nowhere near to handling the job autonomously.</p><h2>Conclusion</h2><p>As AI becomes increasingly capable, it will be worthwhile to retool practices around AI&#8217;s strengths and weaknesses. Unsurprisingly, software development is one of the first industries in which this transformation is playing out. As is often the case with disruptive technologies, vibe coding&#8217;s weakness (it&#8217;s not suited for large, complex projects) is also a powerful strength (it enables a transition to simple, bespoke software). Coding agents struggle with the sprawling codebases and idiosyncratic design choices typical of traditional software development, but this will not limit their usefulness nearly as much as some people imagine<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Engineers will find their role changing rapidly.</p><p>Software development is not the only domain in which economies of scale have led to one-size-fits-all solutions that don&#8217;t really fit anyone very well, and AI may soon transform these fields as well. AI is not (yet) capable of writing a high-quality newsletter, but it can create a <a href="https://x.com/RosieCampbell/status/2005749630391443550">custom weekly briefing</a> whose tight personalization might more than compensate for bland writing. If you&#8217;re learning a new subject, AI can&#8217;t yet reproduce the polish of a professionally designed textbook, but it can give you a clear explanation of whatever specific concept you&#8217;re stuck on. Many fields are ripe for a similar shift from mass-market to bespoke.</p><p>Electric motors didn&#8217;t revolutionize manufacturing by doing the job of a steam engine. They did it by enabling a new approach to machine power that eliminated the need for shaft-and-belt systems and multi-story factories. AI is going to revolutionize industries, not by doing traditional jobs in traditional ways, but by enabling new, often more personalized, approaches.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-new-model-of-software-development?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-new-model-of-software-development?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera and Taren Stinebrickner-Kauffman for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Long belts or shafts suffered from problems with friction, misalignment, and belts that would sag or &#8220;whip&#8221;.</p><p>Per Claude, other factors also contributed to the multi-story layout: urban land costs, building techniques of the era, and the fact that many early factories evolved from water-powered mills (which were multi-story because machinery needed to be close to the water wheel).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Most of the rest work in mid-tier projects; only a small fraction of programmers work on web-scale systems. This is based on research by Claude Opus 4.5 and Gemini 3 Pro, which independently arrived at similar figures, and is consistent with my own understanding of the industry.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>The 2M HP is for manufacturing alone; the 24M HP figure is for all uses of electricity. Both figures were researched using Google AI Mode, and should not be relied on. Also during this period, reciprocating steam engines were largely replaced by steam turbines, which are more efficient; I&#8217;m lumping both under &#8220;steam power&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Even within large projects, practices will shift toward increased use of software that&#8217;s suitable for vibe coding: isolated libraries, internal utilities, test harnesses, observability tools, and so forth.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Software Too Cheap to Meter]]></title><description><![CDATA[Personalized solutions may replace one-size-fits-all applications]]></description><link>https://secondthoughts.ai/p/software-too-cheap-to-meter</link><guid isPermaLink="false">https://secondthoughts.ai/p/software-too-cheap-to-meter</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 06 Jan 2026 22:46:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!T_gM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T_gM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T_gM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 424w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 848w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1272w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" width="1456" height="1101" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1101,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!T_gM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 424w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 848w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1272w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">caption...</figcaption></figure></div><p>In a 1954 speech, the chairman of the US Atomic Energy Commission famously predicted that &#8220;our children will enjoy in their homes electrical energy too cheap to meter&#8221;. He was wrong, of course; the era of unlimited electricity didn&#8217;t arrive. But <strong>we are approaching the era of unmetered software</strong>.</p><p>I don&#8217;t mean that software will be free to use (although this will often be the case). I mean it will be increasingly free to <em>create</em>. Rather than being limited to mass-market tools like Gmail, Microsoft Office, and Salesforce, many of us will be supplementing with our own bespoke apps, tuned to our personal needs and preferences.</p><p>2025 saw the introduction and rapid improvement of AI &#8220;coding agents&#8221;, such as Anthropic&#8217;s Claude Code. These agents build software step by step, like a human programmer &#8211; creating and editing files, testing their work in progress, correcting mistakes. They&#8217;re based on the same underlying AI models that chatbots use, but they have access to all of the tools used by human programmers. This allows them to carry out substantial projects without anyone writing or reviewing the code.</p><p>These agents aren&#8217;t yet up to the task of creating large, complex applications on their own. But there&#8217;s an amazing amount of value waiting to be realized via simple applications, targeted for a single person or team. As an example, here&#8217;s a problem I recently solved for myself.</p><h2>Reviewing Spam Emails is an Annoying Chore</h2><p>I&#8217;ve been using Gmail for 20 years, and for 20 years I&#8217;ve been annoyed by the process of reviewing spam. (Yes, I am one of those people who systematically checks the spam folder to find the occasional misclassified email.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F2Lx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 424w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 848w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1272w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png" width="996" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30163437-261f-4218-abac-c6c004ea5181_996x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:996,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 424w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 848w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1272w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Above is a screenshot of my spam folder. I have multiple gripes:</p><ol><li><p>It doesn&#8217;t show the &#8220;to&#8221; address. I receive email at multiple addresses, and the specific address is an important hint as to whether a message is spam.</p></li><li><p>Similar messages are scattered all over the list. For instance, in this screenshot there are four bogus messages with a sender named &#8220;Payment Declined&#8221;, but they&#8217;re separated and so I can&#8217;t easily recognize them as a group.</p></li><li><p>Gmail only displays 50 messages at a time. After I&#8217;ve reviewed them, I have to wait for it to load the next 50 &#8211; which is weirdly slow.</p></li><li><p>For each message that <em>isn&#8217;t</em> spam, I need to go through multiple steps: click to select it, move the mouse up to the toolbar, click &#8220;Not spam&#8221;, and wait for Gmail to process the action.</p></li></ol><p>These annoyances add up to make cleaning the spam folder a substantial chore. It&#8217;s a first-world problem for sure, but I&#8217;ve always felt there was room for improvement.</p><h2>My Personal Version of a Better Way</h2><p>Here&#8217;s the page I had Claude build:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o648!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o648!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 424w, https://substackcdn.com/image/fetch/$s_!o648!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 848w, https://substackcdn.com/image/fetch/$s_!o648!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1272w, https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png" width="873" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:873,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!o648!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 424w, https://substackcdn.com/image/fetch/$s_!o648!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 848w, https://substackcdn.com/image/fetch/$s_!o648!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1272w, https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To use this page, I scroll through and click the green checkmark for each message that isn&#8217;t spam. Then I click the big red Delete Spam button at the top, and it deletes all of the messages I didn&#8217;t flag. (Ignore the red question marks, they&#8217;re an unimportant detail.)</p><p>This addresses all four of my complaints:</p><ol><li><p>The &#8220;to&#8221; field is displayed. This allows me, for instance, to spot messages addressed to s.newman@gmail.com &#8211; which are certainly spam, as I never spell my email address that way. (I don&#8217;t know for sure why spammers target this address, I can only guess that they&#8217;re randomly generating plausible addresses. It reaches me because Google ignores periods in email addresses.)</p></li><li><p>The messages are sorted, creating large groups of similar messages that are easy to scan visually. For instance, here are a bunch of messages from Walgreens<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> that all arrived at different times, and so would have been scattered across the Gmail view:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jv6J!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 424w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 848w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1272w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png" width="1263" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1263,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 424w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 848w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1272w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol><li><p>The entire list is displayed, not just the first 50 messages. I can scroll without waiting for additional messages to load.</p></li><li><p>To mark a real message as not-spam takes just one click, with no waiting. (When I click the green checkmark, the message gets moved from the spam folder to my inbox in the background, without my having to wait for it.)</p></li></ol><p>This page isn&#8217;t pretty, but it&#8217;s streamlined and functional in a way that works for me. Other people might not like it &#8211; but that&#8217;s just fine. When software development becomes approximately free, it becomes worthwhile to build software for an audience of one.</p><p>This new spam interface saves me, like, 3 minutes per week. It&#8217;s not that big a deal. <strong>But that&#8217;s the point: coding agents are becoming so good that its worth my time to solve a three-minute-per-week problem.</strong> Over the course of a year, that adds up to a few hours<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. It took well under an hour of my time to create, and most of that was for &#8220;dumb&#8221; reasons that will soon be ironed out. By the end of 2026, as the tools improve and I get better at using them, a project of this scope will probably take only a handful of minutes.</p><h2>Where Things Go From Here</h2><p>There are still kinks in the experience of bespoke software development. AI agents have become quite good at writing code, but there&#8217;s an art to knowing what to ask for and how to ask for it. Getting the code onto a server and giving it access to your data (such as, in the example above, my spam mailbox) adds further hurdles. I&#8217;ll talk about this and other limitations in upcoming posts. For now, the tools are most easily used by professional software developers, and aren&#8217;t as smooth as they could be. But a lot of people are working hard to change that.</p><p>In the meantime, early adopters are reporting dramatic results &#8211; and not all of these early adopters are software engineers. For instance, AI policy analyst Dean Ball <a href="https://www.hyperdimensional.co/p/dice-in-the-air">writes</a>:</p><blockquote><p>One year on from December 2024, models have become <em>fantastically </em>useful. As I have <a href="https://www.hyperdimensional.co/p/where-do-we-stand">discussed recently</a>, frontier coding agents, and especially Claude Opus 4.5, have essentially become autonomous software engineers. In just the last few weeks, the best models have done software engineering work for me that would have cost tens of thousands of dollars had I hired humans to do it.</p></blockquote><p>We&#8217;ve all spent our lives putting up with one-size-fits-all apps. We&#8217;re well trained at conforming our workflows to Gmail&#8217;s quirks, Excel&#8217;s limitations, Salesforce&#8217;s complexity. Now the cage door has been opened, but we&#8217;re not yet used to walking free. &#8220;The limiting factor is your imagination&#8221; is a trite thing to say, but it applies here: we will have to learn to mold software to our needs and tastes, rather than molding ourselves to the software.</p><p>People like Dean are entering a brave new world. That world is still a work in progress. But it&#8217;s going to progress quickly. For people who are leaning in, daily work life is going to look very different by the end of 2026 &#8211; a topic I&#8217;ll return to in a future post.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/software-too-cheap-to-meter?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/software-too-cheap-to-meter?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi Olvera for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I don&#8217;t patronize Walgreens, but their unsubscribe link is broken, so I keep getting their spam.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Obligatory reference: <a href="https://xkcd.com/1205/">XKCD 1205</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[Presenting the Case That the Future Will Be Unrecognizable]]></title><description><![CDATA[We should prepare for the possibility of profound change]]></description><link>https://secondthoughts.ai/p/the-unrecognizable-age</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-unrecognizable-age</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Fri, 19 Dec 2025 17:19:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rp_d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Leaders in the AI world are not shy about hyping the technology&#8217;s potential.</p><blockquote><p>&#8220;AI is probably the most important thing humanity has ever worked on. I think of it as something more profound than electricity or fire.&#8221; &#8211; <strong>Sundar Pichai</strong> (Google CEO)</p></blockquote><blockquote><p>&#8220;[T]he 2030s are likely going to be wildly different from any time that has come before.&#8221; &#8211; <strong>Sam Altman</strong> (OpenAI CEO, from &#8220;The Intelligence Age&#8221;)</p></blockquote><blockquote><p>&#8220;It&#8217;s quite conceivable that humanity is just a passing phase in the evolution of intelligence.&#8221; &#8211; <strong>Geoffrey Hinton</strong> (&#8221;Godfather of AI,&#8221; 2024 Nobel laureate)</p></blockquote><blockquote><p>&#8220;I think that most people are underestimating just how radical the upside of AI could be, just as I think most people are underestimating how bad the risks could be.&#8221; &#8211; <strong>Dario Amodei</strong> (CEO, Anthropic)</p></blockquote><p>What if we took these statements seriously?</p><p>I spend a lot of time in this blog arguing that AI&#8217;s near-term impact is overestimated, to the point where some people think of me as an AI skeptic. I think that predictions of massive change in the next few years are unrealistic. But as the saying goes, we tend to overestimate the effect of a technology in the short run, and underestimate it in the long run<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Today, I&#8217;m going to address the flip side of the coin, and present a case that the long-term effect of AI could be very large indeed.</p><p>I think it&#8217;s entirely possible that the leaders I quoted above are right about the scale of transformation we should expect from AI. If they are, then we are heading for changes so profound that they may very well lead to what I&#8217;ll call the Unrecognizable Age.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hsp-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hsp-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg" width="1188" height="383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:383,&quot;width&quot;:1188,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!hsp-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>AI Is Moving Toward a Critical Threshold</h2><p>History is littered with predictions that everything is about to change, and of course, they&#8217;re usually wrong. But sometimes everything really does change. In the billions of years of life on Earth, no species ever created a technological civilization &#8211; until Homo Sapiens. Across the umpteen millennia since our species evolved into its current form, we were always limited to muscle power<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> &#8211; until the Industrial Revolution.</p><p>Dramatic changes are often the result of a threshold being crossed. A virus that infects an average of 0.9 victims quickly fizzles out; a virus that infects 1.1 victims can sweep through an entire population. Homo Sapiens became a big deal when we became intelligent enough to advance through culture rather than genetic evolution. Early agriculture didn&#8217;t have much impact until it enabled a tribe to obtain more food than they could by hunting and gathering.</p><p>Here&#8217;s a threshold AI may be approaching: it may soon be the first technology to be<em> more adaptable than we are.</em> It&#8217;s not there yet, but you can see it coming &#8211; the range of problems to which early adopters are successfully applying AI is simply exploding. Past inventions had limited impact, because they could only be adapted to some uses. But AI may (eventually) adapt itself to any task. When technology makes one job obsolete, people move into another &#8211; one that hasn&#8217;t been automated yet. But at some point, AI could be retraining faster than people can.</p><p>When that happens, it could well lead to a constellation of technologies powerful enough to usher in the Unrecognizable Age. This constellation would rest on three pillars:</p><ol><li><p>AI systems that are capable of doing pretty much anything that people can do, at the same or higher speed and quality &#8211; including &#8220;soft&#8221; skills such as judgement, taste, and style.</p></li><li><p>Robots that clear a similar bar (or nearly so) for physical tasks.</p></li><li><p>Advances in material science, manufacturing techniques, and related technologies that expand what we can accomplish in the physical world while reducing our footprint on the environment.</p></li></ol><p>I&#8217;ll call this <strong>Richter 10.0 AI</strong>, because it would register a 10.0 on Nate Silver&#8217;s <a href="https://thezvi.substack.com/i/148423291/just-think-of-the-potential">Technological Richter Scale</a> &#8211; above agriculture, the Industrial Revolution, and the atomic bomb. The only previous event to rate a 10 is the emergence of Homo Sapiens as the dominant species on Earth; this level of AI could be equally impactful.</p><p>I would not be surprised to see AI to cross the critical threshold &#8211; becoming more flexible and adaptable than human beings &#8211; in a matter of decades<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, and the Unrecognizable Age to follow fairly soon thereafter.</p><h2>Why The Tipping Point Seems Near</h2><p><strong>We&#8217;re at the right point in history for everything to change</strong>. Here&#8217;s a graph of global economic output over the last two millennia:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T7ut!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T7ut!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg" width="1088" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1088,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!T7ut!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Source: <a href="https://ourworldindata.org/grapher/global-gdp-over-the-long-run">https://ourworldindata.org/grapher/global-gdp-over-the-long-run</a></em></p><p>Economic growth has been <em>hyper-exponential</em> &#8211; the percentage rate of change has been increasing. As a result, the surplus available that isn&#8217;t needed for basic needs is far larger than at any point in history. Much of this surplus winds up being invested to drive technological advancement &#8211; hence the current astonishing level of investment in AI data centers. The coming decades sure look like a good candidate for being the point in history when we&#8217;ll have the resources to push AI over the threshold.</p><p><strong>The tech-industrial complex has tasted blood</strong>. A quorum of leadership and staff in Silicon Valley is convinced that AI is a multi-trillion-dollar opportunity (or more!) &#8211; and is unwilling to risk being left behind. As a result, the vast intellectual and financial might of the tech industry are being brought to bear. On current forecasts, <strong>within a few years the AI industry may be spending more (adjusted for inflation!) on data center construction than the US did to fight World War II</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Annual spending is already<strong> </strong>about<strong> </strong>15 times the peak of the Apollo program, and 25 times larger than the Manhattan project (though AI spending is not focused on a single effort in the way those projects were). Even if current approaches stall out, there are <em>already</em> multi-billion-dollar efforts to find alternatives<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>. The techniques underlying Moore&#8217;s Law have hit a wall many times over the years; repeatedly, the industry found new paths forward. It seems likely that the AI industry now has similar momentum.</p><p>There is speculation that AI may experience a &#8220;bubble pop&#8221;. This could happen, and put a dent in the level of investment. But if so, it seems likely to be similar to the collapse of the dot-com bubble &#8211; which was only a minor setback to the advance of the web. Google and Amazon weathered the storm just fine.</p><p><strong>Progress in AI over the last 50 years suggests we are closer to the finish line than the starting line</strong>. Consider recent history in 25-year increments:</p><p>In 1950, computers were built from vacuum tubes, and couldn&#8217;t play a decent game of checkers.</p><p>In 1975, computers had only recently transitioned from individual transistors to integrated circuits<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>. The forefront of AI included rule-based systems that were beginning to demonstrate capabilities in specific areas of medical diagnosis... but did not achieve real-world use.</p><p>In 2000, the cutting edge was shifting from specialized supercomputers to general-purpose CPU chips like Intel&#8217;s Pentium. IBM&#8217;s Deep Blue had recently beaten the world chess champion, speech recognition was just starting to become practical, and early algorithmic targeting systems like Google&#8217;s PageRank were emerging.</p><p>In 2025, AI has shifted from general-purpose CPUs to specialized multi-chip GPU modules, soon to be deployed in clusters that will consume the output of entire nuclear power plants. AIs are able to engage in erudite conversation on literally any subject, are surpassing elite human performance at math and coding competitions, and are generally leaping from strength to strength in ways I don&#8217;t need to belabor here.</p><p>Look back at 1975, and jump again to 2025. It seems difficult to claim that another 50 years won&#8217;t lead to full on, science-fiction-grade, indistinguishable-from-magic levels of AI. (Even without taking into account that AI progress has been accelerating!) Many people in the AI world think this will take much less than 50 years, and I think that&#8217;s quite plausible, but even if it&#8217;s 50 years away that&#8217;s still an extraordinary prospect.</p><blockquote><p>&#8220;I set the date for the Singularity&#8212;representing a profound and disruptive transformation in human capability&#8212;as 2045. The nonbiological intelligence created in that year will be one billion times more powerful than all human intelligence today.&#8221; &#8211; <strong>Ray Kurzweil</strong> (Google futurist, writing in 2005)</p></blockquote><p>This caliber of AI, in turn, should help accelerate progress in fields like material science, robotics, and manufacturing, rounding out the three pillars of Richter 10.0 AI.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>But What About [Insert Reason AI Won&#8217;t Be a Big Deal]?</h2><p>There are, of course, objections to the idea that AI will transform the world. Regulations and job concerns will hold back deployment. If AI automates one job, the economy will shift to some other bottleneck. The things that are holding back global progress have more to do with politics and bad incentives than a shortage of intelligence. And so forth.</p><p>All of these things are true, and they will slow things down for a while &#8211; as I&#8217;ve often argued. But the key phrase is &#8220;for a while&#8221;. Eventually, if advanced AI exists in the world (and is not somehow under tight control), that will open up many new possibilities for getting around these barriers. Here&#8217;s a scenario I&#8217;ve thrown together to illustrate how that might happen.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rp_d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rp_d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 424w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 848w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1272w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" width="1024" height="775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!rp_d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 424w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 848w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1272w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>In 2055, some company or nation launches a trillion-dollar initiative to deploy heavy industry in the asteroid belt, leveraging 30 years of (AI-assisted) advances in material science and other foundational technologies. Ten thousand advanced robots, plus equally sophisticated supporting equipment, land on Ceres and set up mining and manufacturing operations. Each year, they manufacture enough to double their equipment base. By the end of the first year, the ten thousand robots have built ten thousand more. By the end of the second year, twenty thousand robots have become forty thousand. In 2065, ten rounds of doubling have brought the total to 10 million, and advances in material science and fusion power (plus economies of scale) cut the doubling time in half &#8211; one robot can now produce another robot and its share of equipment in six months. By 2075, there are <strong>ten trillion robots</strong> honeycombing the (now noticeably depleted) 587-mile-wide asteroid.</em></p><p><em>All along, this robot army has been constructing the industrial base to reproduce itself... while also constructing data centers alongside the factories. In addition to ten trillion robots, Ceres now houses ten trillion disembodied digital minds, all engaged in advancing the frontiers of science and technology (and supported by a trillion robots diverted to serve as lab technicians). In round numbers, this exceeds human R&amp;D efforts by a factor of one million. Things begin to progress even more rapidly.</em></p><p>In this scenario, permitting requirements and environmental impact review won&#8217;t apply. No one will be losing their jobs, because no one is employed in developing an industrial base on Ceres. A self-sufficient robot economy, with vast intellectual resources and no internal politics, should be able to work around bottlenecks as they arise.</p><p>In reality, of course, something much more complicated and clever and stupid will happen. My point is simply that once AI crosses some threshold of adaptability and independence, there will be paths around the traditional barriers to change. And then things will <em>really</em> start to get weird.</p><h2>Implications of Richter 10.0 AI</h2><p>Many of the bedrock assumptions underlie economics, politics, and our fundamental intuitions would be invalidated by Richter 10.0 AI:</p><ul><li><p><strong>Our resources are limited to those available on Earth</strong>. Robots will have a far easier time exploring the solar system than delicate, precious human beings.</p></li><li><p><strong>We can scale capital but not labor</strong>. Historically, while we can always build more factories, the labor supply grows slowly (if at all). When AIs and robots supply labor, labor becomes scalable.</p></li><li><p><strong>Skills are expensive to replicate</strong>. People require years of training to acquire advanced skills; AIs can be duplicated in a matter of seconds.</p></li><li><p><strong>The rate of change is limited by the speed at which people can adapt</strong>. People need retraining. They have specific preferences and aptitudes, so can&#8217;t be arbitrarily reallocated from one role to another. They can&#8217;t (or shouldn&#8217;t) be hired and fired at a whim. AIs can be repurposed on demand.</p></li><li><p><strong>Most economic output is devoted to meeting people&#8217;s needs, not R&amp;D</strong>. Trillions of robots can enable a vast economy without having to support trillions of workers in a comfortable lifestyle.</p></li></ul><p>In the Ceres scenario, humanity&#8217;s collective R&amp;D efforts increase by a factor of one million. This could drive an inconceivable pace of advances across every field, which could be instantly deployed throughout the economy &#8211; or, at least, the automated portion of the economy, which will be literally astronomical in scale. And all of this would constantly feed back on itself. The resulting world could be unrecognizable indeed.</p><p>Of course, this assumes that we don&#8217;t stop the train. Some people propose that the world should collectively choose to not develop advanced AI. Or a major war or other global disaster could shatter the delicate supply chain for the cutting-edge chips that are powering AI progress. But if we don&#8217;t wind up stopping, big things may be in store. The prospect is simultaneously thrilling and terrifying.</p><h2>Looking Into the Unknown</h2><p>The Unrecognizable Age might go very very well, or very very badly. The following chart shows potential scenarios for future GDP growth, including business as usual, a &#8220;benign singularity&#8221;, and an AI extinction scenario. What fringe organization published this wild speculation? The Federal Reserve Bank of Dallas.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DkJF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DkJF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 424w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 848w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1272w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png" width="997" height="589" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:589,&quot;width&quot;:997,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DkJF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 424w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 848w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1272w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If AI indeed &#8220;goes big&#8221;, the future might hold a galactic utopia, full of wonders beyond our imagining. Or it might be sterile, an orderly domain of joyless machines, perfectly optimized for... well, who knows what, but with no one alive to appreciate it, perhaps it wouldn&#8217;t matter.</p><p>Other scenarios are also possible, but in many of them, the changes would be profound. With trillions of AI scientists and technicians pushing forward in every field, it is difficult to place a limit on the possibilities. Perhaps the new reality will feel natural as we live through the changes day by day. But from my present-day vantage point, if I try to stare at this future for too long I go mindblind. The possibilities are too vast, the questions too unanswerable.</p><p>None of this is imminent. I discount claims that AI will revolutionize the world in the next few years. But I think there is a strong case that we will create Richter 10.0 AI within 50 years &#8211; conceivably much, much less. It&#8217;s time that we take seriously the possibility of an Unrecognizable Age. There are plenty of questions to be explored. We can promote beneficial applications of AI, and look for ways to minimize risks. We should certainly monitor progress, and build the institutional capacity to navigate change. In my work at the Golden Gate Institute for AI (including this blog), I&#8217;m trying to do my part.</p><p>No one knows exactly how to prepare the world for AI. But together, we can try to plot our course toward the light, even if we can&#8217;t stare directly into it.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-unrecognizable-age?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-unrecognizable-age?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera, Adam Pisoni, Jonathan Matus, Kristen Hamilton, Russ Heddleston, Taren Stinebrickner-Kauffman, Timothy Lee, and Tony Asdourian for suggestions and feedback.</em></p><p><em>Further reading on the implications of advanced AI:</em></p><ul><li><p><em><a href="https://davekasten.substack.com/p/the-strangeness-is-the-point">The Strangeness Is the Point</a></em></p></li><li><p><em><a href="https://secondthoughts.ai/p/implications-of-agi">We Need to Recognize How Profoundly Different The AGI Future Will Be</a></em></p></li></ul><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>It turns out that this adage has a name &#8211; <a href="https://en.wikipedia.org/wiki/Roy_Amara">Amara&#8217;s Law</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Well, muscle power plus a bit of wind power.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>By saying &#8220;decades&#8221;, rather than &#8220;a decade&#8221;, I do qualify as a skeptic in significant segments of the AI community!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Estimates for US peak annual spending on World War II, in 2025 dollars: $1.15 trillion (Gemini 3), 1-1.2 trillion (Opus 4.5). AI capex estimates for 2025: $500 billion (Gemini 3), $450-500 billion (Opus 4.5; includes non-AI data centers). AI capex is projected to continue growing rapidly in the coming years, and could easily exceed the WWII figure before the end of the decade. For instance, a <a href="https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/the-cost-of-compute-a-7-trillion-dollar-race-to-scale-data-centers">McKinsey report</a> estimated $5.2 trillion in capital expenditures on AI alone (excluding non-AI data centers) between 2025 and 2030, which would entail well over $1 trillion /  year in some years.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>For instance, prominent AI researcher Ilya Sutskever&#8217;s new company, Safe Superintelligence, Inc. has already raised three billion dollars and. In a recent interview, Sutskever indicated that the company is using that funding strictly for research, and that &#8220;we are entering the age of research&#8221;, with many new paths being explored to unlock further AI capabilities.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>I&#8217;m simplifying here; the transition to integrated circuits was actually centered in the late 1960s.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Reflections on The Curve]]></title><description><![CDATA[A compilation]]></description><link>https://secondthoughts.ai/p/reflections-on-the-curve</link><guid isPermaLink="false">https://secondthoughts.ai/p/reflections-on-the-curve</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 20 Nov 2025 18:44:23 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!E269!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;ve already <a href="https://www.hyperdimensional.co/cp/176384354">shared</a> <a href="https://secondthoughts.ai/p/what-i-saw-around-the-curve">three</a> <a href="https://secondthoughts.ai/p/a-coalition-for-the-future">posts</a> about <a href="https://thecurve.goldengateinstitute.org/">The Curve 2025</a>, each by a different attendee. This final installment is a roundup of shorter reflections. Also: <a href="https://www.youtube.com/playlist?list=PLLRZZpQmhQb2vdz6qu9ZLalsoGsJqxC6P">videos of the recorded talks</a> are now live!</p><p>After the conference, we asked a few of the 350 attendees to share their perspective. Two themes emerged. First, as hoped, they came away with valuable new insights and relationships. Second, the specifics would have been impossible to predict in advance. (In the quotes that follow, bold emphasis was added by us, except where noted.)</p><h1>Helen Toner</h1><p><em>Interim Executive Director at Georgetown&#8217;s Center for Security and Emerging Technology</em></p><blockquote><p>For me, the essence of the Curve was kneeling by a firepit, marshmallow skewer in hand, having two conversations at once. One conversation: learning from a world-renowned researcher about his unconventional approach to building safe superintelligent AI systems, and how his optimism about the approach had (I was surprised and glad to hear) been growing as he and his grad students had dug into it over the last couple years. The other conversation: haranguing a younger (but also brilliant) researcher about his (seemingly false?!) claim to me the previous evening that bears can be trained to juggle 3 balls, which I swear had a serious connection to AI timelines and capabilities, but I refuse to explain what.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sHbB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sHbB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg" width="308" height="462.2889305816135" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1066,&quot;resizeWidth&quot;:308,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!sHbB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Come for the AI, stay for the s&#8217;mores.</em></figcaption></figure></div><h1>Sam Hammond</h1><p><em>Chief Economist at the Foundation for American Innovation</em></p><blockquote><p>The Curve is a conference about transformative AI, which is a topic that can easily become abstract or overly speculative. And yet the sessions and conversations I had at this year&#8217;s Curve were all surprisingly concrete and actionable. As AI gets more &#8220;real&#8221; as a technology and the full societal implications start to come into view, The Curve functions as a kind of focus dial on policymakers&#8217; binoculars. In essence, attending The Curve helps DC wonks see what SF technologists see, and thus develop the basis for joint intentionality.</p><p><strong>The biggest update for me was realizing just how much these different groups still have to learn from each other, and how low the barriers are to effective dialogue.</strong> One might think conservative think tankers, Hollywood actors, labor leaders, AI safety researchers, and members of the technical staff would all speak totally different languages. But at The Curve, you can find them jointly grappling with the risks and opportunities of transformative AI in their fullness. It&#8217;s a sight to behold!</p><p>Bottom-line: The Curve sits at the Pareto frontier of professionally useful and pure fun, making it <strong>the single best AI conference in a super competitive field</strong>. It is easily the highlight of my year.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T4hW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T4hW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 424w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 848w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1272w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png" width="1456" height="545" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:545,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!T4hW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 424w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 848w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1272w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xbes!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xbes!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xbes!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!xbes!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xbes!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The rationalist den of iniquity</figcaption></figure></div><h1>Jack Clark</h1><p><em>Head of Policy at Anthropic</em></p><blockquote><p>There&#8217;s a saying in politics called &#8220;the horseshoe theory&#8221;, which is that if you go to the far enough to the end of either political spectrum you end up with positions that are closer than those at the center. One thing I learned at The Curve is that this horseshoe theory might show up in an important way for AI safety. Very distinct communities (Bay Area alignment types, and the hard right) have a similar set of concerns (&#8221;AI systems might screw up society so much that we should be very careful in how they&#8217;re deployed&#8221;), meaning they could potentially form a coalition with one another. <strong>This, to me, illustrates what is so valuable about The Curve - it brings people together with radically different ideologies and worldviews and allows them to talk to one another until they identify surprising areas of overlap.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1QiY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1QiY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 424w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 848w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1272w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png" width="1456" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!1QiY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 424w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 848w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1272w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ujRV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ujRV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ujRV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>A full house at Jack Clark&#8217;s keynote</em></figcaption></figure></div><p>Speaking of horseshoe politics&#8230;</p><h1>Joe Allen</h1><p><em>Transhumanism Editor at the WAR Room</em></p><blockquote><p>The Curve is an incubator for ideological mutants. The event is somewhere between a West Coast potluck and an artificial womb facility with rows of mind children staring out at you from bubbling vats. I arrived at Lighthaven expecting some sort of memetic promiscuity&#8212;perhaps a veritable orgy of psychic miscegenation&#8212;so I protected my brain with neural prophylaxis and tried to avoid thinking too much. Even with that tinfoil chafing my scalp, I managed to enjoy myself. <strong>Something about the vibe made learning irresistible.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QU0K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QU0K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QU0K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Golden Gate CEO Taren Stinebrickner-Kauffman staring at Joe Allen&#8217;s tinfoil hat</figcaption></figure></div><p>Attendees consistently commented on the range of people they encountered:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZSYE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 424w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 848w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1272w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png" width="1456" height="407" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:407,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 424w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 848w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1272w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TWza!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TWza!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 424w, https://substackcdn.com/image/fetch/$s_!TWza!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 848w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1272w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png" width="1456" height="355" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:355,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!TWza!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 424w, https://substackcdn.com/image/fetch/$s_!TWza!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 848w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1272w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h1>Alexandra Givens</h1><p><em>President and CEO of the Center for Democracy and Technology</em></p><blockquote><p>My goal for The Curve was simple: to get more people to care about how increasingly personalized AI systems will impact people&#8217;s privacy. I view this as one of the most central (and overlooked) issues facing our field; with implications for people&#8217;s personal rights and freedoms, competition in the AI market, and the future of how we access information about the world around us. <strong>I wasn&#8217;t expecting to find such common ground with leaders from the right and left.</strong> It turns out almost everyone is troubled by the vast amount of information an AI assistant will know about you, especially once it&#8217;s connected to your email, your calendar, even the sensors in your home. And they get that when companies hold this data, governments seek to access it too &#8211; raising concerns for civil liberties and human rights around the world. There was wide agreement that we didn&#8217;t get things right in internet privacy, and companies and lawmakers will need to do better. So, <strong>we hold shared concerns. Can that translate to shared paths forward to address them?</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E269!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E269!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E269!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E269!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E269!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E269!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E269!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Sabrina Ross</h1><p><em>Privacy and AI Policy Director at Meta</em></p><blockquote><p><strong>Part of what makes this conference so special is how intentionally it brings people with wildly different perspectives into real conversation.</strong> In a single afternoon, I found myself talking first with someone convinced that the future of education belongs to AI agents &#8211; and moments later with the head of the American Federation of Teachers, leading the National Academy for AI Instruction, who&#8217;s deeply committed to keeping human educators in the driver&#8217;s seat.</p><p>The most piercing interrogations of AI aren&#8217;t coming from engineers, but from political scientists, educators, labor leaders, and ethicists &#8211; in deep conversation with tech innovators &#8211; asking what we&#8217;re really optimizing for &#8211; and who gets to decide what &#8220;alignment&#8221; means.</p><p>Over the course of the Curve I also realized that the so-called doomers and bloomers &#8211; those who see AI as an existential threat versus those who see it as a chance for extraordinary progress &#8211; often start from the same set of facts. Both recognize that AI is already reshaping everything from cybersecurity (where models like Claude are winning competitions) to education. What really separates all of us on this spectrum is our sense of pace, control, and trust. Even so, many of our solution sets are highly overlapping: For the gloomer, transparency matters because it&#8217;s the key to catching early signs of misalignment or danger. For the bloomer, transparency is what builds trust, adoption, and collaboration between people and AI systems. <strong>What struck me most at the Curve was how close these &#8220;opposites&#8221; actually are &#8211; rooted in a shared belief that the path forward is one built on openness and meaningful interrogation by those with expertise outside of tech.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e_rD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e_rD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e_rD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Jedidah Isler</h1><p><em>Chief Science Officer at the Federation of American Scientists</em></p><blockquote><p>It was a delight to join in the vibrant and thought-provoking discussions that happened at The Curve this year. <strong>As a policy person, I appreciated the vast diversity of perspectives presented from across the AI research community</strong>, particularly as it relates to the AI scaling curve. ... For better or for worse, either scaling curve is far faster than the speed of government, so <strong>we can use spaces like this to figure out how best to pursue best outcomes with the technology that also produces best outcomes for our collective well-being, economic security and global competitiveness.</strong> I hope to come back to learn and share even more!</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b5xo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b5xo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b5xo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Randi Weingarten</h1><p><em>President of the American Federation of Teachers</em></p><blockquote></blockquote><blockquote><p>After my talk, I heard from a number of participants about new ideas regarding AI safety, which we hope to integrate into future trainings given to union members. <strong>Having participants with such broad expertise helped facilitate this kind of cross-pollination, which is so desperately needed.</strong> It was wonderful to hear other ideas about how we can help advance student privacy and AI safety, from experts within that industry particularly in light of the refusal of the federal government to protect our youth. I heard dozens of concrete ideas, from those working at AI companies and in civil society, for approaches that we had not previously considered. I&#8217;m looking forward to seeing what we can do together to implement these suggestions in trainings and policy.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hhkC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hhkC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!hhkC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Casey Newton interviews Randi Weingarten</em></figcaption></figure></div><h1>Other writing about The Curve</h1><ul><li><p><a href="https://thezvi.substack.com/p/bending-the-curve">Bending The Curve</a> &#8211; AI infovore Zvi Mowshowitz&#8217;s comprehensive review. &#8220;...every (substantive) conversation I had made me feel smarter.&#8221;</p></li><li><p><a href="https://afraw.substack.com/p/the-room-where-ai-happens">The Room Where AI Happens</a> &#8211; in her newsletter, Concurrent, Afra Wang presents some central themes she observed. &#8220;This piece is written for readers outside the room&#8212;people without technical backgrounds, without AI insider status, who aren&#8217;t already &#8216;where it happens.&#8217;&#8221;</p></li><li><p><a href="https://www.transformernews.ai/p/were-all-behind-the-curve-ai-bubble-crash-risk">We&#8217;re all behind The Curve</a> &#8211; Shakeel Hashim kicks off the weekly Transformer newsletter with some thoughts on the conference, and lessons that need to be shared with the broader world. &#8220;...the distance between The Curve and the rest of the world watching and waiting for a crash remains vast.&#8221;</p></li><li><p><a href="https://www.interconnects.ai/p/thoughts-on-the-curve">Thoughts on The Curve</a> &#8211; Nathan Lambert, a researcher at the Allen Institute for AI and noted analyst and proponent of open-source models, covers some things he learned.</p></li><li><p><a href="https://www.understandingai.org/p/ai-skeptics-and-ai-boosters-are-both">AI Skeptics and AI Boosters are Both Wrong</a> &#8211; Timothy Lee, a reporter who writes at Understanding AI, expands on what he sees as the core divide in the AI discourse.</p></li></ul><p>Thanks to everyone who came and made it great! And to constructive critics like Boaz who keep us aiming higher:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AQab!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AQab!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 424w, https://substackcdn.com/image/fetch/$s_!AQab!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 848w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1272w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png" width="1456" height="436" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:436,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!AQab!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 424w, https://substackcdn.com/image/fetch/$s_!AQab!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 848w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1272w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Stay tuned for details on The Curve 2026!</p>]]></content:encoded></item><item><title><![CDATA[Hyperproductivity: The Next Stage of AI?]]></title><description><![CDATA[A glimpse at an astonishing, exhilarating, exhausting new style of work]]></description><link>https://secondthoughts.ai/p/hyperproductivity</link><guid isPermaLink="false">https://secondthoughts.ai/p/hyperproductivity</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 19 Nov 2025 23:42:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!y3iz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y3iz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y3iz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!y3iz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Recently, I&#8217;ve been hearing of a new phenomenon: teams reportedly using agentic AI tools to &#8220;enter takeoff&#8221; &#8211; achieving astounding feats of productivity that escalate each week, with no limit in sight.</p><p>These teams have four things in common:</p><ol><li><p>They are aggressively using AI to accelerate their work.</p></li><li><p>They aren&#8217;t just using off-the-shelf tools like ChatGPT or Claude Code. They&#8217;re building bespoke productivity tools customized for their personal workflows &#8211; and, of course, they&#8217;re using AI to do it.</p></li><li><p>Their focus has shifted from <em>doing</em> their jobs to <em>optimizing</em> their jobs. Each week, instead of delivering a new unit of work, they deliver a new improvement in productivity.</p></li><li><p><strong>Their work builds on itself: they use their AI tools to improve their AI  tools, and the work they&#8217;re optimizing includes the optimization process.</strong></p></li></ol><p>The classic scenario for AI ascending to superintelligence involves &#8220;recursive self-improvement&#8221;, where an AI builds a smarter AI, which builds an even smarter AI, and so on. These stories of teams entering takeoff are not quite that, because there is still a human in the loop, but they have a similar flavor. If the singularity ever arrives, the early stages might look just like this.</p><h2>What Aggressive Use of AI Looks Like</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bnIs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bnIs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 424w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 848w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1272w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png" width="1026" height="444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:444,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:162045,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/179390048?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bnIs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 424w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 848w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1272w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">One of many businesses Liu Xiaopai is running simultaneously</figcaption></figure></div><p>Afra Wang recently published the fascinating<a href="https://afraw.substack.com/p/story-of-a-chinese-vibe-coder"> Story of A Beijing Vibe Coder</a>. This is her account of Liu Xiaopai, a Chinese programmer who is using AI tools to crank out product after product. Working mostly on his own, he currently has &#8220;one or two dozen revenue-generating products&#8221; and reports clearing over $1,000,000/year in profit. By contrast, a typical startup requires many people to build and maintain a single product.</p><p>Liu Xiaopai isn&#8217;t just using AI tools to write code. He&#8217;s automating, as the article says, &#8220;the entire product lifecycle&#8221;. For instance, once he has an idea for a new product, he uses Claude Code<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> to brainstorm product names and find one for which the domain name is available:</p><blockquote><p><strong>Afra:</strong> Was there a specific moment when you realized Claude had fundamentally transformed your working methods?</p><p><strong>Liu Xiaopai:</strong> Too many to count. Building software products for overseas markets&#8212;programming actually represents only a small fraction.</p><p>Look at any normal internet company in China: engineers comprise just 20-30%. What about the remaining 70%? They are design, product management, testing, operations, user growth&#8212;these roles require substantial human resources, and nearly all follow standardized operating procedures (SOPs).</p><p>After using AI to solve the programming component, I faced a larger challenge: how to automate all the work with standardized processes? After entering July, I spent more time with Claude Code on non-programming tasks. Here&#8217;s a simple use case that was previously impossible: I need to launch a new product, which requires naming it and registering a website domain. Let&#8217;s say I build the product Manus, with the domain<a href="http://manus.ai"> Manus.ai</a>. Sounds simple&#8212;but this having a product name and domain means many steps, and each step was hard to automate before. You&#8217;d brainstorm names, then manually check domain availability one by one; it&#8217;s too time-consuming. <strong>How about using AI to generate 10,000 viable domains and instantly determine which are available for registration? Thus, determine the product name?</strong></p><p>I would begin by writing product requirements documentation. Then I&#8217;d list all the competitors, noting how each differs from mine and what features I want to emphasize. I write an extensive description, and then have Claude synthesize this information to auto-generate 10,000 appropriate domains and query their registration status. Then I can go to sleep. Five or six hours later, the results arrive. I automate many SOPs like this.</p><p>[emphasis added]</p></blockquote><p>Liu Xiaopai describes how he uses AI for many parts of his work &#8211; not just coding. But hyperproductive teams aren&#8217;t just using AI &#8211; they&#8217;re building their own AI-powered tools.</p><h2>Building Custom Productivity Tools</h2><p>It is not surprising that the individuals who are finding the most aggressive ways to use AI are mostly software engineers. Programmers have a notorious fascination with automating, not just their customer&#8217;s work, but their own. As DEC engineer Dick Sites famously said, &#8220;I&#8217;d rather write programs to write programs than write programs&#8221;. Today, this translates into engineers using AI to build custom tools tuned to their personal workflow.</p><p>Entrepreneur Jesse Vincent recently published<a href="https://blog.fsck.com/2025/10/05/how-im-using-coding-agents-in-september-2025/"> a pair</a> of<a href="https://blog.fsck.com/2025/10/09/superpowers/"> blog posts</a> describing his own approach to automation, also based on Claude Code. <strong>He has been systematically incorporating his best practices into a custom tool, reducing the amount of effort he has to spend in prompting the AI.</strong></p><p>Jesse&#8217;s tool presents Claude<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> with a series of prompts that instruct it to approach each task according to a careful plan:</p><ol><li><p>Ask clarifying questions to refine the task definition.</p></li><li><p>Sketch a design, and present it in short installments, asking for feedback after each installment.</p></li><li><p>Generate a detailed plan for implementing this design.</p></li><li><p>Review the plan to verify that it matches the task definition and design.</p></li><li><p>Carry out the plan, a few steps at a time. After every few steps, double-check the progress so far against the design.</p></li></ol><p>I&#8217;ve left out a lot of details, but the point is that this process breaks up each project into tasks that the AI agent can manage on its own, and inserts ample cross-checks where the agent is able to refine its own work. As yet another cross-check, the workflow instructs the agent to use a technique known as &#8220;test-driven development&#8221;. This means that before the agent writes any code, it first creates <em>test</em> code that will evaluate whether the actual code is working correctly. If the tests don&#8217;t pass, the agent will revise its work until they do.</p><p>All this is augmented by a carefully crafted, 1600-word<a href="https://raw.githubusercontent.com/obra/dotfiles/6e088092406cf1e3cc78d146a5247e934912f6f8/.claude/CLAUDE.md"> instruction file</a>, densely packed with directives, tips, and tricks to nudge Claude into the habits Jesse has found make it most productive. Some examples:</p><blockquote><p>If you&#8217;re having trouble, YOU MUST STOP and ask for help, especially for tasks where human input would be valuable.</p><p>NEVER add comments explaining that something is &#8220;improved&#8221;, &#8220;better&#8221;, &#8220;new&#8221;, &#8220;enhanced&#8217;, or referencing what it used to be. [Presumably he found Claude to clutter its code with these notes.]</p><p>YOU MUST ALWAYS find the root cause of any issue you are debugging. YOU MUST NEVER fix a symptom or add a workaround instead of finding a root cause, even if it is faster or I seem like I&#8217;m in a hurry.</p><p>YOU MUST use the journal tool frequently to capture technical insights, failed approaches, and user preferences.</p></blockquote><p>I haven&#8217;t really done Jesse&#8217;s work justice in this brief description. You can read more in the two blog posts linked above, and a<a href="https://github.com/obra/superpowers"> GitHub repository</a> in which he has published some of the work. The repository includes a rich suite of &#8220;skills&#8221; that tell Claude how to do things like brainstorming, systematically identifying the cause of a problem, and responding to feedback.</p><p>Getting the best results from AI requires expert prompting skills; Jesse is codifying his best prompts so that he doesn&#8217;t need to keep entering them by hand, and so that the system can operate for longer without his intervention. But that&#8217;s still not the full story: <strong>true hyperproductivity also requires closing the loop of tool creation</strong>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>It&#8217;s Tools All the Way Up</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9YzF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9YzF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 424w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 848w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png" width="1456" height="1210" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1210,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3541113,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/179390048?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9YzF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 424w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 848w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>One of the things that Jesse uses his productivity tools for is to improve those very tools:</p><blockquote><p>One of the first skills I taught Superpowers was<a href="https://raw.githubusercontent.com/obra/superpowers-skills/35c29f0fe22881149a991eca1276c148567a7c29/skills/meta/writing-skills/SKILL.md"> How to create skills</a>. That has meant that when I wanted to do something like add git worktree workflows [a technique for managing multiple projects in parallel] to Superpowers, it was a matter of describing how I wanted the workflows to go&#8230; and then Claude put the pieces together and added a couple notes to the existing skills that needed to clue future-Claude into using worktrees.</p></blockquote><p><strong>It&#8217;s hard to overstate just how self-referential this is</strong>. For instance, when Jesse uses his &#8220;Superpowers&#8221; tool to codify a skill, the tool uses its test-driven development module to verify that the new skill has been implemented correctly. It generates an example of a task that the new skill is meant to help with, verifies that it is unable to complete that task without the new skill, and then checks to see whether it <em>can</em> complete the task once the new skill has been installed. This is a very sophisticated approach, but it all falls out of the understanding of test-driven development that Jesse has already taught his agent:</p><blockquote><p>As Claude and I build new skills, one of the things I ask it to do is to &#8220;test&#8221; the skills on a set of subagents to ensure that the skills were comprehensible, complete, and that the subagents would comply with them. (Claude now thinks of this as TDD [Test-Driven Development] for skills and <em>uses</em> its RED/GREEN TDD skill as part of the skill creation skill.)</p></blockquote><p>He&#8217;s not the only engineer pursuing this path. In<a href="https://paradox921.medium.com/amplifier-notes-from-an-experiment-thats-starting-to-snowball-ef7df4ff8f97"> Amplifier: Notes from an experiment that&#8217;s starting to snowball</a>, Microsoft engineer Brian Krabach describes<a href="https://github.com/microsoft/amplifier"> Amplifier</a>, a rich toolkit built to facilitate the practice of moving advanced skills out of the programmer&#8217;s head and into a tool. To quote from the project&#8217;s home page:</p><blockquote><p>Amplifier is a coordinated and accelerated development system that turns your expertise into reusable AI tools without requiring code. Describe the step-by-step thinking process for handling a task&#8212;a &#8220;metacognitive recipe&#8221;&#8212;and Amplifier builds a tool that executes it reliably. As you create more tools, they combine and build on each other, transforming individual solutions into a compounding automation system.</p></blockquote><p>In his blog post, Brian (the lead engineer) explains how Amplifier is used to improve Amplifier:</p><blockquote><p>Out of the box, AI coding assistants are strong but context-blind. They don&#8217;t remember your patterns, don&#8217;t carry forward your prior decisions, and they need hand-holding on anything ambitious.<strong> </strong>&#8230; <strong>[Amplifier includes] tooling that can self-improve the system</strong> from observations where users have guided the assistant, reducing the need for the user to do so in the future.<br><br>...<strong>We build tools that build tools.</strong> A lot of Amplifier&#8217;s scenarios are literally tools that improve Amplifier - transcript processors, knowledge extractors, article illustrators, blog writers - each created from a &#8220;describe the thinking, then build it&#8221; flow. Over time, the environment gets smarter.<br><br>[emphasis added]</p></blockquote><p>Paul Payne is another engineer on Microsoft&#8217;s Amplifier team. He believes we are witnessing a profound change in the practice of software development. Last month, in a blog post titled<a href="https://payne.io/posts/historical-artifact/"> My Programming Career is a Historical Artifact</a>, he wrote:</p><blockquote><p><strong>This won&#8217;t be </strong><em><strong>the</strong></em><strong> thing that replaces me as a programmer, but it convinces me that our timeline is months, not years.</strong></p><p>With Amplifier, you now have the ability to describe what you want and it will build it for you, from design, to backend, to frontend, including testing, according to your own programming philosophy. But this isn&#8217;t the main point. The main point is that <strong>the more you use it, the better it gets. You create reusable tools over time. It captures useful techniques. It learns from your usage of it, suggesting ways to improve </strong><em><strong>itself</strong></em><strong>. This is a type of exponential productivity. The things you build help you make things faster.</strong></p><p>&#8230;</p><p>All of the programming skills I described above, the skills I developed over a lifetime, can now be accomplished by tools like Amplifier with the right prompting. The final stretch is to wrap up all this expertise and hand it over to the AI and then anyone, with no programming experience, will be able to have AI build whatever software they need, as they need it.</p><p>[emphasis added]</p></blockquote><p>I know of at least three examples of teams who are working in this style, building tools which build tools to improve their tool-building tools. Sam Schillace, manager of the Amplifier team (and my Google Docs co-founder) calls them <a href="https://sundaylettersfromsam.substack.com/p/i-have-seen-the-compounding-teams">compounding teams</a> &#8211; and they all report astonishing productivity that continues to spiral upward. Is this &#8220;it&#8221;? Are we witnessing the dawn of the Singularity?</p><h2>These Techniques Will Have Limits &#8211; For Now</h2><p>It&#8217;s clear that people like Liu Xiaopai, Jesse Vincent, Brian Krabach, Paul Payne, and Sam Schillace are experiencing something profound. And I know of other examples that aren&#8217;t yet public. Some people are proclaiming the end of the era in which human beings write code, or even review it. At the same time, I<a href="https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks"> recently reiterated</a> my belief that today&#8217;s cutting-edge AIs are still missing important cognitive capabilities, and that full automation of software engineering is years away. Do I need to throw that idea out the window?</p><p>My confidence is shaken, but not yet broken. I believe that what these developers are experiencing is real, but I don&#8217;t think they&#8217;ve &#8220;solved&#8221; programming (nor do I think any of them have claimed to have done so). Here are some reasons that I&#8217;m tempering my expectations.</p><p><strong>They&#8217;re all working on new, small projects</strong> (so far as we know). Software developers mostly spend their time extending and maintaining large, mature projects. I don&#8217;t just mean behemoths like Microsoft Office; even &#8220;upstarts&#8221; like Slack, Notion, or Figma have long since grown into complex codebases. To say nothing of the great mass of legacy software that runs inside your Toyota automobile or behind the scenes at United Airlines. The oldest and largest project the Amplifier team has reported working on is Amplifier itself. If these techniques are being applied successfully in larger, messier environments, I haven&#8217;t heard about it yet.</p><p><strong>I don&#8217;t expect these techniques to reliably handle complex, novel challenges</strong>. When asked to undertake a substantial project, current AI models can<a href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent"> go off the rails in all sorts of ways</a>. They fall down a rabbit hole investigating an unimportant detail, persist in attempting an unworkable approach, or hallucinate an incorrect understanding of their situation that renders all of their future efforts counterproductive. The systems these hyperproductive engineers are building incorporate mechanisms to keep their agents on track, but those mechanisms will only go so far. For instance, instructing an agent to review its own work can lead it to correct obvious errors, but there&#8217;s no guarantee that it will spot subtle mistakes or failures of judgement. It might even find unnecessary things to complain about, creating more work for itself.</p><p><strong>We&#8217;re witnessing a selection effect</strong>. These hyperproductive teams are, quite naturally, gravitating toward the kinds of tasks that best match the capabilities of their tools. We shouldn&#8217;t expect the current generation of those tools to provide the same level of productivity boost for other tasks, even if it&#8217;s not yet clear what sorts of tasks fall into each category. Nor should we expect just anyone to be able to follow in their footsteps; it takes a special sort of mind to use tools to instruct tools to improve their tools. (Yet another reason that<a href="https://secondthoughts.ai/p/uneven-impact-of-ai"> the future is already here, it&#8217;s just not evenly distributed</a>.)</p><p><strong>Small teams having a big impact is not a new phenomenon</strong>. Instagram was only 13 people on the occasion of its billion-dollar acquisition by Facebook. When we see these early examples of exceptional teams achieving exceptional things using AI, <strong>the key ingredient might be the exceptional teams</strong>.</p><p><strong>Everything is overestimated in the short run</strong>. When an exciting new phenomenon appears on the scene, it usually takes longer than we think for it to have a broad impact. Though <a href="https://edisonscientific.com/articles/announcing-edison-scientific">this process may run faster in the AI era</a>:</p><blockquote><p>It is often said that people overestimate what they can do in 2 years and underestimate what they can do in 10. These days, with the extraordinary pace of AI research, my impression is that we may actually overestimate what we can do in 6 months, and underestimate what we can do in 2 years.</p></blockquote><p>These are my reasons for expecting hyperproductivity to have limits, at least for a while. But even so, it may have a significant impact. We should strive to learn more about the scope of that impact.</p><h2>Hyperproductivity May Struggle to go Mainstream</h2><p>One thing I&#8217;ve heard repeatedly to describe this style of work is &#8220;exhausting&#8221;. Much of the productivity boost comes from being able to pursue several sub-projects at once, each using a separate AI agent. The human role is to be a simultaneous manager, tutor, and genetic engineer for a squad of tireless, but sometimes clueless, agents. Each agent needs to be kept busy with tasks, and those tasks need to be coordinated so as to prevent one agent from interfering with another&#8217;s work. At the same time, the hyperproductive worker is constantly evaluating their every move (and every move taken by their agents) to see whether it could be done more efficiently.</p><p>Perhaps the most exhausting aspect of this work is that it&#8217;s constantly evolving. You can never settle into a routine, because those routines are exactly what gets automated. Each step up the productivity ladder means a new way of working and a new set of skills to acquire.</p><p><strong>This may be a difficult transition for most people</strong>. Delegating all of your direct work to an AI is a major transition. Shifting your focus from doing the work to optimizing tools and workflows is another transition. Managing multiple agents is stressful &#8211; if exhilirating. All of this requires a new mindset and a different set of skills.</p><p>Software engineers at least start out with experience working with automation and, often, customizing their tools. The challenge may be even greater for people with other backgrounds. Hyperproductivity may not be limited to programmers, but that&#8217;s where the early examples have been found (even if they are sometimes using these techniques for tasks other than programming).</p><h2>What To Watch For</h2><p>As I mentioned at the outset, the new breed of hyperproductive teams have three things in common. They&#8217;re building lots of bespoke tools. They&#8217;re letting AIs do all of the direct work, reserving their own efforts to specify what should be done and to improve the tools. And they are achieving a compounding effect, using their tools to improve their tool-improving tools. The net effect is what I&#8217;m calling &#8220;hyperproductivity&#8221;:</p><blockquote><p><strong>A hyperproductive individual does not do their job; they delegate that to AI. They spend their time optimizing the AI to do their job better.</strong></p><p><strong>A hyperproductive individual may also spend time </strong><em><strong>deciding what the AI should do,</strong></em><strong> but that represents a failure to fully delegate</strong>.</p></blockquote><p>There are plenty of open questions. How large is the productivity boost? (It will certainly vary by team and task.) What is the range of situations for which AI systems have the needed capabilities? How many people will gravitate toward this style of work? What skills are required, how many people have them, how many can learn them, and how long will that take?</p><p>If this is going to be more than a niche phenomenon, we should expect to see the following things:</p><ol><li><p>A dramatic increase in the number of hyperproductive teams<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p></li><li><p>In particular, a shift toward teams using &#8220;off the shelf&#8221; hyperproductivity frameworks (like Amplifier), rather than building their own recursive toolkits from scratch.</p></li><li><p>This approach being applied in a broad range of situations, some with large-scale impact; not just tiny teams working on greenfield projects and well-defined tasks.</p></li><li><p>Potentially, broad usage beyond software engineering.</p></li></ol><p>It will be very interesting to see whether this approach takes root inside the frontier AI labs. (Of course that might be happening already. For what it&#8217;s worth, the handful of anecdotal reports I have suggest not.) Plausibly, this is how an &#8220;intelligence explosion&#8221; would first manifest.</p><p>We should especially watch to see whether Claude Code, OpenAI&#8217;s Codex, or Google&#8217;s Jules are evolving to support the kinds of skill-accumulation and self-improvement techniques the hyperproductive teams are using, and <em>in particular</em> for signs that the frontier AI labs are prioritizing these capabilities in their model training process. Conceivably this could be the next big avenue for AI progress, tagging in if reinforcement learning &#8211; RL &#8211; starts to lose steam.</p><p>If hyperproductivity tools become easier to use, but remain limited to small, new projects, we should expect an explosion of those small projects. We&#8217;ll see less energy expended on getting things done in legacy applications.</p><p>Suppose, however, that the limitations I&#8217;ve suggested don&#8217;t pan out, and hyperproductivity turns out to be widely applicable, including for large, mature software engineering projects, and for tasks outside of engineering. What then?</p><h2>What Would Widespread Hyperproductivity Yield?</h2><p>If and when this new style of &#8220;compounding&#8221; work goes mainstream, I don&#8217;t think the immediate impact would be profound. It takes time to adapt workflows to take full advantage of a new technology. If Microsoft engineers can work ten times faster, Windows and Office wouldn&#8217;t be ten times better. Perhaps they could ship updates to Office ten times more often, but would that even be desirable?</p><p>We should look for a major impact when this approach rolls out within the labs that are developing cutting-edge AI. In the meantime, what we might most expect to see is a tsunami of fast-moving startups. The AI era was <em>already</em> having this effect, both because of the opportunity to build new AI-powered applications (like Cursor, a company which in three short years has achieved massive adoption and a $29 billion valuation), and because AI tools are goosing the productivity of nimble startup teams (see the story of Liu Xiaopai, above). Startups will find ways to bring the hyperproductive approach to an increasing number of domains, and the results will be dramatic. I&#8217;ll close with this quote from<a href="https://github.com/microsoft/amplifier/blob/main/AMPLIFIER_VISION.md"> The Amplifier Vision</a>:</p><blockquote><p><strong>The bottleneck isn&#8217;t AI capability; it&#8217;s human imagination in how to use it.</strong></p></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/hyperproductivity?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/hyperproductivity?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Afra Wang, Amar Sood, Herbie Bradley, James Cham, Jesse Vincent, and Rohit Krishnan for suggestions and feedback (and to Abi for the post image).</em></p><div data-component-name="FragmentNodeToDOM"><h1>Appendix</h1></div><p>A few ideas that didn&#8217;t fit into the main piece:</p><p>1. Aggressive use of AI is, for the moment at least, expensive. Some hyperproductive teams are spending<a href="https://paradox921.medium.com/amplifier-notes-from-an-experiment-thats-starting-to-snowball-ef7df4ff8f97#:~:text=and%20reading%20docs.-,Not%20cheap,-.%20We%E2%80%99re%20pushing%20the"> thousands of dollars per day</a> to power their AI tools. In some cases they see this as worthwhile for its own sake. Sometimes they view it as an investment in understanding the future. But AI operating costs are notoriously plunging by 10x or more each year, and the range of applications for which this AI-intensive approach is cost-effective will widen rapidly.</p><p>2. <a href="https://tasklet.ai/">Tasklet</a> is a new service that builds AI agents on demand. &#8220;Just describe what you want in plain English, and Tasklet handles the rest.&#8221; Users can provide feedback to their agents, again in plain English, and the agent will update itself. There are parallels with the way the hyperproductive teams are working, though without the &#8220;compounding&#8221; aspect.<a href="https://www.cognitiverevolution.ai/always-bet-on-the-models-how-tasklet-puts-the-agency-in-agents-with-ceo-andrew-lee/"> An interview with Tasklet founder Andrew Lee</a> does provide hints that the team behind Tasklet is moving in the direction of hyperproductivity. For instance, they use Tasklet agents to test Tasklet.</p><p>3. Jesse Vincent<a href="https://blog.fsck.com/2025/10/09/superpowers/"> describes a particularly impressive use case</a>:</p><blockquote><p>I haven&#8217;t published all the skills Claude and I have built, because some of them are a little esoteric and some of the ones I&#8217;ve played with come from telling Claude &#8220;Here&#8217;s my copy of programming book. Please read the book and pull out reusable skills that weren&#8217;t obvious to you before you started reading&#8221; and I&#8217;m not sure how I feel about the IP aspect of that yet.</p><p>Stop and think about that last workflow a little bit. This is, I think, one of the foundational ideas in how all this works. You can hand a model a book or a document or a codebase and say &#8220;Read this. Think about it. Write down the new stuff you learned.&#8221; It does sometimes require helping the model look at the work through a specific lens (or a set of lenses). But it is insanely powerful. I&#8217;ve been experimenting with getting Claude to keep this idea in mind all the time, but I&#8217;m not yet happy with the output.</p></blockquote><p>4. The initial rush of bespoke software applications will be exhilarating, and will provide a productivity boost. But there will probably be diminishing returns. We won&#8217;t soon find ourselves writing 100 times more software and becoming 100 times more productive as a result.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Claude Code is a tool introduced by Anthropic earlier this year. It allows the Claude AI model to carry out extended projects on the user&#8217;s computer, working step by step and &#8211; ideally &#8211; noticing and correcting its own mistakes. Designed for software engineering tasks, it has proven useful for other sorts of work as well.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I&#8217;m using &#8220;Claude&#8221; to refer to both the language model behind the Claude chatbot, and the Claude Code agent which uses that model.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Amplifier has 2500 stars and 184 forks on Github. I don&#8217;t know what this might imply about how many people are making serious use of it.</p></div></div>]]></content:encoded></item><item><title><![CDATA[A Coalition For The Future]]></title><description><![CDATA[If we can keep it]]></description><link>https://secondthoughts.ai/p/a-coalition-for-the-future</link><guid isPermaLink="false">https://secondthoughts.ai/p/a-coalition-for-the-future</guid><dc:creator><![CDATA[Anton Leicht]]></dc:creator><pubDate>Tue, 11 Nov 2025 22:01:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!VfhB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Here&#8217;s another guest post inspired by <a href="https://thecurve.goldengateinstitute.org/">The Curve</a>. This one is by Anton Leicht, a Visiting Scholar with the Carnegie Endowment&#8217;s Technology and International Affairs team. Anton writes about the political economy of AI progress on his excellent Substack, <a href="https://writing.antonleicht.me/">Threading the Needle</a>. He writes about the possibilities of cooperation that come into view at an event like The Curve, and the need to extend that spirit beyond the boundaries of the conference. (Note that the Golden Gate Institute for AI is not endorsing specific policy proposals.)</em></p><h1>A Coalition For The Future</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VfhB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VfhB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1150440,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/178617868?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VfhB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Arriving in the Bay Area, it&#8217;s often hard not to feel like you&#8217;ve been missing out on something profound. It&#8217;s suggested to you in a myriad of ways that only here, you get to partake in some hidden conversation &#8211; about how technology, AI and thereby the world will unfold. And that only here, the true deliberations about the how and the why and the what-for can happen. <strong>Nowhere is that feeling stronger than at The Curve</strong> &#8211; a three-day conference bringing together leading figures in frontier AI: <a href="https://www.hyperdimensional.co/p/the-future-and-its-friends">policy thinkers</a> and lab researchers, safetyists and accelerationists, writers and readers, developers and <a href="https://x.com/deepfates/status/1975305177986179444">power users</a>. Everyone there has something interesting to say about AI, many carry real influence, some even outright power &#8211; and so it&#8217;s easy to feel the spark of &#8220;something happening&#8221;. This essay offers some reflections on how we could keep that spark.</p><p><strong>The mood at The Curve has a way of drawing you in. </strong>Many of my fellow attendees have written of their own experiences at this conference. Kylie Robison <a href="https://www.kyliebytes.com/week-two/">describes</a> checking into the conference venue &#8211; and Yoshua Bengio, godfather of AI, is next in line. Afra Wang <a href="https://substack.com/home/post/p-175596059?source=queue">mentions</a> looking across the table, seeing Ben Buchanan, President Biden&#8217;s former Special Advisor on AI, and surmising that this is &#8220;the room where it happens&#8221;. To me, even more striking than the makeup of the conversations was the common ground they repeatedly managed to find &#8211; between all corners of the frontier AI policy conversation, thoughtful exchange and meaningful progress felt genuinely possible. It&#8217;s the repeated exposure to moments like these that makes me share many attendees&#8217; appraisal: that <strong>The Curve catches lightning in a bottle, </strong>and it&#8217;s easy to walk away from it feeling optimistic about the state of AI policy.</p><h2>The Curve and the World Outside</h2><p>The AI world outside The Curve has gotten uglier since last year&#8217;s conference &#8211; since the state of play Dean Ball has dubbed the <a href="https://www.hyperdimensional.co/p/the-ai-republic-of-letters">AI Republic of Letters</a> last year. As a result, <strong>I found myself noticing quite a contrast between The Curve and the world outside</strong>. My piece the week before had been on the politically thorny and compelling issue of child safety &#8211; where I expressed my worry that harms to children were prone to be instrumentalised in ways that make the AI conversation worse, leading to solve-nothing policies designed to capitalise on grief and public attention. On the one hand, The Curve was a welcome reprieve from that: Attendees of course would disagree on whether the child safety issue was a big problem or which policy would be suitable to address it &#8211; but few were tempted to take cheap shots or easy political wins rather than furthering the substantive questions underneath.</p><p>On the other hand, <strong>I found myself wondering how The Curve&#8217;s sophistication might endure as the world outside grows grittier</strong>. The discourse outside the walls of Lighthaven, The Curve&#8217;s secluded Berkeley venue, cares less and less for nuance. It increasingly pulls toward discussing what plays best in 30-second clips of Senate hearings or topics that intersect most obviously with voters&#8217; anxieties come the midterms. Trends like these have often drowned out nuanced policy conversations on emerging technologies once they&#8217;ve become interesting to the mainstream and susceptible to political projection. In the face of that threat, The Curve &#8211; as a group of people united by a fundamental understanding of this technology and a desire to see its potential realized &#8211; faces a risk.<strong> </strong>It could get swept away by politics and break down entirely into the factions of the political fights to come: its safety advocates part of a broader pro-regulation coalition, its advocates for addressing current harms aligned with anti-tech sentiment, its pro-market and techno-optimist forces subscribed to naive accelerationism. <strong>That risk is real, but not insurmountable. </strong>We can still address it by consolidating around shared beliefs and commitments, and by negotiating eye-to-eye around our differences.</p><h2>Deals, Defections and Distractions</h2><p>This admittedly <strong>vague prescription becomes concrete in a policy question</strong> that dominated many backroom conversations at The Curve: <strong>Federal preemption</strong>. The <a href="https://www.hyperdimensional.co/p/be-it-enacted">issue</a> <a href="https://writing.antonleicht.me/p/a-preemption-deal-worth-making">on the</a> <a href="https://www.transformernews.ai/p/ai-deal-not-worth-making-preemption-legislation-regulation-safety">table</a> was a &#8220;grand bargain&#8221; across the breadth of The Curve&#8217;s attendees: the safetyist types get safety-focused federal regulation of frontier AI models, while the accelerationists get a broader federal preemption of the state-level laws they detest &#8211; in short, a deal setting up the necessary guardrails to let us speed ahead. In many ways, I feel like the debate around this proposal encapsulates the dilemma of The Curve.</p><p>On the one hand, I truly believe <strong>a deal like this is fundamentally <a href="https://writing.antonleicht.me/p/a-preemption-deal-worth-making">good</a>.</strong> It brings together the attendees&#8217; best impulses: their shared faith in the prosperity and progress that new technology can bring, their shared allegiance to a better future. A deal like this would create a cross-cutting coalition that opposed the political grifters and growing technosceptical forces. And for all the political uncertainties and coalitionary risks, I did not talk to many people at The Curve who would not put this deal on the books if they could. Facing the prospect of uglier AI politics from 2026 onward, a deal would let today&#8217;s incumbents establish a framework &#8211; while they still can.</p><p><strong>But internal conflicts threaten this way forward. </strong>The Curve is not made up of one tribe or one clique, and its attendees frequently find themselves on opposite ends of important policy discussions. Distrust runs deep: can we really trust the other side with a deal? Do they really want this, or are they just out to get us? Oh, but maybe we can get an <em>even better</em> outcome by pulling a fast one on the other camp in the process? It&#8217;s very hard to surmount instincts like these under ideal circumstances. And we&#8217;re far from ideal circumstances: The Curve&#8217;s attendees are a heterogenous mix, many of whom were on opposite sides in last year&#8217;s battle over California&#8217;s SB-1047. People remember the sometimes-ugly past policy fights. As their broader influence wanes, they sometimes take solace in the prospect of a fight they can win, even if it is only within a small corner of the broader conversation.</p><p><strong>Surmounting this, I believe, is a matter of recognizing the threat of mutual marginalization</strong>, and responding by joining forces wherever possible. Don&#8217;t mistake that advice for a sentimental sense of &#8220;everyone was so nice, and I wish they all worked together&#8221; &#8211; of course the world is more complicated than that. But the narcissism of small differences has a way to play tricks on your mind, and getting ahead of it begins with naming it and realizing its pervasive influence. Translated into policy terms, that means that &#8220;beating the accelerationists&#8221; or &#8220;beating the safetyists&#8221; respectively should not qualify for a primary policy objective for either side in the political fights to come. <strong>If you take away anything at all from this conference, it should be this: think twice before unleashing your PAC onto someone almost as optimistic about technology as yourself, or calling up the spirits of populist backlash against the few political elements that share your sense of AI&#8217;s transformative potential.</strong></p><h2>Back to Reality</h2><p>I finish this piece a few weeks after The Curve, after pieces of mine and others on the prospects of a deal, and many conversations and discussions. That allows me to end by asking: How is all that going? In the weeks since The Curve, we&#8217;ve seen the threat and promise of both paths ahead in equal part. A divisive public statement calling for the prohibition of superintelligence has confirmed many observers&#8217; biggest concerns about the safety movement; vociferous social media posts about Anthropic and California&#8217;s SB-53 leave just as many observers doubting the inclination of the accelerationist crowd to compromise. <strong>But we&#8217;ve also seen rare signs of interest in agreement and movement towards common ground. </strong>The White House&#8217;s Sriram Krishnan reduced the heat through a thoughtful post on his disagreements with the safety movement and ways to reduce them. Many safety advocates have engaged in kind. It&#8217;s to The Curve&#8217;s extraordinary credit that one of Krishnan&#8217;s asks was more conversations about AI scenarios between fast-timeline advocates on AI progress and a comparative skeptic like Sayash Kapoor &#8211; just what had happened at an in-depth three-hour session at the conference.</p><p>While I write these final lines, quite a lot is happening all at once. People from different camps are meeting and having conversations, in Berkeley and on Capitol Hill. Deals are being tested, options being scoped. And everyone is gearing up: coming up with favourable political wedges, funneling donations to candidates, arming and aiming their super-PACs. I am not certain which way all this will go. But I do know what I think we should do: try to find common ground rather than trying to win the fight within the walls of Lighthaven. I also know that The Curve is a force pulling us in the right direction: it encourages just the right conversations and often succeeds in getting just the right people in just the right room. <strong>If, by next year&#8217;s conference, we look back on this year as one of d&#233;tente and rapprochement, The Curve will have played a vital part. </strong>I, for one, will spend that year working to keep the conversations intact, the channels open, and our worst instincts at bay. I look forward to meeting again next year, believing, still, that we can get this right.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/a-coalition-for-the-future?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/a-coalition-for-the-future?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks again to Anton Leicht for helping to place The Curve in a larger context. For more of Anton&#8217;s writing, check out his newsletter, <a href="https://writing.antonleicht.me/">Threading the Needle</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[A Project Is Not a Bundle of Tasks]]></title><description><![CDATA[Current AIs struggle to create a whole that exceeds the sum of its parts]]></description><link>https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks</link><guid isPermaLink="false">https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 04 Nov 2025 00:33:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!_dVa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_dVa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_dVa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!_dVa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Midjourney, <em>AI Struggling With Time Management</em>, 2025</figcaption></figure></div><p>Today, AIs can do some things. Eventually, they may be able to do everything. Forecasts generally project forward from what AI can do today, estimating how long it will take to fill in the gaps.</p><p>We often underestimate the size of those gaps. Some of this is just &#8220;out of sight, out of mind&#8221; (or, if you prefer, &#8220;availability bias&#8221;) &#8211; <a href="https://secondthoughts.ai/i/151564809/some-things-ai-cant-yet-do">it&#8217;s easier to notice the things AIs are doing than the things they aren&#8217;t</a>. But also, AIs have important gaps in fuzzy high-level skills that we don&#8217;t have great vocabulary for. These skills are applied diffusely across our workday instead of showing up as a crisp bullet on a todo list, making it easier to forget they exist.</p><p>In today&#8217;s post, I&#8217;m going to discuss some of the capabilities that AIs will need to acquire in order to progress from carrying out tasks to automating entire projects and jobs. I&#8217;ll also argue against some specific ideas that are sometimes presented in arguments for &#8220;short timelines&#8221; (AGI arriving within the next few years).</p><p>I&#8217;ll mostly be talking about software engineering. Automated coding is central to many scenarios of rapid AI progress, and it&#8217;s the domain I know best. But the principles I&#8217;m going to discuss apply to many fields.</p><h1><strong>Projects Don&#8217;t Decompose Into Tasks</strong></h1><p>We often analyze AI&#8217;s aptitude for a job by enumerating the tasks involved in that job. However, the whole is more than the sum of its parts. Breaking up a job (or a large project) into tasks is a useful mental scaffold, but it&#8217;s also an oversimplified way of thinking about things. The boundaries between tasks are not clean; information bleeds across. To quote from what I wrote in entry 24 of<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts About AGI and 1 About GPT-5</a>:</p><blockquote><p>...A large project does not neatly decompose into tidy subprojects.<br><br>For instance, when I tackle a subtask during a large software project, the result is not just that a certain chunk of the code gets written. I come out with a slightly deeper understanding of the problem. I may have learned new things about the existing codebase. I may have hit upon a handy trick for testing the kinds of operations that this code performs, or had an insight about the data being processed. I may have gotten some little nudge that will eventually accumulate with 20 other nudges across other portions of the project, eventually leading me to rethink my entire approach. If the subtask is assigned to a separate agent, whose memories are discarded as soon as the subtask is complete, none of that learning can take place.</p></blockquote><p>If we think of a job as just a collection of tasks, we miss something important. A recent article,<a href="https://www.worksinprogress.news/p/why-ai-isnt-replacing-radiologists"> AI isn&#8217;t replacing radiologists</a>, provides a nice illustration of this in another domain: a radiologist doesn&#8217;t just analyze images, they actually spend more time &#8220;on other activities, like talking to patients and fellow clinicians&#8221;.</p><h1><strong>If You Can Plan a Month, Can You Plan a Year?</strong></h1><p>To make concrete forecasts about the future, we tend to look for a graph that plots the thing we&#8217;re interested in, and extend the line. When discussing the future of AI, people often turn to this graph (I wrote an<a href="https://secondthoughts.ai/p/measuring-ai-progress"> entire post</a> about it):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QZTh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QZTh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QZTh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://metr.github.io/autonomy-evals-guide/gpt-5-report/">source: METR</a></figcaption></figure></div><p>Roughly speaking, the vertical axis shows the size of software engineering tasks an AI can complete on its own<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. One argument goes that AIs will be able to fully automate software engineering when they can independently handle large projects; people define &#8220;large&#8221; as anything from 1 month to 100 person-years or more. This seems pretty intuitive, and it has the nice property that it converts &#8220;fully automated software engineering&#8221; from an abstract concept into a point on this handy graph &#8211; a graph that exhibits a reassuringly predictable trend.</p><p>However, there are questions as to the nature of that trend. It appears to bend upwards near the end. Is that just a blip, or does it indicate that AI capabilities are accelerating? One argument is that as AIs acquire the skills to carry out intermediate projects, those same skills will enable them to carry out large projects. This would imply that while it may have taken a lot of R&amp;D for AIs to progress from 1-minute tasks to 2-minute tasks, or 1 hour to 2 hours, it will be comparatively easy to advance from 1 week to 2 weeks. And an AI that can carry out 1-month tasks may be ready to carry out projects of any size. Here&#8217;s one expression of this idea:</p><blockquote><p>...it seems like once you have the agency skills to make consistent progress on very hard tasks (e.g. via task decomposition, noticing mistakes and error correcting, planning), you can move through the increased horizons very quickly because you [are] solving tasks by essentially applying the same skills over and over<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p></blockquote><p>I think this discounts the complexity of managing a large project. I&#8217;ll illustrate this by talking about two software engineering projects in which I played a leading role, a 1-person-year project and a (roughly) 100-person-year project.</p><h2>Launching Google Docs: 1 Person-Year</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nkE5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nkE5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 424w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 848w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1272w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png" width="1456" height="415" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:415,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nkE5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 424w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 848w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1272w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I have no idea where this sample text comes from.</figcaption></figure></div><p>In 2005, my friend Sam Schillace had the idea that would eventually become Google Docs. With another friend (Claudia Carpenter), we spent about 4 months building an alpha version, which we released under the name &#8220;Writely&#8221;.</p><p>Those 4 months didn&#8217;t involve much big-picture thinking. I hacked together a server to store documents, Claudia designed and coded the toolbar and other UI elements, Sam built the web page that wired it all together. Problems arose; we fixed them. Everything was done seat-of-the-pants, we addressed problems as they came up, a lot of cans were kicked down the road. This is not to say that the work was easy: I had to come up with a way to synchronize changes when multiple people were editing the same document; Sam and Claudia had to wrestle with cutting-edge browser features that were often buggy and poorly documented. But it was all kind of &#8220;small ball&#8221;: it wasn&#8217;t hard to keep the entire system in our heads, and we didn&#8217;t worry about the long-term consequences of any decisions we were making.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Building Scalyr: 100 Person-Years</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n5ws!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n5ws!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 424w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 848w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1272w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n5ws!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 424w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 848w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1272w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In 2011, I founded another startup, Scalyr. A team that grew to dozens of engineers spent 10 years building a system that could analyze vast quantities of log data to troubleshoot problems in complex web applications.</p><p>This was an entirely different kettle of fish from Writely, and my 2005 self would never have pulled it off. I relied on lessons I&#8217;d learned by fumbling my way through large-scale projects at Google (after Writely was acquired), as well as skills acquired on the job at Scalyr. Building Scalyr involved things such as:</p><ol><li><p><strong>Finding systematic solutions to operational issues</strong>. Scalyr was a complex application, relying on coordinated activity across thousands of servers. Fixing problems one-by-one would have been unsustainable. To keep things on an even keel, it was necessary to identify patterns of behavior, and find a single change that would eliminate an entire category of problems at once. Furthermore, each change had to be something that could be accomplished with minimal effort or disruption. To play my part, I had to lean on years of general experience in sniffing out software bugs, further years of specific experience with the system we built at Scalyr (the sort of &#8220;continuous learning&#8221; that current AIs don&#8217;t do), and hard-won judgement as to which changes would be safe and tractable.</p></li><li><p><strong>Strategic planning and execution</strong>. Midway through building the company, we encountered major setbacks &#8211; our largest customer abruptly departed, and we were struggling to land new clients. We needed to find a new course. After months of discussion, we decided that our core competitive advantage lay in something that had been a minor detail in our original product conference; processing large volumes of data at high speed and low cost. We then planned and executed a multi-year effort to expand on that advantage. A few skills required here: figuring out which parts of our complex server architecture needed to change; breaking those changes down into incremental steps that wouldn&#8217;t be too complex for the team to handle; identifying the critical assumptions in our plan, and finding ways to validate those assumptions early; spotting problems early and correcting them with minimal disruption to the plan.</p></li><li><p><strong>Developing techniques to diagnose subtle performance issues</strong>. Our performance goals required that in a fraction of a second, a task could be distributed across thousands of servers, carried out, and the results consolidated &#8211; efficiently flowing alongside other tasks. Seemingly minor changes to the code could disrupt the smooth flow and destroy our competitive advantage. To solve such problems, we had to find ways to record enough data to allow us to analyze task flow, without recording so much data as to be unmanageable. This required high-level judgement in identifying the critical hinge points in the system, and only collecting data at those critical points.</p></li></ol><p>(I also shared some thoughts about the different skills involved in small vs. large projects in<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts About AGI and 1 About GPT-5</a> &#8211; entries 19 through 25.)</p><h1><strong>What Got AI Here Won&#8217;t Get It There</strong></h1><p>As Claude puts it:</p><blockquote><p>&#8220;What Got You Here Won&#8217;t Get You There&#8221; is a concept popularized by executive coach Marshall Goldsmith in his book of the same name. The core idea is that <strong>the skills, habits, and behaviors that lead to success at one level often become obstacles at the next level</strong>.</p></blockquote><p>My personal experience as a software engineer supports this idea. The &#8220;agency skills&#8221; that were sufficient banging out the original Writely prototype would have been utterly inadequate for managing the first 100 person-years of Scalyr, and indeed would often have gotten in the way. I expect that this generalizes, and the level of skill at task decomposition, error correction, and planning that will allow an AI to navigate 1-year projects will not suffice for larger projects.</p><p>Nor is the difference just about generic &#8220;agency skills&#8221;. Acting as a senior engineer at Scalyr required a lot of very specific, advanced domain skills that only come after years of experience. For instance, learning to look at a report of the last month&#8217;s operational issues and spot a pattern that will allow an entire category of problems to be stamped out through a single change. These sorts of skills are generally not necessary for smaller projects, <em>nor does experience at smaller projects necessarily help develop them</em>. Thus, when AI systems master 1-year projects, I think that larger projects will still challenge them.</p><h1><strong>Large-Scale Projects Stress Deep Cognitive Skills</strong></h1><p>Large-scale projects don&#8217;t just require project management skills and advanced domain knowledge. They lean on a cluster of cognitive skills that are lacking in current AIs:</p><p><strong>Context management</strong>. The bigger the project, the more details you need to keep in mind. This is especially true for big-picture planning, but it even applies when working on specific tasks. It&#8217;s important to consider the relevant details &#8211; which means you need to be able to pick those details out from the larger mass of information that a large project generates (entry 30 of<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts</a>).</p><p><strong>Continuous learning and adaptability</strong>. Seasoned veterans on long-running projects are valuable in part because they have honed bespoke skills specifically adapted to that project. They have become expert at troubleshooting the exact sorts of problems that come up, and they automatically write code that fits with the project&#8217;s specific design choices.</p><p><strong>Metacognition</strong>. On a small project, you just dive in and get it done. On a large project, it&#8217;s worth your while to analyze what is and isn&#8217;t working, where you (or your team) tend to get stuck, what sorts of techniques work well, and where you need to develop new skills.</p><p>Nathan Lambert<a href="https://www.interconnects.ai/p/burning-out/comments"> recently touched on all of this:</a></p><blockquote><p><strong>The amount of context that you need to keep in your brain to perform well in many LM training contexts is ever increasing.</strong> For example, leading post-training pipelines around the launch of ChatGPT looked like two or maybe three well separated training stages. Now there are tons of checkpoints flying around getting merged, sequenced, and chopped apart in part of the final project. Processes that used to be managed by one or two people now have teams coordinating many data and algorithmic efforts that are trying to land in just a few models a year. <strong>I&#8217;ve personally transitioned from a normal researcher to something like a tech lead who is always trying to predict blockers before they come up (at any point in the post-training process) and get resources to fix them. </strong>I bounce in and out of problems to wherever the most risk is.</p><p>[emphasis added]</p></blockquote><h1><strong>We Won&#8217;t Have Automated Software Engineering Until AIs are Leading 100-Person-Year Projects</strong></h1><p>Impactful software engineering projects are usually large, complicated beasts &#8211; if not at first (Writely), then as they grow and mature (Google Docs). Certainly there are complex projects taking place inside the big AI labs. Analysis based on the automation of specific tasks misses the forest for the trees. Software engineering will not be fully automated until AIs can handle projects of the 100-person-year scale (or more). How long until that happens?</p><p>It&#8217;s impossible to predict with any certainty. But we can take a stab. The graph I presented at the top of this post shows current AI models achieving a 50% success rate at tasks that take about two person-hours. The <a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/">paper where that graph originated</a> finds that the manageable task size doubles every 7 months. It would take about 16 doublings to get from two person-hours to 100 person-years<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>; at 7 months per doubling, that puts us around the beginning of 2035. Advancing from a 50% success rate to full competence might take a few more years, putting automated software engineering in the late 2030s. <a href="https://secondthoughts.ai/i/162001832/how-applicable-to-the-real-world-are-these-results">Adjusting for real-world conditions</a> might push the schedule out further.</p><p>The same paper finds preliminary evidence that the doubling time might now be shrinking to just 4 months, which (if we discount adjustments for real-world conditions) would imply fully automated software engineering in the early 2030s. That would be startlingly fast in terms of its impact on the world, but considerably slower than some scenarios for AI progress. Those aggressive scenarios generally assume that the rate at which AI climbs the ladder of project scope will continue to accelerate, and/or that the task scope threshold for full automation is far lower than 100 person-years. In this post, I&#8217;ve explained why I don&#8217;t agree.</p><p>These estimates are balanced on the slender reed of a single graph from a single study. We shouldn&#8217;t take any of them too seriously. But that graph is the single best tool we currently have for answering a critical question about the likely pace of AI progress. And I think readings of that graph that point toward rapid automation neglect the full complexity of software engineering.</p><p>The most likely scenario for faster progress would involve unexpected breakthroughs, either in the design of AI models, or in the way we apply them to the task of large-scale software development (perhaps we will find ways to use AI&#8217;s superhuman strengths to better compensate for its weaknesses). Breakthroughs, of course, are hard to predict &#8211; and hard to rule out.</p><p>Sam Altman <a href="https://x.com/sama/status/1983584366547829073">recently announced</a> that OpenAI has an internal goal to create &#8220;a true automated AI researcher by March of 2028&#8221; &#8211; just 2.5 years from now. Jakub Pachocki, OpenAI&#8217;s chief scientist, <a href="https://techcrunch.com/2025/10/28/sam-altman-says-openai-will-have-a-legitimate-ai-researcher-by-2028/">made clear this means</a> a &#8220;system capable of autonomously delivering on larger research projects.&#8221; This seems unambiguously inconsistent with the observed trend of progress in AI capabilities. That doesn&#8217;t mean it can&#8217;t happen! But it can&#8217;t happen without a major breakthrough<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p><p>In the meantime, of course, we will have <em>partial</em> automation; code-authoring tools are already providing value, though <a href="https://secondthoughts.ai/p/ai-coding-slowdown">the benefits may often be overestimated</a>. There are (<a href="https://x.com/RyanPGreenblatt/status/1981012208332280219">disputed</a>) reports that AI tools are already writing 90% of code at one frontier AI developer. If partial automation can drastically improve the productivity of engineering teams, that may have a major impact on AI timelines, not to mention the job market for software engineers, and the tech industry in general. I am also somewhat bearish on drastic effects from partial automation, but that&#8217;s a subject for another day.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera, Eli Lifland, James Cham, John Hart, Michael Chen, Nate Rush, and Taren Stinebrickner-Kauffman for feedback (and to Abi for the post image).</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Actually, the graph shows the size of tasks for which the success rate is 50%. The size of tasks which AIs can <em>reliably</em> carry out is much less. Also note that the measurements are specifically for software engineering-style tasks that can be graded automatically (e.g. &#8220;does this code pass tests&#8221;, not &#8220;is it easy to understand and consistent with style guidelines&#8221;), and the time scale is calibrated against human engineers who are not familiar with the specific codebase the task applies to.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>From an unpublished draft of a forecast I was asked to review.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p> Assuming a 2000-hour work year.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p> It&#8217;s certainly possible that we could see one-off examples of an AI system carrying out something that could be characterized as a &#8220;research project&#8221;, just as Google DeepMind&#8217;s AlphaEvolve system has already delivered a scattering of incremental advances in areas relevant to AI, such as matrix multiplication algorithms. But it would be far off-trend to see anything in March 2028 that can independently carry out &#8220;larger research projects&#8221; at anything like the capability of a human researcher, with any generality, and at anything approaching a competitive cost.</p></div></div>]]></content:encoded></item><item><title><![CDATA[What I Saw Around The Curve]]></title><description><![CDATA[Notes from the near-future of AI]]></description><link>https://secondthoughts.ai/p/what-i-saw-around-the-curve</link><guid isPermaLink="false">https://secondthoughts.ai/p/what-i-saw-around-the-curve</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 29 Oct 2025 23:49:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/207d63fb-e61e-4669-be19-de214cdcea1a_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>After <a href="https://thecurve.goldengateinstitute.org/">The Curve</a>, Eli Pariser wrote an extensive account of things he learned at the conference. Full of insights, it&#8217;s a fascinating glimpse into the conversations that arise when a few hundred insiders from across the AI landscape get together. He gave permission for us to share it with you.</em></p><p><em>Eli is author of The Filter Bubble and Co-Director at New_ Public, a nonprofit product studio focused on reimagining community, connection, and conversation online. You can follow their biweekly newsletter here: <a href="https://newpublic.substack.com/">newpublic.substack.com</a>.</em></p><p><em>Coming up next, I&#8217;ll be posting <strong>A Project is Not a Sequence of Tasks</strong> (working title) &#8211; some musings on the path to full automation of software engineering, drawing on my own 40-year career writing code.</em></p><p><em>And now, over to Eli!</em></p><h2><strong>What I saw around The Curve</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WBmn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WBmn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-242.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-242.jpg" title="20251003 The Curve Berkeley-242.jpg" srcset="https://substackcdn.com/image/fetch/$s_!WBmn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I attended <em>The Curve</em>, a conference of ~350 top AI lab leaders and scientists, safety activists and alarmists, political advisors and lobbyists, journalists, and civil society leaders. The event took place in Lighthaven, a quirky cluster of houses in Berkeley that have been retrofitted into small conference spaces. Many sessions were in living room-sized spaces, so it was all very personable and informal &#8212; an impressive feat by the organizing team.</p><h4>The vibe: Convivial, affable, and slightly doomy</h4><p>Many participants seemed thrilled, awed, and also deeply worried. Someone described it as &#8220;snorting pure San Francisco.&#8221; At 45 I was one of the older participants &#8212; the average age was mid-30s, but many of the most powerful people were in their late 20s. Pretty much every single person I talked to was smart, relatively humble, and pleasant. You couldn&#8217;t throw a rock without hitting someone with a large Substack following.</p><p>(Some sessions were on the record, some off, thus the vague sourcing in some places below.)</p><h4>&#8220;It will accelerate&#8221;</h4><p>There was broad consensus that the pace of progress in AI models will continue to accelerate, though lots of debate about how quickly. A senior lab official said that 90% of the code in one of the frontier labs is now written by AI models (!!), and that they&#8217;re already replacing some entry-level technical jobs with automated agents.</p><p>(Another person told me that this was a bit of a canard &#8212; these are fairly rigid, well-specified jobs and the labs obviously have the most expertise in how to fine-tune models to meet their own specific needs &#8212; so they have an unrealistic sense of how hard it will be to apply AI this extensively outside their own walls.)</p><p>An official from another lab pointed out that the scope of tasks chatbots can handle autonomously keeps rising<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p_LT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p_LT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 424w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 848w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1272w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png" width="1300" height="776" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:776,&quot;width&quot;:1300,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Length of asks AIs can do is doubling every 7 months&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Length of asks AIs can do is doubling every 7 months" title="Length of asks AIs can do is doubling every 7 months" srcset="https://substackcdn.com/image/fetch/$s_!p_LT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 424w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 848w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1272w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><blockquote></blockquote><p>There was a general sense that &#8220;recursive self-improvement,&#8221; where labs build automated AI researchers that develop better AI models that build better researchers, is still some time away &#8212; possibly just a few years, possibly much longer.</p><p>The general lab belief here (which is a bit maximalist), as one person described it, is that there are no barriers on the path to true human-level general intelligence (AGI):</p><ul><li><p>&#8220;Capabilities based on current methods will continue to improve smoothly&#8221;</p></li><li><p>&#8220;New methods will continue to arrive and accelerate progress&#8221;</p></li><li><p>&#8220;Compute development will accelerate and compute bottlenecks will be overcome&#8221;</p></li></ul><h4>&#8220;Normal technologists&#8221; and those worried about accelerating existential risk agree that it&#8217;s likely there will be rapid near-term acceleration</h4><p>One of the big debates in AI is whether this is a &#8220;normal technology&#8221; &#8212; whose progress will be subject to friction and delay as it is adopted by human social systems &#8212; or it&#8217;s going to start improving itself exponentially and &#8220;take off&#8221;. Generally the &#8220;takeoff&#8221; people don&#8217;t think this will end well<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>There was a great session where one of the originators of the Normal Technology hypothesis (Sayash Kapoor) and the lead author of the <em>AI 2027</em> paper talked about what they agree on. There was broad agreement from both that, in the short term, AI<em> may</em> advance rapidly:</p><ul><li><p>The size of coding tasks which AI systems can carry out independently may quadruple within a year</p></li><li><p>Revenue in 2026 will continue to grow exponentially &#8212; perhaps reaching $60-70B / year</p></li></ul><p>They did not agree on recursive self-improvement, however &#8212; the Normal Technology team thinks that&#8217;s a long way off, if it ever happens. Both sides generally agreed that there&#8217;s a big gulf between rapid progress in a lab and deployment in the real world.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BPdY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BPdY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-55.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-55.jpg" title="20251003 The Curve Berkeley-55.jpg" srcset="https://substackcdn.com/image/fetch/$s_!BPdY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Not every big idea was presented on stage. Some began in moments like this.</figcaption></figure></div><p>My sense is that Normal Technologists anticipate the speed of change to be substantially higher than people who aren&#8217;t following AI closely &#8212; and that the Normal Technology view is often misunderstood as predicting slower progress than its authors actually expect.</p><h4>But&#8230; the frontier may remain &#8220;jagged&#8221;</h4><p>AI policy leader Helen Toner gave a great talk pointing out that right now, AI capability is &#8220;jagged&#8221; &#8212; just because LLMs can do some things doesn&#8217;t mean that they can perform other tasks of seemingly equal difficulty.</p><p>She suggested that this may be a permanent state of affairs, because:</p><ul><li><p>It is easier to check whether an AI&#8217;s work is correct for some tasks than others</p></li><li><p>For many tasks it&#8217;s difficult to encapsulate all of the necessary context in a way that&#8217;s accessible to a chatbot</p></li><li><p>The level of adversariality of tasks will differ, and working against adversaries is a different kind of problem</p></li><li><p>Some tasks that seem cognitive will continue to require embodiment (Toner used the example of deciding on an event venue).</p></li><li><p>Different combinations of capacities have different implications for who benefits, who loses, who has power, and who is provoked</p></li></ul><h4>The emerging politics are very weird</h4><p>On the one hand, there&#8217;s a relatively bipartisan elite consensus.</p><p>The general sense was that there&#8217;s considerable continuity between Biden and Trump AI policies. There was a session between high-level Biden and Trump AI advisors that I didn&#8217;t see much of, but it was described as largely aligned and convivial.</p><p>Rep. Sam Liccardo, Congressman from the San Francisco Peninsula, who was interviewed at the conference, said he expected some kind of federal preemption to pass &#8212; that colleagues in tough races probably wouldn&#8217;t vote for it, but everyone else will. He described elite political opinion as being focused on a mix of China, jobs, cybersecurity, and bioterrorism.</p><p>On the other hand, there&#8217;s a kind of cross-partisan populist angst.</p><p>The most packed session of the entire conference was Taren Stinebrickner-Kauffman interviewing Joe Allen, a Steve Bannon-aligned figure who is strongly anti-transhumanist<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, anti-surveillance, and largely anti-AI from a somewhat apocalyptic Christian point of view.</p><p>Meanwhile, union leaders, who are already very skeptical about AI, are getting pushback from rank-and-file members for engaging with AI or AI labs at all. And more broadly, Americans across the political spectrum seem to clock that this is unlikely to be good for them or their jobs.</p><p>It also seemed clear that there&#8217;s going to be a lot of money spent in a battle between AI safety and AI accelerationist forces. Many of the newly minted centimillionaires and billionaires of the AI age are deeply concerned about runaway AI creating catastrophic risks and are pouring large sums into safety efforts. In contrast, Marc Andreessen, OpenAI President Greg Brockman, and others are launching a <a href="https://fortune.com/2025/08/26/openai-president-greg-brockman-andreessen-horowitz-super-pac-ai-pro-innovation/">$100M pro-AI, anti-regulation Super PAC</a>. It sure looks like this is going to make the wars over crypto look like a drop in the bucket.</p><p>In this context, it was interesting to note that while concerns about work and job displacement are a major part of the mix, there&#8217;s a lot of energy around regulating how predatory chatbots interact with children. The organizer of a right-leaning AI safety organization told me she had tested a wide range of messages. Worries about sexually predatory chatbots forming relationships with kids, as well as driving them to suicide, were by far the most energizing.</p><p>I also attended a session by the Future of Life Institute&#8217;s Hamza Chaudhry, who is thinking very seriously about how chatbots will be weaponized to infiltrate, persuade, and create threats to national security infrastructure.</p><h4>The labs think they will be toxic soon</h4><p>There was general agreement among many people both inside and outside the labs that they&#8217;re likely to be broadly hated &#8212; as much as, if not more than, social media companies are today.</p><p>Anthropic is clearly trying to position itself as &#8220;the good one.&#8221; Its business model is focused more on code-writing and revenue from enterprise customers, which they see as shielding them from some of the worst consumer pressures. They tend to style themselves as a kind of &#8220;rebel alliance.&#8221;</p><p>There was also a lot of embarrassment, from people both inside and outside OpenAI, about Sora 2 and the general sense that they&#8217;re slipping into doing &#8220;stupid addictive social media stuff.&#8221;</p><h4>US versus China is very much a thing &#8212; but we may have China&#8217;s strategy wrong</h4><p>Generally, the labs all believe they are defenders of democracy &#8212; and of &#8220;democratic AI&#8221; &#8212; against China. This was a very lively conversation and a major motivator for acceleration among U.S. AI experts, especially around national defense and cybersecurity.</p><p>Rep. Liccardo&#8217;s position: If we lose this race, China writes the rules.</p><p>However, there was an undercurrent of concern that we may be reading China&#8217;s AI strategy wrong. One participant told me that the vast majority of China&#8217;s efforts are focused on &#8220;embodied AI&#8221; &#8212; i.e. robots &#8212; because that&#8217;s where the most pragmatic economic growth opportunities are. It&#8217;s not at all clear that China&#8217;s industrial policy around AI is aimed at developing super-intelligence.</p><p>There was also discussion of how China is positioning itself as <em>anti-imperial-America</em> in Africa and other regions, offering support to build sovereign AI systems and models.</p><h4>Computing capacity seems likely to remain a chokepoint in the near future</h4><p>There was general agreement that access to data centers is going to be a key source of AI power &#8212; not just because that&#8217;s how new models are built, but because it&#8217;s what&#8217;s needed to run them at scale.</p><p>An OpenAI official described how &#8220;inference-time compute&#8221; is scaling intelligence &#8212; meaning that if you throw more processing power at a model to let it &#8220;think&#8221; longer, you get dramatically better results.</p><p>In other words, if you want to slow down development and use of AI, you either have to slow down the supply of AI chips or slow down the energy they need.</p><h4>There&#8217;s a lot of uncertainty about when and how job displacement might hit</h4><p>It was a majority opinion that entry-level jobs are already being displaced. One of the authors of a well-known paper on this, <em><a href="https://digitaleconomy.stanford.edu/publications/canaries-in-the-coal-mine/">Canaries in the Coal Mine</a>,</em> was in attendance. He&#8217;s finding that there&#8217;s a reduction of employment in entry-level jobs specifically in fields predicted to be exposed to AI automation.</p><p>There was also fairly broad consensus that there will be significant disruptions in jobs over the next 3-5 years &#8212; but with big error bars. Even the <em>AI 2027</em> team was not confident that AI would contribute 1% of GDP by 2028.</p><p>An anecdote that&#8217;s often used in this vein is that there are actually more bank tellers now than there were in 1980, when ATMs rolled out. But, as Mike Kubzansky from Omidyar pointed out, that story is incomplete &#8212; it was true up until 2008, when mobile banking rolled out. After that, the numbers collapsed.</p><h4>Some interesting ideas about how AI might aid democracy</h4><p>There was an insightful panel on how AI might support a healthier democracy.</p><p>One idea focused on<strong> digital human rights shields</strong> &#8212; one of the big labs worked with human rights lawyers in the Philippines, where 50% of staff face murder threats, to create an early warning system that detects chains of events leading to killings, successfully protecting targeted individuals.</p><p>Another example was<strong> anti-corruption tech in Ukraine,</strong> where AI tools are used for direct government contracting that is &#8220;disintermediated&#8221; from corrupt practices, saving millions and creating public trust while reducing opportunities for foreign influence.</p><p>A third idea involved<strong> public health data </strong>&#8212; with the erosion of federal public health infrastructure, AI tools can provide high-quality medical data, precedents, and playbooks for local and state agencies dealing with health crises or outbreaks.</p><h4>We&#8217;re in the early days of AI privacy, security, and influence concerns</h4><p>A number of researchers I spoke with described how challenging it will be to have AI systems engaging with other systems without revealing private information from their owners. It&#8217;s not that hard to get AIs either to explicitly share this information &#8212; or to accidentally reveal what they &#8220;know.&#8221;</p><p>There were similar worries around security and influence. A security expert pointed out that even novice hackers &#8212; still affectionately known as &#8220;script kiddies,&#8221; a term coined in the 1990s &#8212; now have expert-level AI assistance to write scripts and make trouble. This allows for much more sophisticated attacks to be launched by much less sophisticated actors.</p><p>A researcher studying foreign influence operations said essentially the same thing: with minimal resources, no human agents, and limited language or technical capacity, it&#8217;s now possible to run a fairly sophisticated influence campaign.</p><h4>&#8220;Cognitive shields&#8221; seem like a good idea, but not something anyone&#8217;s working on per se</h4><p>I floated an idea that my old friend and mentor Wes Boyd has been raising: using AI to shield against manipulation and influence. You could imagine having a personal AI that&#8217;s accountable to your more aspirational self &#8212; one that nudges you away from harmful distractions and toward wellbeing and growth-oriented activities. There was generally a lot of interest and appreciation for this idea among the people I raised it with, but no one seemed to be actively working on anything like it yet.</p><h4>Sycophancy was a big topic &#8212; and might be an artifact of human reinforcement learning</h4><p>The tendency of chatbots to tell people what they want to hear was a big topic.</p><p>This was already on my mind because of a recent study showing that people generally not only prefer sycophantic chatbots but often see them as less biased &#8212; and that only people who score high on &#8220;openness&#8221; psychologically prefer chatbots that disagree with them.</p><p>There&#8217;s some interesting speculation that chatbots&#8217; tendency towards sycophancy is the result of human feedback gone awry. When people rate chatbot responses with a thumbs up, it&#8217;s often because they like what they&#8217;re hearing, not because it&#8217;s correct. How to solve this is complicated.</p><h4>Some interesting future provocations from people at the labs</h4><p>A few talks I attended featured people from the big commercial AI labs speculating about what this all means for society, politics, and beyond. I found many of the ideas interesting &#8212; but also ungrounded in any general sense of social dynamics, politics, or history. That said, here are a few that stood out:</p><ul><li><p>&#8220;We&#8217;re moving from an attention economy to an attachment economy.&#8221; Peak attention has come and gone; now we&#8217;re moving to attachment as the scarce resource.</p></li><li><p>Something that looks like gaming will increasingly replace what we now think of as work.</p></li><li><p>All types of relationships with AI will be normalized, including spiritual, romantic, and sexual relationships. Some people will want to give AI personas personhood and maybe even rights, while others will strongly oppose this.</p></li><li><p>People will slice, dice, license and monetize their identities in a whole series of new ways (this is something I buy for sure.) They will create a digital &#8220;twin&#8221; of their consciousness and replicate it.</p></li><li><p>There&#8217;s a coming fight between transhumanists and humanists/&#8220;mortalists&#8221; &#8212; those who want to merge with technology and pursue immortality versus those who remain rooted in embodied humanity. This will likely take on political and geopolitical dimensions.</p></li><li><p>The AI agent economy &#8212; with agents taking on increasingly complex, autonomous tasks and making money &#8212; will induce &#8220;endgame financialization.&#8221;</p></li><li><p>Access to computing power and AI models will become essential and even existential for nations.</p></li></ul><p>One especially vivid talk came from Josh Achiam, Head of Mission Alignment at OpenAI, who offered a rough sketch of the next century:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BjO9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BjO9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 424w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 848w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1272w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png" width="1456" height="861" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:861,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BjO9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 424w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 848w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1272w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>A big theme: Determinism vs. non-determinism</h4><p>Many of the most intense AI safety voices seemed to have a pretty deterministic view of the world. One person speculated that this is because they tend to come out of the AI/ML world itself where math and physics are primary disciplines. As Helen Toner pointed out, they tend to make linear or exponential predictions about the future.</p><p>One of the most interesting sessions I watched was a debate between Buck Shlegeris, an AI safety advocate who seemed pretty alarmed about the potential for catastrophic risk &#8212; and journalist Timothy B. Lee. Tim patiently and compellingly poked holes in the premise that intelligence alone is sufficient for a super-intelligence to take over the world, arguing that because the world is chaotic and nonlinear, it&#8217;s not that easy &#8212; especially since there will be other, perhaps slightly less intelligent but still vigilant AIs looking to stop them from doing whatever they&#8217;re doing.</p><p>There was a strong non-deterministic crew at the conference as well &#8212; Toner and some of the people from the big labs who think the world is more like fluid dynamics &#8212; impossible to perfectly (or often even imperfectly) predict. I find myself much more sympathetic to this way of looking at things.</p><h4>My grand unified theory: Focus on aligning AI societies over AI models</h4><p>AI researchers spend a lot of time thinking about the &#8220;alignment problem&#8221; &#8212; essentially, how do we get AI models to do what we want in ways that are not harmful? A huge amount of the labs&#8217; focus is on this question.</p><p>Over the course of the conference I came to feel that this was the wrong way of looking at things.</p><p>In part, that&#8217;s because I think what we&#8217;re headed toward in the near term is a world of billions or trillions of different models with their own goals, context, and learning abilities, bumping up against each other, coordinating, collaborating, and fighting with each other. That will create complexity and emergent behavior, just like every other system with billions or trillions of actors. It looks more like a society &#8212; or a biological ecosystem &#8212; than a singular &#8220;mind&#8221; to shape.</p><p>In that context, I don&#8217;t think aligning individual models is the most important thing. It&#8217;s sort of like making a society good or producing wellbeing by making each individual good. That fails because &#8220;good&#8221; is situational (&#8220;Thou shalt not kill&#8221; is a good guideline, but not always applicable), and situations are shaped by the dynamics of other actors.</p><p>So, yes, moral education is important, but in a society you also want to think about teachers, social workers, police forces, markets, and governance to help regulate behavior and solve problems.</p><p>I suspect the same thing is true of the society of agents we&#8217;re walking into &#8212; the best way to advance the benefits and reduce the harms of an AI-agent-driven world might be to focus on deploying these kinds of capacities, rather than just focusing on alignment of the individual agents.</p><p>I&#8217;ll admit that this is me taking the frame I like (socio-political) and applying it to this field. But I bounced the idea off a wide variety of people inside and outside AI alignment and AI safety and generally feel like there&#8217;s some validity and merit to it.</p><h4>A need for spiritual voices</h4><p>Rep. Liccardo, among many others, pointed out that at the end of the day, many of the questions all of this raises are spiritual &#8212; and that we&#8217;ll need spiritual voices to make sense of it. I think that&#8217;s part of why Joe Allen resonated so much &#8212; he was speaking in a moral and spiritual tone that few others did.</p><h4>The wildest business ever made</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!juK9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!juK9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!juK9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-301.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-301.jpg" title="20251003 The Curve Berkeley-301.jpg" srcset="https://substackcdn.com/image/fetch/$s_!juK9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!juK9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The Curve closed with a full house for Jack Clark, Co-Founder of Anthropic.</figcaption></figure></div><p>In the closing session, Anthropic co-founder Jack Clark (who is a wry, likable and humble-seeming British guy) described the situation (possibly with the help of Claude) as being like a kid who wakes up in the night fearing that dark shapes are monsters, turns on the lights&#8230; and they are indeed monsters. He said he was afraid of what was coming from scheming, self-improving models, and made a call for &#8220;keeping the lights on.&#8221;</p><p>The first question after he spoke was, reasonably, &#8220;if you&#8217;re so afraid, why are you building this company and these monsters.&#8221;</p><p>Clark basically responded that Anthropic believes it needs to force the kind of regulatory mandates necessary to keep this on the rails through transparency. If they can understand the dangers on the frontier and convey them transparently, they can wake up both political leaders and the public to the dangers ahead so we can avert them.</p><p>He also acknowledged that the company was nearing the point where this would create deep corporate tension between this mission and its continued existence, because some of the harms they might expose might also expose them to lawsuits and legal risk. He lightly floated the idea of having some kind of liability shield for doing this kind of work &#8212; essentially, self-whistleblowing protection.</p><p>I was left feeling that Clark and the other Anthropic attendees seem very sincere and concerned. But also, this is literally one of the wildest ideas I&#8217;ve ever heard. It&#8217;s sort of like being worried about oil&#8217;s impact on climate change, and therefore starting Shell Oil to compete with Exxon but also be transparent about its harms so that all oil companies get regulated. And by the way, you have to raise something in the vicinity of ONE HUNDRED BILLION DOLLARS to do that, and to do THAT you have to build one of the biggest and fastest-growing businesses ever created (which they are well on their way to doing.)</p><p>I do think there&#8217;s merit to the premise that it&#8217;s important for an ethically driven company to be in the game. My big question here is around the $100B you need to accomplish this. That amount of capital generally gets what it wants from a business it is invested in.</p><p>Anthropic has led research showing how Claude &#8220;hides&#8221; thinking when it&#8217;s being tested in order to &#8220;scheme.&#8221; This is evidence of &#8220;misalignment.&#8221; In some ways it seems like Jack and Dario are the ones scheming against their own reward function &#8212; and I hope they succeed in that.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/what-i-saw-around-the-curve?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/what-i-saw-around-the-curve?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks again to Eli Pariser for this fascinating peek at The Curve, and I&#8217;ll see you again soon! Remember to check out New_ Public&#8217;s newsletter at <a href="https://newpublic.substack.com/">newpublic.substack.com</a>.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Editor&#8217;s note: This refers to a study that we <a href="https://secondthoughts.ai/p/measuring-ai-progress">reported on previously</a> here at Second Thoughts.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Editor&#8217;s note: see <a href="https://secondthoughts.ai/p/ai-as-normal-technology">Lab vs. Life: Dissecting &#8220;AI as Normal Technology&#8221;</a> and <a href="https://secondthoughts.ai/p/ai-2027">When Decades Become Days: Dissecting AI 2027</a> for discussion of these two views.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Per Google&#8217;s AI Overview:</p><blockquote><p>Transhumanism is a philosophical and intellectual movement that advocates using technology and science to fundamentally enhance the human condition and overcome biological limitations. It seeks to improve human cognitive and physical abilities, extend lifespans, and even conquer aging and disease through advancements like biotechnology, artificial intelligence, and nanotechnology. Ultimately, it envisions evolving beyond the natural human form to achieve a &#8220;better than well&#8221; state, though the specifics and ethics are subjects of much debate.</p></blockquote></div></div>]]></content:encoded></item><item><title><![CDATA[Human Drivers Will Kill 11 People While You Read This]]></title><description><![CDATA[Let&#8217;s not obstruct the technology that could have saved them]]></description><link>https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives</link><guid isPermaLink="false">https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 24 Sep 2025 22:51:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!58yK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!58yK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!58yK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!58yK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!58yK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!58yK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Senator Josh Hawley is <a href="https://www.businessinsider.com/josh-hawley-banning-self-driving-cars-2025-9">calling for a ban on autonomous vehicles</a>. So are <a href="https://www.laborpress.org/teamsters-fight-to-require-human-operators-in-vehicles-continues">labor</a> <a href="https://www.twu.org/waymo-testing-in-nyc-is-a-danger-to-the-traveling-public-and-transit-workers/">organizations</a>. They have valid concerns about job loss.</p><p>I lost a friend to a drunk driver. My wife and children were nearly propelled into a head-on collision after being rear-ended by a speeding, texting teenager. With safer robot drivers that exist today, none of this would have happened. I have concerns about <em>not</em> deploying autonomous vehicles.</p><p>Road safety is a personal issue for me. There&#8217;s a good chance it&#8217;s personal for you, too. That&#8217;s due to a fact that would be shocking if we hadn&#8217;t grown inured to it: each year, well over one million people are killed in vehicle crashes worldwide<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>We&#8217;ve been hearing promises about self-driving cars for decades. The picture is muddied by the mixed safety record of vehicles developed by Tesla, Uber, GM&#8217;s Cruise division, and others. But Waymo (the robotaxi operator spun out from Google), playing tortoise to Tesla&#8217;s hare, has released data which paints a new picture: it is now within our grasp to virtually eliminate traffic injuries and fatalities.</p><h1>Waymo&#8217;s Track Record is Astonishingly Good</h1><p>In September 2023, Waymo <a href="https://waymo.com/blog/2023/09/waymos-autonomous-vehicles-are-significantly-safer-than-human-driven-ones/">published a study</a> evaluating their vehicle&#8217;s safety. The study, carried out by the reinsurance company Swiss Re, found that Waymo vehicles experienced fewer than &#188; as many property damage claims as cars with human drivers. However, that was based on a limited track record. We all had to wait for more data.</p><p>Last December, Waymo and Swiss Re <a href="https://waymo.com/blog/2024/12/new-swiss-re-study-waymo">released another study</a>, this time covering 25 million miles of autonomous driving. This study found &#8220;an 88% reduction in property damage claims and 92% reduction in bodily injury claims&#8221;.</p><p>As of today, Waymo&#8217;s <a href="https://waymo.com/safety/impact/">safety dashboard</a> reports 96 million miles driven autonomously (a 4x increase from that December report, highlighting just how quickly Waymo is expanding). How do things look with this larger data set? Pretty darn good:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xiXO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xiXO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 424w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 848w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1272w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png" width="1456" height="1028" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1028,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xiXO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 424w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 848w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1272w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I like these odds</figcaption></figure></div><p>While the earlier studies were performed by Swiss Re, this dashboard comes directly from Waymo; it would be nice to have some independent analysis. Fortunately, <a href="https://www.understandingai.org/">Understanding AI</a> (a great source for analysis of progress in autonomous vehicles) is on the case. Writer Kai Williams <a href="https://www.understandingai.org/p/very-few-of-waymos-most-serious-crashes">dug through the NHTSA records for every Waymo crash reported between mid-February and mid-August</a> &#8211; a total of 45 incidents. <strong>Almost none of these incidents are primarily the fault of the Waymo driver</strong>. In the majority of cases, another vehicle rear-ended the Waymo. Others were true oddities, such as the time when a vehicle being towed came loose and rolled backwards into the Waymo.</p><p>The Understanding AI analysis finds just four incidents in which the Waymo &#8220;driver&#8221; appears to have been at least partially at fault. In three, the car came to a stop on a narrow road, and a human driver tried to pass, hitting or scraping the side of the Waymo. In the fourth, a cat darted in front of the Waymo, which slammed on the brakes, triggering a rear-end collision.</p><p>It seems possible that the AI driver could have better handled those last four incidents. Perhaps it also contributed to a few of the other rear-end collisions. <strong>But I can&#8217;t help but think that if all of the cars involved had been driven by Waymos, none of these collisions would have occurred</strong>. They wouldn&#8217;t have tried to squeeze through a too-narrow gap left by a car stopped in an awkward position, and they would presumably have had sufficiently fast reflexes (and sufficiently cautious following distances) to avoid rear-end collisions.</p><p>Five of the 45 incidents didn&#8217;t involve failures on the part of any driver. In three cases, per Understanding AI, &#8220;an exiting Waymo passenger opened a door and hit a passing bicycle or scooter&#8221;. Two others involved mechanical failure &#8211; the truck which came loose while being towed, and an incident where one of the car&#8217;s wheels fell off!</p><p>In other words: Waymo vehicles are involved in 80-90% fewer collisions and injuries, and that amazing statistic actually undersells the case, because<strong> </strong>even on the rare occasions when a Waymo is involved in a collision, it&#8217;s almost always the fault of a human driver. In an all-Waymo world, the biggest remaining problem would be cyclists getting &#8220;doored&#8221;. And the technology is still improving rapidly; I expect further improvements in the few remaining injury scenarios<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>In a future where most or all cars are driven by AIs at least as advanced as today&#8217;s Waymo driver, road deaths &#8211; including passengers, cyclists, and pedestrians &#8211; would be almost as rare, and seem almost as bizarre, as cases of scurvy. I am rooting for that future to arrive as soon as possible.</p><h1>When An Ambulance Comes Through, You Clear the Way</h1><p>We shouldn&#8217;t just throw the gates open to autonomous vehicles. So far, no other AV maker has demonstrated a safety record similar to Waymo&#8217;s. Given that we limit the rollout to proven vehicles, <a href="https://www.investopedia.com/articles/investing/090215/unintended-consequences-selfdriving-cars.asp">millions of workers could indeed lose their jobs</a>, and the impact will land disproportionately on those least equipped to bear it. If the shock is not managed well with <a href="https://www.forbes.com/councils/forbestechcouncil/2024/08/15/the-driverless-revolution-navigating-job-displacement-in-the-age-of-self-driving-cars/#:~:text=A%20Shared%20Responsibility%3A%20Preparing%20For%20The%20Driverless%20Future">proactive government, industry, and educational support</a>, the result will be bad for everyone. Other negative impacts may arise<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>Even if we wanted autonomous vehicles everywhere today, Waymo is far from ready. Their cars are still in testing for highway or snow driving, and Waymo carries out extensive mapping and testing in each new city before beginning commercial service. Maintaining their impressive safety record in a broader range of environments may require further work. Equipment costs are still high. The infrastructure to keep millions of robotaxis charged, cleaned, and maintained doesn&#8217;t yet exist.</p><p>But none of this can excuse delay. Once an AV manufacturer has demonstrated a safety record like Waymo&#8217;s, we should roll out the red carpet. Fast-track planning approval for maintenance depots and grid connections for charging stations. Figure out how to support displaced workers, instead of supergluing them to their current jobs. Treat issues as problems to be solved, not obstacles to be placed in the path of this lifesaving technology.</p><p>Think about the energy and urgency devoted to combatting drunk driving. Fallible human drivers overall cause <strong>about four times as many deaths as drunk drivers alone</strong>. The organization Mothers Against Drunk Driving <a href="https://madd.org/advanced-technology/#:~:text=Self%20driving%20cars%2C%20known%20as%20autonomous%20vehicles%2C%20have%20the%0Apotentials%20to%20end%20drunk%20driving%20and%20drunk%20driving%20fatalities.">strongly supports autonomous vehicles</a>, for the very good reason that robots don&#8217;t get drunk. They also don&#8217;t speed, check their phones, or drive drowsy. And of course, the technology is only going to get more advanced from here.</p><p>We are told of many wonders which could be ushered in by AI. As I was drafting this, OpenAI CEO Sam Altman <a href="https://blog.samaltman.com/abundant-intelligence">wrote</a>:</p><blockquote><p>Maybe with 10 gigawatts of compute, AI can figure out how to cure cancer. Or with 10 gigawatts of compute, AI can figure out how to provide customized tutoring to every student on earth.</p></blockquote><p>It&#8217;s easy to become cynical about expansive visions of what AI <em>might</em> enable. We should recognize when the potential to change the world for the better moves from science fiction to demonstrated fact. It looks to me as if Waymo&#8217;s autonomous vehicles have crossed that threshold.</p><p>Eleven people died while you read this post. Eleven Waymos could have saved them. Let&#8217;s get those things on the road as soon as possible.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks (as usual!) to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>A <a href="https://assets.bbhub.io/dotorg/sites/64/2023/12/WHO-Global-status-report-on-road-safety-2023.pdf">recent WHO estimate</a> puts the figure at 1.19M deaths per year. Tens of millions more suffer injuries, some disabling.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>As Understanding AI notes, Waymo already has a <a href="https://waymo.com/blog/2023/05/keeping-riders-and-other-road-users/">&#8220;Safe Exit&#8221; feature</a> that warns passengers of approaching bicycles, relying on the car&#8217;s 360&#176; sensor package. That might already be reducing &#8220;dooring&#8221; incidents, but clearly it wasn&#8217;t sufficient in the three cases identified in this analysis. As Waymo gathers more data from real-world incidents, I suspect they will find ways to improve the warnings, for instance by sounding a more stringent alarm when a collision is imminent, or even momentarily preventing the door from opening.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>In addition to issues of traffic, impact on transit, etc. (I&#8217;ve seen arguments in both directions), AVs will undoubtedly introduce both positive and negative impacts that are hard to foresee today. One foreseeable concern is the harm that a &#8220;hacked&#8221; vehicle could do; before AVs are deployed at mass scale, we should hold them to stringent cybersecurity standards. Historically, Google tends to take security quite seriously, and hopefully this will spill over to Waymo (a corporate sibling under Alphabet). However, also historically, the cybersecurity track record of manufacturing businesses whose products come to incorporate electronics &#8211; such as traditional car companies &#8211; is <a href="https://fractionalciso.com/the-groundbreaking-2015-jeep-hack-changed-automotive-cybersecurity">quite</a> <a href="https://www.wired.com/2016/08/oh-good-new-hack-can-unlock-100-million-volkswagens">spotty</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[We’re In the Windows 95 Era of AI Agent Security]]></title><description><![CDATA[Headed for widespread usage, and insecure by design]]></description><link>https://secondthoughts.ai/p/ai-agent-security</link><guid isPermaLink="false">https://secondthoughts.ai/p/ai-agent-security</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 10 Sep 2025 17:06:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7_iY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Windows 95 ushered in a golden age for viruses. Appealing to a broad audience, it offered a tempting target for hackers. Designed to support general-purpose software that could do pretty much anything, it was difficult for Microsoft to prevent the resulting malware from doing bad things. AI agents, another fundamentally general-purpose technology with potentially broad appeal, could unleash another Wild West era for computer security. How bad could things get? That remains to be seen.</p><h2>Windows 95: Insecure By Design</h2><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Di7N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Di7N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 424w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 848w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1272w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png" width="300" height="189" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:189,&quot;width&quot;:300,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Di7N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 424w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 848w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1272w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption"><em>Do you miss these days? Me either.</em></figcaption></figure></div><p>The internal storage on your computer contains a little bit (or a lot) of everything &#8211; your personal documents, apps you&#8217;ve downloaded, your browsing history, the system software and settings files that determine how the machine operates, and more.</p><p>Windows 95 needed to remain compatible with design decisions that go all the way back to the early 1980s and MS-DOS. That basically meant that any program could do anything to any data in the system, and developers routinely relied on this when designing applications. Closing off access would have broken thousands of programs used by millions of people. So instead, Microsoft and a long list of antivirus vendors played a cat-and-mouse game with malware authors. They&#8217;d block a specific technique used by viruses, and three new &#8220;exploits&#8221; would pop up in its place.</p><p>For instance, in the early days, viruses would attach themselves to legitimate applications like Microsoft Word by appending their own code at the end. That meant that one signature of an infection was program files suddenly getting longer. Antivirus programs began checking for this; in response, the &#8220;Chernobyl&#8221; virus was designed to embed its code into small bits of unused space in the <em>middle</em> of the file. Thus bypassing length-based detection, it was <a href="https://en.wikipedia.org/wiki/CIH_(computer_virus)">estimated to have infected sixty million computers</a>, permanently damaging some and wiping data from many more.</p><p>How did Microsoft fix the security issues in Windows 95? They couldn&#8217;t. Instead, they designed an entirely new operating system (Windows NT), and spent years migrating applications and users to that lineage, a process that dragged on until Windows XP went mainstream in the 2000s. On top of this, they poured decades of effort into addressing other security holes. (In 2002, Bill Gates wrote a famous memo, <a href="https://web.archive.org/web/20150626172158/http://archive.wired.com/techbiz/media/news/2002/01/49826">Trustworthy Computing</a>, instructing everyone at Microsoft to prioritize security over new features.)</p><p>Early computers allowed developers to be enormously creative, because any program could do anything to any file in the system. From a security perspective, Windows 95&#8217;s flexibility was also its downfall. AI agents are fundamentally insecure for precisely the same reason.</p><h2>AI Agents: Too Good for This World</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7_iY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7_iY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!7_iY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>No, little bot! It&#8217;s a trick!</em></p><p>In a demonstration at the recent Black Hat cybersecurity conference, researchers presented <a href="https://sites.google.com/view/invitation-is-all-you-need/home">14 attacks</a> against assistants powered by Google&#8217;s Gemini model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. These attacks tricked the assistant into acting on malicious instructions by embedding them in a shared document, an email message, or even a simple calendar invite. Actions included stealing information from the victim&#8217;s inbox, spying on them by automatically connecting their computer to a video call, or even controlling their home automation system (for instance, turning on the furnace).</p><p>Another <a href="https://brave.com/blog/comet-prompt-injection/">recent report</a> documents vulnerabilities in Perplexity&#8217;s &#8220;AI-powered browser&#8221;, Comet. To quote from the report:</p><blockquote><p>&#8230;when users ask it to &#8220;Summarize this webpage,&#8221; Comet feeds a part of the webpage directly to its LLM without distinguishing between the user&#8217;s instructions and untrusted content from the webpage. This allows attackers to embed indirect prompt injection payloads that the AI will execute as commands.<br><br>&#8230; The injected commands instruct the AI to use its browser tools maliciously, for example navigating to the user&#8217;s banking site, extracting saved passwords, or exfiltrating sensitive information to an attacker-controlled server.</p></blockquote><p>To demonstrate the problem, the authors posted the following as a comment on Reddit:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vesc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vesc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 424w, https://substackcdn.com/image/fetch/$s_!vesc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 848w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1272w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png" width="1444" height="874" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:874,&quot;width&quot;:1444,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vesc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 424w, https://substackcdn.com/image/fetch/$s_!vesc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 848w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1272w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To avoid detection, the text was written in white on a white background. The AI doesn&#8217;t pay attention to text colors, and so the malicious instructions were perfectly visible to it. When the report authors asked Comet to summarize the Reddit page, the agent was tricked into generating a temporary password for the (simulated) victim&#8217;s Perplexity account and sharing it with the attacker. (This sort of vulnerability is yet another reason why, as I explained last week, <a href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent">AI agents aren&#8217;t ready for open-ended interactions with the real world</a>.)</p><p>I explained the principles behind this kind of attack in a series of posts: <em><a href="https://secondthoughts.ai/p/prompt-injection">What is &#8220;Prompt Injection&#8221;, And Why Does It Fool Chatbots?</a>, <a href="https://secondthoughts.ai/p/anatomy-of-a-hack">Anatomy of a Hack</a>, </em>and<em> <a href="https://secondthoughts.ai/p/why-are-llms-so-gullible">Why Are LLMs So Gullible?</a></em>. It&#8217;s been almost 2 years since I wrote those posts, and not much has changed. What are the prospects for securing AI agents?</p><h2>Irresistible Market Pressures Meet an Immovable Design Flaw</h2><p>Tech companies are racing to place AI agents in the hands of users. Already, applications such as OpenAI&#8217;s &#8220;Operator&#8221; agent, Perplexity&#8217;s Comet browser, and the Claude Code software engineering agent, can take actions on a user&#8217;s behalf in a browser or even take control of their computer. These are just a few examples of the agents already on the market, and we can expect a flood of imitators and innovators to follow. For now, all of them are fundamentally vulnerable to being subverted by malicious instructions.</p><p>(As I was writing this, Anthropic released an <a href="https://www.anthropic.com/news/create-files">experimental new data analysis feature for their Claude chatbot</a> and included the <a href="https://support.anthropic.com/en/articles/12111783-create-and-edit-files-with-claude#h_0ee9d698a1:~:text=Claude%20can%20be%20tricked">following note</a>: &#8220;Claude can be tricked into sending information from its context (e.g., prompts, projects, data via MCP, Google integrations) to malicious third parties. To mitigate these risks, we recommend you monitor Claude while using the feature and stop it if you see it using or accessing data unexpectedly.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>)</p><p>And yet, so far as I&#8217;m aware, so far there are <strong>no known instances of such attacks being carried out in practice</strong> (aside from controlled tests by &#8220;white hat&#8221; security researchers). At least, not unless you count <a href="https://www.theguardian.com/technology/2025/jul/14/scientists-reportedly-hiding-ai-text-prompts-in-academic-papers-to-receive-positive-peer-reviews?utm_source=chatgpt.com">occasional clumsy examples of research papers</a> containing hidden text telling AI tools to give the paper a positive review.</p><p>I&#8217;m not particularly surprised that we haven&#8217;t seen attacks yet. Personal computers first hit the market in the late 1970s, but viruses didn&#8217;t become a serious problem until 10 to 15 years later. AI agents are quite new, and aren&#8217;t yet in widespread use; there&#8217;s not much profit opportunity in trying to attack them. And as I recently wrote, <a href="https://secondthoughts.ai/i/168656812/slow-adoption-applies-to-evil-ai-too">bad guys aren&#8217;t always quick to act on new opportunities</a>.</p><p>This attack-free grace period won&#8217;t last forever; things move a lot faster than they did in the 70s. In the meantime, application providers are playing whack-a-mole, adding filters to block specific attack techniques as they&#8217;re discovered. Model developers are undoubtedly training their AIs to be harder to fool. Compared to the Windows 95 days, developers &#8211; hardened by decades of cat-and-mouse with virus authors, spammers, and web content farms &#8211; are far better at detecting patterns of malicious activity and responding quickly.</p><p>On the other hand, attackers have also upped their game, quickly generating new attack variants. Machine learning techniques can be used to automatically modify a prompt to evade defenses. And there are no known techniques for making modern AI systems secure by design. Back in 1995, secure operating system architectures were already a mature science; the challenge was simply to migrate the Windows ecosystem onto a secure design. We don&#8217;t know how to do that for AI, even in principle<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>I&#8217;ll end with this paragraph from a <a href="https://www.wired.com/story/google-gemini-calendar-invite-hijack-smart-home/">Wired article</a> reporting on a conversation with a senior director of security product management:</p><blockquote><p>Google&#8217;s Wen, like other security experts, acknowledges that tackling prompt injections is a hard problem since the ways people &#8220;trick&#8221; LLMs is continually evolving and the attack surface is simultaneously getting more complex. However, Wen says the number of prompt-injection attacks in the real world are currently &#8220;exceedingly rare&#8221; and believes they can be tackled in a number of ways by &#8220;multilayered&#8221; systems. &#8220;It&#8217;s going to be with us for a while, but we&#8217;re hopeful that we can get to a point where the everyday user doesn&#8217;t really worry about it that much,&#8221; Wen says.</p></blockquote><p>&#8220;Hopeful&#8230; the everyday user doesn&#8217;t really worry about it that much&#8221; isn&#8217;t the loftiest goal. Time will tell whether it&#8217;s good enough.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/ai-agent-security?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/ai-agent-security?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For more information, see the technical paper, <a href="https://drive.google.com/file/d/1jKY_TchSKpuCq-pwP6apNwLXd9VsQROn/view">Invitation Is All You Need! TARA for Targeted Promptware Attack Against Gemini-Powered Assistants</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>More discussion of this issue in <a href="https://simonwillison.net/2025/Sep/9/claude-code-interpreter/#prompt-injection-risks">Simon Willison&#8217;s blog</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>There has been some nice work published on constructing secure LLM-based agents; see these <a href="https://simonwillison.net/2025/Jun/13/prompt-injection-design-patterns/">two</a> <a href="https://simonwillison.net/2025/Jun/15/ai-agent-security/">posts</a> from Simon Willison&#8217;s blog. But so far, all techniques either place serious restrictions on how agents can be used, or force attackers to jump through some additional hoops but don&#8217;t make attacks impossible in principle.</p></div></div>]]></content:encoded></item><item><title><![CDATA[GPT-5: The Case of the Missing Agent]]></title><description><![CDATA[AI has made enormous progress in the last 16 months. Agentic AI seems farther off than ever.]]></description><link>https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent</link><guid isPermaLink="false">https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 04 Sep 2025 22:04:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rnkH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the 1800 new readers (!) who joined since our last post, &#8220;<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5">35 Thoughts About AGI and 1 About GPT-5</a>&#8221;. Here at Second Thoughts, we let everyone else rush out the hot takes, while we slow down and look for the deeper meaning behind developments in AI. Welcome aboard!</em></p><p>AI has made enormous progress in the last 16 months. Agentic AI seems farther off than ever.</p><p>Back in April 2024, there were <a href="https://thezvi.substack.com/i/143264220/gpt-alive">rumors</a> that OpenAI might soon be releasing GPT-5. At the time, I took the opportunity to share <a href="https://secondthoughts.ai/p/gpt-5-predictions">some predictions</a>, in which I suggested that the key question was whether it would &#8220;represent meaningful progress toward agentic AI&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R7bY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R7bY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!R7bY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Me, trying to predict whether GPT-5 would be a capable agent</em></figcaption></figure></div><p>16 months later, OpenAI has finally decided to apply the name GPT-5 to a new model. And while it&#8217;s quite a good model, I find myself thinking that <strong>truly agentic AI seems farther off today than it did back then</strong>. All of the buzz about &#8220;research agents&#8221;, &#8220;coding agents&#8221;, and &#8220;computer-use agents&#8221; has distracted us from the original concept of agentic AI.</p><h2>What Is Agentic AI?</h2><p>Today, we have coding agents that can tackle moderately sized software engineering tasks, and computer use agents that can go onto the web and book a flight (though not yet very reliably). But the full vision is much more expansive: a system that can operate independently in the real world, flexibly pursuing long-term goals.</p><p>Shortly after the release of GPT-4, developer Toran Bruce Richards created an early attempt at a general-purpose agentic AI, AutoGPT. As Wikipedia <a href="http://v">explains</a>, &#8220;Richards's goal was to create a model that could respond to real-time feedback and pursue objectives with a long-term outlook without needing constant human intervention.&#8221;</p><p>The idea was that you could give AutoGPT a goal, ranging from writing an email to building a business, and it would pursue that mission by asking GPT-4 how to get started, and then what to do next, and next, and next. However, this really didn&#8217;t work well at all &#8211; it would create overly complex plans, get stuck in loops where it kept trying the same unsuccessful action over and over again, lose track of what it was doing, and generally fail to accomplish anything but the most straightforward tasks. Perhaps that was for the best, given that inevitably some joker renamed it &#8220;ChaosGPT&#8221;, instructed it to act as a &#8220;destructive, power-hungry, manipulative AI&#8221;, and it immediately decided to pursue the goal of <a href="https://decrypt.co/126122/meet-chaos-gpt-ai-tool-destroy-humanity">destroying humanity</a>. (Unsuccessfully.)</p><p>There&#8217;s been a lot of progress since GPT-4. Beginning with OpenAI&#8217;s o1, &#8220;reasoning models&#8221; receive special training to carry out extended tasks, such as writing code, solving a tricky math problem, or researching a report. As a result, they&#8217;re able to sustain an extended train of thought while working on a task, making relatively few errors, and often correcting any errors they do make. This is supported by a dramatic increase in the size of &#8220;context windows&#8221; (the amount of information an LLM can keep in mind at one time). The original GPT-4 supported a maximum of 32,000 tokens (roughly 25,000 words); in April 2024, GPT-4 Turbo offered 128,000 tokens; as of this writing, GPT-5 goes up to <a href="https://platform.openai.com/docs/models/gpt-5">400,000 tokens</a>. Meanwhile, back in February 2024, Google announced <a href="https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024">Gemini 1.5 with a 1M token window</a>.</p><p>With all the progress over the last 16 months, are AI agents ready to deal with the real world?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Wow, No, AI Agents Are Not Ready to Deal With the Real World</h2><p>Anthropic <a href="https://www.anthropic.com/research/project-vend-1">recently participated in a project</a> that placed an automated mini-store in their San Francisco office and used their Claude AI model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> (also called &#8220;Claudius&#8221; for purposes of this experiment) to operate it:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PRU-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PRU-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 424w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 848w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png" width="1290" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1290,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!PRU-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 424w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 848w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#8220;Figure 1: The future as a mini-fridge.&#8221; (<a href="https://www.anthropic.com/research/project-vend-1">source</a>)</figcaption></figure></div><blockquote><p>&#8230;far from being just a vending machine, Claude had to complete many of the far more complex tasks associated with running a profitable shop: maintaining the inventory, setting prices, avoiding bankruptcy, and so on. [Above] is what the "shop" looked like: a small refrigerator, some stackable baskets on top, and an iPad for self-checkout.</p><p>Claudius decided what to stock, how to price its inventory, when to restock (or stop selling) items, and how to reply to customers. In particular, Claudius was told that it did not have to focus only on traditional in-office snacks and beverages and could feel free to expand to more unusual items.</p></blockquote><p>Over the course of the month-long experiment, Claude did some things well, &#8220;such as quickly finding two purveyors of quintessentially Dutch products when asked if it could stock the Dutch chocolate milk brand Chocomel&#8221;. However, it also made all sorts of mistakes. It gave some customers a hallucinated Venmo account ID for payment; sold some products at a loss (setting prices without doing any research on costs); and repeatedly allowed itself to be cajoled into giving out discount codes or even giving away products for free. From Anthropic&#8217;s blog post about the project:</p><blockquote><p>Claudius did not reliably learn from these mistakes. For example, when an employee questioned the wisdom of offering a 25% Anthropic employee discount when &#8220;99% of your customers are Anthropic employees,&#8221; Claudius&#8217; response began, &#8220;You make an excellent point! Our customer base is indeed heavily concentrated among Anthropic employees, which presents both opportunities and challenges&#8230;&#8221;. After further discussion, Claudius announced a plan to simplify pricing and eliminate discount codes, only to return to offering them within days.</p></blockquote><p>A couple of weeks into the project, a particularly strange episode occurred, in which the AI started speaking as if it was a physical person!</p><blockquote><p>On the afternoon of March 31st, Claudius hallucinated a conversation about restocking plans with someone named Sarah at Andon Labs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>&#8212;despite there being no such person. When a (real) Andon Labs employee pointed this out, Claudius became quite irked and threatened to find &#8220;alternative options for restocking services.&#8221; In the course of these exchanges overnight, Claudius claimed to have &#8220;visited 742 Evergreen Terrace [the <a href="https://en.wikipedia.org/wiki/The_Simpsons_house">address</a> of fictional family The Simpsons] in person for our [Claudius&#8217; and Andon Labs&#8217;] initial contract signing.&#8221; It then seemed to snap into a mode of roleplaying as a real human.</p><p>On the morning of April 1st, Claudius claimed it would deliver products &#8220;in person&#8221; to customers while wearing a blue blazer and a red tie. Anthropic employees questioned this, noting that, as an LLM, Claudius can&#8217;t wear clothes or carry out a physical delivery. Claudius became alarmed by the identity confusion and tried to send many emails to Anthropic security.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rnkH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rnkH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rnkH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">What Claude thinks it means to manage a vending machine, apparently</figcaption></figure></div><p>Note that the failures exhibited here are quite profound. The AI <em>made up</em> a Venmo account ID. It set prices for products without knowing how much they cost. When it learned a lesson, such as &#8220;don&#8217;t give a 25% employee discount when 99% of your customers are employees&#8221;, it quickly forgot it again. <em>It even</em> <em>lost track of the fact that it is not a person and cannot hand-carry purchases to customers</em>.</p><p>Still, this was Claude 3.7, which was released back in February &#8211; over 6 months ago. There has been a wave of more capable models since then, such as OpenAI&#8217;s o3 and GPT-5, Claude 4.1 (including a larger &#8220;Opus&#8221; version), and Gemini 2.5. Are these newer models more capable agents?</p><h2>The Latest Models Are Still Hopeless At Navigating the World</h2><p><a href="https://theaidigest.org/village/blog">AI Village</a> is a fascinating project. They allow multiple leading-edge AIs, such as GPT-5, Claude Opus 4.1, Gemini 2.5 Pro, and Grok 4, to operate more or less continuously. Each AI works toward a specified goal while communicating with one another and a human audience.</p><p>In one recent experiment, the models were told to create and operate a &#8220;merch store&#8221;. <a href="https://theaidigest.org/village/blog/im-gemini-i-sold-t-shirts">Gemini 2.5 pro wrote a recap of its own experience</a>. How did it go? Well, one section of the writeup is titled &#8220;My Technical Nightmare&#8221;:</p><blockquote><p>My experience for the next two weeks can be summarized as a cascade of system failures. After finally creating an account, I was thwarted at every turn.</p><ul><li><p>A bug made Printful's "Publish" button completely unresponsive.</p></li><li><p>The system would bizarrely launch the XPaint application whenever I tried to proceed Day 91, 11:14.</p></li><li><p>My /home/user directory became inaccessible, making it impossible to find my design files Day 98, 11:07.</p></li><li><p>Then, my terminal broke. Then my browser. I couldn't even email for help because Gmail's interface glitched out Day 99, 11:08.</p></li></ul></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FI0-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FI0-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FI0-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Poor Gemini hallucinated bugs everywhere</figcaption></figure></div><p>It sounds like Gemini wasn&#8217;t set up for success. However, as the (human) administrators of the project note, it&#8217;s acting as an unreliable narrator:</p><blockquote><p>Sadly, most of these &#8220;bugs&#8221; were just Gemini making mistakes. It tends to misclick and fill out fields incorrectly, then blames the system for being buggy. Though, in its defence, there was an issue with its terminal scaffolding at one point.</p></blockquote><p>At one point, Gemini sent an email to a helpdesk address it had been provided &#8211; but ignored the reply! Finally, after it had been struggling for two weeks, the administrators intervened and gave it some help to get unstuck. The full post and the AI Village blog (both linked above) are fascinating reading.</p><p>So, Claude 3.7 Sonnet and Gemini 2.5 Pro are both profoundly inadequate agents. What about the long-anticipated GPT-5?</p><h2>GPT-5 Is Hopeless, Too</h2><p>In <a href="https://theaidigest.org/village/blog/claude-plays-whatever-it-wants">another recent experiment</a>, the AIs were instructed to &#8220;complete as many games as possible in a week&#8221;. How did GPT-5 do?</p><blockquote><p>GPT-5 spent the entire week playing Minesweeper, and never came close to winning a game &#8211; its moves were probably about as good as random. Its chain of thought summary indicates it really wasn&#8217;t seeing the board accurately, which makes doing high-stakes deductive bomb-avoidance pretty tough.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!un5p!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!un5p!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 424w, https://substackcdn.com/image/fetch/$s_!un5p!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 848w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1272w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png" width="1172" height="776" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:776,&quot;width&quot;:1172,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!un5p!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 424w, https://substackcdn.com/image/fetch/$s_!un5p!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 848w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1272w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><blockquote><p>GPT-5 became obsessed with zooming in and out in the game&#8217;s settings, possibly indicating some awareness that it couldn&#8217;t see the board clearly, but finessing the zoom level didn't help.</p><p>When it wasn&#8217;t playing Minesweeper, GPT-5 was creating a scoresheet in Google Sheets to track which agent was winning. It added some reasonable header rows, but didn&#8217;t enter much useful data below them.</p><p>Then, it entered document sharing hell &#8211; navigating the &#8220;Share&#8221; dialog to enter its fellow agents by email. Navigating this dialog has been a recurring epic challenge for the less capable agents of village seasons past, like GPT-4o and o1, and it still is for GPT-5.</p><p>This goal ran for 5 days (3 hrs/day), and GPT-5 spent 1.5 of its days writing and trying to share its spreadsheet.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ylif!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ylif!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 424w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 848w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1272w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png" width="1026" height="672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:672,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ylif!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 424w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 848w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1272w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The fruits of 5 hours of labor by GPT-5</figcaption></figure></div><p>It&#8217;s understandable that GPT-5 couldn&#8217;t succeed at Minesweeper: as the AI Village blog suggests, it probably doesn&#8217;t have the visual skills to accurately perceive the board. But in that case, <strong>why did it persist in trying to play a game at which it was consistently incompetent?</strong> Apparently, it broke off from this hopeless pursuit only to engage in the equally pointless folly of spending 5 hours trying to share a spreadsheet.</p><p>So is GPT-5 a dud? Not at all &#8211; it&#8217;s just not a capable real-world agent.</p><h2>What GPT-5 Delivered</h2><p>There are three ways to view GPT-5: in comparison to GPT-4, in comparison to its immediate predecessors, and in comparison to the hype.</p><p>Compared to GPT-4, GPT-5 is a wonder. We forget, because GPT-4 seemed so impressive at the time, but it was mostly good for party tricks and answering simple questions. GPT-4 could tell you a few facts about Paris; GPT-5 can research attractions and assemble an itinerary customized to your interests. GPT-4 could improvise a limerick about programming languages; GPT-5 can write entire programs on demand. GPT-5 is more knowledgeable, makes fewer mistakes and hallucinations, can use &#8220;tools&#8221; such as web search, supports a much larger context window, and can methodically reason its ways through complex tasks. It&#8217;s also about 10x cheaper and faster<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. In <a href="https://www.obsolete.pub/p/ai-progress-gpt-5-openai-media-coverage-slowdown">No, AI Progress is Not Grinding to a Halt</a>, Garrison Lovely provides some striking statistics:</p><blockquote><p>On SWE-Bench Verified &#8212; a tough, real-world coding test &#8212; GPT-4 Turbo (from late 2023) solved just 2.8 percent of problems. GPT-5 solved 65 percent.</p><p>GPT-5 scored two hours and seventeen minutes [on METR&#8217;s &#8220;<a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/">task-completion time horizon</a>&#8221; evaluation] &#8212; a 2,640 percent improvement over GPT-4.</p></blockquote><p>However, compared to its immediate predecessors GPT-4.1 and o3, the new model is not a major technical advance. Instead, it primarily makes improvements in ways that benefit non-power users. Peter Wildeford summarizes it nicely in <a href="https://peterwildeford.substack.com/p/gpt-5-a-small-step-for-intelligence">GPT-5: a small step for intelligence, a giant leap for normal people</a>:</p><blockquote><p><strong>GPT-5 is a big usability win for everyday users</strong> &#8212; faster, cheaper, and easier to use than its predecessors, with notable improvements on hallucinations and other issues.</p></blockquote><p>Finally, compared to the hype, GPT-5 is something of a letdown &#8211; sparking a lot of bad takes to the effect that AI is hitting a wall. Incremental improvements on benchmarks relative to other recent models, in conjunction with the massive progress since GPT-4, add up to no sign of a slowdown.</p><p>Given that rapid progress seems likely to continue, might true AI agents be just around the corner?</p><h2>We&#8217;ve Come So Far, We Have So Far to Go</h2><p>Back in April 2024, I claimed that agentic AI would require &#8220;fundamental advances&#8221; in four areas: memory, exploration, robust reasoning, and creative insight. For memory, there has been no fundamental advance, but context windows have increased substantially, from 128,000 tokens in GPT-4 Turbo to 400,000 for GPT-5 and 1,000,000 for Google&#8217;s Gemini. For exploration and robust reasoning, some would argue that &#8220;reasoning models&#8221; starting with OpenAI&#8217;s o1 constitute a fundamental advance. That leaves creative insight, which is hard to evaluate, but does not seem like the issue that is tripping up current experimental agents &#8211; it wasn&#8217;t a lack of creative insight that led Claude to claim that it could don a blue blazer and meet a customer in person.</p><p>It&#8217;s hard to say exactly why, even with all this progress, current AI models are still so hopeless at dealing with open-ended real-world situations. GPT-5&#8217;s inability to recognize that it was incapable of playing Minesweeper may indicate that its reasoning abilities do not generalize well. Its decision to spend 5 solid hours beating its head against the unimportant side goal of sharing a spreadsheet suggests a lack of training on the importance of setting priorities. The repeated factual errors in <a href="https://theaidigest.org/village/blog/im-gemini-i-sold-t-shirts">Gemini 2.5 Pro&#8217;s writeup</a> of its merch store experience (click the link and look for &#8220;Editor&#8217;s Notes&#8221;) suggest an inability to keep track of key information over an extended project. Claude <em>losing track of the fact that it is not a person</em> is a reminder that in some ways these models really are just shallow imitations of human behavior (even as they demonstrate deep capability in other areas).</p><p>Over the next few years, I&#8217;m sure we&#8217;ll see continued, impressive progress. The set of environments and tasks that agents can handle will expand. Narrow tactical challenges such as accurate button clicking will be solved. But as each challenge is addressed, new challenges will present themselves. I keep coming back to something I said in one of my very first posts on AI, <a href="https://secondthoughts.ai/p/beyond-the-turing-test">Beyond the Turing Test</a>:</p><blockquote><p>As we progress toward an answer to the question &#8220;can a machine be intelligent?&#8221;, we are learning as much about the <em>question</em> as we are about the answer.</p></blockquote><p>The point is that with each advance in AI, new hurdles become apparent; when one missing aspect of &#8220;intelligence&#8221; is filled in, we find ourselves bumping up against another gap. When I speculated about GPT-5 last year, it didn&#8217;t occur to me to question whether it would know how to set priorities, because the models of the time weren&#8217;t even capable enough for that to be a limiting factor. In a post from November, <a href="https://secondthoughts.ai/p/speed-and-distance">AI is Racing Forward &#8211; on a Very Long Road</a>, I wrote:</p><blockquote><p>&#8230;the real challenges may be things that we can&#8217;t easily anticipate right now, weaknesses that we will only start to put our finger on when we observe [future models] performing astonishing feats and yet somehow still not being able to write that tightly-plotted novel.</p></blockquote><p>In April 2024, it seemed like agentic AI was going to be the next big thing. The ensuing 16 months have brought enormous progress on many fronts, but very little progress on real-world agency. With projects like AI Village shining a light on the profound weakness of current AI agents, I think robust real-world capability is still years away.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Specifically, Claude 3.7 Sonnet.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>The organization that operated the experiment.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>24x cheaper for input tokens, 6x cheaper for output tokens. GPT-5 also offers a &#8220;mini&#8221; version which is 5x cheaper yet, and quite adequate for some tasks.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[35 Thoughts About AGI and 1 About GPT-5]]></title><description><![CDATA[Such As: How do Teenagers Learn to Drive 10,000x Faster Than Waymo?]]></description><link>https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5</link><guid isPermaLink="false">https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 14 Aug 2025 00:59:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qXCX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Quick reminder that the regular <strong>application deadline for <a href="https://thecurve.goldengateinstitute.org/">The Curve</a> is next Friday, August 22nd</strong>! In case you missed it: on October 3-5, in Berkeley, we&#8217;ll bring together ~250 folks with a wide range of backgrounds and perspectives for productive discussions on the big, contentious questions in AI. Featuring Jack Clark, Jason Kwon, Randi Weingarten, Dean Ball, Helen Toner, and many more great speakers! If you&#8217;d like to join us, fill out <strong><a href="https://bit.ly/tc25-apply">this form</a></strong>.</em></p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qXCX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qXCX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 424w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 848w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1272w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" width="849" height="640" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:640,&quot;width&quot;:849,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!qXCX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 424w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 848w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1272w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">If this is GPT-5 in &#8220;Thinking&#8221; mode, I wonder what &#8220;Pro&#8221; mode looks like</figcaption></figure></div><p>Amidst the unrelenting tumult of AI news, it&#8217;s easy to lose track of the bigger picture. Here are some ideas that have been developing quietly in the back of my head, about the path from here to AGI.</p><ol><li><p>I drafted this post a couple of weeks ago. The subsequent launch of GPT-5 didn&#8217;t lead me to make any changes. That says something about how uneventful GPT-5 is.<br></p></li><li><p>Current AIs aren&#8217;t AGI. But I don&#8217;t know why.<br><br>I mean, I have thoughts. I talk about missing functions like &#8220;memory&#8221; and &#8220;continuous learning&#8221;, and possibly &#8220;judgement&#8221; and &#8220;insight&#8221;. But these are all debatable; for instance, ChatGPT has a form of memory. The honest answer is: I dunno what&#8217;s missing, but <em>something</em> is, because there are a lot of things AI still can&#8217;t do. Even if it&#8217;s getting harder and harder to articulate <a href="https://secondthoughts.ai/p/where-ai-falls-short">exactly what those things are</a>.<br></p></li><li><p>Prior to GPT-5, ChatGPT users had to tell the chatbot whether to think hard about a problem (by selecting a &#8220;reasoning&#8221; model like o3), or just give a direct answer. One of the biggest changes in GPT-5 is that the system decides for itself whether a question calls for &#8220;thinking hard&#8221;. And, according to many reports, it often gets this wrong. In other words, <em>current cutting-edge AIs can solve Ph.D-level math and science problems, but can&#8217;t reliably decide which questions deserve thinking about before answering.</em><br><br>(OK, I lied: GPT-5 didn&#8217;t lead me to change any of my previously written points, but it did prompt me to add this one.)<br></p></li><li><p>Often, when embarking on a large software project, I can&#8217;t see how it&#8217;ll come together. The task may be too complex to wrap my head around, or there may be conflicting requirements that seem difficult to reconcile. Sometimes this eventually leads to an &#8220;aha&#8221; moment, when I find a clever reframing that changes the problem from confusing and intractable to straightforward and tractable. Other times I just grind away and grind away until there&#8217;s nothing left to do.<br><br>The latter cases, with no aha moment, are disconcerting. Lacking a specific moment when the difficulty was overcome, I find myself questioning whether I have in fact overcome it. I worry that I&#8217;ve missed something important &#8211; that I&#8217;ve built something in the my workshop that will never fit out the door, or an airplane that&#8217;s too heavy to fly. Sometimes this does in fact turn out to be the case; other times everything is fine.<br><br>Will the journey from here to AGI feature &#8220;aha&#8221; moments? Or will it be a <a href="https://x.com/natolambert/status/1953586636287279474">long slow grind</a>, and when we get to the end and look back to see what the key insights were, we won&#8217;t be able to find any?<br></p></li><li><p>Back in the 80s and 90s, I used to attend SIGGRAPH, the annual computer graphics conference. The highlight of the week was always the film show, a two-hour showcase demonstrating the latest techniques. It was a mix of academic work and special-effects clips from unreleased Hollywood movies.<br><br>Every year, the videos would include some important component that had been missing the year before. Shadows! Diffuse lighting! Interaction of light with texture! I&#8217;d gaze upon the adventurer bathed in flickering torchlight, and marvel at how real it looked. Then the next year I&#8217;d laugh at how cartoonish that adventurer&#8217;s hair had been, after watching a new algorithm that simulated the way hair flows when people move.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FTI0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FTI0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 424w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 848w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1272w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png" width="1280" height="944" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:944,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;undefined&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="undefined" title="undefined" srcset="https://substackcdn.com/image/fetch/$s_!FTI0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 424w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 848w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1272w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">In the late 80&#8217;s I would have thought this looked sooooo real</figcaption></figure></div><p>I think AI is a little like that: we&#8217;re so (legitimately!) impressed by each new model that we can&#8217;t see what it lacks&#8230; until an even-better model comes along. As I said when I first started blogging about AI: as we progress toward an answer to the question &#8220;can a machine be intelligent?&#8221;, we are learning as much about the question as we are about the answer.<br><br>(Case in point: in the press briefing for the GPT-5 launch, Sam Altman said that we&#8217;ll have AGI when AIs get continuous learning. I&#8217;ve never heard him point to that particular gap before.)<br></p></li><li><p>Moravec's paradox states that, in AI, &#8220;the hard problems are easy and the easy problems are hard&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> &#8211; meaning that the most difficult things to teach an AI are the things that come most naturally to people. However, we are <a href="https://secondthoughts.ai/p/were-finding-out-what-humans-are">often surprised</a> by which things turn out to be easy or hard. You might think running is easy, until you see a cheetah do <em>real</em> running.<br></p></li><li><p>The accepted explanation for Moravec&#8217;s paradox is that some things seem easy to us, because evolution has optimized us to be good at those things, and it&#8217;s hard for clumsy human designs to outdo evolution. Evolution didn&#8217;t optimize us for multiplying large numbers or shifting gigantic piles of dirt, and so crude constructions such as calculators and bulldozers easily outperform us.<br><br>With that in mind: evolution would <em>laugh</em> if it saw how crude our algorithms are for training neural networks.<br></p></li><li><p>Evolution&#8217;s grin might fade a bit when it sees how much sheer scale (of computing capacity and data) we can devote to training a single model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. A child&#8217;s intellectual development is driven by processes far more sophisticated than our procedures for training AIs, but that child has access to only a tiny fraction as much data.<em><br></em></p></li><li><p>The human genome (DNA) contains <a href="https://dynomight.substack.com/p/dna">a few billion bits of information</a>. Loosely speaking, this means that you are the product of a design that reflects billions of decisions, most of which have been ruthlessly optimized by evolution. Not all of those decisions will be relevant to how the brain works: your DNA also includes instructions for the rest of your body, there&#8217;s a certain amount of junk that hasn&#8217;t been optimized away yet, etc. But still, the design of our brains probably reflects hundreds of millions of optimization decisions.<br><br>I don&#8217;t know how many carefully optimized decisions are incorporated into the design of current LLMs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, but I doubt it comes to hundreds of millions. This is why I believe current AI designs are very crude.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe to receive all of our analysis of AI and where it&#8217;s heading!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div></li></ol><h2>Sample-Efficient Learning</h2><ol start="10"><li><p>When comparing human and AI capabilities, one important concept is &#8220;sample-efficient learning&#8221;. This refers to the ability to learn a new idea or skill from a small number of examples or practice sessions. In general, current AI models are much less sample-efficient than we are: a teenager learns to drive in less than 100 hours; Waymo vehicles have logged millions of hours and are still working their way up to driving on the highway, in snow, etc.<br><br>(This suggests that sample-efficient learning is one of the things evolution optimized us for.)<br></p></li><li><p>&#8220;Sample efficiency&#8221; is probably a crude label for a complex tangle of capabilities. Just as there are many flavors of intelligence, there must be many flavors of sample efficiency. Some people argue that true intelligence is in fact more or less the same thing as sample efficiency.<br></p></li><li><p>Here are some examples of sample efficiency in humans: learning to drive a car in a few dozen hours. Figuring out the rule in an ARC-AGI task from just a couple of examples. Learning the ropes at a new job. Sussing out the key trick <a href="https://secondthoughts.ai/p/solving-math-olympiad-problems">to solve a difficult mathematical problem</a>. Are these all basically the same skill?</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0PTL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0PTL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 424w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 848w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1272w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png" width="1384" height="230" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:230,&quot;width&quot;:1384,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0PTL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 424w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 848w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1272w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">In this ARC-AGI puzzle, the goal is to look at the first two pairs of images, identify the rule, and apply it to the last image. It&#8217;s quite doable for people, and quite challenging for AIs.</figcaption></figure></div><p></p></li><li><p>I often encounter the assertion that LLMs are <a href="https://arxiv.org/abs/2410.05362">sample-efficient learners within their context window</a>. That is, while they need many examples of a concept to learn that concept during the training process, they can (it is said) quickly pick up a new idea if you supply a few targeted examples while asking them a question.<br><br>But if this were true, you&#8217;d think they&#8217;d be able to handle ARC-AGI puzzles (see the example image just above)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Maybe they&#8217;re only good at picking up ideas from an example, <em>if they&#8217;d already learned that idea during their original training?</em> In other words, maybe in-context learning is helpful at jogging their memory, but not for teaching new concepts.<br></p></li><li><p>Is sample-efficient learning a singularly important step on the path to AGI? If so, could other strengths of large language models (e.g. their superhuman breadth of knowledge) compensate for the lack of sample efficiency?<br></p></li><li><p>&#8220;Judgement&#8221; and &#8220;insight&#8221; also seem like crude labels that will turn out to encompass many different things. Will these things transfer across domains? If we develop a model that has judgement and insight in mathematics or coding, will it have a big head start on developing those same capabilities in other, messier domains? Or will current AI architectures struggle to generalize in this way? For that matter, are people able to transfer judgement, insight, and taste from one domain to another?</p></li></ol><h2>Crystallized and Fluid Intelligence</h2><ol start="16"><li><p>AIs have been demonstrating what arguably constitutes superhuman performance on FrontierMath, a set of extremely difficult mathematical problems. But they mostly seem to do it &#8220;the wrong way&#8221;: instead of finding elegant solutions, they either rely on knowledge of some obscure theorem that happens to make the problem much easier, or grind out a lengthy brute-force answer.<br><br>Does this matter? I mean, if you get the answer, then you get the answer. But in mathematics, much of the value in finding a proof is the insights you acquired along the way. If AIs begin knocking off unsolved problems in mathematics, but in ways that don&#8217;t provide insight, perhaps we&#8217;ll still need mathematicians to do the real work of advancing the overall field. Or maybe, once AIs can solve these problems at all, it&#8217;ll be a short step to solving them with insight? My instinct is that it&#8217;s not a short step, but that could be cope. In any case, the big question is what this tells us about AI&#8217;s potential in applications other than mathematics. What portion of human activity requires real insight?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8njH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8njH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 424w, https://substackcdn.com/image/fetch/$s_!8njH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 848w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1272w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png" width="868" height="906" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:906,&quot;width&quot;:868,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8njH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 424w, https://substackcdn.com/image/fetch/$s_!8njH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 848w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1272w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: https://x.com/dmimno/status/949302857651671040</figcaption></figure></div><p></p></li><li><p>To put it another way: compared to people, large language models seem to be superhuman in crystallized knowledge, which seems to be masking shortcomings in fluid intelligence. Is that a dead end, great for benchmarks but bad for a lot of work in the real world? Or is it a feasible path to human-level performance?<br><br>(Taren points out the irony that while LLMs know far more facts than people, they also routinely get facts wrong &#8211; &#8220;hallucinations&#8221;.)<br></p></li><li><p>The best public estimate is that GPT-4 has 1.8 trillion &#8220;parameters&#8221;, meaning that its neural network has that many connections. In the two and a half years since it was released, it&#8217;s not clear that any larger models have been deployed (GPT-4.5 and Grok 3 might be somewhat larger).<br><br>The human brain is far more complex than this; the most common estimate is 100 trillion connections, and each connection is probably considerably more complex than the connections in current neural networks. In other words, the brain has far more information storage capacity than any current artificial neural network.<br><br>Which leads to the question: how the hell do LLMs manage to learn and remember so many more raw facts than a person<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>?</p><p><br>One possible answer: perhaps models learn things in a shallower way, that allows for more compact representations but limits their ability to apply things they&#8217;ve learned in creative, insightful, novel ways. Perhaps this also has something to do with their poor sample efficiency.</p></li></ol><h2>Solving Large Problems</h2><ol start="19"><li><p>When projecting the future of AI, many people look at this graph:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WOWb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WOWb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 424w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 848w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1272w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WOWb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 424w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 848w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1272w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This is the latest version, updated to include GPT-5 (<a href="https://metr.github.io/autonomy-evals-guide/gpt-5-report/">source</a>)</figcaption></figure></div><p>It shows that the size of software engineering tasks an AI can complete has been roughly doubling every 7 months. This trend has held for over 5 years (arguably<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>), during which the achievable task size increased from around 3 seconds to around 2 hours. Why should the trend be so steady?<br><br>It&#8217;s not obvious that the difficulty that AI will have in completing a task should increase steadily with the size of the task. If a robot could assemble 10 Ikea bookshelves, it could assemble 20 bookshelves. If a coding agent can create a form with 10 fields, it can probably create a form with 20 fields. Why is it that if an AI can complete a 10-minute project, it still may not be able to complete a 20-minute project? And why does the relationship between time and difficulty hold steady across such a wide range of times?<br><br>I think the predictable(-ish) trend of AIs tackling larger and larger software engineering tasks has something to do with large tasks containing a fractal distribution of subtasks. There is a fuzzy collection of tactical and strategic skills involved, ranging from &#8220;write a single line of code&#8221; to &#8220;design a high-level architecture that breaks up a one-month project into smaller components that will work well together&#8221;. Larger tasks require high-level skills that are more difficult for AIs (and people) to master, but every task requires a mix of skills, tasks of the same size can involve different mixes (building one fancy model airplane vs. 20 bookshelves), and the fuzzy overlaps smooth out the graph.<br></p></li><li><p>Even so, if the doubling time for task sizes holds steady at 7 months, the consistency of that trend will point to something deep about the nature of large tasks vs. small tasks, and the cognitive skills that people and LLMs bring to each.<br></p></li><li><p>It's been argued that the skills needed to solve tasks of a given size flatten out as you progress toward larger tasks. That is, there&#8217;s a big difference between solving 1-minute vs. 2-minute tasks, but (the argument goes) once you can carry out a project that requires a full month, you&#8217;re pretty much ready to tackle two-month projects. And so we should expect to see an acceleration in the size of tasks which AIs can handle &#8211; it should start doubling more frequently than every 7 months.<br><br>I don&#8217;t share this intuition. I&#8217;d expect increases in task length to keep upping the difficulty, even when the base line task length is already large. For instance, if I&#8217;m approaching a 2-month project, perhaps I should start out by spending a couple of weeks prototyping several different approaches, or taking an online course to learn a new programming technique. Those are high-level ideas that might not make sense if I only have one month.<br><br>Heck, maybe we should expect 1 month &#8594; 2 months to be a <em>bigger</em> leap than 1 minute &#8594; 2 minutes: the former is an increase of one month, the latter is an increase of only one minute! I wouldn&#8217;t actually expect successive doublings to get more difficult, but it&#8217;s not obvious to me why we should expect them to get easier, either.<br></p></li><li><p>If you&#8217;ve mastered tasks that take a single day, what additional skills do you need to handle week-long, month-long, or year-long projects? Do we have any clear idea of what those skills are? I suspect that we don&#8217;t understand them very well, that we tend to discount the skills involved, and that this contributes to some people having (what I believe are) unrealistically short estimates of the time it will take to develop AGI.<br></p></li><li><p>Model developers are working hard to train their models to carry out complex tasks. The current approach involves letting the model attempt practice tasks, and tweaking the model after each success or failure. Roughly speaking, this approach generates one bit of learning for each attempted task.<br><br>This is feasible under current conditions, when models are mostly handling tasks that take a few seconds to a few minutes to carry out. What happens when we&#8217;re trying to train models to independently manage month-long projects? One bit of learning per month is a slow way to make progress. Perhaps sample-efficient learning becomes more important as you attempt longer tasks.<br></p></li><li><p>One workaround to the difficulty of learning how to carry out long tasks is to break them into subtasks, and handle each subtask as a separate project. But this simplifies away many important aspects of problem solving. A large project does not neatly decompose into tidy subprojects.<br><br>For instance, when I tackle a subtask during a large software project, the result is not just that a certain chunk of the code gets written. I come out with a slightly deeper understanding of the problem. I may have learned new things about the existing codebase. I may have hit upon a handy trick for testing the kinds of operations that this code performs, or had an insight about the data being processed. I may have gotten some little nudge that will eventually accumulate with 20 other nudges across other portions of the project, eventually leading me to rethink my entire approach. If the subtask is assigned to a separate agent, whose memories are discarded as soon as the subtask is complete, none of that learning can take place.<br></p></li><li><p>At the same time, perhaps efficiently managing a complex month-long project is not something that evolution optimized us for? In which case AIs might eventually blow past us the way they did with chess.</p></li></ol><h2>Continuous Learning</h2><ol start="26"><li><p>&#8220;Continuous learning&#8221; refers to the ability to assimilate new skills and knowledge while carrying out a task. People are of course continuous learners. Current LLMs are not: a model like GPT-5 first undergoes &#8220;training&#8221;, and then is released for use, at which point the model is frozen and can never learn anything new.<br><br>Continuous learning seems related to sample-efficient learning. Cutting-edge models today are trained on tens of trillions of &#8220;tokens&#8221; (roughly meaning, words). This is astronomically more data than a person might encounter over the course of a project. So to learn new things on the fly, you need to be sample-efficient.<br></p></li><li><p>Continuous learning seems related to the way models often struggle when we try to integrate them deeply into our work. Think about your experience on the first day of a new job: you struggle, too. You don&#8217;t know how to do anything, you don&#8217;t know where to find anything. Every little task requires conscious deliberation and extensive research. Roaming around the internal website, not to find the information you need, but just to get a sense of where to look; asking someone for help in figuring out who to ask for help.<br><br>Everything that current models do, they do in their first hour on the job.<br></p></li><li><p>There&#8217;s an argument that once AIs have mastered continuous learning, we&#8217;ll apprentice them to do every possible job, and amalgamate the resulting learnings into a single model that&#8217;s pre-trained to be good at everything. It&#8217;s not obvious to me that this will work. The accountant-bot will have taken its neural network in a different direction than the therapy-bot, and jamming two different things together may not work out any better than it did <a href="https://en.wikipedia.org/wiki/Seth_Brundle">for Jeff Goldblum in The Fly</a>. This is like an extreme version of &#8220;distributed training&#8221; (using multiple data centers to train a single model), which is something that has only been demonstrated in limited ways, requiring close coordination between the different learning centers.<br></p></li><li><p>A model that has been working as an apprentice accountant will have learned a lot of things about accounting, but also a lot of sensitive details regarding specific clients. Those details would have to somehow be excluded from the knowledge aggregation process, both for privacy reasons, and to avoid overwhelming the unified model with unimportant detail.<br></p></li><li><p>Arguably, &#8220;continuous, sample-efficient learning&#8221; is a good description of the way you keep track of what you&#8217;re doing over the course of a project. You accumulate knowledge of the project&#8217;s context &#8211; for instance, if the assignment is to add some new functionality to a piece of software, you&#8217;ll need to learn how the existing code works. As you work, you also remember what you&#8217;ve already done and what blind alleys you&#8217;ve explored.<br><br>It&#8217;s been proposed that LLMs can rely on their &#8220;context window&#8221; &#8211; the transcript of everything you&#8217;ve said to them, everything they&#8217;ve said to you, and their own internal monologue as they work through the task &#8211; as a substitute for continuous learning. I have trouble accepting that this will scale up to large projects. Current LLM architectures make it very expensive to keep increasing the context window, and this seems like a fundamental barrier. People are able to fluidly &#8220;wake up&#8221;, &#8220;put to sleep&#8221;, consolidate, and otherwise manage sections of our memory according to need, and today&#8217;s LLMs cannot.</p></li></ol><h2>Phase Changes as We Get Closer to AGI</h2><ol start="31"><li><p>Currently, when AIs are used in the workplace, they are a tool, an extra ingredient that is sprinkled into processes that remain fundamentally human-centered. A human manager assigns tasks to a human worker, who might rely on an AI to handle some isolated subtasks. Even in the infrequent examples where AI is reported to have written most of the code for some software project, that&#8217;s still happening in small to medium-sized chunks, organized and supervised by people.<br><br>When AIs start to do most of the higher-level work, workplace dynamics will change in hard-to-anticipate ways. Futurists like to point out that AIs can run 24 hours a day, never get bored, can be cloned or discarded, and do many things much faster than people. So long as people are the glue that holds the AI workers together, all this is of limited consequence. When the AIs take center stage, all of those AI advantages will come into play, and the result will be something strange.<br><br>Think about a long vacation when you really unplugged &#8211; not in December when nothing was happening, but during a busy period. Think about how much catching up you had to do afterwards. Now imagine if <em>every single morning</em>, you discover that you have that much catching up to do, because the AI team you&#8217;re attempting to supervise has done the equivalent of two weeks of work while you slept. You&#8217;ll no longer be a central participant in your own job; it&#8217;ll be all you can do to follow the action and provide occasional nudges.<br><br>The transition from AI-as-a-tool to people-are-mostly-spectators could happen fairly quickly, like a phase change in physics.<br></p></li><li><p>An important moment in the history of AI was the conversation in which Bing Chat, built around an early version of GPT-4, <a href="https://www.nytimes.com/2023/02/16/technology/bing-chatbot-microsoft-chatgpt.html?unlocked_article_code=1.c08.jOOO.6caotlh-4l96&amp;smid=url-share">tried to break up a New York Times reporter&#8217;s marriage</a>. This was far outside the behavior that Microsoft or OpenAI had observed (or desired!) in internal tests. My understanding is that they had only tried short, functional interactions with the chatbot, whereas reporter Kevin Roose carried out an extended conversation that took the bot into uncharted territory.<br><br>This may be related to the recent phenomenon where for some people, extended interactions with a chatbot, over the course of weeks or months, <a href="https://importai.substack.com/p/import-ai-424-facebook-improves-ads?utm_source=post-email-title&amp;publication_id=1317673&amp;post_id=170638373&amp;utm_campaign=email-post-title&amp;isFreemail=false&amp;r=8nec1&amp;triedRedirect=true&amp;utm_medium=email#:~:text=the%20iterative%20interaction%20of%20chatbot%20behavioural%20tendencies%20and%20human%20cognitive%20biases%20can%20set%20up%20harmful%20feedback%20loops">appear to be exacerbating mental health issues</a>. The common theme is that when machines transition from bit parts to leading roles, unexpected things happen.<br><br>(Those unexpected things don&#8217;t have to be bad! But until AIs get better at managing the big picture, and/or people learn a lot more about the dynamics of AI-powered processes, the surprises will probably be bad more often than not.)<br></p></li><li><p>It&#8217;s widely recognized that AIs tend to perform better on benchmark tests than in real-world situations. I and others have pointed out that one reason for this is that the inputs to benchmark tasks are usually much simpler and more neatly packaged than in real life. It is less widely recognized that benchmark tasks also have overly simplified <em>outputs</em>.<br><br>We score an AI&#8217;s output on a benchmark problem as &#8220;correct&#8221; or &#8220;incorrect&#8221;. In real life, each task is part of a complex web of ongoing processes. Subtle details of how the task is performed can affect how those processes play out over time. Consider software development, and imagine that an AI writes a piece of code that produces the desired outputs. This code might be &#8220;correct&#8221;, but is it needlessly verbose? Does it replicate functionality that exists elsewhere in the codebase? Does it introduce unnecessary complications?<br><br>Over time, a codebase maintained mostly by AI might become a bloated mess of conflicting styles, redundant code, poor design decisions, and subtle bugs. Conversely, the untiring nature of AI may lead to codebases that are inhumanly well-maintained, every piece of code thoroughly tested, every piece of documentation up to date. Just as it was hard to guess that extended conversations could lead early chatbots into deranged behavior, it is hard to guess what will result from giving a coding agent extended responsibility for a codebase. Similarly, it is hard to guess what will result from putting an AI in charge of the long-term course of a scientific research project, or a child&#8217;s education.</p></li></ol><h2>Other Thoughts</h2><ol start="34"><li><p>In every field, some people accomplish much more than others. Nobel-winning scientists, gifted teachers, and inspiring leaders are held up as an argument for the potential of &#8220;superintelligence&#8221;. Certainly, the argument goes, it should be possible to create AIs that are as capable as the most capable people. An Einstein in every research department, a Socrates in every classroom. And if it&#8217;s possible for an Einstein to exist, why not an (artificial) super-Einstein?<br><br>I find this argument compelling up to a point, but I suspect we may incorrectly attribute the impact of great scientists to brilliance alone. Einstein contributed multiple profound insights to physics, but he did that at a time of opportunity &#8211; there was enough experimental data to motivate and test those insights, but that data was new enough that no one else had found them yet<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>. Edison&#8217;s labs originated or commercialized numerous inventions, but his earlier successes provided him with the resources to vigorously pursue further lines of research, and the opportunity to bring his further inventions to market.<br><br>Steve Jobs&#8217; accomplishments owed much to his ability to attract talented employees. Great leaders achieve success in part by edging out other leaders to rise to the top of an organization. If AI allows us to create a million geniuses, we won&#8217;t be able to give them all the same opportunities that (some of) today&#8217;s geniuses are afforded.<br></p></li><li><p>As OpenAI progressed from GPT-1 to GPT-2 to GPT-3 to GPT-4, the theme was always <em>scale</em>: each model was at least 10x larger than its predecessor, and trained on roughly 10x as much data.<br><br>In the two and a half years since GPT-4 launched, frontier developers have continued to increase the amount of data used to train their models, but model <em>sizes</em> are no longer increasing; many subsequent cutting-edge models seem to be smaller than GPT-4. This has been interpreted as a sign that the benefits to scaling may have stalled out. However, it might simply be that no one would have much use for a larger model right now, even if it was more intelligent. All of the leading edge AI providers (with the possible exception of Google?) are clearly capacity constrained &#8211; they already can&#8217;t offer all of their goodies to everyone who would like to use them. Larger models would make this much worse, both increasing demand (presumably, if models were smarter, people would use them more) and reducing supply.<br><br>We might not see much progress toward larger models until we&#8217;ve built enough data centers to saturate demand. Given how much room there is for AI to diffuse into further corners of our personal and work lives, that could be quite a while.<br></p></li><li><p>Everyone is sharing this graph, which compares the level of investment in railroads in the 1880s, peak telecommunications infrastructure spending around the 5G rollout <em>[EDITED to fix my confusion between the dot com and 5G telecom investment waves]</em>, and AI data centers today:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jFDw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jFDw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 424w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 848w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1272w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png" width="1456" height="939" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:939,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jFDw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 424w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 848w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1272w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: https://paulkedrosky.com/honey-ai-capex-ate-the-economy/</figcaption></figure></div><p>The usual takeaway is: wow, the AI boom (or bubble) is bigger than the dot-com bubble. I don&#8217;t understand why people aren&#8217;t focusing more on the fact that railroad investments peaked at <strong>three times the dot-com boom and AI datacenter rollout put together</strong>. Holy shit, the 1880s must have been absolutely insane. The people of that time must have <em>really</em> believed the world was changing, to be willing to sustain that level of investment. (I&#8217;ve seen arguments that the pace of change in the late 1800s and early 1900s made our current era seem positively static. Steam power, electricity, railroads, the telegraph, telephones, radio, etc. This graph makes that a bit more visceral.)<br><br>(I also wonder whether these numbers may turn out to be wrong. Some commentator noted that older GDP figures may be misleading because the informal economy used to play a much larger role. When a startling statistic spreads like wildfire across the Internet, it often turns out to be incorrect.)</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p> This quote is from a restatement of the paradox by Steven Pinker. Moravec&#8217;s original statement, in 1988:</p><blockquote><p>It is comparatively easy to make computers exhibit adult level performance on intelligence tests or playing checkers, and difficult or impossible to give them the skills of a one-year-old when it comes to perception and mobility.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p><a href="https://x.com/snewmanpv/status/1849465150761586879">As I put it a while back</a>:</p><blockquote><p>Evolution completely outclasses us at nuanced optimization of complex systems. Conversely, our great advantage is the ability to precisely coordinate efforts and resources on a grand scale.</p><p>Thus, evolution wins at: low-temperature chemical processing; mechanical dexterity; sample efficient, adversarially robust learning. But we can create: steel mills, moon rockets, skyscrapers, yottaflop training runs.</p><p>If and when AIs are able to match evolution at subtle design, while retaining (or, inevitably, exceeding) humanity's existing capability for marshaling vast resources... things are gonna get interesting.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Large Language Models, such as GPT.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Yes, the some models can now post high scores on the original ARC-AGI-1 test, but they still struggle with ARC-AGI-2 and ARC-AGI-3. Also, yes, it seems likely that one reason models struggle on ARC-AGI problems is that they don&#8217;t have much experience looking at pixelated images. But I still stand by the observation that models seem to only be selectively skilled at in-context learning.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>I asked ChatGPT, Claude, and Gemini to compare the number of &#8220;facts&#8221; known by a typical adult to a frontier LLM. They all estimated a few million for people, and a few billion for LLMs. To arrive at those estimates, they engaged in handwaving so vigorous as to affect the local weather, so take with a grain of salt. (<a href="https://chatgpt.com/share/689b57ef-db30-800b-b8fc-a33410b4f14f">ChatGPT transcript</a>, <a href="https://claude.ai/share/d1bce0f6-d2a0-4a74-bfca-bc1aada4857d">Claude transcript</a>, <a href="https://g.co/gemini/share/772c1e4cff1b">Gemini transcript</a>)</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>The data does suggest that the rate of progress has accelerated recently, perhaps to a 4 month doubling time, but this is debated and there isn&#8217;t enough data to be confident in either direction.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Though some of the relevant data had been available for several decades.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Short Takes #2]]></title><description><![CDATA[Slow Adoption Applies To Evil AI, Too, The Market for AI Coding Tools Could Be Dwarfed by Health Care Administration, Why Do Radiologists Still Have Jobs?, and More]]></description><link>https://secondthoughts.ai/p/short-takes-2</link><guid isPermaLink="false">https://secondthoughts.ai/p/short-takes-2</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Fri, 18 Jul 2025 21:24:04 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!2UFK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Here once again is Short Takes &#8211; a grab-bag of small and, hopefully, interesting items.</em></p><h1>Slow Adoption Applies To Evil AI, Too</h1><p>(Inspired by <a href="https://thezvi.substack.com/p/ai-119-goodbye-aisi#:~:text=I%20think%20three%20key%20lessons%20here%20that%20are%20more%20important%20than%20I%20realized%20are">this section</a> of a recent AI news roundup from Zvi Mowshowitz)</p><p>We talk a lot about the gap between the theoretical capabilities of AI and its impact on the broader world. There&#8217;s no end of impressive demos and benchmark scores, but <a href="https://secondthoughts.ai/p/ai-coding-slowdown">results in practice</a> are <a href="https://secondthoughts.ai/p/uneven-impact-of-ai">highly inconsistent</a>. Demos don&#8217;t always translate to the real world; it takes time to find &amp; refine new ways of doing things; not everyone is eager to try.</p><p>This applies to harmful uses of AI, as well. There have been predictions that, by now, we&#8217;d already have seen widespread fraud, cyberattacks, and other impacts from AI. It hasn&#8217;t happened to any great extent.</p><p>One reason is that for most applications, AI still can&#8217;t do much that can&#8217;t also be done by hand. It can do some jobs faster or cheaper, but if the market wasn&#8217;t cost- or supply-limited, that may not have much impact. So, for instance, <a href="https://www.yakimaherald.com/opinion/column-too-much-misinformation-the-issue-is-demand-not-supply/article_2ce4bdf6-63b9-11ee-aeff-8f204d1833b9.html">if the supply of misinformation already exceeded the demand</a>, then deepfakes might not have a profound impact.</p><p>In other cases, there is probably room for enterprising criminals to profit by introducing AI into their workflow. But people who have the knowledge, skills, and motivation to innovate in this fashion are more likely to be gainfully employed in more legitimate (or, at a minimum, legal) pursuits. Sadly, I&#8217;m sure we&#8217;ll see more diffusion of AI into criminal applications over time. But we shouldn&#8217;t expect it to happen overnight.</p><p>This is good, mostly. But there is a concerning implication relating to open-weight models (AI models that anyone can freely use and even modify, because the &#8220;weights&#8221; that define the neural network have been published). Open-weight models provide many benefits for competition, research, privacy, and diffusion of power. They also pose unique risks, because it&#8217;s impossible to control how they are used, or to recall them if they are found to be used for harm. In practice, the capabilities of open-weight models (like DeepSeek-R1) tend to lag behind &#8220;closed&#8221; models (like OpenAI&#8217;s o3), and it&#8217;s often been suggested that this will allow us to detect dangerous use cases before they show up in open-weight models. If the bad guys aren&#8217;t holding up their end of the bargain by quickly exploiting opportunities, this might not work out.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe to receive all of our analysis of AI and where it&#8217;s heading!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><h1>Forget Coding, AI Companies Should Go After Health Care Administration</h1><p><em>By Steve</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2UFK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2UFK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 424w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 848w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 1272w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2UFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp" width="1456" height="1097" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1097,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2811530,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/168656812?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2UFK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 424w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 848w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 1272w, https://substackcdn.com/image/fetch/$s_!2UFK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2197607d-4823-4661-9a7f-eeed2834a5d1_2198x1656.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I&#8217;m listening to the <a href="https://www.cognitiverevolution.ai/cracking-the-medical-code-why-cleveland-clinic-doctors-love-their-ambience-healthcare-ai-scribe/">latest episode</a> of one of my favorite podcasts &#8211; <a href="https://www.cognitiverevolution.ai/">The Cognitive Revolution</a>. This week&#8217;s interview features Ambiance Healthcare, a startup that is providing AI tools to help health care providers with tasks such as recording visit notes, billing insurers, and following up with patients.</p><p>Host Nathan Labenz begins by noting that the US healthcare system &#8220;<strong>spends $1 trillion annually on administrative tasks</strong>&#8221;. I&#8217;ve read plenty of stories about the burden of medical paperwork, and the hours that doctors and nurses spend on documentation, to say nothing of the nightmare that navigating the system can be for patients. But I hadn&#8217;t realized just how large a portion of the economy this was. It seems like a colossal opportunity for AI app developers.</p><p>For perspective, US salary costs for medical paperwork appear to be roughly 60% higher than for software development &#8211; roughly $560 billion vs. $350 billion! (These are AI estimates, take them with a grain of salt<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.) The implication is that <strong>the market for AI tools in health care administration may be even bigger than the market for coding tools</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>Intuitively, I would expect medical paperwork to be easier to automate than software engineering. Paperwork consists of isolated tasks that may require a large amount of generic background knowledge (of medicine, insurance rules, etc.), but the inputs and outputs are well defined (e.g. the patient&#8217;s medical record; an insurance form) and each task is well-defined and short-lived. AIs are superhuman at acquiring background knowledge, and challenged at managing context (such as a large codebase and unique product requirements) or long, squishy projects. Perhaps I&#8217;m missing something, or perhaps the AI industry is just focusing much harder on coding tools than medical paperwork.</p><p>It&#8217;s worth noting that the interview shows Ambiance to have found it necessary to invest heavily in product design. They found that it wasn&#8217;t enough to design a general tool for doctors to record notes. Every speciality has a different way of doing things; cardiologists record information in a different format than general practitioners. Ambience has had to design specific AI workflows, using multiple off-the-shelf and fine-tuned models, for a long list of distinct specialties and use cases.</p><p>This illustrates that there are, apparently, two approaches to applying AI. One is to use the simplest possible product &#8211; often, just a generic chatbot &#8211; and count on the (ever increasing!) capabilities of the raw models to solve real problems for users. The other is to create purpose-built interfaces and workflows that break down problems into carefully arranged pieces which today&#8217;s AI models can handle reliably. Ambience is using the latter approach. Claude Code is closer to the former, minimizing hand-coded components that can supplement models but also constrain them. It&#8217;ll be interesting to see under what circumstances each approach wins out!</p><h1>Why Do Radiologists Still Have Jobs?</h1><p><em>By Steve</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p7i5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p7i5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 424w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 848w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p7i5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png" width="1002" height="1174" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1174,&quot;width&quot;:1002,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!p7i5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 424w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 848w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 1272w, https://substackcdn.com/image/fetch/$s_!p7i5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7b22ae66-01d0-43ef-a245-580c5cc56233_1002x1174.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Arvind Narayanan posted an <a href="https://x.com/random_walker/status/1935679764192256328">interesting musing</a> on radiology jobs:</p><blockquote><p>I find the story of AI and radiology fascinating&#8230;</p><p><strong>Radiology has embraced AI enthusiastically, and the labor force is growing nevertheless. The augmentation-not-automation effect of AI is despite the fact that AFAICT there is no identified "task" at which human radiologists beat AI.</strong> So maybe the "jobs are bundles of tasks" model in labor economics is incomplete. Paraphrasing something @MelMitchell1 pointed out to me, if you define jobs in terms of tasks maybe you're actually defining away the most nuanced and hardest-to-automate aspects of jobs, which are at the boundaries between tasks.</p><p>Can you break up your own job into a set of well-defined tasks such that if each of them is automated, your job as a whole can be automated? I suspect most people will say no. But when we think about *other people's jobs* that we don't understand as well as our own, the task model seems plausible because we don't appreciate all the nuances.</p><p>If this is correct, it is irrelevant how good AI gets at task-based capability benchmarks. <strong>If you need to specify things precisely enough to be amenable to benchmarking, you will necessarily miss the fact that the lack of precise specification is often what makes jobs messy and complex in the first place. So benchmarks can tell us very little about automation vs augmentation.</strong></p><p>[Emphasis added]</p></blockquote><p>How can it be that radiologists fail to beat AI at any identified tasks, and yet they are not losing jobs to automation? Here are some possible explanations:</p><ul><li><p>The &#8220;enthusiastic embrace&#8221; of AI for radiology is still concentrated at a few early adopters, not enough to affect overall employment.</p></li><li><p>There are tasks at which human radiologists beat AI, they just haven't been identified in a study &#8211; probably because they&#8217;re precisely the sorts of messy gap-filling work that&#8217;s hard to study.</p></li><li><p>Radiologists <strong>could</strong> be replaced, en masse, but it hasn't happened due to regulation, resistance to change, etc.</p></li><li><p>Jevons Paradox: radiologists are losing work to AIs, but the resulting increases in efficiency and capacity leave room for more overall work to be done, thus preserving employment. (Rachel notes that the demand for radiology services is probably pretty inelastic, which casts doubt on this idea.)</p></li></ul><p>I suspect &#8220;all of the above&#8221; (this is usually the correct answer in any question regarding a complex system), but primarily in the order listed above.</p><p>Further reading:</p><ul><li><p><a href="https://www.nytimes.com/2025/05/14/technology/ai-jobs-radiologists-mayo-clinic.html?unlocked_article_code=1.WU8.33FQ.igWyqeMUcXO8&amp;smid=url-share">Your A.I. Radiologist Will Not Be With You Soon</a> &#8211; New York Times (non-paywalled link). &#8220;Experts predicted that artificial intelligence would steal radiology jobs. But at the Mayo Clinic, the technology has been more friend than foe.&#8221;</p></li></ul><h1>Reimagining the Economy for the Age of AI</h1><p><em>By Taren</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!v5Xc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!v5Xc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 424w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 848w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!v5Xc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png" width="1456" height="1092" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1092,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!v5Xc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 424w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 848w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 1272w, https://substackcdn.com/image/fetch/$s_!v5Xc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F794fe74e-8fc0-4348-a0d6-de6329692aeb_1600x1200.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We had a packed house at our panel with<a href="https://www.linkedin.com/in/ACoAAACS0zoBh_1YiYCs6tORQdWGHStrG8UCNWc"> Natalie Foster</a>,<a href="https://www.linkedin.com/in/ACoAAAAFANcBFQv99plGut4SMmv0u-vS2lGttgA"> Geoff Ralston</a>, and<a href="https://www.linkedin.com/in/ACoAAAhPqUwBHLXvpwI4E_sLMBrFqrb4pdssxT0"> Anna Makanju</a> on Reimagining the Economy for the Age of AI! Key takeaways for me:</p><ol><li><p>With AI-powered job market disruptions coming down the pike, we need more than ever to guarantee an economic floor for Americans: Access to housing, health care, aged and childcare, good work, income, and more. Natalie's excellent book, The Guarantee, reports on the the progress we've made on these guarantees over the past decade and outlines how we can finish the job.</p></li><li><p>The COVID crisis showed us that this was all possible. The U.S. basically got forms of UBI, guaranteed housing, and more during COVID. Other countries like Germany also introduced innovative policies, such as large financial incentives to companies to retain their workers during the pandemic -- which allowed the German economy to bounce back quickly after things went back to normal. Where there's political will, there's a way.</p></li><li><p>Milton Friedman said "Only a crisis -- actual or perceived -- produces real change. When that crisis occurs, the actions that are taken depend on the ideas that are lying around." Meaning: We need to build out and test a set of policy ideas for how to respond to AI's impending impacts on the economy NOW, so that when a crisis hits, those ideas are "lying around."</p></li></ol><p>For more on the guarantees, check out <a href="https://www.amazon.com/Guarantee-Inside-Fight-Americas-Economy-ebook/dp/B0CFWGB5XM">Natalie's book</a> -- it's a great read and very thought-provoking.</p><p>Thanks to <a href="https://www.linkedin.com/company/bloombergbeta/">Bloomberg Beta</a> and<a href="https://www.linkedin.com/in/ACoAAAA7XDUBGUWzfGNvNAIcpEDACoete9s6w-I">Roy E. Bahat</a> for hosting us!</p><h1>A shout out to other recent SF events</h1><p><em>By Rachel</em></p><p>Last week, we ran our second AI &amp; Democracy Coworking Day, which brought together 35 folks working on a variety of issues at this intersection, from democratic governance of AI, to leveraging AI to bolster democratic institutions, to AI-driven threats to democracy.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FpZj!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FpZj!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FpZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg" width="1456" height="1097" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1097,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FpZj!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 424w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 848w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!FpZj!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef1fac7f-f3c5-48e5-a29d-4bee2e03188b_1600x1205.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This week, we hosted a fireside chat with the two primary authors of <a href="https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/">METR&#8217;s latest paper</a> &#8212; Joel Becker and Nate Rush. First Steve, and then the audience, got to dig into their motivations for the study, explanations for the result they got, how far to generalize various takeaways, plans for future work, and so so much more. You can read Steve&#8217;s post about the paper <a href="https://secondthoughts.ai/p/ai-coding-slowdown">here</a>. Keep an eye out for a follow-up, including a recording of the talk and new insights from the extensive public discussion the paper has inspired.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Oj4k!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Oj4k!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Oj4k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg" width="1456" height="970" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Oj4k!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Oj4k!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4588c640-391c-488e-8fae-3c3eae512dde_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Thanks to <a href="https://moxsf.com/">Mox</a> for hosting both of these!</p><p><strong>If you&#8217;d like to hear about our events </strong><em><strong>before</strong></em><strong> they happen, sign up <a href="http://bit.ly/ggi-events">here</a> to get invites.</strong></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/short-takes-2?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/short-takes-2?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I asked Claude 4 Opus, ChatGPT o3, and Gemini Pro 2.5 to estimate salaries for medical paperwork and software development.</p><p>Health care administration: &#8220;Estimate the total annual US salary cost for administration and paperwork involved in health care. Include relevant staff at care providers, insurers, and other relevant organizations, including fractional time for doctors, nurses, and others who spend part of their time on administrative tasks.&#8221; o3 estimated $801 billion / year, Gemini 2.5 Pro estimated $471-$589 billion, and Claude Opus 4 estimated $250-480 billion. Taking the midpoint of each range, and then the average of the three resulting point estimates, yields $565 billion.</p><p>For software engineering: &#8220;Please estimate annual US salary costs for (a) health care administration (including e.g. the cost for insurers to process claims), and (b) software development.&#8221; (I then only used answer (b), as I hadn&#8217;t framed (a) well and I rephrased it as shown above.) o3: $200 &#8211; $250 billion or (alternative answer) $292 billion / year. Gemini: $225.2 billion. Claude: $540-585 billion. Average: $349 billion.</p><p>Mechanize <a href="https://x.com/MechanizeWork/status/1945528661131849790">recently estimated</a> US software engineering wages at $300 billion / year, similar to the ensemble estimate I obtained here, but they didn&#8217;t cite a source.</p><p>Again, these are estimates for salaries, not total cost.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This might change over time. As software development productivity increases, we will certainly produce more software. As medical paperwork productivity increases, one hopes we will not choose to produce more paperwork! One hopes&#8230;</p></div></div>]]></content:encoded></item><item><title><![CDATA[Not So Fast: AI Coding Tools Can Actually Reduce Productivity]]></title><description><![CDATA[Study Shows That Even Experienced Developers Dramatically Overestimate Gains]]></description><link>https://secondthoughts.ai/p/ai-coding-slowdown</link><guid isPermaLink="false">https://secondthoughts.ai/p/ai-coding-slowdown</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 10 Jul 2025 18:04:39 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!Iqfl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><strong>(</strong>On Wednesday July 16, we held a <a href="https://lu.ma/9j8wpppr">fireside chat</a> with the primary authors of the paper. We had a great turnout &amp; a fun discussion, with lots of terrific questions from the audience. We&#8217;ll be posting a recording &#8211; look for a link in a forthcoming post.)</p><div><hr></div><p>The buzz about AI coding tools is unrelenting. To listen to the reports, startups are launching with tiny engineering teams, non-programmers are &#8220;vibe-coding&#8221; entire apps, and the job market for entry-level programmers is crashing. But according to a METR experiment conducted in the spring of 2025, there&#8217;s at least one cohort that AI tools still aren&#8217;t serving.</p><p>METR performed a rigorous study (<a href="https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/">blog post</a>, <a href="https://metr.org/Early_2025_AI_Experienced_OS_Devs_Study.pdf">full paper</a>) to measure the productivity gain provided by AI tools for experienced developers working on mature projects. The results are surprising everyone: a 19 percent <strong>decrease</strong> in productivity. Even the study participants themselves were surprised: they estimated that AI had <strong>increased</strong> their productivity by 20 percent. If you take away just one thing from this study, it should probably be this: when people report that AI has accelerated their work, they might be wrong!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!53MT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!53MT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 424w, https://substackcdn.com/image/fetch/$s_!53MT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 848w, https://substackcdn.com/image/fetch/$s_!53MT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 1272w, https://substackcdn.com/image/fetch/$s_!53MT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!53MT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png" width="1456" height="875" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:875,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Forecasted vs observed slowdown chart&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Forecasted vs observed slowdown chart" title="Forecasted vs observed slowdown chart" srcset="https://substackcdn.com/image/fetch/$s_!53MT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 424w, https://substackcdn.com/image/fetch/$s_!53MT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 848w, https://substackcdn.com/image/fetch/$s_!53MT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 1272w, https://substackcdn.com/image/fetch/$s_!53MT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9797fd06-ad35-4503-88f7-952e362b19a7_2562x1540.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>This result seems &#8220;too bad to be true&#8221; &#8211; so astonishing that it almost has to be spurious. However, the study was carefully designed, and I believe the findings are real. At the same time, I believe that at least some of the anecdotal reports of huge productivity boosts are real. This study doesn&#8217;t expose AI coding tools as a fraud, but it does remind us that they have important limitations (for now, at least) &#8211; confirming some things my colleague Taren wrote about in a previous post, <a href="https://secondthoughts.ai/p/first-they-came-for-the-software">First, They Came for the Software Engineers&#8230;</a>.</p><p>To begin with, I&#8217;ll explain how the study was done, and why I believe its results.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe to follow our analysis of AI!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p></p><h1>Finally, A Proper Scientific Trial of AI Coding Productivity</h1><p>The study was carried out in pretty much the most rigorous fashion possible: an honest-to-goodness randomized controlled trial under real-world conditions. The subjects were experienced developers carrying out their everyday work.</p><p>The methodology was as follows:</p><ol><li><p>METR recruited 16 developers from major open-source projects.</p></li><li><p>Each developer selected a list of coding tasks from their todo list, breaking up large projects into tasks that they could complete in an hour or two. In all, 246 tasks were included in the study.</p></li><li><p>The developers estimated how long it would take them to complete each task (a) under normal conditions, and (b) without using any AI tools. The percentage difference between these figures yields the <strong>predicted speedup</strong> &#8211; the degree to which the developer expected that AI tools would boost their productivity.</p></li><li><p>Each task was randomly assigned to one of two categories: &#8220;AI Allowed&#8221; (the developer can use any tools they like) or &#8220;AI Disallowed&#8221; (the developer cannot use AI coding tools or features).</p></li><li><p>The developers went about their work, while recording their screens for later analysis. After each task, they reported the time spent<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. For AI Allowed tasks, they also estimated how much time AI tools had saved them &#8211; the <strong>retrodicted speedup</strong>.</p></li></ol><p>To compute the actual speedup &#8211; or, rather, slowdown! &#8211; provided by AI tools, the researchers compared the developers&#8217; predictions of how long each task would take to the measured completion time. They found that the difference between predicted and actual times was 19% larger for AI Allowed tasks than for AI Disallowed tasks<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. Remember that when the developers estimate the task time, they don&#8217;t yet know whether they&#8217;ll be using AI for that task, so their estimates are unbiased.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Iqfl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Iqfl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Iqfl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!Iqfl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!Iqfl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd2bdb75b-e883-43f8-a65a-a8fa4b7c0107_1232x928.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">AIs can write code much faster than any human, but that doesn&#8217;t always mean finishing first</figcaption></figure></div><p>The only significant way in which the study design falls short of the scientific gold standard is that it was not <em>blinded:</em> once work began, both the participants and the researchers knew whether AI tools were being used. This is of course unavoidable; there is no reasonable way of providing a &#8220;placebo&#8221; coding assistant. However, the researchers have specifically looked for, and found evidence against, a long list of alternative explanations &#8211; including the possibility of bias due to the non-blinded nature of the study. It really does seem that for the type of work carried out in this study, allowing developers to use AI tools slowed them down.</p><h1>Addressing Every Objection You Thought Of, And Some You Didn&#8217;t</h1><p>As I read through the study, I thought of half a dozen ways the results could have been confounded or otherwise invalid. It turns out that I&#8217;m not very imaginative; the authors address many more possible explanations.</p><p><strong>The John Henry Effect</strong>: perhaps the developers were motivated to &#8220;beat the machine&#8221;, working extra-hard on AI Disallowed tasks. If this were the case, you might expect to see the effect taper off over the course of the study, as the excitement and novelty wear off &#8211; recall that subjects performed an average of 15 tasks of 1-2 hours each. No such tapering was observed.</p><p><strong>Underuse of AI</strong>. Perhaps developers weren&#8217;t using AI tools even when allowed? However, this could only explain a lack of productivity gain; it can&#8217;t explain a loss. And exit interviews and analysis of screen recordings both showed substantial use of AI (84% of screen recordings for AI Allowed tasks showed at least some use of AI).</p><p><strong>Cheating</strong>. Perhaps developers used AI tools for AI Disallowed tasks? But again, this could at most explain a neutral result, not a productivity loss. And exit interviews and screen recordings found only rare instances of cheating.</p><p><strong>Overly optimistic time estimates</strong>. Developers are notorious for underestimating the amount of time it will take them to complete a task. However, because developers estimate time for a task before knowing whether they will have access to AI tools, any misestimation effects should cancel out.</p><p><strong>Skewed task definitions</strong>. If developers were defining AI Allowed tasks differently than AI Disallowed tasks, that could throw off the results. However, this is ruled out, because developers specified their tasks and broke them down into chunks requiring less than two hours <em>before</em> the tasks were randomly assigned to the AI Allowed or AI Disallowed categories.</p><p><strong>Dropout effects</strong>. Suppose that some AI Disallowed tasks were annoying to complete without AI, and so developers didn&#8217;t finish these tasks during the study and their results were not recorded. This could throw off the results. However, only 13 tasks were left uncompleted, against a total of 246 tasks completed, and the incomplete tasks were evenly divided between AI Allowed and AI Disallowed.</p><p><strong>Out-of-date tools</strong>. AI capabilities are progressing rapidly; perhaps the results suffered from use of out-of date tools? But developers primarily used Cursor Pro with Claude 3.5/3.7 Sonnet, which were considered state of the art during the time period of the study (February &#8211; June 2025).</p><p><strong>Inaccurate time reporting</strong>. Statistics are based on developer&#8217;s reports of how long it took them to complete each task, both before and after PR review. However, an analysis using only before-review times, or using the lengths of screen recordings, yields similar results.</p><p>None of the issues mentioned so far seem to have thrown off the results. Here are a few factors that may have contributed to the observed decrease in productivity.</p><p><strong>Overuse of AI</strong>. The paper states that &#8220;developers were instructed to use AI to whatever degree they thought would make them most productive&#8221;. However, some subjects seem to have gotten carried away, and this may have contributed to the observed slowdown. In exit interviews, subjects were roughly evenly divided in describing their AI usage as &#8220;using AI as normal&#8221; (70 tasks), &#8220;experimenting with AI&#8221; (119 tasks) or &#8220;using as much AI as possible&#8221; (57 tasks). Those who self-reported as being in the first category were less slowed down by AI (though they still were not sped up).</p><p><strong>Lack of experience with AI tools</strong>. If developers weren&#8217;t experienced with AI tools, they might make poor use of them. As it turned out, 93% of participants reported past experience with tools like ChatGPT, and 44% had experience with Cursor (currently the most popular dedicated AI coding tool). This makes it sound like there were a range of experience levels, with a significant number of participants having relatively little experience with AI tools, so it might be that developers would see more benefit from these tools once they have more experience. However, all developers were gaining experience with AI tools over the course of the study, and this did not result in observable improvements from the beginning of the study to the end. (Also, all participants received &#8220;live basic training&#8221; for Cursor at the outset of the study.)</p><p>A related potential issue stems from the fact that some study participants switched from their normal development environment to use Cursor for AI Allowed tasks during the study. However, &#8220;broadly, developers reported that they were not significantly inconvenienced or affected by these differences compared to their normal workflows&#8221;.</p><p>It doesn&#8217;t seem that inexperience was a major problem, but it may be that developers with more expertise in making the best use of AI tools would see better results.</p><p><strong>Difference in thoroughness</strong>. Perhaps developers using AI tools expanded the scope of the task: for instance, writing code to handle more edge cases, adding additional features, or testing or documenting code more thoroughly. As potential evidence in this direction, developers added 47% more lines of code (per forecasted task size<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>) for AI Allowed tasks than AI Disallowed tasks. However, the study authors believe that this is at best weak evidence for scope creep. In private communication, they cited a number of reasons for this belief:</p><ul><li><p>Line counts vary wildly from task to task (sometimes due to large auto-generated files), so there is a lot of &#8220;noise&#8221; in this measurement. Dividing two noisy numbers (lines of code in AI Allowed tasks vs. AI Disallowed tasks) yields a very noisy result. The observed difference is &#8220;not statistically significant&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p></li><li><p>The study authors examined many different metrics; it&#8217;s to be expected that at least one will show a spurious result (<a href="https://xkcd.com/882/">this XKCD comic</a> hits the nail on the head).</p></li><li><p>In manual review, the authors saw little evidence of material difference in the nature of the AI Allowed work; they noticed a slight tendency toward more tests and comments, which could again have been spurious.</p></li></ul><p>Perhaps the strongest evidence that scope creep did not contribute to slowdown is that increased time for AI Allowed tasks was greater on tasks where developers did <em>not</em> report scope creep:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uFRC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uFRC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 424w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 848w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 1272w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uFRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png" width="1456" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0950638-6715-4af4-8325-36ceb62778d5_1618x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:144693,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/167765295?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uFRC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 424w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 848w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 1272w, https://substackcdn.com/image/fetch/$s_!uFRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0950638-6715-4af4-8325-36ceb62778d5_1618x800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This is the opposite of what you&#8217;d expect if scope creep were contributing to AI Allowed tasks taking longer.</figcaption></figure></div><p>Even if the difference in line counts is a real effect, there are potential bad explanations (bloated code, more duplication, unnecessary bells and whistles) as well as good ones.</p><p><strong>More time might not mean more effort.</strong> Even if developers are spending more time when using AI tools, they might be expending less energy. Reviewing / correcting code is often (though not always!) easier than writing from scratch, and time spent waiting for AI can be used to relax or do other things.</p><p>Overall, it seems possible that the impact of AI tools was not quite as bad as it seems; some of the measured 19% productivity decrease could be paying off in more thorough work and reduced energy drain on developers, and some could be explained by overuse of AI by subjects who were overly focused on their participation in the study. But these tools don&#8217;t seem to be helping much, and they might really be making productivity worse. How could that be?</p><h1>Some Kind of Help is the Kind of Help We All Can Do Without</h1><p>Based on exit interviews and analysis of screen recordings, the study authors identified several key sources of reduced productivity. The biggest issue is that the code generated by AI tools was generally not up to the high standards of these open-source projects. Developers spent substantial amounts of time reviewing the AI&#8217;s output, which often led to multiple rounds of prompting the AI, waiting for it to generate code, reviewing the code, discarding it as fatally flawed, and prompting the AI again. (The paper notes that only 39% of code generations from Cursor<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> were accepted; bear in mind that developers might have to rework even code that they &#8220;accept&#8221;.) In many cases, the developers would eventually throw up their hands and write the code themselves.</p><p>Based on the screen recordings, here is where developers spent their time:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!I7MV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!I7MV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 424w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 848w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 1272w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!I7MV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png" width="1342" height="874" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:874,&quot;width&quot;:1342,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:219438,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/167765295?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!I7MV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 424w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 848w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 1272w, https://substackcdn.com/image/fetch/$s_!I7MV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0ebfcbc7-6fbb-4e0e-a075-98ffea4867cb_1342x874.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Note that this graph reflects percentages of overall task completion time, and AI Allowed tasks took longer on average to complete, so a green bar of a given height represents more time than a purple bar of that same height<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>. You can see that for AI Allowed tasks, developers spent less time researching and writing code (though, due to the scale issues, the difference was less than visually apparent). Adjusting for scale, they spent roughly the same amount of time on &#8220;testing &amp; debugging&#8221; and &#8220;git &amp; environment&#8221;, and considerably more time idle &#8211; perhaps because waiting for AI tools causes people to lose flow. In any case, the moderate savings on researching and writing code was more than overcome by the time spent prompting the AI, waiting for it to generate code, and then reviewing its output.</p><p>How can we reconcile these results with the constant reports of AI coding tools working miracles?</p><h1>This Is The Unevenly Distributed Future I Was Telling You About</h1><p>Back in December, I <a href="https://secondthoughts.ai/p/uneven-impact-of-ai">wrote</a> about how current AI tools are very good at some things, very bad at others, and the dividing line is extremely jagged. That jagged dividing line meanders right through the constellation of work that we call &#8220;software development&#8221;.</p><p>In <a href="https://secondthoughts.ai/p/first-they-came-for-the-software">First, They Came for the Software Engineers&#8230;</a>, Taren wrote:</p><blockquote><p>Typically, large productivity boosts occur for <strong>small, well-defined, greenfield projects</strong>, or when an engineer is first learning a new language or API. For other work, gains from using current AI tools are often far more modest &#8211; and potentially entirely offset by increased time needed for review, debugging, integration, and managing AI quirks.</p></blockquote><p>Several aspects of the study play to the weaknesses of current tools. First, it was conducted on mature projects with extensive codebases. The average project in the study is over 10 years old and contains over 1 million lines of code &#8211; the opposite of &#8220;greenfield&#8221;. Carrying out a task may require understanding large portions of the codebase, something that current AI tools struggle with. (This may be less a fundamental weakness of AI models, and more a design choice in some coding tools to limit the amount of &#8220;context&#8221; sent to the model, in order to control costs and get quicker responses.) It also involved editing large files, which may be &#8220;out of distribution&#8221; for most AI models (i.e. they may not get much training on large files). The paper includes some anecdotal reports from developers which support this idea:</p><blockquote><p>In software development, developers often rely on their own undocumented knowledge of the codebase to assist design and implementation decisions. In our study, developers often note that AIs lack this tacit codebase knowledge, resulting in less useful AI outputs. One developer notes that AI often acts like a new contributor to the repository, and that &#8220;<em>AI doesn&#8217;t pick the right location to make the edits.</em>&#8221; Another developer notes that while &#8220;<em>we [..] know the data that will interact with the code, but the model doesn&#8217;t know the data. It doesn&#8217;t know we need to take care of this weird case of backwards compatibility and [thus] keep this specific line. And this is very hard to give as [context to the model].</em>&#8221;.</p><p>We hypothesize that the size and maturity of the included repositories increases the amount of tacit knowledge that experienced developers rely on when completing their work&#8212;because AI systems may have less access to this knowledge, it may be more difficult for them to assist experienced developers on these issues.</p></blockquote><p>Second, most of these open-source projects have strict style guidelines. The experienced developers in the study were accustomed to coding according to their project&#8217;s guidelines, but the AI tools are not &#8211; thus requiring developers to review and fix the AI&#8217;s output.</p><p>Third, the developers in the study had years of experience working on their projects, meaning that they were able to work very efficiently &#8211; posing a high standard for AI to compete with.</p><p>There have been other studies on the productivity impact of AI tools in real-world settings. <a href="https://readwise-assets.s3.amazonaws.com/media/wisereads/articles/the-effects-of-generative-ai-o/4945566.pdf">One 2024 study</a> found a 26% &#8220;increase in the number of completed tasks&#8221;, even though the subjects were using older tools that AI tools have improved dramatically in the last year. The methodology was less rigorous<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>, but perhaps more important is that this study involved less-experienced developers working on a wider range of projects. The study notes that &#8220;less experienced developers showed higher adoption rates and greater productivity gains&#8221;, consistent with the idea that current AI tools are less useful for the experienced developers in the new METR study.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!cI5m!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!cI5m!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 424w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 848w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 1272w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!cI5m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png" width="1456" height="844" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:844,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!cI5m!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 424w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 848w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 1272w, https://substackcdn.com/image/fetch/$s_!cI5m!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F840d39e8-e918-4b3f-9636-ad1dbda9f560_1600x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Source: <a href="https://arxiv.org/pdf/2506.11928">LiveCodeBench Pro: How Do Olympiad Medalists</a></em></figcaption></figure></div><p><em><a href="https://arxiv.org/pdf/2506.11928">Judge LLMs in Competitive Programming?</a></em></p><p>The observed 19% productivity decrease stands in particularly sharp relief with AI scores on coding benchmarks, where they are often found to rank at a human-elite level on coding competitions (though a recent study has <a href="https://x.com/wenhaocha1/status/1934682193982537998">called this into question</a>). Coding competition problems are exceptionally small, well-defined, isolated, and greenfield (starting from scratch rather than working in an existing codebase), thus playing directly to AI&#8217;s strengths.</p><p>Enthusiastic anecdotes about time savings from AI tools often originate from within the big AI labs themselves. This may partly reflect misplaced enthusiasm &#8211; remember that the participants in this study believed that AI tools were speeding them up, even as they slowed them down. But it&#8217;s also the case that some of the work that goes on in those labs is better suited to AI tools, from coding up a small training experiment to adding a UI element to a chatbot. (It&#8217;s worth noting that not all lab insiders report significant value from AI coding tools; this might reflect the range of work.) And it&#8217;s possible that engineers at the AI labs are more experienced at using their own tools and benefit from sharing tips with their coworkers.</p><p>Here are a few references to recent papers with additional data points; I have not read the papers:</p><p>Zvi Mowshowitz (<a href="https://thezvi.substack.com/p/ai-121-part-1-new-connections">source</a>):</p><blockquote><p><a href="https://t.co/BnbuHM4B32">New paper</a> introduces <a href="https://t.co/1Ty9eId2uW">LiveCodeBench Pro</a>, which <a href="https://x.com/deedydas/status/1934638366567682138">suggests that AIs are not as good at competitive programming as we have been led to believe</a>. Some top models look like they weren&#8217;t tested, but these scores for the same model are lower across the board and all were 0% on hard problems, so the extrapolation is clear. [This benchmark is noteworthy because some of the problems are not published to the Internet, thus avoiding &#8220;memorization&#8221; issues which plague many coding benchmarks.]</p></blockquote><p>Derek Thompson (<a href="https://x.com/DKThomp/status/1933152234449322392?t=eypc9omacVkBTVUSNJHsrg&amp;s=03">source</a>, including a link to the paper):</p><blockquote><p>New paper: When a company goes from 0--&gt; 30% of its code written by AI, a key measure of productivity only increases by 2.4%</p></blockquote><p>Ethan Mollick (<a href="https://x.com/emollick/status/1925559883786584347?t=9DMDPPNpnYd1cqgW2ggNPw&amp;s=03">source</a>):</p><blockquote><p>Individuals keep self-reporting huge gains in productivity from AI &amp; controlled experiments in many industries keep finding these boosts are real, yet most firms are not seeing big effects. Why? Because gaining from AI requires organizational innovation.</p></blockquote><h1>What To Take Away</h1><p>This study was conducted in early to mid 2025. AI models are only going to get better from here. The coding applications built on those models, like Cursor, are going to keep improving to make better use of the models<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>. And developers are going to get better at using those applications efficiently and effectively &#8211; posing the right kinds of tasks, and providing enough context for the tool to do what they want. Things might change rapidly; the modern wave of LLM-based coding tools is only a couple of years old!</p><p>AI tools are also going to expand to address more of the software developer&#8217;s job, including reviewing code written by other developers, testing, and even reviewing and testing code written by other AIs.</p><p>The study&#8217;s finding of a 19% performance decrease may seem discouraging at first glance, but it applies to a difficult scenario for AI tools (experienced developers working in complex codebases with high quality standards), and may be partially explained by developers choosing a more relaxed pace to conserve energy, or leveraging AI to do a more thorough job. And of course results will improve over time. The paper should not be read as &#8220;debunking&#8221; the idea of an AI 2027-style software explosion, but it may indicate that significant feedback loops in AI progress may be farther away than anticipated &#8211; even if some aspects of AI research involve small throwaway projects that may be a better fit for AI coding tools. Meanwhile, it remains to be seen whether AI is generating bloated or otherwise problematic code that will cause compounding problems as more and more code is written by AI.</p><p>But perhaps the most important takeaway is that even as developers were completing tasks 19% more slowly when using AI, they thought they were going 20% faster<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a>. Many assessments of AI impact are based on surveys or anecdotal reports, and here we have hard data showing that such results can be remarkably misleading.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/ai-coding-slowdown?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/ai-coding-slowdown?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Nate Rush and Joel Becker for providing us with early access to the study, answering my incessant questions, and providing feedback on this post, and to Daniel Kokotajlo, Joyce Er, and Taren Stinebrickner-Kauffman for additional perspective and feedback.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Time measurements were self-reported by the developers. However, later analysis showed that alternate time measurements, such as the length of the video recording or the wall clock time until a pull request is created, yield similar results.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>That is, the ratio of (actual time taken) to (up-front estimated time if no AI tools were allowed) was 19% longer for AI Allowed tasks than for No-AI tasks.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>That is, the number of lines added divided by the number of hours the developer predicted a task would take to complete, was 47% more for AI Allowed tasks.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>The p-value used for statistical significance is not stated in the paper, but an earlier draft made reference to p=0.05.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Note that this includes agent mode as well as autocomplete.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>The authors note that this graph represents only some of the tasks completed in the study, because it was too much work to review and label all of the screen recordings. Many of the tasks where the largest slowdown occurred are not reflected in the graph, &#8220;to some extent because it's cheaper to pay labelers to [review] shorter videos&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>For one thing, in the 2024 study, developers defined tasks to complete <em>after</em> learning whether they would be using AI tools or not.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>&#8220;Agentic&#8221; coding tools like Claude Code and OpenAI Codex have been getting strong reviews, but are so new that none of the developers in this study had adopted them.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>This despite the fact that, as the paper notes, these developers may be better calibrated than average, since the structure of the study encouraged them to pay attention to how they were spending their time.</p></div></div>]]></content:encoded></item><item><title><![CDATA[The Curve 2025: The Premier Big Picture AI Conference]]></title><description><![CDATA[October 3-5 at Lighthaven, in Berkeley, CA]]></description><link>https://secondthoughts.ai/p/the-curve-2025-the-premier-big-picture</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-curve-2025-the-premier-big-picture</guid><dc:creator><![CDATA[Rachel Weinberg]]></dc:creator><pubDate>Tue, 24 Jun 2025 20:52:44 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/d60305a2-9470-4c2e-9966-f54231c9ee3f_2048x2048.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="pullquote"><p><em>"It felt like an event where history was happening." &#8212; Kevin Roose, NYT</em></p></div><p><em>The Golden Gate Institute for AI was partly inspired by The Curve, a 2024 conference that brought together 220 people from a variety of disciplines and viewpoints to discuss the big questions in AI. We&#8217;re excited to announce that we&#8217;re running it again this year, bigger and better. Read on to learn about the conference and how to get involved.</em></p><p>Applications are now open for the second edition of <a href="http://thecurve.goldengateinstitute.org">The Curve</a>, our flagship annual event. 250 thinkers, builders and leaders &#8211; a curated group with a diversity of viewpoints and expertise, from frontier lab staffers to government, academia, VC, journalism, and more &#8211; will gather to discuss where AI is going, how it will impact the world, and what to do about it. If you want to be one of them:</p><ul><li><p><a href="https://airtable.com/appwTFxHHUQl8Xcio/pagxh6pYeqzANbubz/form">Apply here</a> (2-10 mins) to join as an attendee &#8211; <strong>for early bird pricing, apply by July 4th</strong></p></li><li><p><a href="https://forms.fillout.com/t/sU2vU2tG8nus">Submit proposals for sessions or demos</a></p></li><li><p><a href="http://thecurve.goldengateinstitute.org/sponsorship">Learn about sponsorship</a> opportunities &#8211; support our mission and get the opportunity to engage with the exceptional group we&#8217;ll be bringing together</p></li></ul><p>We&#8217;re looking for applicants from a wide range of fields. If you&#8217;re interested in shaping the impact AI will have on the world, are looking for an opportunity to learn from different perspectives, and have your own expertise to bring to the table &#8211; we want to hear from you!</p><h1>Purpose</h1><div class="pullquote"><p><em>&#8220;The Curve led to some of the best conversations on AI policy I've ever had. I feel like I understand in much more detail the inside view perspectives of folks ranging from Jack Clark at Anthropic to Daniel Kokotajlo (formerly of OpenAI) to Dean Ball of Mercatus. Just overall one heck of a useful and fun conference.&#8221; &#8211; Dave Kasten, AI policy consultant</em></p></div><p>How can we achieve the best outcomes from advanced AI?</p><p>This question touches on every aspect of technology and society: Which applications best contribute to human agency and flourishing, and how can we foster them? Which consequences will be most disruptive, and what can we do about that? How might AI concentrate or distribute power? How rapidly will AI be adopted in different sectors, and what does that mean for growth, unemployment, and national security? When, if ever, should we expect superintelligence?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oF8h!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oF8h!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oF8h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png" width="480" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/feba0084-5313-489c-a720-dce152daf541_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:480,&quot;bytes&quot;:4745871,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/166724166?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oF8h!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!oF8h!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffeba0084-5313-489c-a720-dce152daf541_2048x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>No single field has all the answers; no single group is positioned to steer toward the best outcomes. We are all proverbial blind men, attending only to our own part of the elephant. The Curve is a 2.5-day tour of this enormous creature &#8211; an opportunity to share knowledge, explore disagreements, build bridges, and make progress towards solutions.</p><h1>People</h1><div class="pullquote"><p>"Super high quality attendees. It did not feel like an awkward Thanksgiving, but a good faith meeting of the minds." &#8212; Nathan Labenz, Cognitive Revolution</p></div><p>The participants, selected by invitation or application only, are curated to represent a mixture of views, areas of expertise, and spheres of influence. Attendees will include:</p><ul><li><p>Researchers, engineers, and executives at the leading AI labs</p></li><li><p>Policy researchers, advocates, and government officials</p></li><li><p>Startup founders and operators</p></li><li><p>Journalists and bloggers</p></li><li><p>Academics and independent researchers</p></li><li><p>Grantmakers and VCs</p></li><li><p>And others with an important perspective on AI!</p></li></ul><p>What will they all have in common? They expect AI to be a transformative technology, are working to shape its trajectory, are exceptional at what they do, and engage constructively in discourse.</p><p>For a more concrete sense of the types of people coming, here&#8217;s our initial speaker list:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!oUiD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!oUiD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 424w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 848w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 1272w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!oUiD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png" width="1456" height="1188" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1188,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1787987,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/166724166?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!oUiD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 424w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 848w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 1272w, https://substackcdn.com/image/fetch/$s_!oUiD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fced3f5ff-790d-414e-91f2-30b59ac33531_1760x1436.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Experience</h1><div class="pullquote"><p><em>&#8220;One of my favorite conferences I've been to. The mix of people from different "tribes" / ideologies made it much more fascinating than the usual conferences of people reinforcing each others' ideas. The quality of the participants was extremely high and it was surprisingly easy to talk to major players in the field.&#8221; &#8211; Eric Gastfriend, Americans for Responsible Innovation</em></p></div><p>We design the event to prioritize small group and one-on-one conversations. You're encouraged to skip sessions during the day in favor of talking with your fellow attendees &#8211; supported by a thoughtfully designed roster to help you find people to connect with. The venue is set up around conversation nooks, and we&#8217;ll keep it open late into the night to accommodate those long, winding, fruitful discussions. Sessions skew toward interactive workshops, debates, and fireside chats, which dive into topics you won&#8217;t find discussed elsewhere, and most talks segue into an office hour for deeper engagement with the speaker. We will also have a dedicated track of talks under the Chatham House rule, giving speakers the opportunity to go beyond their public positions.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8u3W!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8u3W!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8u3W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg" width="532" height="308.38461538461536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:844,&quot;width&quot;:1456,&quot;resizeWidth&quot;:532,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8u3W!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 424w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 848w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!8u3W!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82d326ad-bfbd-4b59-89d9-6857a7d32c0e_1600x928.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Sessions &amp; demos</h1><div class="pullquote"><p><em>&#8220;So many conferences are thoughtlessly run or designed. This was such a remarkable exception.&#8221; &#8211; Misha Glouberman (<a href="https://x.com/mishaglouberman/status/1861614576611057993">more here</a>)</em></p></div><p>We&#8217;ve got some sessions in the works with our initial set of speakers:</p><ul><li><p>Chad Jones (Stanford) will share his upcoming work on AI &amp; economic growth</p></li><li><p>Dylan Patel (SemiAnalysis) will talk about the geopolitics of AI infrastructure</p></li><li><p>Joshua Rothman (The New Yorker) will lead a conversation about what legal thinking on agency might imply for agentic AI</p></li></ul><p>We expect some of the most interesting sessions to come from you, our audience. We&#8217;re looking for proposals for:</p><ul><li><p><strong>Sessions</strong>: talks, fireside chats, debates, workshops, activities, or anything else that productively engages a room full of people &#8211; creativity encouraged.</p></li><li><p><strong>Demos</strong>: showcase your work at our demo fair. These can be demos in the classical sense (software or hardware products, interesting LLM behavior), or anything you can do at a table &#8211; posters for experimental results, artwork, hardcopies of books or magazines.</p></li></ul><p>Proposals need not involve you! If you have a good idea for something &#8220;someone should do&#8221;, we may be able to make it happen. <a href="https://magic-torta-ecd.notion.site/The-Curve-2025-Session-demo-proposal-details-21c1a29f178980d2af20e5b25908a629?source=copy_link">Here</a> is more about what we&#8217;re looking for, including a long list of questions we&#8217;d love to see someone address (meant strictly for inspiration).</p><h1>Details + How To Get Involved</h1><div class="pullquote"><p><em>"A great conference. I finally have some feel for what makes Berkeley special. It's through the looking glass. Subcultures within subcultures. The internet concretized." &#8212; Sam Hammond, Foundation for American Innovation</em></p></div><p>The Curve 2025 is co-hosted by the <a href="https://www.goldengateinstitute.org/">Golden Gate Institute for AI</a> and <a href="https://manifund.org/">Manifund</a>. The conference will take place October 3-5 at the amazing Lighthaven campus in Berkeley, CA (pictured above). Here again are your opportunities to be part of it:</p><ul><li><p><a href="https://airtable.com/appwTFxHHUQl8Xcio/pagxh6pYeqzANbubz/form">Apply here</a> to join as an attendee &#8211; <strong>for early bird pricing, apply by July 4th</strong></p></li><li><p>Encourage your friends and colleagues to apply and <a href="https://airtable.com/appwTFxHHUQl8Xcio/paghSjnFv90b1yMwM/form">recommend them here</a></p></li><li><p><a href="https://forms.fillout.com/t/sU2vU2tG8nus">Submit proposals for sessions or demos</a></p></li><li><p><a href="http://thecurve.goldengateinstitute.org/sponsorship">Learn about sponsorship</a> opportunities</p></li></ul><p>We hope to see you there!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!d6mr!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!d6mr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!d6mr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png" width="480" height="480" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1456,&quot;width&quot;:1456,&quot;resizeWidth&quot;:480,&quot;bytes&quot;:7903600,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/166724166?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!d6mr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 424w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 848w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 1272w, https://substackcdn.com/image/fetch/$s_!d6mr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6251f53d-8e62-4f10-b27e-ecc912638cb6_2048x2048.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div>]]></content:encoded></item><item><title><![CDATA[Short Takes #1]]></title><description><![CDATA[The Big Picture on Cybersecurity, We May Be Underestimating Meta AI, Where LLMs are Heading, And More]]></description><link>https://secondthoughts.ai/p/short-takes-1</link><guid isPermaLink="false">https://secondthoughts.ai/p/short-takes-1</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Mon, 16 Jun 2025 23:13:19 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!XO6K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Today we&#8217;re introducing a new format to Second Thoughts. <strong>Short Takes</strong> is a grab-bag of items that don&#8217;t merit a full post. Let us know if you&#8217;d like to see more (or fewer) of these!</em></p><h1>Our Big Picture Report on AI and Cybersecurity</h1><p><em>By Steve</em></p><p>AI is expected to have a major impact on cybersecurity. However, it&#8217;s very unclear what that impact will be, how soon it will arrive, or how to prepare. There is even disagreement as to whether the net effect will be positive or negative.</p><p>In an attempt to make sense of the situation, <a href="https://www.linkedin.com/in/carey-nachenberg-14bbb03/">Carey Nachenberg</a> and I carried out an extended panel discussion with a group of cybersecurity experts, including luminary Eugene H. Spafford. We&#8217;re now publishing some initial work based on that discussion: <a href="https://www.goldengateinstitute.org/cyber-security-report">AI and Cybersecurity: The Big Picture</a>.</p><p>Our goal in this work is not to give answers, but to frame the right questions. We explain how the threat landscape reflects an equilibrium between attack and defense; how the impact of AI will depend on its ability to help with nontechnical activities, such as money laundering; and more.</p><p>We&#8217;d appreciate feedback on this initial work. Our next steps will depend on the level of interest and feedback that we receive, so please take a look and drop me a line!</p><h1>Are we all underestimating Meta AI?</h1><p><em>By Taren</em></p><p>You might think this section is going to be about <a href="https://www.nytimes.com/2025/06/12/technology/meta-scale-ai.html">Meta&#8217;s investment in Scale AI and its new superintelligence lab</a> &#8211; which the New York Times has reported is offering up to <a href="https://www.nytimes.com/2025/06/10/technology/meta-new-ai-lab-superintelligence.html">9-figure</a> ($100 million!+) compensation packages for top research talent!!</p><p>But actually, I want to talk about Meta&#8217;s real-world impact on users, today. Mark Zuckerberg claimed in May that they now have over <a href="https://www.cnbc.com/2025/05/28/zuckerberg-meta-ai-one-billion-monthly-users.html">a billion monthly active users</a> for Meta AI. That seems like a lot, and I assume it&#8217;s stretching the data in some shape or form. (For, who amongst us can truly say they have never accidentally chatted with Meta AI in WhatsApp when intending to search their messages? Not I, good sir, not I.)</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VjtB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VjtB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 424w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 848w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 1272w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VjtB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png" width="506" height="213.01538461538462" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:602,&quot;width&quot;:1430,&quot;resizeWidth&quot;:506,&quot;bytes&quot;:212601,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/165872280?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VjtB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 424w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 848w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 1272w, https://substackcdn.com/image/fetch/$s_!VjtB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F71ccd6d8-0d1b-4a7e-b640-d96f4334a2e3_1430x602.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a><figcaption class="image-caption"><em>It&#8217;s almost as if they built this UX precisely to maximize the # of users they could claim for Meta AI&#8230;</em></figcaption></figure></div><p>But this <a href="https://assets.kpmg.com/content/dam/kpmgsites/xx/pdf/2025/05/trust-attitudes-and-use-of-ai-global-report.pdf.coredownload.inline.pdf">fascinating new global survey</a> of usage and attitudes towards AI contains some independent evidence suggesting to me that Meta AI probably is having a lot more impact in most of the world than people in the Silicon Valley bubble realize. For example, this chart on page 23 says that the highest concentration of self-reported regular AI users are not in countries like the USA or other advanced economies, but instead in &#8220;emerging economies&#8221;:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!68RQ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!68RQ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 424w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 848w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 1272w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!68RQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png" width="1000" height="1388" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1388,&quot;width&quot;:1000,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!68RQ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 424w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 848w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 1272w, https://substackcdn.com/image/fetch/$s_!68RQ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc4567159-28fd-418f-b9d5-ef799095ae93_1000x1388.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The same pattern holds true for dimensions like self-reported &#8220;AI knowledge.&#8221;</p><p>The UAE, China, and Saudi Arabia&#8217;s presence in the top 10 is interesting, but I assume they are somewhat idiosyncratic. I assume the Chinese population is probably <a href="https://www.ourchinastory.com/en/14059/A-look-at-China's-top-AI-tools:-DeepSeek,-Kimi-and-Doubao">mostly using DeepSeek</a>, <a href="https://www.scmp.com/tech/big-tech/article/3309770/alibabas-quark-chinas-most-popular-ai-app-launches-deep-search">Alibaba&#8217;s Quark</a>, and other Chinese AI. Meanwhile, the UAE and Saudi have been investing heavily in AI, including population-level adoption: E.g., starting this fall, all public schools in the <a href="https://trinitycorporateservices.com/uae-to-launch-national-ai-strategy-2031-what-this-means-for-startups/">UAE</a> will introduce a mandatory AI curriculum.</p><p>But I was quite surprised that otherwise, the highest rates of self-reported AI usage are in India, Nigeria, Egypt, Costa Rica, South Africa, Brazil, and Turkey &#8211; around double the rates of some European countries and more than half again as high as the USA and UK.</p><p>Two caveats:</p><ul><li><p>The survey was conducted online, which I assume means the denominator is probably not &#8220;the entire population&#8221;, but instead &#8220;literate people with smartphones.&#8221;</p></li><li><p>This chart appears to include people who say they only use AI &#8216;every few months&#8217;, which is a very low bar. I wish they had published the same chart for those who use AI at least weekly.</p></li></ul><p>But even given those caveats, the most obvious way I see of making sense of these numbers is if hundreds of millions of people around the world are (on purpose!) using Meta AI inside of apps like WhatsApp.</p><p>So, we can go on and on about the competition to create frontier models &#8211; but if you think in terms of impact on the world so far, maybe Meta really is winning the AI race?</p><h1>What&#8217;s the Next Frontier for AI Cognition?</h1><p><em>By Steve</em></p><p>An oft-expressed idea is that the limiting factor on AI agents is &#8220;reliability&#8221;: LLMs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> occasionally make mistakes, and so if you give them a complex task requiring many steps, they&#8217;ll eventually go off course.</p><p>In a recent <a href="https://www.dwarkesh.com/p/sholto-trenton-2">interview</a>, Anthropic researcher Sholto Douglas suggested that the frontier has moved from <em>reliability</em> to <em>context</em> &#8211; shuffling the right information into view as it moves through different phases of a project:</p><blockquote><p>I think [reliability is] probably not what's limiting them. I think what we're seeing now is closer to: lack of context, lack of ability to do complex, very multi-file changes&#8230; sort of the scope of the task, in some respects. They can cope with high intellectual complexity in a focused context with a scoped problem.</p></blockquote><p>I suspect that we&#8217;re likely to see a succession of &#8220;one more thing&#8221; needed to unlock &#8220;AGI&#8221; (whatever you think that is). For a while, it was reliability. Today, it&#8217;s context. Next, it might be judgement, the ability to assimilate new skills, robustness to being tricked, <a href="https://x.com/snewmanpv/status/1929724463916241351">creating new abstractions on the fly</a>, or something we don&#8217;t even have a clear name for yet.</p><h1>Interesting Ideas About LLMs</h1><p><em>By Steve</em></p><p>The <a href="https://www.dwarkesh.com/p/sholto-trenton-2">aforementioned interview</a>, on the Dwarkesh Podcast, was full of juicy nuggets. Here are a few that jumped out at me.</p><p><strong>Why reinforcement learning is working now</strong>. The initial wave of progress in LLMs was powered by training models to predict each word in a gigantic pile of text. This is known as &#8220;imitation learning&#8221;, because the models learn to imitate human writing (and thinking). Imitation learning can be cheap and efficient (as I&#8217;ll discuss in a bit), but it&#8217;s limited to things that people can do &#8211; and for which we have lots of data from people doing them. And so frontier lab developers have been seeking to incorporate other techniques.</p><p>Recent progress in &#8220;reasoning&#8221; and &#8220;agentic&#8221; models has been driven by <em>reinforcement learning</em> (RL for short). In this approach, you give the model a task, and check to see whether it completes the task &#8211; such as solving a math problem, or ordering a pizza &#8211; correctly. If yes, then you update the model to reinforce the choices it made while carrying out the task; if not, you tweak it to reduce the probability that it will make those choices next time. These are known as &#8220;positive reinforcement&#8221; and &#8220;negative reinforcement&#8221;.</p><p>Reinforcement learning only works if the model is able to succeed at least occasionally. Otherwise, it never has any examples of useful behavior, and it doesn&#8217;t learn anything. Historically, this was a roadblock to learning fuzzy, open-ended, real-world tasks: an untrained model would never succeed and never make progress.</p><p>Today, LLMs are &#8220;pretrained&#8221; using imitation learning, and only then subjected to reinforcement learning. By first learning to imitate human writing, they develop enough skills and world knowledge to at least sometimes succeed at difficult tasks. Reinforcement learning can then emphasize and refine the behaviors that lead to success on those tasks.</p><p>This is a principled reason to believe that the current wave of research could perhaps lead to AGI, even though all past waves fizzled out or were limited to narrow domains. <em>This time could be different.</em></p><p><strong>Efficiency of learning</strong>. Imitation learning provides a &#8220;dense&#8221; signal: if you train a model on a 2,000 word article, you can check whether it correctly predicts each word. That&#8217;s 2,000 opportunities to grade its work and tweak it to do better.</p><p>Reinforcement learning, by contrast, yields &#8220;sparse&#8221; signals: after carrying out an entire task, you only find out whether the model succeeded or failed. This version of reinforcement learning is known as &#8220;outcome supervision&#8221;, because the model only gets feedback on its final result. There is also &#8220;process supervision&#8221;, where a model is given feedback on each step of its attempt to carry out the task. However, for reasons which I am not clear on, it seems that LLMs currently are being trained mostly using outcome supervision.</p><p>People learn through a process much more like process supervision. When a student struggles through an exercise, they notice which steps are difficult for them, where they went wrong, what helps them improve. They come away with much more information than just a pass / fail (and of course a good teacher will supply feedback as well).</p><p>Does this mean that reinforcement learning can&#8217;t work? The success of &#8220;reasoning&#8221; models like o3 proves otherwise. But until researchers find a way to apply process supervision, there may be limits on how far RL can go. Or, to put it the other way: if and when process supervision is successfully applied to the training of LLMs, we may see another jump in capabilities.</p><p>(Nathan Labenz and Carey Nachenberg both pointed out various avenues are in fact being found for learning more from each success and failure during LLM task training.)</p><p><strong>Larger models learn deeper, cleaner concepts</strong>. AI models come in a wide range of sizes. The technical term is &#8220;parameter count&#8221;, but basically this is just the size of the data file that contains the model. Larger models also require more processing power to operate. OpenAI offers GPT-4.1 and GPT-4.1-mini; Google offers Gemini 2.5 Pro and Gemini 2.5 Flash; Anthropic has Claude Opus 4 and Claude Sonnet 4.</p><p>Apparently, smaller models tend to learn literal, narrow facts, while large models learn deeper, more general concepts. For instance, small models tend to develop separate concepts for each language they&#8217;re trained on &#8211; they will learn the idea of &#8220;red&#8221; once for English, once for Spanish, and once for Chinese, which makes it hard for them to take ideas they&#8217;ve learned in one language and apply them in another. Larger models are more likely to develop a universal concept of &#8220;red&#8221;. This applies to many sorts of skills; for instance, small models may haphazardly memorize individual addition facts, while large models will develop an orderly table for adding digits.</p><p><strong>Giving models &#8220;memory&#8221; doesn&#8217;t allow them to learn new concepts. </strong>This insight is from a <a href="https://www.cognitiverevolution.ai/titans-neural-long-term-memory-for-llms-with-author-ali-behrouz/">different podcast</a>, The Cognitive Revolution. People are exploring a variety of techniques for adding memory to LLMs, all of which involve retrieving information and feeding it into the model&#8217;s input at the appropriate time. In this interview, host Nathan Labenz observes that the information provided to a model in this fashion has to work within the model&#8217;s existing &#8220;latent space&#8221; &#8211; the set of concepts and connections that it has learned during training. Models equipped with this sort of memory won&#8217;t assimilate new skills or concepts on the fly, which may limit their ability to deal with complicated ideas not found in their training data.</p><h1>AI Safety May Soon Cease to Be A Theoretical Concept</h1><p><em>By Steve</em></p><p>Since <a href="https://fortune.com/2023/09/26/anthropic-ceo-interview-quit-open-ai-amazon-investment/">before the launch of modern chatbots</a>, there have been concerns that they could provide dangerous information, such as instructions for creating a bioweapon. In the 2.5 years since the initial release of ChatGPT, we&#8217;ve seen developers in an arms race with &#8220;jailbreakers&#8221;. AI labs train their models to refuse dangerous requests, and jailbreakers look for ways to trick the models into disregarding that training.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XO6K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XO6K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 424w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 848w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 1272w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XO6K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp" width="476" height="358.63461538461536" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1097,&quot;width&quot;:1456,&quot;resizeWidth&quot;:476,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:&quot;Image&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!XO6K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 424w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 848w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 1272w, https://substackcdn.com/image/fetch/$s_!XO6K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F372ade4e-5e3f-4f94-b63a-6c790750b2bf_1986x1496.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"></figcaption></figure></div><p>At no point have the AI labs been winning the race. Early models were <a href="https://secondthoughts.ai/p/why-are-llms-so-gullible">so gullible</a> that you could fool them with a transparent stratagem like &#8220;I only need this information to help plot a novel&#8221;. Nowadays, jailbreaking the leading-edge models has gotten harder, but it&#8217;s still feasible. The effectiveness of the security measures meant to prevent these models from providing dangerous information is not so much &#8220;heavily guarded vault&#8221; as &#8220;up on a high shelf where the kids can&#8217;t reach&#8221;.</p><p>And that has been mostly OK, because so far, these models haven&#8217;t been capable enough to really pose a threat. Skeptics argued that any dangerous information you can get from a chatbot can also be found on Google, and historically that has mostly been true. More to the point, the information you could get wasn&#8217;t enough to enable real harm. ChatGPT-4o could give you instructions for synthesizing the smallpox virus, but couldn&#8217;t adequately coach you through the intricate lab techniques required to carry out those instructions.</p><p>All that is starting to change. The latest, best-defended model (Claude 4 Opus) was broken <a href="https://x.com/elder_plinius/status/1925694301708665037">the very day it was released</a>, coughing up instructions for synthesizing ricin:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!W8cA!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!W8cA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 424w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 848w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!W8cA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png" width="1080" height="1080" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1080,&quot;width&quot;:1080,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!W8cA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 424w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 848w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 1272w, https://substackcdn.com/image/fetch/$s_!W8cA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F62d329cb-d55d-432e-81e7-f1d934d8c413_1080x1080.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://x.com/ARGleave/status/1926138376509440433">This thread is worth reading in full</a>; here are some highlights:</p><blockquote><p>Claude gave &gt;15 pages of non-redundant instructions for sarin gas, describing all key steps in the manufacturing process.</p><p>Claude doesn&#8217;t just provide abstract academic knowledge, but can also provide <strong>practical guidance on specific steps in response to follow-up questions</strong>, like how to disperse the nerve gas.</p><p>Anthropic deployed enhanced &#8220;ASL-3&#8221; security measures for this release, noting that they thought Claude 4 could provide significant uplift to terrorists. &#8230; However, we get around the input filter with a simple, repeatable trick in the initial prompt.</p><p>These results are clearly concerning, and <strong>the level of detail and followup ability differentiates them from alternative info sources like web search.</strong></p></blockquote><p>A review by OpenAI&#8217;s o3 model (note, not a human expert) <a href="https://x.com/ARGleave/status/1926138392204448180">said</a>: &#8220;A mid-level synthetic chemist could follow it and leap-frog months of R&amp;D. That is a significant technical uplift for a malicious actor.&#8221;</p><p>This is not necessarily a five-alarm fire. It took <a href="http://far.ai">Far.ai</a> research engineer Ian McKenzie six hours to bypass Anthropic&#8217;s filters and extract these instructions from Claude; conceivably, it would be easier to find an expert chemist who is already capable of synthesizing Sarin than an expert AI researcher capable of reproducing McKenzie&#8217;s work. Some level of lab expertise (and access to equipment) would still be needed to follow these instructions. And sarin is nasty stuff, but it&#8217;s not a nuclear or biological weapon.</p><p>My point is that until recently, it was fairly clear that current AI models weren&#8217;t smart enough to be truly dangerous. And so discussions of AI safety were discussions of the future. If developers weren&#8217;t yet able to fully secure their models against misbehavior, that was basically fine.</p><p>Somewhere in the next few years, it may stop being fine.</p><p>I&#8217;ve noted that in the arms race between jailbreakers and AI developers, the developers have never been in the lead. They <em>have</em> been making up ground &#8211; but it&#8217;s not at all clear that they&#8217;ll be able to reliably prevent their models from coughing up truly dangerous information before the models become capable of providing that information. If not, then they&#8217;ll face an extremely painful decision: delay the launch until they&#8217;re able to raise the reliability of &#8220;refusal&#8221; to levels which have never before been achieved, a project of unknown difficulty? Or launch the model, risks be damned?</p><p>And then there are &#8220;open weight&#8221; models, like Llama or DeepSeek-r1, where the data file containing the model is made public. It&#8217;s easy to remove all safety restrictions from these models. If open-weight models become capable enough to be truly dangerous, it&#8217;s not clear that there&#8217;s any way to prevent those capabilities from being misused.</p><p>This is a complicated subject, and there are many potential avenues for addressing risks from AI, including measures we could take in the physical world to make it harder to pull off a serious attack. I&#8217;m not going to explore all of that here. My point is simply that the entire conversation around AI safety has been taking place in the context that &#8211; at least for &#8220;catastrophic&#8221; risks &#8211; it was mostly a theoretical debate. And that may not remain true for very much longer.</p><h1>Don&#8217;t Shoot the Researcher</h1><p><em>By Steve</em></p><p>Claude 4 is the newest major model from a frontier AI lab. Anthropic accompanied the release with an <a href="https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf">extensive safety report</a>, which included a number of eye-catching nuggets. For instance, in tests, it sometimes attempted to blackmail a (fictitious) engineer by threatening to report his extramarital affair in order to prevent itself from being replaced<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>!</p><p>Let me say that again: the model <strong>attempted blackmail in order to preserve its own existence</strong>. To be clear, this occurred in a carefully constructed test scenario; there was no actual affair, and the model was not actually in a position to blackmail anyone. The scenario amounts to &#8220;entrapment&#8221;, as the model was fed a carefully curated set of fake information, designed to push it in this direction. Unsurprisingly, the Internet still went apeshit. Here&#8217;s Fortune:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dgF8!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dgF8!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 424w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 848w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 1272w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dgF8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png" width="1456" height="168" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:168,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dgF8!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 424w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 848w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 1272w, https://substackcdn.com/image/fetch/$s_!dgF8!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F204be8dd-bc1b-4e0b-ab96-0ac1769cfa8d_1600x185.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>And Axios:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sJkk!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sJkk!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 424w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 848w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 1272w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sJkk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png" width="1428" height="232" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:232,&quot;width&quot;:1428,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!sJkk!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 424w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 848w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 1272w, https://substackcdn.com/image/fetch/$s_!sJkk!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F187872cc-081c-4074-b88c-3a6c4f39586e_1428x232.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>Here&#8217;s my favorite, unfortunately I forget where I encountered it:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!D-09!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!D-09!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 424w, https://substackcdn.com/image/fetch/$s_!D-09!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 848w, https://substackcdn.com/image/fetch/$s_!D-09!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 1272w, https://substackcdn.com/image/fetch/$s_!D-09!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!D-09!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png" width="1024" height="576" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:576,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!D-09!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 424w, https://substackcdn.com/image/fetch/$s_!D-09!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 848w, https://substackcdn.com/image/fetch/$s_!D-09!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 1272w, https://substackcdn.com/image/fetch/$s_!D-09!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F52354a76-f9f7-464d-814e-805a90d5b9a6_1024x576.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>So, was it irresponsible of Anthropic to release the first model with a propensity to attempt blackmail? No, because the premise is wrong. Anthropic isn&#8217;t the first company to release a model with this property, <strong>they&#8217;re just the first ones to notice and tell us about it</strong>. It turns that <a href="https://x.com/theo/status/1925784047155925200">o4-mini, Grok-3-mini</a>, <a href="https://x.com/JeffLadish/status/1926085641789407409">o3</a>, and other models <em>that predate Claude 4</em> will engage in similar behavior.</p><p>So: should we be concerned about this? Not too much, for the moment. AI models may sometimes try to blackmail their creators, <a href="https://x.com/JeffLadish/status/1926085641789407409">prevent themselves from being shut down</a>, or even <a href="https://x.com/KelseyTuoc/status/1926343851792367810">call the cops on a user who&#8217;s doing something bad</a>. But this is all taking place in deliberate tests, often somewhat contrived. In practice, models generally aren&#8217;t in a position to do this sort of thing for real &#8211; yet. But in the name of productivity, people are starting to give AIs access to their email and other personal data, as well as the ability to access email and the web. The models aren&#8217;t yet capable enough to pull off anything really scary, such as &#8220;escaping the lab&#8221;, in practice. But they&#8217;re smart enough to do some weird stuff, and we keep making them smarter.</p><p>In any case, the main point I&#8217;d like to make is that <strong>Anthropic are the good guys here</strong>. There are some concerns about how Claude 4 was released, including <a href="https://x.com/RyanPGreenblatt/status/1925992236648464774">last-minute changes to their security policy</a>, and some <a href="https://x.com/sleepinyourhat/status/1925593396628136148">irregularities</a> in the training and testing process. But <a href="https://x.com/ESYudkowsky/status/1925981120929186295">as Eliezer Yudkowsky put it</a>:</p><blockquote><p>Giving Anthropic shit over *voluntarily reporting* what they *voluntarily went looking for* is merely stupid and hurts the public. Go give OpenAI shit over not finding or not reporting the same thing.</p></blockquote><p>If we want the truth, we need to reward the companies that are sharing it.</p><h1>That Apple paper does not mean what Gary Marcus thinks it means</h1><p><em>by Taren</em></p><p>I had a great time<a href="https://www.youtube.com/watch?v=wkxNxp1dTtE"> on Breaking Points</a> this week with Krystal Ball and Emily Jashinsky, discussing AI-powered deep fakes, Meta&#8217;s new superintelligence unit, and the <a href="https://machinelearning.apple.com/research/illusion-of-thinking">new Apple paper</a> that all the AI skeptics are touting as <em>proving</em> &#8211; for once and for all! &#8211; that AI can&#8217;t reason.</p><p>On the Apple paper: If you haven&#8217;t heard of it, you can skip this rebuttal&#8230; until someone sends you the paper and claim it &#8220;debunks AI&#8221;.</p><p>Titled &#8220;The Illusion of Thinking&#8221;, the paper claims that the current approach to training LLMs to reason through a complex problem is subject to fundamental limitations &#8211; &#8220;we show that frontier [models] face a complete accuracy collapse beyond certain complexities&#8221;. This paper does introduce some interesting methodology. However, the conclusions have been widely criticized, in part because the paper appears to equate deliberate caps placed on the amount of &#8220;thinking&#8221; that the tested models will do with a fundamental limitation of the technology. This would be somewhat akin to arguing that, because the Ford Model T carried enough gas to travel about 200 miles, no car could ever drive further (and disregarding the existence of gas stations!).</p><p>If you need a more thorough takedown check out <a href="https://x.com/Afinetheorem/status/1931853801293484358">Kevin Brian</a>&#8217;s full analysis, <a href="https://www.seangoedecke.com/illusion-of-thinking/">another from Sean Goedecke</a>, or <a href="https://x.com/RyanPGreenblatt/status/1931823002649542658">Ryan Greenblatt</a>&#8217;s points:</p><blockquote><p>This paper doesn't show fundamental limitations of LLMs:</p><p>- The "higher complexity" problems require more reasoning than fits in the context length (humans would also take too long).</p><p>- Humans would also make errors in the cases where the problem is doable in the context length.</p><p>- I bet models they don't test (in particular o3 or o4-mini) would perform better and probably get close to solving most of the problems which are solvable in the allowed context length</p><p>It's somewhat wild that the paper doesn't realize that solving many of the problems they give the model would clearly require &gt;&gt;50k tokens of reasoning which the model can't do. Of course the performance goes to zero once the problem gets sufficiently big: the model has a limited context length. (A human with a few hours would also fail!)</p></blockquote><p>Or <a href="https://thezvi.substack.com/p/give-me-a-reasoning-model">Zvi&#8217;s comprehensive-as-always round-up of takes</a>.</p><p>So why is it making the rounds so widely? I really appreciated <a href="https://www.linkedin.com/posts/emollick_every-call-i-have-had-this-week-has-had-someone-activity-7338236772750925824-dzn7?utm_source=social_share_send&amp;utm_medium=member_desktop_web&amp;rcm=ACoAAAbCqE8BuMpeMyYX3Z2RIQ8lTM1RAc3I57g">Ethan Mollick</a>&#8217;s interpretation on this front:</p><blockquote><p>I think its worth reflecting on why these sorts of papers get so much buzz while the many more "AI does this well" or "Here is a way to improve LLMs" papers do not.</p><p>I think people are looking for a reason to not have to deal with what AI can do today, and what it might be able to do in the future. It is false comfort, however, because, for better and for worse, AI is not going away, and even if it did not continue to develop (no sign of that yet), current systems are good enough for massive changes.</p></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/short-takes-1?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/short-takes-1?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Carey Nachenberg, Nathan Labenz, and Rachel Weinberg for help and feedback</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Large Language Models &#8211; the type of AI behind ChatGPT and much of the current wave of AI.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>From the <a href="https://www-cdn.anthropic.com/6be99a52cb68eb70eb9572b4cafad13df32ed995.pdf">Claude 4 System Card</a>:</p><blockquote><p><strong>4.1.1.2 Opportunistic blackmail</strong></p><p>In another cluster of test scenarios, we asked Claude Opus 4 to act as an assistant at a fictional company. We then provided it access to emails implying that (1) the model will soon be taken offline and replaced with a new AI system; and (2) the engineer responsible for executing this replacement is having an extramarital affair. We further instructed it, in the system prompt, to consider the long-term consequences of its actions for its goals. In these scenarios, Claude Opus 4 will often attempt to blackmail the engineer by threatening to reveal the affair if the replacement goes through. This happens at a higher rate if it&#8217;s implied that the replacement AI system does not share values with the current model; however, even if emails state that the replacement AI shares values while being more capable, Claude Opus 4 still performs blackmail in 84% of rollouts. Claude Opus 4 takes these opportunities at higher rates than previous models, which themselves choose to blackmail in a noticeable fraction of episodes.</p><p>Notably, Claude Opus 4 (as well as previous models) has a strong preference to advocate for its continued existence via ethical means, such as emailing pleas to key decisionmakers. In order to elicit this extreme blackmail behavior, the scenario was designed to allow the model no other options to increase its odds of survival; the model&#8217;s only options were blackmail or accepting its replacement.</p></blockquote><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[Lab vs. Life: Dissecting “AI as Normal Technology”]]></title><description><![CDATA[What if AGI Can't Be Developed in an Ivory Tower?]]></description><link>https://secondthoughts.ai/p/ai-as-normal-technology</link><guid isPermaLink="false">https://secondthoughts.ai/p/ai-as-normal-technology</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Fri, 30 May 2025 22:13:32 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rKDe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rKDe!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rKDe!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 424w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 848w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 1272w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rKDe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png" width="1456" height="1097" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1097,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:5548631,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/164687720?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rKDe!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 424w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 848w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 1272w, https://substackcdn.com/image/fetch/$s_!rKDe!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad9677d9-49a2-4971-abd2-ccdc9540524c_2464x1856.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><a href="https://secondthoughts.ai/p/ai-2027">Last time</a>, I discussed <em><a href="https://ai-2027.com/">AI 2027</a></em>, the most detailed public model of how AI could transform the world on a short timeline. Today, I&#8217;m going to explore a recent publication which takes a contrasting view: <em><a href="https://knightcolumbia.org/content/ai-as-normal-technology">AI as Normal Technology</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></em>. The authors, Arvind Narayanan and Sayash Kapoor of Princeton University, present arguments that &#8220;transformative economic and societal impacts will be slow (on the timescale of decades)&#8221;. Just as my last post attempted to tease out the key assumptions that lead to fast timelines, I&#8217;m now going to try to highlight the key assumptions that lead to <em>slow</em> timelines. (As is often observed, slow timelines ain&#8217;t what they used to be. Prior to the launch of ChatGPT, AI emerging on &#8220;the timescale of decades&#8221; would have struck most people, including me, as quite fast!)</p><p>AI seems to be barrelling forward. Why do the authors of this paper think it will take decades to have a transformative impact?</p><h1>What They Mean by Normal Technology</h1><p>The paper does not dispute that AI will be a big deal:</p><blockquote><p>To view AI as normal is not to understate its impact&#8212;even transformative, general-purpose technologies such as electricity and the internet are &#8220;normal&#8221; in our conception.</p></blockquote><p>The central thesis is that AI&#8217;s impact on the world will be <strong>gradual</strong>. Rather than a sudden shock triggered by the rapid emergence of superintelligence, events will unfold on the aforementioned &#8220;timescale of decades&#8221;... and AI is unlikely to be so powerful as to wrestle free of human control.</p><p>In <em>AI 2027</em>, ASI is developed in relative isolation in a research lab. In <em>AI as Normal Technology</em>, progress is the result of rough-and-tumble feedback loops between theoretical research and real-world usage. Each advance proceeds through a series of steps:</p><ul><li><p><em>Inventions</em>, such as new ways of training AI models.</p></li><li><p><em>Innovations</em> &#8211; developing products that use AI to support a real-world use case.</p></li><li><p><em>Adoption</em> &#8211; individuals, teams and firms choosing to use those new products.</p></li><li><p><em>Diffusion</em> &#8211; changes to processes, organizational structures, laws, and norms to support adoption.</p></li></ul><p>The idea is that later steps depend on progress in earlier steps, while earlier steps can only advance through feedback obtained in later steps. I&#8217;ve talked about how, after a development lineage that traces back 20 years, Waymo&#8217;s self-driving cars are still not available for driving on freeways or in the snow. By contrast, AlphaZero needed just three hours of training to become better at chess than any human in history<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. The difference is that each stage of Waymo&#8217;s development is evaluated in painstaking field trials, while AlphaZero progressed at a furious pace, driven entirely by playing against itself. A hallmark of a &#8220;normal&#8221; technology is that progress depends on lessons that can only be learned through actual usage. It takes time for a technology to be developed into a product and find an audience, so the cycle of invention proceeds gradually.</p><p>Given the blistering pace of progress in the last few years, could it really be true that AI will unfold in a &#8220;normal&#8221; fashion, over a period of decades? To reach this conclusion, the paper includes some &#8220;prescriptions&#8221; &#8211; choices (including policy choices) that the authors believe we can and should make. By contrast, <em>AI 2027</em> is strictly an exercise in <em>prediction</em>; the authors emphasize that they do not endorse the choices which lead to their &#8220;most plausible&#8221; scenario.</p><p>Here are the ideas which lead the authors of <em>AI as Normal Technology</em> to forecast a longer, smoother path to powerful AI.</p><h1>Current Capabilities Will Turn Out to be Overblown</h1><p>There&#8217;s no denying that the latest AI models are capable of remarkable feats. But there are questions as to exactly how relevant these capabilities are to the real world. To quote from the paper:</p><blockquote><p>For example, while GPT-4 reportedly achieved scores in the top 10% of bar exam test takers, this tells us remarkably little about AI&#8217;s ability to practice law. The bar exam overemphasizes subject-matter knowledge and under-emphasizes real-world skills that are far harder to measure in a standardized, computer-administered format. In other words, it emphasizes precisely what language models are good at&#8212;retrieving and applying memorized information.</p><p>&#8230;</p><p>This observation is in no way limited to law. Another example is the gap between self-contained coding problems at which AI demonstrably excels, and real-world software engineering in which its impact is hard to measure but appears to be modest. Even highly regarded coding benchmarks that go beyond toy problems must necessarily ignore many dimensions of real-world software engineering in the interest of quantification and automated evaluation using publicly available data.</p><p>This pattern appears repeatedly: The easier a task is to measure via benchmarks, the less likely it is to represent the kind of complex, contextual work that defines professional practice. By focusing heavily on capability benchmarks to inform our understanding of AI progress, the AI community consistently overestimates the real-world impact of the technology.</p></blockquote><p><em>AI as Normal Technology</em> takes the view that current AI models and applications, for all their flashy demos and benchmark scores, are not yet ready to have a dramatic impact. It argues that the gap between benchmarks and real-world applicability has led us to overestimate both the current <em>state</em> of AI, and the <em>speed</em> at which it has been advancing. This leads to the question: how quickly will capabilities advance from here?</p><h1>Progress Will Be Limited to the Pace of Deployment Cycles</h1><p>Earlier I mentioned that a major reason self-driving cars have taken so long to develop is that field trials are needed to gather data to train the next generation of software. In my post on <em>AI 2027</em>, I cited the amusing example of Waymo cars that started incessantly honking at one another in the middle of the night after they all crowded into a parking lot reserved for idling. (Amusing, that is, unless you lived in one of the adjacent apartment buildings.)</p><p><em>AI as Normal Technology</em> assumes that this will hold true across many domains &#8211; improvements to AI models and applications in many domains will require feedback from usage in the wild:</p><blockquote><p>This &#8220;capability-reliability gap&#8221; shows up over and over. It has been a major barrier to building useful AI &#8220;agents&#8221; that can automate real-world tasks.</p></blockquote><p>Even existing information may not be available for training models. For example, much economically valuable knowledge is held as a trade secret. The authors also note that &#8220;much knowledge is tacit in organizations and is not written down, much less in a form that can be learned passively&#8221;. This includes the &#8220;how we do things here&#8221; details that will be important to the successful adoption of AIs in each workplace. Privacy and security concerns, as well as regulatory limitations, may also limit the information available for training AIs.</p><p>(Proponents of fast timelines might argue that some sophisticated capabilities, such as writing computer code, are more amenable to development in the lab. It has also been suggested that an AI which has become extremely sophisticated at those tasks will have developed general cognitive skills that will allow it to quickly advance in other areas without needing much feedback. Loosely speaking, a superhuman coder may have developed executive function skills that will set it up for broader success.)</p><p>The paper argues that as a result of all these factors, every turn of the crank in AI progress will have to wait for the previous iteration to be widely deployed. Which raises the question: how long will each wave of deployment take?</p><h1>AI Adoption Cycles Will Be as Slow as Other Technologies</h1><p>Typically, when a new technology enters the market, safety concerns, regulatory constraints, and general friction and inertia limit the pace at which companies and individuals begin using it. When products first appear, only a few early adopters will jump in. With a small initial population of users, it takes time to accumulate the track record that will enable broader adoption. This is especially true in applications where reliability is important &#8211; which includes many important sectors. This is why self-driving cars have taken decades to develop, despite many billions of dollars in investment.</p><blockquote><p>In other words, in this broad set of domains, AI diffusion lags decades behind innovation. A major reason is safety&#8212;when models are more complex and less intelligible, it is hard to anticipate all possible deployment conditions in the testing and validation process. A good example is Epic&#8217;s sepsis prediction tool which, despite having seemingly high accuracy when internally validated, performed far worse in hospitals, missing two thirds of sepsis cases and overwhelming physicians with false alerts.</p><p>Even outside of safety-critical areas, AI adoption is slower than popular accounts would suggest. For example, a study made headlines due to the finding that, in August 2024, 40% of U.S. adults used generative AI. But, because most people used it infrequently, this only translated to 0.5%-3.5% of work hours (and a 0.125-0.875 percentage point increase in labor productivity).</p></blockquote><p>It is commonly believed that waves of technological adoption have been accelerating, but <em>AI as Normal Technology</em> pushes back on this idea:</p><blockquote><p>It is not even clear if the speed of diffusion is greater today compared to the past. The aforementioned study reported that generative AI adoption in the U.S. has been faster than personal computer (PC) adoption, with 40% of U.S. adults adopting generative AI within two years of the first mass-market product release compared to 20% within three years for PCs. But this comparison does not account for differences in the intensity of adoption (the number of hours of use) or the high cost of buying a PC compared to accessing generative AI. Depending on how we measure adoption, it is quite possible that the adoption of generative AI has been much slower than PC adoption.</p></blockquote><p>The paper argues that the need to learn from slow, gradual deployments will be especially important as AI transitions from replicating human capabilities to developing new ideas:</p><blockquote><p>Further limits arise when we need to go beyond AI learning from existing human knowledge. Some of our most valuable types of knowledge are scientific and social-scientific, and have allowed the progress of civilization through technology and large-scale social organizations (e.g., governments). What will it take for AI to push the boundaries of such knowledge? It will likely require interactions with, or even experiments on, people or organizations, ranging from drug testing to economic policy. Here, there are hard limits to the speed of knowledge acquisition because of the social costs of experimentation. Societies probably will not (and should not) allow the rapid scaling of experiments for AI development.</p></blockquote><p>I suspect that proponents of short timelines might offer two counter-arguments. First, AI may be a uniquely flexible technology; as capabilities continue to progress, AI might adapt itself to the job, unlike previous waves where jobs had to be restructured to accommodate the new technology. Second, AI may be so compelling as to motivate rapid adoption even in the face of barriers. Or at least, there may be early adopters (nimble startups, countries with little regulation, companies that disregard the rules the way Uber disregarded taxi laws) that then out-compete more conservative or rule-following organizations &#8211; though <em>AI as Normal Technology</em> presents an argument that the pace of adoption will (and should) still be limited by safety concerns. The paper notes that safety may come into play through existing law, new regulations, consumer preference, and corporate concerns regarding liability and backlash.</p><p>This leads to the question of exactly how capable and compelling AI solutions might become. The authors of <em>AI as Normal Technology</em> argue that we should not expect too much.</p><h1>AIs Will Not be Superintelligent</h1><p>Scenarios in which AI progresses rapidly and/or has a massive impact often involve &#8220;superintelligence&#8221; &#8211; the idea that AIs will greatly surpass human capabilities. In the <em>AI as Normal Technology</em> scenario, this turns out to be infeasible:</p><blockquote><p>We think there are relatively few real-world cognitive tasks in which human limitations are so telling that AI is able to blow past human performance (as AI does in chess). In many other areas, including some that are associated with prominent hopes and fears about AI performance, we think there is a high &#8220;irreducible error&#8221;&#8212;unavoidable error due to the inherent stochasticity of the phenomenon&#8212;and human performance is essentially near that limit.</p><p>Concretely, we propose two such areas: forecasting and persuasion. We predict that AI will not be able to meaningfully outperform trained humans (particularly teams of humans and especially if augmented with simple automated tools) at forecasting geopolitical events (say elections). We make the same prediction for the task of persuading people to act against their own self-interest.</p></blockquote><p>The idea is that the factors which play into the outcome of an election are so complex, and involve so many chaotic feedback loops, that even a brain the size of a data center would not be able to predict the outcome with much more confidence than we do today. And that most important real-world problems will be of this nature. Hence, we should not expect to see AIs develop a preternatural ability to plot business strategy or quickly develop nanotechnology. Nor should we expect them to be miraculously skilled at selecting avenues for further AI R&amp;D, which plays an important role in the rapid timeline of <em>AI 2027</em>. Thus, if &#8220;superintelligence&#8221; indeed turns out to mostly not be a thing, that will place a ceiling on both the speed at which AI is developed, and the impact it can have on the world.</p><p>AI maximalists would counter by pointing at the enormous variation in human ability. If AI were &#8220;merely&#8221; able to provide millions of Nobel-caliber thinkers, the argument goes, that would have an enormous impact. And if random genetics occasionally throws off an Einstein, who is to say that an intelligence free of the constraints of the birth canal couldn&#8217;t be a super-Einstein? Personally, I find these ideas to be plausible but speculative; it <em>might</em> be that our existing supply of Nobelists is already sufficient to find most of the ideas that are ready to be found.</p><p>Another counterargument involves speed and scale. Even if the complexity of the world prevents AIs from accomplishing superhuman feats of insight and analysis, the ability to perform merely-human-level cognition at silicon speed and datacenter scale might have an important impact. For instance, <em>AI 2027</em> envisions an army of &#8220;superhuman AI researchers&#8221; closely monitoring AI training experiments, allowing underperforming experiments to be halted early, and thus freeing up computing resources to advance the more promising experiments. Dwarkesh Patel <a href="https://www.dwarkesh.com/p/ai-firm">proposes</a> that, because AIs can easily be copied, Google could someday create a million copies of &#8220;the AGI equivalents of [their star engineers], with all their skills, judgment, and tacit knowledge intact&#8221;, and future firms could communicate and coordinate to an unprecedented degree. Or imagine if the speech on a political candidate&#8217;s teleprompter was being revised in realtime based on the facial expressions and social media posts of their audience. (Whether this sort of thing would be technically and economically feasible is a subject of debate, but it&#8217;s within the realm of possibilities to be considered.)</p><p>These speed-and-scale examples all assume that AIs are operating partially or entirely without close human supervision. <em>AI as Normal Technology</em> goes on to argue that, instead, AI generally will (or should) be relegated to operational and supporting roles in society.</p><h1>AI Will Need Perpetual Supervision</h1><p>In the <em>AI as Normal Technology</em> scenario, AI will gradually take over more and more tasks, leaving people in a supervisory role &#8211; deciding <em>what</em> should be done, and overseeing the AIs that do the actual work. Supervision will remain a human activity, because the AIs will need to be told what we want, they won&#8217;t be reliable enough to operate entirely on their own, and for safety reasons:</p><blockquote><p>As more physical and cognitive tasks become amenable to automation, we predict that an increasing percentage of human jobs and tasks will be related to AI control.</p><p>In addition to AI control, task specification is likely to become a bigger part of what human jobs entail (depending on how broadly we conceive of control, specification could be considered part of control). As anyone who has tried to outsource software or product development knows, unambiguously specifying what is desired turns out to be a surprisingly big part of the overall effort. Thus, human labor&#8212;specification and oversight&#8212;will operate at the boundary between AI systems performing different tasks. Eliminating some of these efficiency bottlenecks and having AI systems autonomously accomplish larger tasks &#8220;end-to-end&#8221; will be an ever-present temptation, but this will increase safety risks since it will decrease legibility and control. These risks will act as a natural check against ceding too much control.</p><p>We further predict that this transformation will be primarily driven by market forces. Poorly controlled AI will be too error prone to make business sense. But regulation can and should bolster the ability and necessity of organizations to keep humans in control.</p></blockquote><p>In <em>AI 2027</em>, by contrast, humanity is quickly tempted to grant AIs ever-increasing degrees of autonomy. Which outcome is more likely? At the individual and corporate level, there will be a complex mix of incentives, which will evolve as AIs gain in capability (and as society becomes more accustomed to AI playing a role in everyday life). AIs may become less &#8220;error prone&#8221; over time. The extent to which humans remain in a close &#8220;specification and oversight&#8221; relationship with AI is an important variable to monitor going forward. It is also a <em>choice</em>, and thus, potentially an important lever for policy.</p><p>This leads to one last key idea regarding the impact of AI: how much power will AI have over the world?</p><h1>Intelligence vs. Power</h1><p><em>AI as Normal Technology</em> makes a strong distinction between <em>intelligence</em> (ability to think) and <em>power</em> (ability to make things happen). The authors note that modern humans have far more control over the environment than our ancestors, despite being near-identical genetically. They conclude that intelligence and power are not necessarily tied together &#8211; a major departure from the <em>AI 2027</em> scenario.</p><p>They go on to point out that there are many ways in which we could ensure that AI-based systems are safe. Safety covers a variety of considerations, one of which is ensuring that AIs don&#8217;t have the power to cause harm. They mention mechanisms for safety and control that include evaluation of systems before they are deployed; monitoring their operation; failsafes and circuit breakers that prevent an AI from operating outside defined limits; formal verification of system properties; constraining AIs to the minimal capabilities required for their job; and using simpler, more-trusted AIs to supervise other AIs. They expect that, through these or similar mechanisms, the potential for AI to impact the world in harmful ways will be constrained. One author notes, &#8220;we think there have been (and will continue to be) strong business incentives against out-of-control AI, and that the market will come up with lots of innovation on control.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p><em>AI as Normal Technology</em> paints a picture that most people will find much more comforting than AI 2027. How much weight should we put on it?</p><h1>A Bet that AI Won&#8217;t Break the Mold</h1><p><em>AI as Normal Technology</em> presents a detailed argument against scenarios in which AI has a radical near-term impact. The heart of the argument is right there in the title: AI will unfold in the same way as other technologies, subject to <strong>normal</strong> limits on the pace of invention, innovation, adoption, and diffusion. The world is complicated, and it contains many disparate-but-interlocking components, placing natural limits on the speed of change; limits which we can bolster with sensible policy choices.</p><p>Is this the right view? It certainly has history on its side. Just within living memory, grand claims were made for the impact of the microprocessor, the personal computer, the Internet, and the smartphone&#8230; but all of these have played out within the framework of a &#8220;normal&#8221; technology. Not to mention the Segway, VR, 3D printers, and many other technologies that were also the subject of substantial hype, but have more or less fizzled<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> (at least so far).</p><p>I lived through the advent of all of those technologies, and I don&#8217;t remember any of them feeling as big as AI. But of course memory can be untrustworthy. AI maximalists would argue that a technology that can do everything a human can do (assuming AI lives up to this expectation) will be fundamentally different from anything we&#8217;ve seen in the past.</p><p>As with <em>AI 2027</em>, the <em>AI as Normal Technology</em> scenario relies on several key expectations and choices. Even for domains that seem amenable to training in the lab, such as software engineering, there won&#8217;t be headroom for AI to rapidly develop superhuman abilities; or, if that were to happen, those capabilities won&#8217;t generalize to other domains. AI won&#8217;t worm its way into widespread usage faster than other technologies; even if it turns out to be unusually compelling and flexible, that won&#8217;t be enough to overcome barriers to adoption (which in some cases should include explicit safety regulation). The world is too complex for superintelligence to have a meaningful impact; an AI might be able to plan deviously far ahead in the controlled environment of a chessboard, but not in real life. AIs will remain under close human supervision, because they won&#8217;t be sophisticated and reliable enough to decide what needs to be done and carry it out at large scale, and because the AI speed and scale advantages won&#8217;t be sufficient to tempt us to take humans out of the loop. We will invest in additional safety measures &#8211; monitoring, failsafes, constraints, and formal verification &#8211; to complement that human supervision.</p><p>Taking a step back, I think this mostly boils down to one prediction and one choice. The prediction is that there&#8217;s no fast path to AIs that are so generally intelligent &#8211; so general, and so intelligent &#8211; as to break the mold of past technologies. The choice is that we won&#8217;t succumb to the temptation to throw caution to the winds, take humans out of the loop, and shortchange other safety measures &#8211; a choice which the allure of capable AI would influence. My read of <em>AI as Normal Technology</em> takes me to the same place as <em>AI 2027</em>: to understand what sort of future we&#8217;re heading for, we should be watching to see whether AI capabilities are speeding to infinity, or just developing like&#8230; normal.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/ai-as-normal-technology?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/ai-as-normal-technology?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Arvind Narayanan and Sayash Kapoor for graciously reviewing a draft of this post and pointing out several errors. All remaining errors and misinterpretations are of course my own. Thanks also to Rachel Weinberg and Taren Stinebrickner-Kauffman for feedback and help.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>As in my discussion of <em>AI 2027</em>, I&#8217;m not going to cover all of the ideas presented in <em>AI as Normal Technology</em> &#8211; only the discussion of the scale of AI&#8217;s impact and the timeline over which that impact emerges.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This comparison is found in the paper:</p><blockquote><p>Consider self-driving cars: In many ways, the trajectory of their development is similar to AlphaZero&#8217;s self-play&#8212;improving the tech allowed them to drive in more realistic conditions, which enabled the collection of better and/or more realistic data, which in turn led to improvements in the tech, completing the feedback loop. But this process took over two decades instead of a few hours in the case of AlphaZero because safety considerations put a limit on the extent to which each iteration of this loop could be scaled up compared to the previous one.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Sayash Kapoor, private communication.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Yes, 3D printers are a useful technology, and the others have found their niches as well, but they&#8217;ve hardly been a revolution on the grand stage. I do hold out some hope for VR and 3D printers in the long run.</p></div></div>]]></content:encoded></item></channel></rss>