<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Second Thoughts]]></title><description><![CDATA[Accessible analysis of the big issues in AI]]></description><link>https://secondthoughts.ai</link><image><url>https://substackcdn.com/image/fetch/$s_!pUAA!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6d71f419-5f21-4179-be19-65b71d59de3a_1024x1024.png</url><title>Second Thoughts</title><link>https://secondthoughts.ai</link></image><generator>Substack</generator><lastBuildDate>Sat, 27 Jun 2026 23:11:53 GMT</lastBuildDate><atom:link href="https://secondthoughts.ai/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Steve Newman]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[amistrongeryet@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[amistrongeryet@substack.com]]></itunes:email><itunes:name><![CDATA[Steve Newman]]></itunes:name></itunes:owner><itunes:author><![CDATA[Steve Newman]]></itunes:author><googleplay:owner><![CDATA[amistrongeryet@substack.com]]></googleplay:owner><googleplay:email><![CDATA[amistrongeryet@substack.com]]></googleplay:email><googleplay:author><![CDATA[Steve Newman]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[AI Can Help Plan a Bioweapon. Building One is Still Hard.]]></title><description><![CDATA[Hurdles include facilities, materials, and steps requiring unwritten lore and hands-on experience.]]></description><link>https://secondthoughts.ai/p/ai-can-help-plan-a-bioweapon-building</link><guid isPermaLink="false">https://secondthoughts.ai/p/ai-can-help-plan-a-bioweapon-building</guid><dc:creator><![CDATA[Abi Olvera]]></dc:creator><pubDate>Thu, 28 May 2026 16:03:09 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!OuCU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Attention all progress-minded aspiring bloggers: The Roots of Progress Institute is accepting applications for its annual blog-building fellowship. I [Steve] participated in this program in 2024 and highly recommend it! Over 10 part-time weeks, you&#8217;ll get inspiring talks from noted thinkers and writers, guidance from professional editors, and &#8211; most importantly &#8211; join a community of supportive fellow writers. Not to mention a free ticket to the excellent, invitation-only Progress Conference this October. <strong>Applications close June 1st,</strong> so <a href="https://d2vr4p04.na2.hubspotlinksstarter.com/Ctc/5F+113/d2vr4p04/VWPvQ08X-LKBW1n6r8-3pbJMpW824bMN5PfV7MN3Lg88z3m2nnW7lCdLW6lZ3nfN4C-5_PgG45fW4YV9tb2DMf4yN1b2-R33Xlk9W33f4HS76hvNWW8SJm7q2yl3D1W1dY6KK83q3kyW4CqdbY2vH-n9N3dCnZv7rzRsW6HRM-763KGm1W3Hcdy49jVsJPW7P0HYj6mzlLQW3ZJLpW5y0950W1F5yrB5kQvSmW5BJLfd4Zw3w8W3m2Jml7YtPTZW27x4Ml3Sf6ZvW258wHy5_DbmzW4XZMnt2p1GTWW7lzZnT66NmwDW4m_W3d5C0YMFW20mSxC11ZDpFW1V41s75XC5fZVg2xbP8kPf0rN3G_0jBCkzXjf7gt_2804">apply now</a>!</em></p><p><em>This is the third and final installment in a series examining the potential for AI to lower the bar to creation and use of bioweapons, written by Golden Gate Institute for AI&#8217;s <a href="https://abio.substack.com/">Abi Olvera</a>. <a href="https://secondthoughts.ai/p/why-arent-bioweapons-common">Part 1</a> explains why bioweapons are rarely used. <a href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor">Part 2</a> presents the importance of &#8220;tacit knowledge&#8221; in biology. This installment explains why bioweapons are difficult to create, which steps do (and don&#8217;t) become easier with AI assistance, and why discussion of AI risks doesn&#8217;t always reflect the complexities of bioweapon work.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!OuCU!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!OuCU!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!OuCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2766190,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/199360760?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!OuCU!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!OuCU!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5d2a7e86-9fa2-462c-8aa7-f7710906545d_1448x1086.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Can AI help someone create a bioweapon?<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> Yes, it can help with the preliminary research and planning parts of the process. An LLM can retrieve obscure details about pathogen biology, suggest design modifications, and connect ideas across disciplines &#8211; such as aerosolization and fluid dynamics &#8211; faster than any literature search.</p><p>But building a bioweapon requires many things to go right in sequence. The details depend on the specific bioweapon, but the difficult steps often include sourcing, DNA assembly, growing a live organism, stabilizing it, testing, and deploying it. AI helps substantially with the first one. The rest are physical, manual, and organism-specific. They require lab skills, specialized equipment, and the kind of troubleshooting intuition that comes from years of hands-on work (see <a href="https://secondthoughts.ai/p/why-arent-bioweapons-common">Part 1</a> and <a href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor">Part 2</a> for more).</p><p>When a plan has multiple steps and every step must succeed in sequence, the chances of the whole chain succeeding decrease quickly. <a href="https://pubmed.ncbi.nlm.nih.gov/32522112/">Each new step multiplies the odds of failure</a>: seven steps, each with a 50% chance of working, yield less than a 1% chance of success. This means that an unskilled actor, unless they have very substantial resources, will find it nearly impossible to complete the full chain. Even if an LLM helps with some steps, the overall odds will remain extremely low. Improving the odds of success for some steps isn&#8217;t enough when so many others remain difficult. Experts are much more likely to succeed in steps within their specialty. But can LLMs help any non-expert close that gap?</p><p>Few studies use real-world participants to test whether LLMs can help non-experts acquire the underlying skills required to create bioweapons. The largest such study to date, by ActiveSite, indicated that non-experts with access to mid-2025 LLMs performed better on individual virology tasks but <a href="https://x.com/ActiveSiteBio/status/2024536132961390826">saw no meaningful improvement in end-to-end workflows</a>. Success rates on these basic workflows remained below 8%. And basic virology is only a fraction of what bioweapon creation demands; the study used a pathogen simpler than influenza and didn&#8217;t cover deployment, which presents its own engineering barriers. This is likely why bioweapon development has historically been carried out by teams with decades of experience and institutional support.</p><p>To see why bioweapon construction entails so many hard problems, it helps to sample the steps. The table below is an illustrative pipeline.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> (Note: Different organisms skip different steps: anthrax found in soil doesn&#8217;t require DNA assembly, but hits an engineering wall at large-scale weaponization.)</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/r57h1/4/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/95f1e7a1-b5ad-4656-8e25-85a53e4dc689_1220x4548.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/399a32d7-4a38-4af1-bc75-ebc73faaf436_1220x4548.png&quot;,&quot;height&quot;:2328,&quot;title&quot;:&quot;| Created with Datawrapper&quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/r57h1/4/" width="730" height="2328" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><h1>Each step presents hurdles</h1><p><em>Nearly every lab step depends on &#8220;good hands&#8221; or the predominantly tacit skill of labwork. </em>Consider pipetting, described by one scientist as &#8220;<a href="https://www.linkedin.com/pulse/good-hands-more-science-than-art-elisabeth-chapman-ph-d-/#:~:text=.%20Finally%2C%20we%20entered%20that%20mystical%20zone%20respected%20and%20lightly%20feared%20by%20most%20conscientious%20bench%20scientists%E2%80%A6pipetting.">mystical, if not lightly feared</a>&#8221; by bench scientists. Pipetting requires mastery of tip position, speed, and viscosity, depending on the task at hand. It is a complex balance between mixing and causing shear or damage to the DNA segments. AI provides knowledge. It doesn&#8217;t provide muscle memory or the unwritten tiny steps known to each scientist for their <a href="https://scienceblogs.com/principles/2008/11/10/hands-of-science#:~:text=And%20then%2C%20of,the%20same%20thing.">hyperspecific workflow</a> (see <a href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor">Part 2</a> for more).</p><p><em>Biological materials are perishable in ways that are easy to underestimate.</em> RNA degrades. Pathogens lose viability. A delay that&#8217;s tolerable for one organism may be fatal for another, and the tolerances aren&#8217;t always published. Even governments can&#8217;t stockpile biological agents the way they stockpile nuclear or chemical weapons. The shelf life is too inconsistent and too organism-specific. Anthrax, one of the more stable agents, loses the plasmids necessary for toxin production <a href="https://www.usanca.army.mil/LinkClick.aspx?fileticket=vfSRmHPb1IM%3D&amp;portalid=114">during long-term storage</a>.</p><p><em>Every step requires repeated testing, and testing is itself a source of risk and delay. </em>Opening a vial to verify the materials poses a risk of contamination and personal exposure. Skipping verification means discovering failures three steps later. DNA sequencing has gotten faster and cheaper, but extracting, purifying, and loading samples still requires hands-on work. When sequencing analysis flags a bad batch, you still have to go back and start over.</p><p><em>Every piece of equipment in a biology lab breaks, drifts, or needs maintenance. </em>In a normal lab, this is background noise. You call a service tech and order parts. In a clandestine operation, these acts are not possible or could lead to exposure.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> Some modifications, such as negative-pressure rooms or biosafety exhaust systems, require structural work that you cannot fully conceal or do yourself. Additionally, the equipment still has its own imperfections. This incubator runs a half-degree hot. That centrifuge vibrates at a specific speed. In a normal lab, you learn these things through experience, with colleagues who notice when something looks off. Without support, you&#8217;re troubleshooting blind. Military research identifies <a href="https://www.usanca.army.mil/LinkClick.aspx?fileticket=vfSRmHPb1IM%3D&amp;portalid=114">access to equipment</a> as a primary factor limiting the success of non-state groups&#8217; attempts.</p><h1>Aiming to unleash a modified virus adds additional hurdles</h1><p>Everything above assumes you&#8217;re working with a known pathogen. If you want something worse than what nature or your lab supplier provides, <a href="https://philsci-archive.pitt.edu/22539/1/Montague2023_GUTMBT_final.pdf">the task becomes dramatically harder</a>.</p><p><em>If you need to reverse the weakening of the pathogen you ordered:</em> Samples of dangerous pathogens available to order are often deliberately weakened for safety. Undoing this requires weeks of assembly work, months of iterative testing, and repeated failures. Strains are weakened through multiple, often interdependent mechanisms such as deleted virulence genes, modified regulatory sequences, and metabolic dependencies.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> You rarely know all the attenuation mechanisms. (Published papers typically describe the major modifications, but not every subtle change.) To confirm that you&#8217;ve successfully restored the pathogen to full strength, sequencing won&#8217;t be enough. You need to test virulence, transmission, and immune evasion in live subjects. AI won&#8217;t suffice because it cannot predict immune cascade effects, microbiome interactions, and other complex dynamics. A single silent mutation can kill infectivity in ways that won&#8217;t show up until you&#8217;re testing in humans. Testing in humans is risky, posing a high risk of detection or accidental release. You need to recruit willing subjects or <a href="https://www.unodc.org/unodc/en/human-trafficking/Webstories2023/tracking-illicit-financial-flows-linked-to-human-trafficking-and-migrant-smuggling.html#:~:text=For%20example%2C%20reported,for%206%2C300%20USD.">engage in human trafficking</a> of someone. AI&#8217;s contribution is limited to the planning and research here, not the execution.</p><p><em>If you want to modify a known pathogen to make it more deadly or transmissible:</em> While scientists regularly modify pathogens for various purposes, this remains largely theoretical as a bioweapons approach.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a> Every difficulty above gets worse. You&#8217;re no longer just trying to restore a pathogen&#8217;s natural function. You&#8217;re applying genetic modifications that can break down, silently degrading function. Modifications interact: a change that increases transmissibility might reduce virulence, or vice versa, and you won&#8217;t know until you test in living systems. The researchers capable of troubleshooting these complex testing cycles are a very small pool of molecular biologists who require significant biosafety training and institutional infrastructure. AI&#8217;s contribution is limited because the relevant data is sparser: fewer people have done this work, less of it is published, and the results are more organism-specific.</p><p><em>Designing a new virus or bacteria from scratch</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a><em>:</em> Designing a virus is harder than rocketry, says microbiologist Michael Montague. Rocket engineering optimizes within a known possibility space. The laws of physics are fixed. If your math is right and your components perform to spec, the rocket works. Each failed test tells you something specific: this valve leaked, that stage separated too early. You can tell you&#8217;re getting closer. Biology doesn&#8217;t work this way. This is why we still can&#8217;t cure cancer after 50 years and hundreds of billions in research. The action space in biology is vast, partly unmapped, and full of interactions that aren&#8217;t predictable from first principles. You can design a pathogen with a specific combination of properties on paper, e.g., high virulence, high transmissibility, long incubation, and immune evasion, but there&#8217;s no way to know in advance whether that combination is biologically viable. While nature generates viable viruses constantly, nature works through parallel filtering of billions of variants, most of which fail. A designer in a lab might spend years iterating a handful of designs toward an endpoint that biology simply won&#8217;t support, with no indication of whether you&#8217;re close, far, or chasing something impossible. AI can&#8217;t close this loop because the relevant data doesn&#8217;t exist yet. You have to generate it yourself in the lab through trial and error, which can go on indefinitely.</p><h1>When hard steps stop being hard</h1><p>The history of biotechnology is a history of hard things getting easier. Gene synthesis was once the exclusive domain of well-funded governments. Then universities. Then startups. Then hobbyists.</p><p>Each time a step got easier, working with biology got slightly easier, but it still remained hard.</p><p>Some developments, such as general-purpose lab automation, could meaningfully reduce the difficulty of the steps involved in creating a bioweapon. Even the most skeptical biosecurity expert I interviewed expected that humanoid robotics would eventually transform laboratory work, particularly once they could acquire tacit knowledge the way lab interns do: through proximity and repetition.</p><p>Today, lab robots are not general tools that a novice could deploy. They are painstakingly calibrated for specific workflows in well-funded labs. But the gap between specialized and general-purpose automation is narrowing every year.</p><p>Right now, the world&#8217;s protection from bioweapons comes less from any single safeguard than from the compounding friction across a very long chain. Rather than declaring the overall risk high or low, risk monitoring should evaluate which specific steps are becoming easier, how quickly, and whether the weakest links in the chain are moving.</p><p>But this complexity rarely surfaces in AI biosecurity debates online.</p><h1>The broader discourse problem</h1><p>Even if someone completed every step and successfully recreated the 1918 flu, which killed 50 million people, the threat still looks different from what most people assume.</p><p>Most 1918 flu deaths were due to bacterial pneumonia, not the virus itself. We now have antibiotics. Current vaccines also work against the original strain. Also, since the 1918 virus evolved into today&#8217;s seasonal flu, <a href="https://pmc.ncbi.nlm.nih.gov/articles/PMC7095040/">most people&#8217;s immune systems recognize its descendants</a>.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> Unleashing it would be bad, but not as bad as the original pandemic.</p><p>These real-world complications, known to biosecurity experts, don&#8217;t show up in headlines. Headlines often focus on abstract capabilities (AI can design pathogens, genomes are public, DNA synthesis is cheap), making the risk of biological attack seem enormous. But adding real-world frictions, such as regulatory barriers and technical bottlenecks, paints a different picture, at least in the near term.</p><p>The AI biosecurity community is divided along fault lines similar to those in the broader online AI discourse, with one group focusing on AI trajectories and another anchored on bottlenecks in the world. Traditional biosecurity professionals focus on addressing today&#8217;s threats through pandemic preparedness, disease surveillance, and closing public health gaps. Many examine how AI might impact these existing challenges. A newer group, closely linked to the AI safety community, focuses on future risks, such as engineered pathogens that could cause catastrophic outcomes. All of these groups do intensive research, though their work is often tailored to their own communities&#8217; concerns and frameworks.</p><p>The SecureBio <a href="https://www.virologytest.ai/">virology capabilities test</a> is an example of rigorous research of interest to the AI safety world, but not closely tracked by traditional biosecurity professionals. The test, which found that GPT5.5 outscores expert human virologists, is cited as evidence that AI has crossed a danger threshold. The test focuses on real-world, lab-type situations, asking questions like, &#8220;I infected cells with avian flu at 37&#176;C and 5% carbon dioxide in a jelly layer. Then incubated with these special proteins for 48 hours. Results look wrong [image included]. What happened?&#8221; The test does a great job of going beyond virology knowledge by addressing the intricacies of lab work. The test questions came from virologists describing real scenarios they encountered daily, with a focus on unpublished knowledge.</p><p>However, traditional biosecurity professionals often don&#8217;t consider knowledge of lab work a primary bottleneck. This is partly because real lab work is full of situations where the outcome you&#8217;d predict from theory doesn&#8217;t match what&#8217;s actually happening. Much like how a doctor&#8217;s diagnosis may be wrong despite being correct based on symptoms, the reasons why a virology task failed are often opaque even to trained scientists. Biology involves manipulating real things we don&#8217;t fully understand. The difficulty is the messy reality in front of you.</p><p>Research shows that the impact of text-based knowledge on lab capability <a href="https://forecastingresearch.substack.com/p/how-well-did-superforecasters-and">is overestimated</a>. Troubleshooting help evidently isn&#8217;t enough for non-experts to reliably get results, let alone to prevent failure modes they don&#8217;t know about. Most of what an experienced virologist relies on is not written down. It is pattern recognition built over years of working with the organisms she studies. And bioweapon creation is extremely difficult; a bad actor would need to succeed in conditions where experts regularly fail.</p><p>Research that would help bridge the divergent views within the biosecurity community would test theoretical risks against real barriers. Studies like ActiveSite, which check whether ordinary people can complete dangerous tasks with AI help, are a great start. These types of studies are rare, partly because they are hard to run and require lab space, ethics board approval, and significant resources. But these would join both communities in tracking where real risk begins. Right now, we are measuring what is easy to measure in ways that don&#8217;t change existing assumptions within each community.</p><p><em>Again, this is the third and final installment in our series on AI and bioweapons. <a href="https://secondthoughts.ai/p/why-arent-bioweapons-common">Part 1</a> explains why bioweapons are rarely used; <a href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor">part 2</a> presents the importance of &#8220;tacit knowledge&#8221; in biology.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/ai-can-help-plan-a-bioweapon-building?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/ai-can-help-plan-a-bioweapon-building?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Steve Newman, Taren Stinebrickner-Kauffman, Mike Montague, Matt Sharkey, Gigi Gronvall, and David Manheim for suggestions and feedback.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>By bioweapon, we mean pathogens with the inherent capacity to kill thousands, if not millions of people. This is the category dominating AI biosecurity discourse. Simpler, less scalable agents such as ricin from castor beans or Salmonella (used in the 1984 salad bar attacks) fall outside this scope.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This covers only known unmodified viruses or bacteria, deployed at scale.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>One could bypass repair needs by buying used equipment on eBay or similar sources each time something breaks. However, this introduces quality issues for delicate and dangerous lab work. Often, equipment needed for sophisticated attacks falls under export controls, end-use monitoring, or scrutiny from government agencies. These include specialized jet mills, associated air handling/classification systems, spray dryers, and specialized aerosol generators/micronizers. Even if the specific equipment needed is available, each purchase exposes the buyer to the seller and marketplace, and creates a transaction record. Cash payments obscure some of this, but repeated purchases of specialized lab equipment and materials draw attention.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Someone could order DNA fragments for a full-strength virus from synthesis companies instead of ordering a ready-made, weakened pathogen. An <a href="https://www.nature.com/articles/s41467-025-67955-3">MIT Red Team study pulled this off</a>, though their institutional credibility helped. A <a href="https://thebulletin.org/2024/06/mit-researchers-ordered-and-combined-parts-of-the-1918-pandemic-influenza-virus-did-they-expose-a-security-flaw/">non-state actor</a> would have a significantly harder time. However, a real gap exists: roughly <a href="https://www.nti.org/about/programs-projects/project/preventing-the-misuse-of-dna-synthesis-technology/">20% of synthesis providers</a> don't screen. Notably, converting synthetic DNA into a living, viable virus is overwhelmingly harder than ordering and synthesis itself.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Gain-of-function research in well-equipped institutions has made flu viruses more transmissible in animals. Animal transmissibility and human transmissibility, however, are not the same. No institution or researcher has ever modified a pathogen to successfully increase transmissibility in humans.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Some people cite bacteriophages as evidence of<a href="https://80000hours.org/podcast/episodes/richard-moulange-ai-bioweapons-biorisk/">&nbsp;AI-created novel viruses</a>; however, practitioners don&#8217;t consider them truly novel. These are 7% different from nature, which is seen as normal evolutionary change. These engineered bacteriophages don&#8217;t need to evade the immune responses that even bacteria possess, making it unclear whether they&#8217;re viable compared to natural viruses that have survived that selection pressure. A pathogen needs to survive multiple environmental transitions: for E. coli, this means persisting on food, transitioning from room temperature to body temperature, surviving digestive enzymes and stomach acid, and passing through intestines with changing pH just to reach its host. It would need to repeat this through multiple animal species to complete its lifecycle. In the real world, viruses compete fiercely with each other and with non-viral pathogens. Finding a fit virus is less about design sophistication than about how many times you roll the dice; evolution has no intentional design tool. Modified bacteriophages are medically useful despite their inability to spread because they can be delivered directly to patients. But this isn&#8217;t evidence that engineering real-world threat agents is easy. Still, it represents progress in AI-assisted biotech, and the trajectory of that progress is worth monitoring and mentioning.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p> Note that this does not imply full immunity.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[From Compute Overhang to Compute Crunch]]></title><description><![CDATA[The state of AI in Q2 2026, part 2]]></description><link>https://secondthoughts.ai/p/the-ai-race</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-ai-race</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 19 May 2026 18:14:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!jv1G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>In the<a href="https://secondthoughts.ai/p/is-ai-2027-coming-true"> first installment</a> of this &#8220;state of AI&#8221; series, I reviewed the pace of change. In this installment, I&#8217;ll review the factors driving that change. Understanding the forces behind AI progress can help us understand where things will go from here.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jv1G!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jv1G!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jv1G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png" width="1448" height="1086" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1086,&quot;width&quot;:1448,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2573173,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/198337189?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jv1G!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 424w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 848w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 1272w, https://substackcdn.com/image/fetch/$s_!jv1G!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1c672ab2-9f80-47da-b1ed-b97bb9dcd107_1448x1086.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2><strong>A punctured equilibrium</strong></h2><p>I was in my 20s when the World Wide Web burst onto the scene. I remember being startled at how quickly it went from an obscure technical standard to a household name. On reflection, I realized the web spread rapidly because <em>the ingredients had already been in place</em>. Computers were widespread in the workplace and home. Many were already connected to modems or networks to access AOL, CompuServe, or corporate e-mail. The modems tapped into an existing worldwide network of telephone cables. And there was pent-up demand for services the Web could offer. Like a boulder perched at the edge of a cliff, the world of the 1990s was poised for change.</p><p>The AI wave is being driven by another dramatic overhang of untapped potential. Prior to the release of ChatGPT in November 2022:</p><ul><li><p>Semiconductor manufacturing was already an enormous industry, well accustomed to meeting surges in demand.</p></li><li><p>Close to 100 million &#8220;GPU&#8221; chips were being manufactured per year. They were primarily used for video game graphics, which involves massive numbers of numerical calculations, meaning these chips could easily be repurposed for training large language models.</p></li><li><p>Semiconductor factories can easily pivot to new chip designs. As a result, capacity could be shifted from other chips to GPUs, and GPU designs could rapidly be altered to more efficiently execute the &#8220;deep learning&#8221; algorithms used in LLMs.</p></li><li><p>Vast quantities of text existed in digital form, ready to be fed into AI training pipelines.</p></li><li><p>Most non-physical work had long since moved from pen and paper to computers, leaving it poised for automation.</p></li></ul><p>One result is that, since the beginning of 2022, the world&#8217;s supply of AI computing capacity has increased by a factor of 1000!</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!GezR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!GezR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GezR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GezR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GezR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!GezR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg" width="1038" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1038,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!GezR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!GezR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!GezR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!GezR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa7760d93-61f7-4b5a-98f7-9d640b3b37f9_1038x768.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Notice how steeply this graph rises. Then note that it only 3.5 years! (<a href="https://epoch.ai/data-insights/ai-chip-production">source</a>)</figcaption></figure></div><p>Eventually, some of these tailwinds will peter out, as I&#8217;ll discuss below. But for now, all of this untapped potential is driving an insane pace of change. Everyone in the AI industry feels like they&#8217;re in a race. Some are racing to beat the competition; some are just racing to rake in money as quickly as possible. The most pivotal race is that between OpenAI and Anthropic.</p><h2><strong>Two companies are setting the pace &#8211; and choosing the course</strong></h2><p>An unfathomable number of companies are working to develop and deploy AI. Among startups alone, hundreds have raised $100M or more, including many I&#8217;ve never heard of.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!2-hx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2-hx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2-hx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg" width="1174" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1174,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2-hx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!2-hx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F857fd4d0-bce8-4079-a78a-4ae8eeb41d40_1174x768.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">ChatGPT Pro claims that each of these AI companies has raised $100M or more.</figcaption></figure></div><p>Despite all this activity, most current action is being driven by progress in two areas:</p><ol><li><p>General-purpose <em>models</em> like ChatGPT and Claude, that are trained on a vast array of data and can take on a wide range of bite-sized tasks.</p></li><li><p>&#8220;Agent harnesses&#8221; like Codex and Claude Code, that tackle large projects by breaking them up into a series of tasks that can be handled by a model.</p></li></ol><p>Models, on their own, can do things like answering questions or summarizing documents. Incorporated into an agent harness, they can carry out an increasing range of valuable tasks, from writing software to preparing presentations. Together, they are central to the present-day impacts and expansive future scenarios of AI.</p><p>Developing a cutting-edge model is now a multi-billion-dollar proposition. Only five companies have been able to spend that kind of money, and three of them are not getting success for their money.</p><p>AI efforts at <strong>Meta</strong> (Facebook) have been plagued by dysfunction and leadership churn. Meta has never managed to produce a cutting-edge general-purpose model, and at this point, there&#8217;s no obvious reason to believe they will. They&#8217;ve pushed the state of the art in some areas, and their AI efforts have apparently paid off by improving ad targeting, but they&#8217;re not a central player in the most important race.</p><p><strong>xAI</strong> &#8211; Elon Musk&#8217;s AI company, now part of SpaceX &#8211; has built a pair of gigantic data centers in record time, aptly dubbed Colossus 1 and 2. However, their &#8220;Grok&#8221; model is best known for contradicting crazy things said by Musk, creating racy pictures of nonconsenting women, and<a href="https://www.nytimes.com/2025/05/17/opinion/grok-ai-musk-x-south-africa.html"> briefly referring to itself as MechaHitler</a>. xAI could conceivably have a future in robotics (a focus for another Musk company, Tesla). But for now, they are such an unimportant player in the AI world (experiencing <a href="https://www.theinformation.com/articles/spacexai-exodus-50-recent-exits-meta-thinking-machines-hire-staff?rc=jsaoww">ongoing departures of key staff</a>) that they can&#8217;t make use of the Colossus 1 data center and are<a href="https://www.anthropic.com/news/higher-limits-spacex"> handing it to Anthropic</a>.</p><p>The most surprising failure, of course, is <strong>Google</strong>. In a familiar story in tech, Google invented the key technologies behind modern AI, but was caught entirely by surprise when OpenAI commercialized them in ChatGPT. Google has subsequently funneled massive efforts into developing their Gemini model, and while they&#8217;ve had much better results than Meta or xAI, they don&#8217;t seem to be able to keep pace with the front-runners<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. This was brought home by a recent informal survey of preferred AI coding tools, where fewer than one percent of respondents preferred Gemini<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FIjv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FIjv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 424w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 848w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 1272w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FIjv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png" width="1366" height="719" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:719,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FIjv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 424w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 848w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 1272w, https://substackcdn.com/image/fetch/$s_!FIjv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F98242487-e772-4599-8ca0-4f531a9268b7_1366x719.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Even more compelling is a fact that I mentioned in my last post: <strong>engineers at Google DeepMind prefer Claude over Google&#8217;s own tools</strong> &#8211; to the point where &#8220;<a href="https://x.com/Steve_Yegge/status/2046260541912707471">several engineers reportedly threatened to leave</a>&#8220; if forced to switch.</p><p>It would be unwise to count Google out. They have the mother of all cash cows,<a href="https://epoch.ai/data/ai-chip-owners?view=graph&amp;tab=h100_equivalents"> almost as much AI chip capacity as Microsoft and Amazon combined</a>, a deep bench of AI talent, access to stupendous amounts of data, and distribution via the likes of Google Search, Chrome, YouTube, Android, Gmail, and Docs. But if established companies are good at one thing, it&#8217;s squandering opportunities, and in this regard Google has clearly joined the establishment. Their broad portfolio of established products has seemed more of an entanglement than a leg up.</p><p>That leaves two companies defining the course of AI.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>OpenAI vs. Anthropic: too close to call</strong></h2><p>From the day ChatGPT took the world by storm, the AI race has been OpenAI&#8217;s to lose. And last fall, they appeared to be doing just that &#8211; losing. During three years of mostly-unchallenged leadership, OpenAI lacked focus, with plans ranging from videos to wearable devices to advertising to shopping agents, even an <a href="https://techcrunch.com/2026/03/24/openais-sora-was-the-creepiest-app-on-your-phone-now-its-shutting-down/">ill-fated social network</a>. Meanwhile, Anthropic was moving from strength to strength: the Claude chatbot became the cognoscenti&#8217;s tool of choice, Claude Code defined the era of coding agents, and the raw capability of Anthropic&#8217;s AI models was advancing rapidly while OpenAI seemed to stumble. In late 2025 and early 2026, there was near-consensus among insiders that Anthropic had pulled ahead in the areas that mattered, while avoiding the controversies and distractions that plagued OpenAI.</p><p>Coding agents may be the single most important technology of the moment, as both an enormous moneymaker (strongly contributing to Anthropic&#8217;s meteoric revenue), and the tool that is driving the acceleration of AI R&amp;D. And in this area, Anthropic&#8217;s lead was especially strong, with Claude Code the almost-undisputed leader. The announcement of Claude Mythos seemed to cement that lead; not only was Mythos astonishingly capable at finding cybersecurity bugs, it smashed records on software development benchmarks.</p><p>However, OpenAI &#8211; unlike Google &#8211; always managed to stay within striking distance. And in the last couple of months, the race is looking more even. OpenAI&#8217;s latest model, GPT-5.5, is widely held to be on a par with Anthropic&#8217;s current Opus 4.7. They&#8217;ve rolled out a flurry of improvements to Codex (their coding agent), narrowing the feature gap with Claude Code. Many insiders are shifting back to OpenAI tools for coding, research, and other &#8220;objective&#8221; tasks, even if Claude remains the favorite for brainstorming and writing. And OpenAI may still have gas in the tank. There is<a href="https://x.com/sdmat123/status/2048247373193785386/history"> speculation</a> that GPT-5.5 was rushed out the door without much polish, leaving room for a quick 5.6 followup that could put them in the lead.</p><p>Meanwhile, Anthropic is showing signs of strain. Service outages occur <a href="https://status.claude.com/">shockingly often</a><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. Flubbed attempts to free up computing capacity have caused Claude Code to get accidentally &#8220;dumber&#8221;<a href="https://www.anthropic.com/engineering/april-23-postmortem"> on at least three separate occasions</a>, one of which was not fixed for over a month.</p><p style="text-align: center;">This level of service disruption would be inconceivable at most companies</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7d54!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7d54!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 424w, https://substackcdn.com/image/fetch/$s_!7d54!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 848w, https://substackcdn.com/image/fetch/$s_!7d54!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 1272w, https://substackcdn.com/image/fetch/$s_!7d54!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7d54!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png" width="1366" height="163" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:163,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7d54!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 424w, https://substackcdn.com/image/fetch/$s_!7d54!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 848w, https://substackcdn.com/image/fetch/$s_!7d54!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 1272w, https://substackcdn.com/image/fetch/$s_!7d54!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75bfb085-93fd-4ac1-ac4b-0b5dd09d072f_1366x163.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>It seems impossible to guess whether either competitor will pull ahead. The best guess might be that OpenAI and Anthropic will continue to play tag &#8211; and that their competitors will have a very hard time catching up.</p><h2><strong>OpenAI and Anthropic are benefiting from feedback loops that have nothing to do with AI automating R&amp;D</strong></h2><p>Aggressive forecasts for AI progress usually involve a feedback loop where one or more leaders gain an insurmountable advantage. The story goes that whoever has the best AI will use it to accelerate their own R&amp;D, leaving competitors to fall further and further behind. This process is referred to as &#8220;recursive self-improvement&#8221;, or RSI.</p><p>OpenAI and and Anthropic have been consistently out-executing the rest of the AI world for several years now. But AI tools weren&#8217;t capable enough to dramatically impact overall R&amp;D progress until recently, so this isn&#8217;t RSI in action. Our two front-runners are simply following a pattern that has played out many times in tech &#8211; a special mojo that comes with being the leader in an exciting new domain. It starts with early success, based on a strong founding team and culture, a well-positioned product, and luck. That early success builds confidence and attracts investment, customers, and talented staff. There are plenty of ways this can go wrong, but also many examples of it going very right for a long time.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MEGW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MEGW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 424w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 848w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 1272w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MEGW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png" width="1406" height="982" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:982,&quot;width&quot;:1406,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:92612,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/198337189?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!MEGW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 424w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 848w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 1272w, https://substackcdn.com/image/fetch/$s_!MEGW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4e3209be-22dc-4119-bcca-29789c4c5a0c_1406x982.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Google&#8217;s sustained success has nothing to do with &#8220;recursive self-improvement&#8221;</figcaption></figure></div><p>Now, on <em>top</em> of that, 2026 seems to be the year where AI is indeed starting to accelerate its own development. I&#8217;ve already mentioned that Google engineers feel hampered if they&#8217;re not allowed to use Claude Code. But only Anthropic&#8217;s engineers can use their best model, Mythos &#8211; it&#8217;s not yet available to others (except for cybersecurity purposes). The two front-runners in the AI race, who were already out-executing the rest of the field, will now have the added advantage of early access to the latest and greatest AI coding agents. (Or rather, we&#8217;re entering an era where that advantage may become very important.)</p><h2><strong>The observant reader may notice that I haven&#8217;t mentioned China</strong></h2><p>China sports a rich ecosystem of AI model developers. DeepSeek is the best known, but many companies are doing serious work, including startups like Moonshot AI and tech giants like Alibaba.</p><p>I haven&#8217;t mentioned any of them, because they&#8217;re simply too far behind to play a central role. Conventional wisdom is that Chinese models lag three to six months behind. In reality, the gap is somewhat larger; comparisons are mostly based on benchmark scores, and<a href="https://x.com/Altimor/status/2024166553357336660"> Chinese models tend to score better on benchmarks than in real-world usage</a>. Furthermore, they appear to be substantially<a href="https://x.com/mkratsios47/status/2047316220785905948"> drafting off of frontier US models</a>, whether by using US models to generate training data, or simply focusing effort on areas where US models have been observed to succeed. If OpenAI and Anthropic were to slow down, progress in China might slow as well.</p><p>A lag of<a href="https://thezvi.substack.com/i/196014315/quiet-speculations"> 8 months or so</a> may not sound like much, but in the world of AI it&#8217;s enough to render a product uncompetitive. Chinese AI developers have not captured a large share of the market &#8211; even domestic companies<a href="https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens-in"> often use</a> US models, and they may<a href="https://x.com/natolambert/status/2043334045187342632"> struggle</a> to<a href="https://www.interconnects.ai/p/reading-todays-open-closed-performance"> sustain</a> the ever-increasing levels of investment required to stay in the game (though<a href="https://www.theinformation.com/articles/deepseek-raise-7-billion-startup-plots-revenue-efforts"> DeepSeek is giving it a shot</a>). Indeed, they may now be<a href="https://x.com/kyleichan/status/2047409173722509463"> falling further behind</a>.</p><p>Another popular story is that China&#8217;s scrappy AI startups, lacking the enormous resources being invested in the US, have developed more efficient techniques. This appears to be an exaggeration: the major American labs have also been aggressively pursuing efficiency, driving down the cost for a given level of AI capability by 10x per year. There are<a href="https://simonwillison.net/2026/Apr/24/deepseek-v4/"> instances</a> of Chinese companies offering AI services at a steep discount to US pricing, but these services don&#8217;t seem to be capturing much market share<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p><p>None of this is to denigrate the capability of Chinese AI researchers. Teams like DeepSeek are doing top-notch work, but they&#8217;re hampered by lack of access to American chips and financial markets. In the world of AI, it is abundantly clear that size matters.</p><p>Chinese companies do play an important role in supporting academic AI research, including American research. That&#8217;s because they often release their model &#8220;weights&#8221; &#8211; the data file that makes up the actual AI model &#8211; for anyone to use. This allows researchers (and startups) around the world to study and modify those models. They also publish papers describing many of their advances, something that has become quite rare at leading US labs who now focus on preserving competitive advantage.</p><p>Much of China&#8217;s AI effort is focused on an entirely different domain: <a href="https://afraw.substack.com/p/mandate-of-ai#:~:text=Chinese%20thinking%20converges%20on%20something%20more%20embodied">robots, factory automation, driving, and other &#8220;embodied applications&#8221;</a>. These will be huge markets, but are not currently central to the great AI race.</p><p>Chinese &#8220;open-weight&#8221; models play a leading role in important niches, such as small AI models that can run locally on a smartphone, drone, or other &#8220;edge device&#8221;. But so long as the US retains its dominant position in the build-out of AI data centers, Chinese models seem likely to remain stuck in these niches.</p><p>And speaking of data centers, that leads me to my last topic...</p><h2><strong>The great compute crunch</strong></h2><p>Earlier, I presented a graph showing that installed AI computing capacity is doubling every 7 months. Overall AI capacity is growing even faster, as developers from OpenAI to DeepSeek find &#8220;algorithmic improvements&#8221; &#8211;&nbsp;ever-more-efficient ways to turn computing power into intelligence.</p><p>It&#8217;s not enough. Demand has been increasing faster than supply, and AI providers &#8211;<a href="https://x.com/trq212/status/2037254607001559305"> especially Anthropic</a> &#8211; are starting to tighten usage limits or move from flat rate to usage-based pricing. (Sam Altman&#8217;s aggressive past pursuit of data center deals may now give OpenAI an important advantage<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>, locking in plenty of capacity for the next few years.)</p><p>This is driven by the explosion in use of AI &#8220;agents&#8221;. A single prompt to an agent can result in the AI working for minutes or hours. Software engineers often have many agents grinding away at once, and can easily use more tokens in one day than a casual ChatGPT user might consume in a year.</p><p>Until very recently, the conventional wisdom was that GPU chips would have a useful lifetime of 3 to 5 years, after which they&#8217;d be replaced by new, more efficient chips. In actuality, rental prices for four-year-old H100 chips, which had been falling over time, <a href="https://newsletter.semianalysis.com/p/the-great-gpu-shortage-rental-capacity">have recently begun to soar</a>, because <a href="https://newsletter.semianalysis.com/p/the-great-gpu-shortage-rental-capacity?triedRedirect=true#:~:text=On%2DDemand%20GPU%20rental%20capacity%20is%20sold%20out%20across%20all%20GPU%20types">there&#8217;s enough demand to saturate all available chips</a> &#8211; old and new.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8k4U!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8k4U!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 424w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 848w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 1272w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8k4U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png" width="977" height="561" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:561,&quot;width&quot;:977,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8k4U!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 424w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 848w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 1272w, https://substackcdn.com/image/fetch/$s_!8k4U!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2425f27f-7411-46a2-ae85-96e169f134f6_977x561.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The market price for renting H100 chips fell as they became dated&#8230; and then turned around as capacity became scarce. (Source: <a href="https://newsletter.semianalysis.com/p/the-great-gpu-shortage-rental-capacity">SemiAnalysis</a>.)</figcaption></figure></div><p>The crunch may get worse. Conventional wisdom is that by 2030, AI will be using most of the world&#8217;s advanced semiconductor manufacturing, at which point it will become impossible to maintain the current rate of growth in computing capacity. And other bottlenecks might arrive sooner; one respected analyst forecasts that<a href="https://newsletter.semianalysis.com/p/the-great-ai-silicon-shortage"> AI will use a majority of all memory (DRAM) manufacturing capacity this year</a>, and 69% next year! The resulting memory shortage is already causing price hikes for laptops and cell phones. So the world is running out of capacity to shift production away from other chips toward AI.</p><p>As a result, the brief era of universal access to cutting-edge AI is ending. Andy Warhol famously noted that Coca-Cola is egalitarian: &#8220;the President drinks Coke, Liz Taylor drinks Coke, and ... you can drink Coke, too&#8221;. For the first few years after ChatGPT launched, something similar was true for AI &#8211; top-notch capabilities were mostly available for free or, at most, $20 / month. With the introduction of agents, that&#8217;s no longer sustainable. I&#8217;m paying $200 / month to Anthropic to support my Claude Code habit, and another $200 / month to OpenAI for access to their &#8220;Pro&#8221; research agent. A year from now, if trends hold, those capabilities will be available for $20 / month... but my spending will probably have <em>increased</em>, because I&#8217;ll want access to the latest and greatest capabilities<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>. Meanwhile, some &#8220;tokenmaxxing&#8221; engineers already spend hundreds of dollars <em>per day</em> to operate fleets of coding agents. Individuals and companies who can&#8217;t afford these prices may struggle to compete.</p><p>The compute crunch may also slow progress in AI capabilities. It will not, however, stop it: new chip fabs are being built, and algorithmic improvements keep squeezing more value out of each chip.</p><p>The shortage of GPUs brings me back to China. China&#8217;s AI efforts are <a href="https://www.interconnects.ai/p/notes-from-inside-chinas-ai-labs?open=false#%C2%A7where-chinas-ai-industry-differs-and-matches-the-western-labs:~:text=Desperation%20for%20more%20Nvidia%20chips">drastically</a> <a href="https://www.understandingai.org/p/a-big-lesson-of-my-china-visit-compute">limited</a> by export controls on GPU chips and other specialized equipment. We&#8217;ve already discussed how this prevents Chinese labs from training cutting-edge models. It also greatly limits the country&#8217;s ability to <em>use</em> AI, even once it&#8217;s been trained: there are simply not enough data centers in China to support anything like the intensity of usage that is emerging in the US and some other countries.</p><p>(If AI demand continues on its current trajectory, <a href="https://writing.antonleicht.me/p/cut-off">access to computing capacity may soon become an issue for every country</a>, with GPUs the &#8220;new oil&#8221; and the US a one-country cartel<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>.)</p><p>For years, China has been working to reduce its dependence on chip imports. The country now has a substantial semiconductor manufacturing industry. However, it primarily produces lower-complexity chips, such as those used in automobiles. Its ability to produce advanced chips such as GPUs is<a href="https://www.noahpinion.blog/p/scoring-the-jensen-dwarkesh-debate#:~:text=If%20export%20controls%20had%20simply%20%E2%80%9Cgalvanized%E2%80%9D%20China%20into%20doing%20everything%20themselves%2C%20they%20wouldn%E2%80%99t%20even%20need%20Nvidia%20chips"> highly constrained</a>, depending on imported manufacturing equipment which is also subject to export controls. Many of China&#8217;s AI data centers are populated with<a href="https://x.com/EpochAIResearch/status/2049924785153638761"> smuggled chips</a>.</p><p>Jensen Huang, CEO of Nvidia (the leading maker of GPU chips for AI), argues that export controls will simply encourage China to develop its own chip industry. However, this appears to be<a href="https://x.com/ShakeelHashim/status/2045173210107306406"> out of touch with reality</a>. It seems clear that China will continue racing to expand domestic chip production whether or not the US maintains export controls. The country has been aggressively pursuing that path for years, forbidding Chinese companies from using imported chips whenever domestic alternatives are available &#8211; even when those alternatives are inferior.</p><h2><strong>The forces driving AI progress</strong></h2><p>To understand where AI is heading, it helps to understand the forces that are driving progress. As we&#8217;ve seen, the impact of AI is centered on &#8220;agents&#8221; and the general-purpose models they rely on. Progress so far has been driven by two key factors. The first is an <strong>overhang</strong> of potential that was ready to be tapped by AI:</p><ul><li><p>Chip manufacturing capacity (being rapidly repurposed to manufacture GPU chips)</p></li><li><p>Digital data (that those chips can process to train AI models)</p></li><li><p>Electronic work (that is ready for automation)</p></li></ul><p>The second is a <strong>positive feedback loop</strong> at OpenAI and Anthropic: their success draws investment, customers, and talent, fueling further success.</p><p>The overhang effect may peter out over the next few years. During that same period, AI itself may pick up the slack via the aforementioned recursive self-improvement: helping to design more-efficient chips, generating its own training data, and accelerating the work done by researchers and engineers at the AI labs.</p><p>As AI agents become able to work unattended for longer periods, demand for computing power is outstripping supply. This is leading to a world of AI haves and have-nots: those with money will have access to smarter agents and be able to use them for more tasks. At the international level, China will be at a major disadvantage, so long as export controls remain in place and effective<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>.</p><p>This picture seems likely to hold for at least the next few years, until one of the following things happens:</p><ol><li><p>Robots start to broadly automate physical labor. This could pull the center of gravity away from agents, OpenAI and Anthropic. It might also shift the advantage to China, with its vast industrial base and commanding lead in manufacturing motors, batteries, and other robot components.</p></li><li><p>Automation of AI research leading to a true intelligence explosion, with unpredictable consequences.</p></li><li><p>An x-factor, such as popular distrust of AI in the US leading to a serious data center ban, or geopolitical strife impacting Taiwanese semiconductor manufacturing.</p></li></ol><p>It&#8217;s possible that some startup will emerge with a new approach to AI that allows it to blow past OpenAI and Anthropic. But it&#8217;s at least as likely that the current leaders would manage to incorporate the new idea. It&#8217;s also possible that AI revenue growth will slow enough to break the feedback loop of success. But<a href="https://www.wsj.com/tech/ai/openai-misses-key-revenue-user-targets-in-high-stakes-sprint-toward-ipo-94a95273"> despite concerns at OpenAI</a>, I think this is unlikely. Things might slow down; one of the leaders might stumble. But OpenAI and Anthropic have enough momentum that I expect at least one of them to remain in the lead for at least the next few years. And, barring a serious recession (and perhaps even then), I expect advances in AI capability to continue indefinitely. There are too many competitors in the race, and too many factors driving it forward, to expect any other outcome.</p><p><em>This is the second installment of my &#8220;state of AI&#8221; series. The <a href="https://secondthoughts.ai/p/is-ai-2027-coming-true">first installment</a> discussed the pace of change. In the remaining installments, I&#8217;ll cover impact (both positive and negative), the politics of AI, and how to personally navigate the coming era.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-ai-race?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-ai-race?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Lots of people provided suggestions and feedback for this series! Thanks to Abi Olvera, Alex Booker, Clara Collier, Dave Kasten, Eli Pariser, Emma Kumleben, Gideon Lichfield, Kai Williams, Taren Stinebrickner-Kauffman, and Tim Schnabel. Apologies if I missed anyone!</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Google&#8217;s annual conference will take place around the time I publish this, and <a href="https://sources.news/p/google-about-to-release-new-gemini">there are reports</a> that they&#8217;ll announce a new Gemini model but that it &#8220;won&#8217;t be pushing the frontier&#8221;. Meanwhile, Google&#8217;s struggles on the product front may be even greater than on raw model capabilities, and this is a challenge that a sprawling organization seems ill-equipped to overcome.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>This audience may be biased toward Anthropic, so don&#8217;t take the Claude / ChatGPT mismatch too seriously.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>This link leads to Anthropic&#8217;s live status page; the statistics shown there will almost certainly improve at some point. The image in this post shows how things looked as I was drafting this post.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>I&#8217;m not entirely sure what is going on here. Certainly Chinese providers have capacity limits, but if a model truly delivers competitive price / performance, you&#8217;d expect some arrangement to run it on data centers in other countries, especially since most of these models are open for anyone to run. It could be that the real-world performance of these models is simply not good enough to be attractive for most customers even at a low price. There may be bias against Chinese models. Or it could be a simple lack of awareness.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p><a href="https://x.com/jukan05/status/2052953133262000238">One recent estimate</a> shows OpenAI having contracted twice as much future data center capacity as Anthropic.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>As Anton Leicht explains:</p><blockquote><p>The often-invoked hope that &#8216;efficiency curves&#8217; will compress token costs quickly doesn&#8217;t save us here: efficiency curves mean that next year, Mythos-level capabilities might be very cheap; they don&#8217;t mean that Mythos 2 will be cheaper than Mythos. The opposite is the case: frontier capabilities have grown more expensive month-to-month for years now.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Taiwan, not the US, holds the overwhelming majority of advanced semiconductor manufacturing facilities. And other countries, such as Japan, dominate supply of other critical components for AI data centers. However, to date, these countries have allowed the US to play the dominant role in determining where that supply goes.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>Or until their domestic industry catches up, but this will take many years.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Is AI 2027 Coming True?]]></title><description><![CDATA[The state of AI in Q2 2026, part 1]]></description><link>https://secondthoughts.ai/p/is-ai-2027-coming-true</link><guid isPermaLink="false">https://secondthoughts.ai/p/is-ai-2027-coming-true</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 07 May 2026 00:02:34 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dSKR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>A quick aside: I was recently <a href="https://www.cognitiverevolution.ai/vibe-coding-an-attention-firewall-w-steve-newman-creator-of-the-curve/">interviewed on The Cognitive Revolution</a>. In the first half, I discuss the suite of tools I&#8217;ve had Claude Code build to improve my personal productivity. In the second half, host Nathan Labenz and I get into a broader discussion of where AI is going. It was a fun conversation!</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dSKR!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dSKR!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 424w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 848w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 1272w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dSKR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png" width="1024" height="771" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:771,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1068894,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/196687950?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!dSKR!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 424w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 848w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 1272w, https://substackcdn.com/image/fetch/$s_!dSKR!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7207f6c6-c0cc-45d2-96ee-697fa0c823e8_1024x771.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>You&#8217;ve likely seen references to Claude Mythos, Anthropic&#8217;s most advanced model. It&#8217;s the first AI tool that is smart enough to be dangerous.</p><p>This is the latest in a series of data points suggesting that things may be progressing as forecast in <a href="https://ai-2027.com/">AI 2027</a>. Which is concerning, as that scenario involves a significant chance of human extinction by 2030!</p><p>I&#8217;m working on a review of the overall &#8220;state of AI&#8221;. This post started out as the first installment. As it unfolded, I realized that this question &#8211; is AI 2027 coming true? &#8211; was a unifying theme. I&#8217;ll cover other topics in another post or posts. If there&#8217;s anything you&#8217;d like me to discuss, leave a comment!</p><h2><strong>AI continues to barrel forward</strong></h2><p>Possibly my favorite tweet of the year so far is a poke at last year&#8217;s claims that progress was stalling out:</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://x.com/deanwball/status/2041644727276048580" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!KaHt!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 424w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 848w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 1272w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!KaHt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png" width="1366" height="271" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:271,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/deanwball/status/2041644727276048580&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!KaHt!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 424w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 848w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 1272w, https://substackcdn.com/image/fetch/$s_!KaHt!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8605152f-c3f1-4dc1-aad6-92b6761977ab_1366x271.png 1456w" sizes="100vw"></picture><div></div></div></a></figure></div><p>On every metric, AI continues up and to the right. The best measure of its importance might be people&#8217;s willingness to pay, and OpenAI revenue is absolutely exploding. Reportedly, their annual run rate quadrupled from $5.5B to $21.4B over the course of 2025, and then reached $25B in February. <strong>If OpenAI&#8217;s growth is lunatic, then Anthropic&#8217;s pace has been absolutely bananas</strong>. Revenue nearly dectupled over the course of 2025, <em>and then accelerated</em>, tripling to a $30B rate in just three months<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!amQ1!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!amQ1!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 424w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 848w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 1272w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!amQ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png" width="1366" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!amQ1!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 424w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 848w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 1272w, https://substackcdn.com/image/fetch/$s_!amQ1!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F65e4fd72-da77-4bcd-bc97-6a1095dc1f1e_1366x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: <a href="https://epoch.ai/data/ai-companies?view=graph&amp;tab=revenue">https://epoch.ai/data/ai-companies?view=graph&amp;tab=revenue</a></figcaption></figure></div><p>For comparison, Anthropic&#8217;s revenue rate is now higher than McDonald&#8217;s, Paramount, Mastercard, Southwest Airlines, or Charles Schwab. Another tripling would put them in the neighborhood of Disney or Tesla. And that $30B figure (taken from an <a href="https://www.anthropic.com/news/google-broadcom-partnership-compute">Anthropic announcement</a> dated April 6) may already be out of date: As of April 24, the highly respected SemiAnalysis newsletter estimates <a href="https://newsletter.semianalysis.com/p/the-coding-assistant-breakdown-more#:~:text=%2440B%20(our%20current%20estimate)">Anthropic&#8217;s revenue rate at $40B/year</a>.</p><p><a href="https://epoch.ai/eci?view=graph&amp;tab=release-date&amp;showFrontierTrend=true">Benchmark scores</a> and<a href="https://epoch.ai/trends#ai-companies"> computing capacity</a> are continuing their upward trajectories as well. None of the predictions that AI will &#8220;hit a wall&#8221; have panned out so far. In fact, the evidence seems to point toward things<a href="https://epoch.ai/blog/have-ai-capabilities-accelerated"> picking up speed</a>.</p><p>And I haven&#8217;t even gotten to Mythos yet.</p><h2><strong>Claude Mythos: the model the government doesn&#8217;t want you to have</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://www.wsj.com/tech/ai/white-house-opposes-anthropics-plan-to-expand-access-to-mythos-model-dc281ab5" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!66Aa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 424w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 848w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 1272w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!66Aa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png" width="1098" height="172" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:172,&quot;width&quot;:1098,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.wsj.com/tech/ai/white-house-opposes-anthropics-plan-to-expand-access-to-mythos-model-dc281ab5&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!66Aa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 424w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 848w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 1272w, https://substackcdn.com/image/fetch/$s_!66Aa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc8f44f69-162f-42cb-911f-bca8889b65ef_1098x172.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p><a href="https://x.com/cgtwts/status/2042698884682780794">Here&#8217;s cybersecurity researcher Nicholas Carlini</a>, who joined Anthropic (the creators of Mythos) last year:</p><blockquote><p>I&#8217;ve found more bugs in the last couple of weeks than I found in the rest of my life combined.</p></blockquote><p>He&#8217;s referring, of course, to what he and his colleagues have accomplished using Mythos. And by &#8220;bugs&#8221;, he means security vulnerabilities in widely used software. Plenty has been written about Mythos since its announcement on April 7, so I&#8217;ll just recap a few key points:</p><ol><li><p>Mythos is a <strong>far</strong> more capable hacker than any (known) previous model.</p></li><li><p>This was entirely an <strong>accidental side effect</strong> of training the model to be good at other tasks, such as programming.</p></li><li><p>These capabilities are so dangerous that Anthropic is <strong>delaying the release</strong> until major software developers have time to fix the bugs identified by Mythos. (Anthropic may also not have enough computing capacity to offer Mythos to the general public; I&#8217;ll discuss this in the next installment.)</p></li></ol><p>So, while Mythos is not the first AI model with the potential to cause harm, it is the first that can cause harm <em>because of how smart it is</em> &#8211; <a href="https://www.anthropic.com/glasswing">smart enough to find</a> &#8220;high-severity vulnerabilities &#8230; in every major operating system and web browser&#8221;.</p><p>There is debate as to whether Mythos is really as big a deal as claimed. My understanding: yes, it is a big fucking deal. Skeptics point out that open-source models can identify the same bugs as Mythos, but there are two flaws with their argument.</p><p>First: Mythos sifted through millions of lines of code in search of vulnerabilities. The people who used an open-source model to &#8220;reproduce&#8221; this work pointed the model at the precise snippet of code where Mythos had found a bug<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. It&#8217;s easy to find Waldo if someone draws a big circle around him first.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!EfHN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!EfHN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!EfHN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg" width="1024" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!EfHN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!EfHN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F22299862-ca30-4e24-a5c6-415cda9fad68_1024x768.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>The Where&#8217;s Waldo publishers should try this, people would get through the books a lot faster!</em></figcaption></figure></div><p>Second: Mythos can do more than find bugs. It can chain them into <strong>exploits</strong>, and in some cases it can carry out a <strong>full-blown attack</strong> against realistic software systems. This is much more difficult, and much more useful for an attacker. In Star Wars terms:</p><ol><li><p><strong>Finding a bug</strong>: analyze the plans for the Death Star and notice the small thermal exhaust port that leads directly to the reactor system.</p></li><li><p><strong>Crafting an exploit</strong>: piece together a workable plan of attack, incorporating the additional observations that the Death Star&#8217;s defenses &#8220;are designed around a direct large-scale assault&#8221; and aren&#8217;t tight enough to keep out an X-wing fighter; that the shaft is ray-shielded, but proton torpedoes can still penetrate; that X-wings do in fact carry proton torpedoes; and that an attacking ship can hide from the gun turrets by flying along the trench that leads up to the exhaust port.</p></li><li><p><strong>Carrying out an attack</strong>: launch a swarm of X-wings with skilled pilots, navigate past the fixed defenses, successfully engage or avoid the defending TIE fighters, traverse the trench, organize wingmen so the lead pilot can focus on the exhaust port, and release a torpedo at just the right moment.</p></li></ol><p>Existing models have pretty much been limited to step 1, finding bugs. Mythos is capable of finding a much larger number of bugs, has constructed working exploits for many of them (step 2), and gets farther than previous models in managing an end-to-end attack (step 3). It represents a sea change in cyber capabilities<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>In the very near term, Mythos looks like a net benefit to security. Under the name<a href="https://www.anthropic.com/glasswing"> Project Glasswing</a>, Anthropic is giving companies like Microsoft, Apple, the Linux Foundation, and Chase Bank a head start &#8211; granting early access to identify and correct bugs before an attacker can use Mythos to identify and exploit them.</p><p>What about the longer term? AI capabilities will progress very rapidly, and both attackers and defenders will take advantage. My guess is that the most professionally managed systems will wind up, on balance, more secure. But there are a lot of important systems in the world that are neglected, under-budgeted, or subject to excessive bureaucracy &#8211; and they may be in for a bad time. You don&#8217;t need a zero-day bug<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> to break into a system like that, but you do need an army of mid-skilled hackers to find the vulnerable systems and figure out a specific way in. AI is going to democratize the hell out of that kind of work. Watch out for more cybercrime &#8211; for instance, attacks on schools and hospitals<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>.</p><p>Ordinary consumers might generally come out ahead, as our security depends primarily on device and service providers like Apple, Google, and Microsoft, who are well positioned to make good use of Mythos and its successors. But we may also be getting even more of those &#8220;your personal information may have been stolen&#8221; letters from the many other companies we do business with.</p><p>Meanwhile, consumers and businesses are rapidly adopting new AI-based tools. That increased pace of change brings increased opportunities for error. Time will tell how much risk we&#8217;re all willing to accept in return for access to the latest and greatest capabilities.</p><p>And of course I&#8217;ve only touched on one aspect of Mythos&#8217; capabilities &#8211; its <em>inadvertent</em> skill at cyber attack. Early indications are that it is also a major advance in areas that Anthropic deliberately trained it for, such as coding. What does that tell us about overall progress?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>Things are progressing on schedule &#8211; the crazy schedule</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/billyhumblebrag/status/2041593835499749848" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uoRC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 424w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 848w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 1272w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uoRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png" width="1366" height="382" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/73e314ad-96ab-4615-9936-a78620981de5_1366x382.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:382,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/billyhumblebrag/status/2041593835499749848&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uoRC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 424w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 848w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 1272w, https://substackcdn.com/image/fetch/$s_!uoRC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F73e314ad-96ab-4615-9936-a78620981de5_1366x382.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Broadly speaking, there are three views of AI progress:</p><ul><li><p><strong>Dismissive</strong> &#8211; AI is mostly hype. It will amount to, at best, a mildly useful tool for some applications.</p></li><li><p><strong>Explosive</strong> &#8211; AI is heading toward &#8220;recursive self-improvement<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>&#8221;, after which it will bootstrap itself into an all-powerful technology that transforms the Universe, empowering and/or extinguishing humanity in the process.</p></li><li><p><strong>It&#8217;s Complicated</strong> &#8211; AI is a big deal, but will take decades (or longer) to play out, and may hit a ceiling at some point.</p></li></ul><div class="pullquote"><p>It&#8217;s under-appreciated that the Explosive and It&#8217;s Complicated views do not diverge much in their predictions until the advent of &#8220;strong AGI&#8221;.</p></div><p>The dismissive view is already ruled out. We&#8217;ve seen far too much evidence of AI providing dramatic value in the real world. A financial correction (&#8220;bubble pop&#8221;) is always possible, but it would be a temporary thing, like the dot-com crash: reflecting over-eager investment, not a flawed concept.</p><p>The last few years have provided a lot of evidence which appears to support the explosive view. AI progress has been remarkably fast, and remarkably predictable. Many AI trends, when plotted on appropriate axes, look like straight lines<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a> which point toward transformative change in the next few years:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!eCTu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!eCTu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 424w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 848w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 1272w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!eCTu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png" width="1366" height="632" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:632,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!eCTu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 424w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 848w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 1272w, https://substackcdn.com/image/fetch/$s_!eCTu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb0945d4d-2499-46ca-bbf0-04a138e1bac2_1366x632.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The most famous graph in AI (<a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/">source</a>), because it&#8217;s basically <a href="https://secondthoughts.ai/p/measuring-ai-progress">the only graph we have that sheds any light on future trends</a></figcaption></figure></div><p>This progress, being predictable, has in fact been predicted, notably in <a href="https://ai-2027.com/">AI 2027</a> &#8211; the most famous articulation of the explosive view. (For an overview, see my <a href="https://secondthoughts.ai/p/ai-2027">writeup from last year</a>). In the year since <em>AI 2027</em> was published, events have played out more or less as portrayed. One contributor recently <a href="https://x.com/dschwarz26/status/2042652010080801073">posted some thoughts about what they got right</a>, and in February two of the lead authors <a href="https://blog.aifutures.org/p/grading-ai-2027s-2025-predictions">graded the predictions for 2025</a>. Accurate predictions include the emergence of powerful hacking abilities, military use of frontier AI models, and aggressive build-out of AI data centers.</p><p>This doesn&#8217;t mean that things will continue to progress on the <em>AI 2027</em> timeline. It&#8217;s under-appreciated that <a href="https://asteriskmag.substack.com/p/common-ground-between-ai-2027-and">the explosive and it&#8217;s-complicated views do not diverge much in their predictions until the advent of &#8220;strong AGI&#8221;</a>. It&#8217;s not unreasonable to take the leap in capabilities demonstrated by Claude Mythos as evidence for the explosive view. It&#8217;s also quite reasonable to believe that we&#8217;re on the It&#8217;s Complicated track. As I&#8217;ll explain below, the evidence is still ambiguous.</p><p>In fact, <a href="https://blog.aifutures.org/i/187696603/quantitative-pace-of-progress">progress on quantitative measures has only been about 65% as fast as predicted</a> in <em>AI 2027</em>. And I believe even this overstates progress, as I&#8217;ll explain in the next section<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>. But logic aside, early 2026 has <em>felt</em> explosive. It&#8217;s one thing to have an intellectual expectation that AI models would soon be able to write full-fledged applications and autonomously construct security exploits, and quite another to see it happening. I lean toward the It&#8217;s Complicated camp in the near term, but the visceral sensation of change has a way of making it hard to trust the arguments. Let&#8217;s review those arguments.</p><h2><strong>AI is still a normal technology, for now</strong></h2><p>If <em>AI 2027</em> is the most famous expression of the Explosive view of AI, <em><a href="https://knightcolumbia.org/content/ai-as-normal-technology">AI as Normal Technology</a></em> is the canonical argument that It&#8217;s Complicated. <a href="https://secondthoughts.ai/p/ai-as-normal-technology">I wrote an entire post about it</a>. The key assertions:</p><ol><li><p>AI&#8217;s impressive showing on benchmarks and coding will struggle to generalize to broader real-world settings.</p></li><li><p>Even as AI develops genuinely useful capabilities, there will be (or should be) delays and limits on how it is adopted in practice.</p></li><li><p>AIs will not develop radically superhuman abilities<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a>, because the real world is often too complex for abstract genius to eliminate the need for trial and error.</p></li></ol><p>In other words, while the authors agree that AI may be a monumentally important technology, they also believe that it will be subject to &#8220;normal&#8221; frictions that rule out the most aggressive scenarios and leave room for human agency. And indeed, even as new AI models keep racking up achievements, it&#8217;s still easy to find reminders of their limitations. A few recent examples:</p><ul><li><p>In <a href="https://secondthoughts.ai/p/the-centaur-era">The Centaur Era</a>, I noted serious errors committed by ChatGPT Pro and Claude Opus during a research project. For instance, Opus noticed that it had used a dramatically wrong figure as part of its analysis, but did nothing to correct the resulting incorrect conclusion.</p></li><li><p>Many people have predicted that AI would unleash a tsunami of sophisticated propaganda; this figures into the <em>AI 2027</em> scenario. <em>[EDIT: I was mistaken here. As AI 2027 co-author Daniel Kokotajlo <a href="https://secondthoughts.ai/p/is-ai-2027-coming-true/comment/254931219">notes in a comment below</a>, this prediction appears in a scenario he wrote way back in 2021.]</em> However, no tsunami has manifested; one of the authors <a href="https://www.update.news/p/ai-economics-needs-to-be-weird?open=false#%C2%A7why-ai-propaganda-hasnt-been-as-effective-as-feared">recently acknowledged</a> that &#8220;AI propaganda hasn&#8217;t been as effective as feared&#8221;.</p></li><li><p>It can be difficult for businesses to integrate AI into their work. In a recent report on Google Cloud&#8217;s <em>Next</em> conference, <a href="https://www.theinformation.com/">The Information</a> reporter Erin Woo writes:</p><blockquote><p>Last year at Google Cloud&#8217;s Next conference, executives touted the power of the company&#8217;s AI models for businesses. This year the theme is how to help companies use the models.</p><p></p><p>And with good reason: In interviews at the conference, customers and Google Cloud resellers told me that <strong>companies racing to adopt AI are hitting roadblocks</strong>. Some are still setting up their first AI agents, while others are struggling to manage a menagerie of them. [emphasis added]</p></blockquote></li><li><p>Even mighty Mythos has limitations. As <a href="https://www-cdn.anthropic.com/8b8380204f74670be75e81c820ca8dda846ab289.pdf">Anthropic reports</a>:</p><blockquote><p>&#8230;the model still cannot be left alone in a production environment... It frequently mistakes correlation with causation and it is not able to course-correct for different hypotheses. When asked to write incident retrospectives, more often than not it focuses on a single root cause and does not consider multiple contributing factors.</p></blockquote></li></ul><p>I mentioned that the authors of <em>AI 2027</em> recently reported that things have been progressing <a href="https://blog.aifutures.org/p/grading-ai-2027s-2025-predictions?open=false#%C2%A7quantitative-pace-of-progress">about 65% as fast as they&#8217;d predicted</a>. But I would argue that the most important component of their forecast is &#8220;AI software R&amp;D uplift&#8221; (the extent to which AI is accelerating its own development). By their estimate, <strong>uplift is only 17% of what they&#8217;d predicted at this point</strong>. The 65% figure primarily reflects metrics that are not directly related to real-world AI capabilities, such as benchmark scores and data center construction. I should note however that frontier lab revenue is at 80% of prediction (in their scoring, which may in fact be conservative here).</p><p><em>[EDIT: in <a href="https://secondthoughts.ai/p/is-ai-2027-coming-true/comments">several comments below</a>, AI 2027 co-authors Daniel Kokotajlo and Eli Lifland provide clarifications and corrections to what I&#8217;ve written here. Uplift &#8211; the extent to which AI tools are accelerating AI R&amp;D progress &#8211; is indeed well short of where the AI 2027 scenario predicts. However, the authors do believe they were on track regarding the rate at which uplift would progress; they merely have adjusted their view of where things stood in early 2025. So uplift is indeed short of the AI 2027 scenario, but may now be advancing at the predicted pace, just from a delayed starting point. And frontier AI lab revenue is in fact ahead of AI 2027&#8217;s predictions; the 80% figure I&#8217;m citing here, which is labeled &#8220;economic value&#8221; in the linked report, turns out to reflect company valuations in addition to revenue. Finally, valuations have jumped since the 80% figure was computed and are now &#8220;about on trend&#8221;. Daniel and Eli provided some other clarifications as well, see their comments.]</em></p><p>As for qualitative predictions, some of the claimed &#8220;wins&#8221; are debatable. For instance, they cite evidence that &#8220;Bioweapon capabilities seem on track&#8221;, but we&#8217;re currently publishing a series of posts explaining that <a href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor">this may underestimate the complexity of bioweapon creation</a>.</p><p>AI is undeniably racking up achievement after achievement. It is becoming superhuman in the scale of tasks it can undertake &#8211; Mythos does not seem to have done anything that a human expert could not have done given enough time, but it has identified many bugs that experts never got around to. At the same time, current systems still have many shortcomings compared to the <a href="https://secondthoughts.ai/p/where-ai-falls-short">range, breadth, and depth of human cognition</a>, and have had limited impact so far. The question is whether that will last.</p><h2><strong>Where do things go from here?</strong></h2><p>If it seems like the rate of change is accelerating, that&#8217;s because it is. Get used to it: this staircase has no landings, there is no &#8220;new normal&#8221; for things to settle into. Rapid change <em>is</em> the new normal. The pace will likely continue to escalate.</p><p>AI is, for now, still a normal technology. As the <em>AI as Normal Technology</em> authors <a href="https://www.normaltech.ai/p/a-guide-to-understanding-ai-as-normal">note</a>, that &#8220;doesn&#8217;t mean mundane or predictable&#8221;; it means that its impact is limited by factors such as slow adoption. AI is advancing in capability and diffusing into the economy at a faster pace than most previous technologies, but that&#8217;s a difference in degree, not a difference in kind&#8230; for now, at least.</p><p>Here are a couple of possible future scenarios. In one, models really start to accelerate their own development over the course of 2026. There are already signs that this is happening:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/tszzl/status/2047386955550470245" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7Yut!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 424w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 848w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 1272w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7Yut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png" width="1456" height="437" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:437,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/tszzl/status/2047386955550470245&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!7Yut!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 424w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 848w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 1272w, https://substackcdn.com/image/fetch/$s_!7Yut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb7180779-dc62-4c0f-8208-8bb9f08a06c5_1754x526.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">From an OpenAI researcher. &#8220;5.5&#8221; is a reference to OpenAI&#8217;s newly released AI model, GPT-5.5.</figcaption></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/aidan_mclau/status/2047388367705575701" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VH_6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 424w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 848w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 1272w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VH_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png" width="1456" height="395" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:395,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/aidan_mclau/status/2047388367705575701&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VH_6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 424w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 848w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 1272w, https://substackcdn.com/image/fetch/$s_!VH_6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F650a142f-bd10-4e74-86f3-156f8a783a09_1734x470.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Another OpenAI researcher, explaining that GPT-5.5 can work independently for days at a time.</figcaption></figure></div><p>Perhaps the most telling sign that OpenAI and Anthropic coding tools are accelerating AI R&amp;D is <a href="https://x.com/Steve_Yegge/status/2046260541912707471">this report</a> that <strong>Google DeepMind engineers threatened to quit if they were required to use Google&#8217;s own product instead</strong>. As a result, the current blistering pace of progress may soon seem so slow as to almost be quaint, and AI will begin to have an unambiguous impact in various sectors of the economy, science, and society. That&#8217;s my conservative scenario!</p><p>The aggressive scenario follows the outline of <em>AI 2027</em>, though probably stretched out by a few years. AI fully automates its own development, and then quickly &#8220;solves&#8221; robotics and automates other critical functions. Barriers to adoption are swept away, through some combination of overwhelming competitive pressure and AI becoming flexible enough to adapt to the workplace rather than the workplace needing to adapt to it. Whatever comes next will be <a href="https://secondthoughts.ai/p/the-unrecognizable-age">unrecognizable</a>.</p><p>It&#8217;s hard to tell which path we&#8217;re on. Here are some of the key questions:</p><ol><li><p>Will the rapid progress in AI coding ability lead to full automation of AI R&amp;D, or are &#8220;softer&#8221; skills also needed<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a>?</p></li><li><p>Can the exponential growth of data center construction be sustained? If not, will progress in AI capabilities slow down as the tailwind of ever-increasing research compute budgets fades?</p></li><li><p>Would automated R&amp;D result in rapid progress in AI capabilities across all domains, from scientific research to strategic planning? Or will advances still be bottlenecked on feedback from real-world deployment?</p></li><li><p>Will increasing AI capability and flexibility overcome traditional barriers to rapid adoption?</p></li><li><p>How quickly will robots &#8220;happen&#8221;?</p></li></ol><p>I&#8217;m increasingly convinced that there&#8217;s a singularity coming somewhere in the next few decades; the questions above will determine when. I&#8217;m even more confident that the current, disorienting rate of change is the slowest we&#8217;re ever likely to see.</p><p><em>In subsequent installments of this series on the state of AI, I&#8217;ll cover topics such as competition between US and Chinese AI labs, data centers, impacts on the job market and elsewhere, and beneficial applications of AI. <strong>Use the comments to suggest additional topics!</strong></em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/is-ai-2027-coming-true?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/is-ai-2027-coming-true?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Lots of people provided suggestions and feedback for this series! Thanks to Abi Olvera, Alex Booker, Clara Collier, Dave Kasten, Eli Pariser, Emma Kumleben, Gideon Lichfield, Kai Williams, Taren Stinebrickner-Kauffman, and Tim Schnabel. Apologies if I missed anyone!</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Note that these figures do not necessarily imply that Anthropic&#8217;s business is now larger than OpenAI. There are important differences in how the two companies measure revenue, these figures are snapshots from different months, and some of the numbers are unofficial.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>If you had the budget, you could use an open-source model to analyze all of the code that Mythos analyzed. However, you&#8217;d wind up drowning in &#8220;false positives&#8221;: the open-source models would likely find some of the bugs that Mythos found, but they&#8217;d also generate zillions of incorrect bug reports, and it would be difficult to separate the wheat from the chaff.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Benchmark results suggest that OpenAI&#8217;s GPT-5.5, which was released a couple of weeks after Mythos, <a href="https://pointestimate.substack.com/p/how-good-is-mythos">may be its equal</a> in cybersecurity capabilities. <a href="https://x.com/ArbResearch/status/2051955928069517599">Other signals suggest</a> that Mythos really is a step ahead and the benchmarks are missing something. Taking into account some private scuttlebutt, I lean toward the latter take.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>A &#8220;zero-day bug&#8221; is a software flaw which the developer has known about for zero days, i.e. one which they have not yet had any time to fix (indeed they may not be aware of it at all).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>For instance, &#8220;ransomware&#8221; attacks, in which a hacker &#8220;kidnaps&#8221; an organization&#8217;s computer systems by encrypting all of the data, and offering to remove the encryption in return for a payment, typically made in cryptocurrency.</p><p>Note that, while we may see an increase in successful cyberattacks, the impact may be limited by non-technical bottlenecks. See our report from last year, <a href="https://www.goldengateinstitute.org/cyber-security-report/the-cybersecurity-equilibrium">The Cybersecurity Equilibrium</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Recursive Self-Improvement refers to the idea that sufficiently advanced AI may accelerate its own development, potentially leading to runaway, explosive progress. See <em><a href="https://cset.georgetown.edu/publication/when-ai-builds-ai/">When AI Builds AI</a></em>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p><em>Some measures of AI capability actually seem to be accelerating of late, as indicated in a <a href="https://epoch.ai/blog/have-ai-capabilities-accelerated">report by Epoch</a> that I linked to earlier. The graph shown here reflects that accelerating trend.</em></p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>The authors of <em>AI 2027</em> also now believe things will move <a href="https://blog.aifutures.org/p/q1-2026-timelines-update">somewhat more slowly</a> than in the published scenario, though they still think <a href="https://www.aifuturesmodel.com/forecast/eli-04-02-26?timeline=ASI&amp;cmode=forecaster&amp;csim=daniel-04-02-26&amp;ctype=atc">it&#8217;s quite possible that superintelligence will emerge within 18 months</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>Of course AI systems are already superhuman in attributes such as speed and patience, and other software systems are superhuman at arithmetic, chess, and many other things. Here, when I raise the question of whether AI will develop radically superhuman abilities, I mean something like &#8220;the ability to figure things out that no person, given enough time, could have figured out&#8221;. For instance, designing a perfect drug without any trials in humans or even lab mice.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>EDIT: in a comment on this post, Eli Lifland notes:</p><blockquote><p>To be more precise, I think everyone agrees non-coding skills are needed to fully automate AI R&amp;D, but some people think it's likely that either automating coding is enough to bootstrap you to automate the others, or that automating coding will generalize well enough (but not perfectly well) to other aspects of AI R&amp;D, such that full automation is close by.</p><p>And I think missing from your list of questions is how fast AI will progress after full AI R&amp;D automation to different levels of superhuman AI R&amp;D capabilities, and how high the ceiling is above the human range.</p></blockquote></div></div>]]></content:encoded></item><item><title><![CDATA[Tacit Knowledge: The Missing Factor in AI Bio Risk Assessments]]></title><description><![CDATA[Lab skills come from hands-on mentorship, not from reading the entire internet.]]></description><link>https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor</link><guid isPermaLink="false">https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor</guid><dc:creator><![CDATA[Abi Olvera]]></dc:creator><pubDate>Thu, 23 Apr 2026 15:03:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!QFmb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>This is the second installment in a series examining the potential for AI to lower the bar to creation and use of bioweapons, written by Golden Gate Institute for AI&#8217;s <a href="https://abio.substack.com/">Abi Olvera</a>. <a href="https://secondthoughts.ai/p/why-arent-bioweapons-common">Part one</a> explains why bioweapons are difficult to create and rarely used. This installment (which you can read without reading part one) presents the importance of &#8220;tacit knowledge&#8221; &#8211;&nbsp;including muscle memory and sense memory critical to delicate lab procedures. I found this essay to be interesting for two reasons. First, it&#8217;s a fascinating exploration of a subject I knew nothing about &#8211; and an excellent example of important skills that current AI training methods don&#8217;t seem to capture. Second, it helps explain why some parts of the AI safety community may be overestimating the bioweapons threat, at least in the near to mid-term.</em></p><p>If you wanted to build a bioweapon, the first thing you&#8217;d notice is that finding instructions isn&#8217;t the hard part.</p><p>Published protocols<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> exist for most of the relevant techniques. Research papers describe how to culture cells, assemble DNA, and grow pathogens. The information has been available for decades. Large language models have made it faster and easier to find.</p><p>But biologists who have spent years at the lab bench will tell you that there is an enormous gap between having instructions and successfully executing them. That gap is filled with knowledge that lives only in practitioners&#8217; hands and heads, intuitions developed through repetition, mentorship, and thousands of small failures. You get better by failing repeatedly under the mentorship of someone who&#8217;s been doing it for years. Biology operates on an apprenticeship model, more like plumbing or surgery than other STEM fields. You learn by watching someone else, then you teach the next person.</p><p>In the literature on expertise, these important, unwritten details are called tacit knowledge.</p><p>Understanding tacit knowledge is important for evaluating AI biosecurity risks because it shapes how much AI can help someone attempting to build a bioweapon. If the hard part is finding information, then AI changes the picture significantly. If the hard part is executing physical procedures that depend on feel, judgment, and years of practice, then AI&#8217;s contribution is more limited, at least for now.</p><p>To see what tacit knowledge looks like in practice, it helps to start with the most basic thing biologists do: move liquid from one container to another.</p><p><strong>Everything in biology is wet</strong></p><p>Almost every step in biological work involves liquid.</p><p>DNA is assembled in liquid. Vaccines, IVs, and blood transfusions are liquid. Even pills must dissolve into liquid for absorption. When NASA looks for signs of life on other planets, it primarily looks for water. Life, as we understand it, requires it.</p><p>If you want to build a bioweapon, you are working in liquid at nearly every step.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a></p><p>The most common way to move these liquids is pipetting. A pipette is an extremely precise eye dropper. Any biology student has used one.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QFmb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QFmb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QFmb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png" width="1456" height="794" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:794,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:7730094,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://abio.substack.com/i/194446761?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!QFmb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 424w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 848w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 1272w, https://substackcdn.com/image/fetch/$s_!QFmb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb4a77705-38bc-4c08-ba2c-0c7a615510e8_2816x1536.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Pipetting sounds simple. It isn&#8217;t.</p><p>Getting it right depends on tip position, liquid viscosity, the speed at which you draw and release, room temperature, and humidity. Students sometimes attempt it 50+ times before it works. Like a grandmother who knows exactly how much a &#8220;pinch&#8221; means, practitioners develop a feel for specific materials over the years.</p><p>Biology makes liquid handling especially unforgiving for three reasons.</p><p><strong>Fragility.</strong> Biological molecules break easily. Even hand tremors can shear some DNA strands. One scientist told me that, in their pure form, without the stabilizing proteins of an organism or packaging, some long DNA molecules struggle to survive natural desk vibrations from traffic and seismic activity. Some proteins are so temperature-sensitive that the local pressure from moving a liquid creates too much heat.</p><p>A written protocol will say &#8220;pipette gently.&#8221; But &#8220;gently enough&#8221; depends on the specific molecule, the specific volume, the viscosity of your buffer, the type of pipette you use, how long the sample has been sitting out, etc. Experienced researchers develop a feel for this. They modulate their thumb pressure on the pipette plunger the way a guitarist modulates finger pressure on a string. A novice following the same written protocol will damage the sample without knowing why.</p><p>Fragility also makes it important to learn the quirks of each lab&#8217;s specific equipment. An incubator that drifts half a degree can kill an entire culture. A practitioner might need to know that their particular incubator runs hot on days when ambient lab temperature rises with the weather outside.</p><p><strong>Concentration.</strong> In biology, the ratio of how much material is dissolved in a given volume matters as much as the raw amount of material. To stitch four pieces of DNA into one larger piece, each piece has to physically encounter its match. That&#8217;s dictated by chance. If the solution is too dilute, the molecules never find each other. Getting concentrations right is partly a matter of following protocols, but experienced researchers have also developed an eye for when things look off.</p><p>They can tell whether the slight cloudiness of a solution indicates that the DNA concentration is within the correct range. They notice when purifier chemicals are starting to fail from subtle flow changes, or when the color of a gel stain indicates that fragments are the expected size. A protocol might say &#8220;check concentration with a spectrophotometer,&#8221; but a practiced scientist recognizes when a bubble in the container is throwing off the reading, or when a residual chemical from a previous step is inflating the measurement.</p><p><strong>Contamination. </strong>In chemistry, one part per million of contamination is usually negligible. In biology, contamination with unwanted organisms replicates across generations, compounding with each cycle. A single stray bacterium in a culture can, within hours, overwhelm the organism you were trying to grow.</p><p>For example, a speck of household dust is a miniature ecosystem containing bacteria, fungal spores, and skin cells, each with its own DNA. During cell culture and pathogen cultivation, a fungal spore from dust can land in your growth medium and outcompete the organism you&#8217;re cultivating. Stray DNA fragments from dust can mess with your genetic construct.</p><p>This is one reason labs use biosafety cabinets with HEPA-filtered airflow and why practitioners are obsessive about sterile techniques. Avoiding contamination requires tacit knowledge: how to angle a pipette tip away from the tube wall, when to swap gloves, and how to catch the faint smell of a contaminated culture before anything looks wrong under a microscope. It means knowing what actually constitutes sterile enough, such as noticing that your HVAC is circulating dust from the construction next door, or that the faint haze around your laminar flow hood means its filter is degrading. Experienced lab workers know which items to sterilize together and how to load them so steam actually penetrates. A person working in a garage or improvised lab would often not know what they didn&#8217;t know. Their cultures would fail, and they might not understand why for weeks &#8212; if ever.</p><p><strong>What happens when tacit knowledge is missing</strong></p><p>Challenges like these arise at each of the many steps involved in creating a biological agent, and they interact with each other. Scientists often describe biological work as months of failed attempts. The individual steps are not conceptually hard, but each one requires dozens of micro-judgments learned through repetition, muscle memory, mentorship, and trial and error. Many of these judgments are difficult to articulate, let alone replicate from a manual.</p><p>The Japanese cult Aum Shinrikyo provides a case study. This wealthy doomsday cult had <a href="https://irp.fas.org/congress/1995_rpt/aum/part04.htm">an estimated $1 billion</a> and a team of specialists, including <a href="https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(18)31869-5/fulltext">doctors, engineers, and microbiologists</a>. The group sprayed what they believed was anthrax from a rooftop in Kameido.<a href="https://wwwnc.cdc.gov/eid/article/10/1/03-0238_article"> No one was harmed</a>. The failure was due to many missteps. The cult failed to<a href="https://www.researchgate.net/publication/11630710_Molecular_Investigation_of_the_Aum_Shinrikyo_Anthrax_Release_in_Kameido_Japan#:~:text=strains%20that%20are%20missing%20the,only%20four%20naturally%20occurring%20B."> make their Anthrax strain</a> sufficiently virulent.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a> Even if the strain had been virulent, the spore concentration was too low to infect anyone. The liquid was too viscous for the spores to form the fine airborne particles needed for inhalation. The gelatinous suspension clogged the steam generator.</p><p>On paper, each step might have looked correct. In practice, every step required judgment and expertise the cult didn&#8217;t have: recognizing that a solution is too thick for aerosolization before loading it, understanding that spore concentration needs to be higher than a na&#239;ve reading of the literature suggests, and knowing how to validate that a strain is actually dangerous before investing months in the rest of the process. The cult&#8217;s experts weren&#8217;t up to the task because they lacked the experience with the specific steps involved in deploying Anthrax.</p><p><strong>Could technology replace tacit knowledge?</strong></p><p>None of these difficulties is permanent.</p><p>Purpose-built automated systems can be tuned, over many iterations, to encode the micro-judgments that experienced scientists make unconsciously &#8212; the right pipetting speed for a specific reagent, the exact pause needed after dispensing to avoid bubbles, the temperature ramp rate that won&#8217;t shear long DNA fragments. AI systems can provide real-time video feedback on lab manipulations.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Trwh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Trwh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 424w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 848w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 1272w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Trwh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png" width="1200" height="800" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/df95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:800,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!Trwh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 424w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 848w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 1272w, https://substackcdn.com/image/fetch/$s_!Trwh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdf95ebe0-6e98-4db2-b5d4-eb5f1cc8a48e_1200x800.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em><a href="https://www.integra-biosciences.com/global/en/pipetting-robots">Integra&#8217;s Pipetting Robot</a></em></figcaption></figure></div><p>This is already happening in some settings.</p><p>Pharmaceutical companies and well-funded research labs run automated systems painstakingly calibrated for specific workflows.</p><p>But these are purpose-built facilities optimized for particular projects, not general tools a novice could deploy. Each new workflow requires its own round of expert calibration.</p><p>As automated labs become more numerous and more varied, they will absorb more of the tacit knowledge that currently lives only in human hands. Even the most skeptical biosecurity experts I interviewed expected that general-purpose lab robots, guided by AI that has learned from thousands of experimental iterations, could eventually make expertise transferable in ways it isn&#8217;t today.</p><p>That world is not today&#8217;s world. Today, tacit knowledge remains a real barrier for someone working without mentorship and without the years of failure required to develop the intuitions that make each step work.</p><p>Furthermore, automated labs won&#8217;t eliminate all barriers. If someone outsources one chunk of the process to a commercial lab, there are still remaining steps. Ordering pre-assembled DNA is already straightforward; many companies will synthesize custom DNA sequences for research purposes. But converting that DNA into a living pathogen, or engineering it for greater lethality, involves work that no legitimate automated lab would perform.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> For policymakers, tracking how these partial capabilities interact is a harder problem than tracking any single capability in isolation.</p><p><strong>What this means for AI risk assessments</strong></p><p>Most evaluations of AI models for biology-relevant capabilities focus on textbook knowledge, such as whether a model can answer questions from the scientific literature or whether access to an LLM improves performance on standardized assessments. Some evaluations test laboratory knowledge, but few attempt to <a href="https://epochai.substack.com/p/do-the-biorisk-evaluations-of-ai">measure whether AI access changes outcomes</a> in real-world tasks and workflows, where tacit knowledge is the binding constraint.</p><p>Only a handful of efforts, like SecureBio&#8217;s <a href="https://www.virologytest.ai">virology evaluation</a>, have tried to test for practical knowledge that goes far beyond what&#8217;s in textbooks. Whether these capture enough of the tacit dimension to be useful predictors of real-world capability remains <a href="https://activesite.substack.com/p/rct">an open question</a>.</p><p>This is critical because it means we may be measuring the wrong thing. If AI&#8217;s contribution to biosecurity risk depends on its ability to support physical execution, not just information retrieval, then evaluations need to be designed accordingly. Risk forecasts that focus only on AI&#8217;s access to codified knowledge, rather than on how much of the tacit-knowledge barrier AI can actually erode, will systematically mis-measure the threat &#8212; and won&#8217;t provide a warning if and when AI tools start to acquire tacit knowledge.</p><p><em>Next in this series: the full chain of steps required to build a bioweapon, and an exploration of where AI can and cannot assist an attacker. Remember to read <a href="https://secondthoughts.ai/p/why-arent-bioweapons-common">part one</a> if you haven&#8217;t already!</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:&quot;button-wrapper&quot;}" data-component-name="ButtonCreateButton"><a class="button primary button-wrapper" href="https://secondthoughts.ai/p/tacit-knowledge-the-missing-factor?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Steve Newman, Taren Stinebrickner-Kauffman, Mike Montague, Matt Sharkey, Gigi Gronvall, and David Manheim for suggestions and feedback.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>A protocol in biology is a step-by-step instruction manual for a laboratory procedure. It lists what materials to use, how much of each, in what order, and for how long.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Exceptions exist. The final stage of making anthrax spores involves a dry powder. However, most steps are wet.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p> Aum Shinrikyo started with a non-virulent strain found in nature.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p> This assumes automated labs are properly regulated. While proper regulation is likely, the rules aren&#8217;t yet in place.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[The Centaur Era]]></title><description><![CDATA[The question isn't what AI can do, it's what you can do with AI]]></description><link>https://secondthoughts.ai/p/the-centaur-era</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-centaur-era</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Fri, 17 Apr 2026 23:59:25 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!CJcw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!CJcw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!CJcw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 424w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 848w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 1272w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!CJcw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png" width="1024" height="572" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:572,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:513701,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/194570887?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!CJcw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 424w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 848w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 1272w, https://substackcdn.com/image/fetch/$s_!CJcw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F616c83d5-5741-44b1-84dd-b5c77b36877f_1024x572.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>We talk about &#8220;AI capabilities&#8221; as if there are things AI can do, and things it can&#8217;t. But there&#8217;s a huge messy middle, where outcomes depend on patience, skill, and luck. ChatGPT might generate a sound analysis if prompted one way, but reflect the user&#8217;s biases if prompted differently. Gemini might diagnose my medical condition, but not yours. Claude Code might be unable to build a complex app on its own, but succeed with a bit of guidance from an experienced developer.</p><p>The most powerful uses of AI fall in this intermediate zone, and yet it is poorly studied and publicized. This discourages people from learning to get full value from AI tools, and confuses the debate regarding AI progress. Benchmarks don&#8217;t explore the sorts of open-ended tasks that often arise in the real world; they also don&#8217;t measure what people can accomplish when working in conjunction with AI<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. (Human-computer collaborations are sometimes called &#8220;centaurs&#8221;.)</p><p>Today&#8217;s post is a combo pack. I&#8217;m going to share my experience building an AI research agent, and the weaknesses it exposed in the reasoning abilities of current AI agents. I&#8217;ll explain how I managed to get good value from the AI&#8217;s research despite the flaws. I&#8217;ll wrap up by presenting implications for the pace of AI progress. And I&#8217;ll illustrate all of this through an analysis of Elon Musk&#8217;s proposal to put data centers in space.</p><h2><strong>Using AI as a Research Agent</strong></h2><p>I have a mental list that I keep adding to. You might call it Topics Where Someone Really Ought to Do The Math. These are debated questions about the real world, that would benefit from a hard-nosed quantitative analysis. I&#8217;d like to do those analyses, but who has the time? AI, that&#8217;s who.</p><p>As a starter problem for AI investigation, I chose the idea of building data centers in orbit. There&#8217;s been an interesting debate about this recently. Elon Musk<a href="https://techcrunch.com/2026/02/02/elon-musk-spacex-acquires-xai-data-centers-space-merger/"> used it as justification</a> for merging his companies SpaceX and xAI, aiming for an IPO at a stratospheric valuation of $2 trillion. Prominent GPU industry analyst Dylan Patel<a href="https://www.dwarkesh.com/p/dylan-patel"> argues that it&#8217;s economically impractical</a>.</p><p>Why would anyone take a perfectly good data center and blast it into space? The argument goes like this: in the race to build ever-larger clusters of AI chips, it&#8217;s getting hard to find locations with an adequate supply of electricity. In orbit, there&#8217;s 24-hour solar power, the inky blackness provides free cooling, and there are no environmental regulations or construction moratoriums. (Spoiler alert: at least two of these claims turn out to be exaggerated.) So there&#8217;s no need to keep scrounging for gigawatts on Earth.</p><p>Does this argument hold water? To see whether the question falls within the zone of current AI competence, I posed it to ChatGPT Pro &#8211; generally acknowledged to be the most capable analysis tool among mainstream AI offerings.</p><h2><strong>ChatGPT Pro Thinks for 42 Minutes and Comes Back With a Useless Muddle</strong></h2><p>I told it to start by reading three recent publications, including transcripts of interviews with Elon Musk and Dylan Patel:</p><blockquote><p>Prepare a report on the potential economical viability of orbital data centers for AI inference over the next 15 years. Specifically address the relevant ideas and arguments presented in <a href="https://www.dwarkesh.com/p/dylan-patel">https://www.dwarkesh.com/p/dylan-patel</a>, <a href="https://www.dwarkeshcom/p/elon-musk">https://www.dwarkeshcom/p/elon-musk</a>, and<a href="https://caseyhandmer.wordpress.com/2026/02/10/i-guess-were-doing-moon-factories-now/"> https://caseyhandmer.wordpress.com/2026/02/10/i-guess-were-doing-moon-factories-now/</a>. Make sure to do your own research and analysis &#8211; don&#8217;t rely excessively on these three sources.</p></blockquote><p>Its report is, like many AI outputs, a sort of Rorschach test. At first glance, it appears impressively definitive and erudite; on closer inspection, it&#8217;s a pile of worthless mush. Readers can easily come away confirming whatever preconceptions they&#8217;d held, whether they believe (a) orbital data centers are the next big thing, (b) they&#8217;re the next big scam, or (c) irrelevant because any such AI report will be stochastic-pigeon slop.</p><p>The seven page report<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> cites 25 sources. The conclusions align with all three of the expert sources I provided, which is no small feat given that they extensively contradict one another. It&#8217;s full of jargon &#8211; all used correctly. <strong>But there&#8217;s no logic to the analysis. As the expression goes, it&#8217;s &#8220;not even wrong&#8221;.</strong></p><div class="file-embed-wrapper" data-component-name="FileToDOM"><div class="file-embed-container-reader"><div class="file-embed-container-top"><image class="file-embed-thumbnail-default" src="https://substackcdn.com/image/fetch/$s_!0Cy0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack.com%2Fimg%2Fattachment_icon.svg"></image><div class="file-embed-details"><div class="file-embed-details-h1">Orbital Data Centers Memo Final</div><div class="file-embed-details-h2">228KB &#8729; PDF file</div></div><a class="file-embed-button wide" href="https://secondthoughts.ai/api/v1/file/f838347e-6a61-4b47-a92a-2b9e5ea6c93e.pdf"><span class="file-embed-button-text">Download</span></a></div><a class="file-embed-button narrow" href="https://secondthoughts.ai/api/v1/file/f838347e-6a61-4b47-a92a-2b9e5ea6c93e.pdf"><span class="file-embed-button-text">Download</span></a></div></div><p>The report doesn&#8217;t present any answer to the central question: how much will orbital data centers cost, and how does that compare to the earthbound alternative? It throws around plenty of relevant figures &#8211; solar cell efficiency, launch costs, etc. &#8211; but at no point does it assemble them into a comprehensive total or tie them to any conclusion. Instead, it confidently asserts a narrative that incorporates elements from each of the three initial sources I&#8217;d instructed it to read. Like a parent unwilling to play favorites, it appears to have looked for a way to agree with all three sources, rather than performing critical analysis.</p><p>So, that&#8217;s what I got in return for a tiny investment in effort &#8211; a single prompt. Next, I&#8217;ll present a more in-depth project to generate an AI analysis of this topic.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2><strong>Creating A Research Agent In my Own Image</strong></h2><p>I&#8217;m a systematic kind of guy. If I were investigating orbital data centers, I&#8217;d begin by framing a precise question:</p><blockquote><p>In what year is the cost of launching and operating an orbital data center likely to become competitive with terrestrial data centers?</p></blockquote><p>This immediately suggests additional questions. What are the major costs involved in each type of data center? How will those costs evolve over time? Each investigation may spawn further questions. For instance, Elon Musk argues that it will soon be impossible to generate enough electricity for earthbound data centers. That begs questions such as: What will the demand for new data centers be? Will there be enough chips to fill them? What in fact are the prospects for powering them &#8211; whether with solar power, gas turbines, unused capacity from the existing electrical grid, or otherwise?</p><p>My typical research process is something like this:</p><ol><li><p>Frame the question in precise terms.</p></li><li><p>Do some initial research.</p></li><li><p>Informed by this research, break the question down into smaller questions.</p></li><li><p>Research each sub-question, and combine the findings into an overall report.</p></li><li><p>Cross-check against key sources (such as the Elon Musk and Dylan Patel interviews). If they disagree with our findings, and our existing research doesn&#8217;t provide evidence to refute them, then perform further research to decide who is correct.</p></li></ol><p>In this process, I keep adding new questions, and new sources bearing on those questions, until I can:</p><ol><li><p>Provide a clear answer to the original question.</p></li><li><p>Explain how the analysis relates to each key source. Hopefully, on a point-by-point basis, I can either explain how my analysis is consistent with the source, or cite evidence that the source is incorrect.</p></li></ol><p>I had Claude Code create a &#8220;skill&#8221; (basically, a set of guidelines) for itself to follow precisely this process. Over a period of a week, Claude and I worked together to refine the skill. It now incorporates multiple &#8220;passes&#8221;, each of which can be run multiple times. A pass might check for arithmetic errors, verify that each conclusion is supported by evidence, or prepare a point-by-point comparison of our conclusions to those of a published paper. The results are... still not what I&#8217;d hoped for, but they show promise. They helped me assemble what I believe to be a fairly well-grounded picture, which I&#8217;ll present next.</p><h2><strong>Stop Trying to Make Orbital Data Centers a Thing</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0vPY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0vPY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 424w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 848w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 1272w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0vPY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png" width="1396" height="662" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:662,&quot;width&quot;:1396,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1426884,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/194570887?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0vPY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 424w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 848w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 1272w, https://substackcdn.com/image/fetch/$s_!0vPY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb8a04a3f-6fbf-43a0-962a-2ac9a9a73078_1396x662.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">No</figcaption></figure></div><p>Here&#8217;s the punchline: <strong>under present conditions, it simply does not make sense to put data centers in space</strong>. The only potential advantage would be access to plentiful solar power, and that&#8217;s not enough to make up for the complexity and expense of orbital deployment. In the extensive research performed by Claude, as well as the ChatGPT Pro report, no other advantages were identified that strike me as significant. Most of the cost of a terrestrial data center goes to components such as GPUs, memory chips, and networking equipment &#8211; all of which would be equally needed in space. This leaves very little scope for orbital data centers to come out ahead.</p><p>It is indeed getting harder to find places to connect data centers to the grid, at least in the US. However, the data centers are still getting built, often relying on off-grid solutions such as gas-powered generators. Because electricity constitutes only 10-20% of the lifetime cost to build and operate a data center<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, even expensive workarounds don&#8217;t add all that much to the overall project expense. The limiting factor on data center construction continues to be chip supply, not power<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p><p>Placing data centers in orbit would introduce a <em>long</em> list of expenses and challenges, such as:</p><ul><li><p><strong>Launch costs</strong>. Each AI server will require a healthy allocation of solar panels, batteries<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>, cooling equipment<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>, and other hardware. Putting all of this in orbit costs hundreds of dollars per pound today<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>, and even very optimistic projections are still tens of dollars per pound.</p></li><li><p><strong>Financing costs</strong>. Data centers have very high upfront costs. Lenders or investors will charge a higher premium for financing higher-risk space deployments.</p></li><li><p><strong>Commissioning delays</strong>. It takes months to go from electronic components delivered to a satellite manufacturer, to the satellite being in orbit and operational<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-8" href="#footnote-8" target="_self">8</a>. This would waste the most valuable period of a GPU&#8217;s operational life, and add to financing costs.</p></li><li><p><strong>Lifetime reduction</strong>: launch stress, radiation, and temperature variations may lead to premature GPU failure.</p></li><li><p><strong>Inability to repair</strong>: AI data centers need maintenance to replace faulty components and fix flaky network connections. This is not currently practical in space.</p></li></ul><p>I&#8217;ve only hit a few highlights; there are <em>many</em> additional complications, unproven technologies, and other risk factors to be considered, without even getting into unknown unknowns.</p><p>One dilemma: how large should individual AI satellites be? On Earth, the trend is to group ever-larger numbers of GPU chips into a single &#8220;scale-up domain&#8221;, with intricately designed, ultra-high-speed network connections between chips. Current scale-up domains fill an entire server rack, using around 120kW of power, and megawatt-scale scale-up domains are on the horizon. By comparison, the entire International Space Station typically uses less than 100kW<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-9" href="#footnote-9" target="_self">9</a>.</p><p>The move to closely integrate large numbers of chips is driven by the fact that larger scale-up domains can operate AI models more efficiently. However, building individual satellites of this size poses serious challenges, from the likely need for in-orbit assembly, to the increased likelihood of component failures.</p><p>On the terrestrial side, it&#8217;s true that there is a supply crunch for electrical components such as transformers and gas turbines. However, <a href="https://x.com/dwarkesh_sp/status/2042256222242349146">there are many ways</a> to <a href="https://www.volts.wtf/p/doing-data-centers-the-not-dumb-way">work around this problem</a>. Anyone who wants to argue that it would be difficult to increase production of these components, will need to explain why that is harder than bootstrapping the entire new industries that would be required for orbital data centers.</p><p>It&#8217;s also true that anti-data-center sentiment is rising in some parts of the US, sometimes making it harder to get construction approvals. In Maine, a bill imposing an 18-month moratorium is awaiting the governor&#8217;s signature. However, so far as I can tell, the industry is still finding places to build; I&#8217;ve seen no reports of GPU racks piling up in warehouses. And finding places to host a massive number of rocket launches would pose its own challenges.</p><p>The inescapable conclusion is that under current circumstances, orbital data centers are something of a fool&#8217;s errand<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-10" href="#footnote-10" target="_self">10</a>. This seems unlikely to change until one of the two following things occur:</p><ol><li><p>We&#8217;ve built so many data centers that <em>a significant portion of Earth is covered in solar panels</em> to supply power.</p></li><li><p>We&#8217;ve established a substantial industrial economy in space for other reasons &#8211; such that there is an established population of either astronauts or robots to perform construction and repairs.</p></li></ol><p>Neither of these seems plausible for at least the next 5 to 10 years. As a result, orbital data centers are a topic for long-range research, not active commercialization. After reviewing the report generated by my new automated research agent, I feel reasonably confident in making this claim. However, I&#8217;m basing that on my own analysis &#8211; the agent&#8217;s analysis is riddled with flaws.</p><h2><strong>My Research Agent Gave Me Good Input, but Bad Analysis</strong></h2><p>I had my new Claude Code research skill perform many, many passes over the report: checking its own work, identifying new questions, and locating new sources. I also had ChatGPT Pro repeatedly critique the work; it turns out to be much better at poking holes in another agent&#8217;s output than in doing a rigorous job itself.</p><p>You can read the resulting book-length report<a href="https://research-viewer.pages.dev/orbital-ai-datacenters-3/"> here</a>. It&#8217;s extremely detailed: 119,481 words across 46 web pages, drawing on 403 sources. That&#8217;s far more thorough than anything I would have undertaken on my own. But there are myriad errors in logic, and they make for interesting reading &#8211; revealing an emphasis on narrative over logic. Here are a few examples:</p><ol><li><p><strong>It sometimes gets things backwards</strong>. The report presents &#8220;optimistic&#8221;, &#8220;central&#8221;, and &#8220;conservative&#8221; estimates. When presenting the optimistic scenario for viability of orbital data centers, it uses its lowest estimate for the cost of earth-based electricity. That&#8217;s optimistic for terrestrial power, but <em>pessimistic</em> for orbital data centers.</p></li><li><p><strong>It ignores facts that clearly contradict a narrative</strong>. To estimate the rate at which GPUs fail and need replacement, Claude relies on a paper from Meta, which describes a training run which experienced &#8220;148 &#8216;Faulty GPU&#8217; interruptions over 54 days on 16,384 GPUs&#8221;. Claude notes that only 3 of these 148 interruptions required manual intervention, implying that most of the interruptions did not involve failed hardware. However, it still uses 148, not 3, as the basis for its calculation of failure rates.</p></li><li><p><strong>It fails to enforce consistency across different sections of the report</strong>. At one point, the report discusses GPUs being &#8220;economically obsolete in 2&#8211;3 years&#8221;, an idea that is in widespread circulation, on the basis that GPU designs keep improving. However, demand is so high that <a href="https://www.latent.space/p/ainews-h100-prices-are-melting-up">even rental prices for five-year-old chips are skyrocketing</a> &#8211; a fact that Claude cites elsewhere in the report.</p></li><li><p>At one point, when discussing the cost of terrestrial power sources, the report presents prices for solar panels plus four hours of battery storage, a typical configuration for non-data-center, grid-connected applications. It acknowledges that four hours is insufficient (nights last longer than that!), but doesn&#8217;t attempt to compute the cost for a freestanding system capable of powering a data center 24/7. It does note that &#8220;longer-duration storage (8-12 hours)... is required&#8221;, but doesn&#8217;t compute how that would impact costs, and in any case it&#8217;s still wrong &#8211; in winter months and/or bad weather, a pure solar+battery setup would require more than 12 hours of battery power.</p></li><li><p>In earlier drafts, it repeatedly cited an FCC &#8220;5-year deorbit rule&#8221; as limiting the maximum lifetime of an orbital data center to 5 years. This is obviously silly, as many satellites are used for longer than 5 years. The rule actually imposes a requirement for cleaning up satellites <em>after they are no longer in use</em>. Claude ignored hints about this from me; I finally had to explicitly point out that it was misinterpreting the rule.</p></li></ol><p>The full report contains dozens, possibly hundreds, of such errors &#8211; despite many rounds of refinement by Claude and ChatGPT Pro<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-11" href="#footnote-11" target="_self">11</a>.</p><p>While the report&#8217;s analysis is faulty, I do believe that the iterative critique-and-refinement process has done a good job of surfacing relevant questions and evidence. On my own, I would have been unlikely to think about financing costs. I might not have bothered to question Musk&#8217;s assertion that space provides uninterrupted solar power<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-12" href="#footnote-12" target="_self">12</a>. I would have had much less access to quantitative estimates of many relevant factors. I would have been worried that I was missing some major consideration. The report, flawed though it may be, was invaluable in helping me to generate my own analysis.</p><p>I started out by referencing the messy middle zone of semi-competent AI capabilities. What did this project teach me about that?</p><h2><strong>The Messy Middle is Where The Action Is</strong></h2><p>Since the dawn of LLMs (the large language models that power AI chatbots and agents), it has always been the case that better prompting techniques yield better outputs. Simple tricks like &#8220;answer like an expert&#8221; are no longer helpful (if indeed they ever were), but there is more room than ever for human skill to elicit better results. In this project, I incorporated techniques such as:</p><ul><li><p>Building a <strong>multi-step workflow</strong> for Claude Code, rather than just asking it a single question.</p></li><li><p>Setting things up so that <strong>Claude could critique and revise its own work</strong>.</p></li><li><p>Finding ways to <strong>collaborate with the AI</strong>, using my own judgement and taste to suggest new questions for it to investigate.</p></li></ul><p>This has implications for the pace of change. Human skill at using AI isn&#8217;t a static factor, but it&#8217;s often a neglected one. When we look at a chart of benchmark scores, we ignore the fact that people are finding <a href="https://secondthoughts.ai/p/hyperproductivity">increasingly sophisticated ways</a> of using AI. This compounds the pace of real-world impact.</p><p>Keep all this in mind when someone says &#8220;AI can&#8217;t do X&#8221; (how hard did they try?) or &#8220;AI just did X for me&#8221; (how much work did it take to get that result?). This also applies to benchmark scores, including those behind the famous METR &#8220;<a href="https://secondthoughts.ai/p/measuring-ai-progress">the size of coding tasks AI can do</a> is doubling every <s>seven</s> four<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-13" href="#footnote-13" target="_self">13</a> months&#8221; graph. Benchmarks exaggerate AI capabilities, because they feature artificially tidy tasks and narrow evaluation criteria; but they also fail to measure the potential for human-AI collaboration.</p><p>What you get out of AI tools has a lot to do with what you put in. If you&#8217;re only using AI when you&#8217;re confident it can give you an accurate answer to a quick question, you&#8217;re missing out. If you&#8217;re willing to push through an extended collaboration with a chatbot or AI agent, a much wider range of possibilities opens up &#8211; and you&#8217;ll develop new skills in the process. My cobbled-together research agent is, as the expression goes, &#8220;the worst it will ever be&#8221;. I already have plans for significant improvements, and the underlying AI models will continue to improve (<a href="https://www.anthropic.com/news/claude-opus-4-7">Opus 4.7 was released</a> as I wrote this concluding paragraph!). I have a long list of questions to research, and high hopes for plowing through them at an accelerating pace.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-centaur-era?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-centaur-era?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera and Taren Stinebrickner-Kauffman for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Some studies do explore human-AI collaboration. For instance, a <a href="https://metr.org/blog/2025-07-10-early-2025-ai-experienced-os-dev-study/">study by METR</a> on software engineering productivity, or <a href="https://www.nber.org/papers/w31161">this 2023 paper</a> by Erik Brynjolfsson et al studying customer support agents.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>You can review the original ChatGPT conversation <a href="https://chatgpt.com/share/69d975ee-180c-83e8-9da2-5df599eeaab6">here</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Commentary I&#8217;ve seen consistently describes electricity supply as a small fraction of AI data center TCO (total cost of ownership, meaning upfront investment plus ongoing operational costs). To put this on firmer ground, I asked ChatGPT Pro to create a report, and then insisted that it find specific citations for each fact asserted in its report. Here&#8217;s the result; this is ChatGPT&#8217;s research and analysis, reviewed and rewritten by me. I would not take this as definitive, in part because I can&#8217;t vouch for the quality of the sources, but it looks reasonable and is consistent with what I&#8217;ve read on the subject:</p><p>Building a data center costs around $35 million per MW. For instance, <a href="https://www.jll.com/en-us/insights/market-outlook/data-center-outlook">one source</a> puts the &#8220;shell and core&#8221; (physical building, etc.) at $10.7M / MW and the fancy electronics (GPUs, memory, networking, etc.) at $25M / MW.</p><p>One megawatt of electronics accounts for roughly 1.2 megawatts [ChatGPT&#8217;s estimate, no source provided] of utility power, the balance going to cooling equipment, conversion losses, and other overhead. If the data center runs at full power 24x7 (an unrealistic assumption, therefore inflating the contribution of electricity cost to TCO), then over a 5-year lifetime, the data center uses 52,560 MWh of electricity (5 * hours-per-year * 1.2).</p><p><a href="https://www.eia.gov/electricity/monthly/epm_table_grapher.php?t=table_5_03">IEA figures</a> place the 2025 U.S. industrial average electricity price at 8.62&#162;/kWh ($86.2/MWh). That&#8217;s the price for utility power, but the whole point of the argument is that it&#8217;s getting harder to find utility power for new data centers. One alternative is &#8220;behind-the-meter&#8221; generation (generating your own electricity on site), for which less-efficient-but-faster-to-build gas-powered &#8220;peaker&#8221; plants cost <a href="https://www.lazard.com/media/eijnqja3/lazards-lcoeplus-june-2025.pdf">$149 &#8211; $251 / MWh</a>. (Many other options are possible, incorporating a mix of grid power, gas, solar, batteries, etc.)</p><p>Here we have three estimates of electricity price &#8211; $86, $149, or $251 per megawatt-hour. Multiplying by 52,560 and comparing with the $35M construction cost puts electricity&#8217;s share of 5-year TCO at 11%, 18%, or 27% of overall TCO, respectively.</p><p>This is a highly simplified model. Factors which would cause electricity&#8217;s actual share of TCO to be higher: the $25M / MW figure was the top end of the source&#8217;s estimated range for cost of GPUs and other equipment; the cost figures for on-site generation probably assume power generation equipment is amortized over much more than 5 years (I didn&#8217;t check). Factors which would cause electricity&#8217;s share to be lower: financing costs (some aspects of electricity cost aren&#8217;t paid until the electricity is used, while equipment has to be purchased up front); data centers aren&#8217;t always consuming their theoretical maximum power draw; operating and maintenance costs; cost-per-MW of AI computing equipment has been increasing over time.</p><p>I asked ChatGPT to follow up on the 20% overhead estimate for power usage (1.2MW in per 1MW going to the electronics), and it found one source putting the overhead at 14% (figure 4.5 of <a href="https://climate.mit.edu/sites/default/files/2025-02/lbnl-2024-united-states-data-center-energy-usage-report.pdf">this report</a>), and another source giving similar figures. This would lower electricity&#8217;s TCO share slightly.</p><p>I would imagine that, over time, the cost-per-megawatt of GPUs will increase, again lowering electricity&#8217;s TCO share. Overall, the simple calculation of 11-27% is almost certainly on the high side, especially going forward.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Dylan Patel has suggested that power may soon become a limiting factor on AI data center expansion, but only temporarily. In the medium to long term, scaling the manufacturing of gas turbines, solar panels and batteries, or other power sources is a vastly simpler proposition than scaling chip fabrication.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>Even the supposed #1 advantage of space &#8211; continuous solar power &#8211; turns out to be exaggerated. It turns out that the best near-Earth orbits pass into shadow for up to 21 minutes per 90ish-minute orbit, meaning that satellites would either need to suffer periodic downtime, or carry batteries that add to launch mass. (An orbit around the dawn/dusk line has no shade, but there is no orbit which maintains this orientation over the course of a year.) Orbits farther from Earth have fewer issues with shade, but they&#8217;re also more exposed to radiation.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>Contrary to popular belief, it is not easy to keep high-power electronics cool in space. Space may be cold, but vacuum doesn&#8217;t conduct heat, so cooling requires large radiators. Worse, AI computation requires lots of high-powered GPUs to be packed together in a small space, and removing all of that heat requires pumping liquid through lots of little pipes &#8211; a tricky business in space, where no repair technician can hear you scream.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>This is the estimate for SpaceX's internal cost; commercial prices are much higher.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-8" href="#footnote-anchor-8" class="footnote-number" contenteditable="false" target="_self">8</a><div class="footnote-content"><p>The server rack needs to be integrated into a satellite, thoroughly tested so that any bad components can be replaced before launch, shipped to the launch site, wait for a launch slot, and then raised into its permanent orbit.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-9" href="#footnote-anchor-9" class="footnote-number" contenteditable="false" target="_self">9</a><div class="footnote-content"><p>The ISS solar panels can generate a bit over 200kW, but the station spends some of its time in the shade, so some power is needed to charge batteries.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-10" href="#footnote-anchor-10" class="footnote-number" contenteditable="false" target="_self">10</a><div class="footnote-content"><p>In some cases, it may make sense to incorporate &#8220;edge compute&#8221; GPUs into remote sensing satellites, to analyze images without having to transmit the full raw camera data down to Earth. However, this would not substitute for construction of terrestrial data centers.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-11" href="#footnote-anchor-11" class="footnote-number" contenteditable="false" target="_self">11</a><div class="footnote-content"><p>It's interesting to ponder why. My sense is that these models emphasize a compelling paragraph-by-paragraph narrative, and fail to notice or address logical contradictions between different parts of a report, even when asked to look for inconsistencies.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-12" href="#footnote-anchor-12" class="footnote-number" contenteditable="false" target="_self">12</a><div class="footnote-content"><p>It doesn&#8217;t; see footnote 9.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-13" href="#footnote-anchor-13" class="footnote-number" contenteditable="false" target="_self">13</a><div class="footnote-content"><p>This isn&#8217;t an editing glitch; earlier estimates were that the scale of coding tasks which could be handled by AI were doubling every 7 months, but recently the trend seems to have accelerated. Note that there are many caveats to this result, one of which is that the benchmark doesn&#8217;t include many tasks difficult enough to challenge the latest coding agents.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[To Forecast AI's Impact on Biosecurity, We Asked: Why are Attacks So Rare?]]></title><description><![CDATA[Nine factors that practitioners say make bioweapons rare.]]></description><link>https://secondthoughts.ai/p/why-arent-bioweapons-common</link><guid isPermaLink="false">https://secondthoughts.ai/p/why-arent-bioweapons-common</guid><dc:creator><![CDATA[Abi Olvera]]></dc:creator><pubDate>Wed, 08 Apr 2026 18:45:40 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!JaGo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>When people discuss potential risks from advanced AI, the list usually includes bioweapons. The idea is that a rogue actor, or possibly a misaligned AI, might use AI tools to help synthesize and release anything from anthrax to a civilization-ending supervirus. To shed more light on this risk, Golden Gate Institute for AI&#8217;s <a href="https://abio.substack.com/">Abi Olvera</a> interviewed biosecurity professionals with decades of hands-on experience in laboratories. What she learned was more reassuring than headlines suggest: bioweapons are genuinely difficult to make, and near-term developments in AI may not change that as much as you might think. This is part one of her four-part series explaining why; the remaining installments will appear here at Second Thoughts in the coming weeks.</em></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JaGo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JaGo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 424w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 848w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1272w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png" width="462" height="347.8588235294118" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:896,&quot;width&quot;:1190,&quot;resizeWidth&quot;:462,&quot;bytes&quot;:2562951,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/193414826?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JaGo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 424w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 848w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1272w, https://substackcdn.com/image/fetch/$s_!JaGo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0c20b45e-7957-4168-8b73-7adc252962ef_1190x896.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Bioweapons are rarely used.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a></p><p>One reason is that they&#8217;re hard to make. Another reason is that they&#8217;re bad weapons.</p><p>You can&#8217;t time them &#8211; a virus spreads on its own schedule. You can&#8217;t aim them &#8211; a virus spreads uncontrollably. Protecting your own people from a virus requires tipping your hand &#8211; a vaccination program is hard to hide.</p><p>People who want to cause harm still follow a cost-benefit logic. They try to pick the tool most likely to achieve their goal. For almost any real-world objective, that tool is a bomb, a gun, a chemical weapon, or a cyberattack. These are cheaper, faster to create, easier to deploy, more controllable, and more predictable than a bioweapon.</p><p>This cost-benefit logic is well-studied in military strategy, but less well recognized in current discussions of bioweapon risks from AI. That gap is important. AI risk researchers and biosecurity practitioners are both worried about bioweapons, but they&#8217;re working from different starting points. AI researchers focus on ways AI could help with bioweapon construction. Biosecurity practitioners focus on the most critical limiting factors for bioweapon creation.</p><p>This series focuses on the practitioner&#8217;s view. It draws on conversations with biosecurity professionals with decades of hands-on laboratory experience. The series&#8217; four essays cover four questions: Why are bioweapons rare? How much laboratory skill do the necessary processes actually require? Where in the production chain does AI help and where doesn&#8217;t it? Why does the biosecurity discourse underplay the factors that make successful bioweapons so rare?</p><p>These are the factors that practitioners say make bioweapons rare:</p><div id="datawrapper-iframe" class="datawrapper-wrap outer" data-attrs="{&quot;url&quot;:&quot;https://datawrapper.dwcdn.net/bSeKB/2/&quot;,&quot;thumbnail_url&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8e647ab6-b53e-417e-b6ff-4d9bf3a57374_1220x1878.png&quot;,&quot;thumbnail_url_full&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab8fde20-a506-4302-a23e-a026b7ce60eb_1220x1878.png&quot;,&quot;height&quot;:1022,&quot;title&quot;:&quot;These are the factors that practitioners say make bioweapons rare: &quot;,&quot;description&quot;:&quot;Create interactive, responsive &amp; beautiful charts &#8212; no code required.&quot;}" data-component-name="DatawrapperToDOM"><iframe id="iframe-datawrapper" class="datawrapper-iframe" src="https://datawrapper.dwcdn.net/bSeKB/2/" width="730" height="1022" frameborder="0" scrolling="no"></iframe><script type="text/javascript">!function(){"use strict";window.addEventListener("message",(function(e){if(void 0!==e.data["datawrapper-height"]){var t=document.querySelectorAll("iframe");for(var a in e.data["datawrapper-height"])for(var r=0;r<t.length;r++){if(t[r].contentWindow===e.source)t[r].style.height=e.data["datawrapper-height"][a]+"px"}}}))}();</script></div><p><em>Additional notes regarding &#8220;Access&#8221;: the estimate of how many people have access to facilities is back-of-the-envelope.</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a><em> Additionally, lab work is not the only route to pathogens.</em><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a></p><p>AI does lower some barriers. Large language models can make some steps of bioweapon creation easier and cheaper, such as deciphering research on how to culture cells or disperse pathogens. AI biodesign tools reduce the expertise required to design modifications to a virus&#8217;s genome.<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a> AI can also help with steps that have nothing to do with biology, such as knowing which international shipping routes to use when buying supplies to avoid regulatory oversight and customs inspections. It can help actors improve their planning and coordination skills.</p><p>However, all the operational and planning capacities provided by AI are also making it easier to launch bombs, chemical weapons, and cyberattacks. AI doesn&#8217;t yet significantly change the calculus for someone wanting to cause harm.</p><p>This doesn&#8217;t mean we should be complacent. New technologies can shift the math.</p><p>Pathogens targeting genetic traits like ancestry or sex could, if ever feasible, make bioweapons more attractive to certain actors. New kinds of actors could emerge. For instance, people deeply destabilized by large economic shifts could mean a larger supply of disgruntled people interested in bioweapons. Robotics and automated laboratories could reduce the level of competence required for success.</p><p>Accurately assessing these future risks depends on a clear understanding of the present. The people who understand these risks best aren&#8217;t the ones writing about them publicly. Most of them work in laboratories, government agencies, or the national security world.  Advocacy and policy organizations fill that gap, though their incentives push them toward focusing on worst-case scenarios. That&#8217;s a big reason public discourse treats AI-enabled bioweapons as more imminent and accessible than practitioners do.</p><p><em>The next installment in this four-part series examines tacit knowledge: the gap between written protocols and real-world lab work, and how that limits AI&#8217;s potential to actually help someone build a bioweapon.</em></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/why-arent-bioweapons-common?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/why-arent-bioweapons-common?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Steve Newman, Taren Stinebrickner-Kauffman, Mike Montague, Matt Sharkey, Gigi Gronvall, and David Manheim for suggestions and feedback.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Since the 1980s, only one fatal bioterrorist attack has occurred: the <a href="https://en.wikipedia.org/wiki/2001_anthrax_attacks">2001 anthrax letters</a>. Other incidents, such as a Salmonella attack to influence a local election, a medfly release targeting California crops, and two ricin letter attempts, caused no deaths.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Two experts estimated how many Americans have the resources to misuse viral genomes, with estimates ranging from tens of thousands to under a hundred once expertise is factored in. We use the higher figure as a rough order-of-magnitude estimate.</p><p>Matt Sharkey (RAND) started with U.S. academic biology departments&#8212;according to Bureau of Labor Statistics and National Science Foundation data, the population under consideration is roughly 100,000 people (~40k life sciences faculty plus ~60k biological or biomedical sciences doctoral students, with ~45k Master&#8217;s students not counted, as many Master&#8217;s programs do not involve benchwork). His assumptions include that about 75% of these people can perform molecular cloning, but only about 20-30% of them have access to the cell culture equipment required for something like influenza. Applying those filters yields 20,000-30,000 technically capable people. Since only about a third would independently work out the reverse genetics approach from published protocols, the academic estimate drops to 6,000-10,000. Adding the private sector could push that 1.5x-3x higher.</p><p>Michael Montague (Archimedes Network and Quantum Biology Institute) started broader (400,000&#8211;700,000 people, including industry PhDs and technicians) but applied three additional cuts: only 10% have &#8220;boot-up&#8221; knowledge to activate an assembled genome; only 1 in 10 labs has the right equipment; and 1 in 100 people have unsupervised, long-term lab access to run a multi-hundred-day project undetected. He estimated that 1000 people in academia have access, though boot-up difficulties reduced his estimate to 40 to 70 people.</p><p>Disagreement between the experts centered on how difficult boot-up is and whether nighttime lab use would go unnoticed.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Instead of making bioweapons in a lab, attackers could try to deliberately get infected by visiting places where people already have dangerous diseases like tuberculosis or Ebola. However, this requires finding actively sick people, getting close enough to catch the disease, and perfect timing around contagiousness windows. Attackers would also be stuck with natural transmission rates.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Bioweapons based on engineered pathogens remain unproven. Their feasibility is unclear. Novel bioweapons will be discussed in the third essay.</p></div></div>]]></content:encoded></item><item><title><![CDATA[45 Thoughts About Agents]]></title><description><![CDATA[The layer of the AI stack that evolves fastest &#8211; and may have the most impact]]></description><link>https://secondthoughts.ai/p/45-thoughts-about-agents</link><guid isPermaLink="false">https://secondthoughts.ai/p/45-thoughts-about-agents</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Mon, 02 Mar 2026 15:59:38 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!dAKl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!dAKl!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!dAKl!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 424w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 848w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1272w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png" width="1024" height="775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/305de7db-0931-408b-8b00-700962adecfe_1024x775.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1477217,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!dAKl!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 424w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 848w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1272w, https://substackcdn.com/image/fetch/$s_!dAKl!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F305de7db-0931-408b-8b00-700962adecfe_1024x775.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">There&#8217;s a lot of debate as to exactly where we are in this progression</figcaption></figure></div><p>One of my most popular posts ever was <a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5">35 Thoughts About AGI and 1 About GPT-5</a>, a grab bag of musings about the path to AGI (plus a snarky aside about GPT-5).</p><p>Here is a fresh collection of musings, this time about AI agents.</p><ol><li><p>I had trouble making time to write this: I&#8217;m so drawn into <em>using</em> agents that it&#8217;s hard to make time to write <em>about</em> agents. This isn&#8217;t an isolated phenomenon; many people are tweeting about getting sucked into vibe coding at every available minute.</p></li><li><p>This is driven by the astonishing productivity of current AI coding agents in particular, <strong>especially when used in ways that play to their strengths</strong>. I&#8217;m using Claude Code to build a ridiculously ambitious set of productivity tools for my own personal use. A single sub-project involves deep integrations with Gmail, Slack, WhatsApp, Twitter, Signal, SMS, Substack, Pocket Casts, Notion, Google Drive, Google Contacts, Google Calendar, and more. As recently as last year, it would have been insane for me to even contemplate such an undertaking, let alone as a side project. With today&#8217;s tools, I knocked off most of the integration work over the course of a weekend. AI models are providing the intelligence the app will need to make use of all this data, and coding agents are handling the tedious integration chores.</p></li><li><p>After decades as a prolific coder, I stopped cold in early 2023, leaving me quite rusty. That rust hasn&#8217;t been the <em>slightest</em> impediment. In fact, I suspect it might be helpful &#8211; the habits I would have had to unlearn have already faded, and it&#8217;s been easy for me to slip into the habit of letting the AI write all the code. I am still using my high-level design experience to guide the agents; interestingly, those skills don&#8217;t feel rusty at all. <strong>I wonder what it is about low-level coding technique vs. high-level design skills that makes the latter easier to retain, and what this might say about which human skills will remain relevant.</strong></p></li><li><p>My experience doesn&#8217;t seem to be in any way unique &#8211; apparently a lot of engineers who had gotten out of the business of producing code themselves are getting back in:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/erenbali/status/2025243266715893922" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!s5qH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 424w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 848w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1272w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png" width="1456" height="889" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:889,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/erenbali/status/2025243266715893922&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!s5qH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 424w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 848w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1272w, https://substackcdn.com/image/fetch/$s_!s5qH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F82eb4a8e-993f-4bb1-a887-9338ca5db476_1600x977.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>This is an example of a broader phenomenon: <strong>AI agents are going to change the nature of work</strong>. Some jobs will get more efficient, some will go away, some new jobs will arise. Some will become more fun, as AI automates the annoying part; some will become more stressful, as AI automates the easy or rewarding part. In this post, I&#8217;m not going to explore the question of how jobs and workers will be impacted overall. But I do want to highlight that things will change in many ways, often unpredictable.</p></li><li><p>I&#8217;m able to dive back into coding because Claude Code has gotten capable enough that I can be productive without editing, or even looking at, the actual code. My impression is that this was not the case prior to the November release of Opus 4.5. This is a reminder that <strong>threshold effects are a huge source of unpredictability</strong> for AI&#8217;s impact.</p></li></ol><h1><strong>Why Agents Are Such a Big Deal</strong></h1><ol start="7"><li><p>Suddenly, all of the AI news seems to be about agents. OpenClaw<a href="https://secondthoughts.ai/p/clawdbot-and-moltbook"> made waves</a> because it&#8217;s an AI agent that can act autonomously. In the recent &#8220;SaaSpocalypse&#8221;, the market cap of Software-as-a-Service (SaaS) companies fell by over one trillion dollars, driven by fears that<a href="https://amistrongeryet.substack.com/p/the-new-model-of-software-development"> coding agents will make mass-market software obsolete</a>. When people talk about the results they&#8217;re getting from the latest models, like Opus 4.6 and GPT-5.3-Codex, they&#8217;re talking about using them to power an agent like Claude Code.</p></li><li><p>One reason agents are having such an impact is that <strong>they are the layer of the AI stack that evolves most rapidly</strong>. A foundation model is a gigantic monolith &#8211; a single data file containing trillions of mysterious numeric weights. Updating it, even for an incremental release like Opus 4.5 &#8594; 4.6, is a big project. Agents, by contrast, are traditional software, and can be updated incrementally &#8211; tweak a prompt here, add a new integration there. Model releases come months apart; Claude Code sometimes has<a href="https://github.com/anthropics/claude-code/blame/main/CHANGELOG.md"> multiple releases in one day</a>. The ship &#8594; get feedback &#8594; improve &#8594; ship cycle for an agent can be very fast.</p></li><li><p>Actually I lied about where change comes fastest: <strong>some users are evolving their behavior even faster than the agents</strong>. My Twitter timeline is full of crazy but productive new ways that people are finding to use these tools. For example, here&#8217;s one epiphany (among many!) that I was late to the game on: agents are comparatively weak at high-level decision making, but they make execution cheap. So sometimes, instead of trying to choose the right path, you can just tell the agent to explore every path.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/snewmanpv/status/2020920612739514681" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 424w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 848w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1272w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png" width="1456" height="452" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/c454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:452,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/snewmanpv/status/2020920612739514681&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!i9Mx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 424w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 848w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1272w, https://substackcdn.com/image/fetch/$s_!i9Mx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fc454feb8-cd30-4e62-bc0e-5b2f911503ab_1600x497.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="10"><li><p>Another adage that agents shoot to hell: &#8220;if you don&#8217;t have time to do it right, when will you have time to do it over?&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/TheStalwart/status/2012222772085215437" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 424w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 848w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1272w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png" width="1456" height="397" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:397,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:163460,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:&quot;https://x.com/TheStalwart/status/2012222772085215437&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!3xbJ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 424w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 848w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1272w, https://substackcdn.com/image/fetch/$s_!3xbJ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4522dfb8-a841-4ac9-8260-c14d6ae51e07_1732x472.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>It was always obvious that serious AI capabilities would require agents of some sort. Any intelligence, whether silicon or carbon based, can do more by feeling its way through a problem than it can do in a direct leap to a finished result<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. The famous METR &#8220;time horizon&#8221; graph, showing the time scale of coding tasks that an AI can attempt with some hope of success,<a href="https://x.com/scaling01/status/2022076396898582957"> seems to have accelerated sharply</a> when the first &#8220;reasoning model&#8221; &#8211; roughly, the first model trained as an agent &#8211; was released.</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h1><strong>What Exactly Is an Agent?</strong></h1><ol start="12"><li><p>People use the term &#8220;agent&#8221; pretty loosely. The core idea for me is a system that pursues a <em>goal</em> rather than following a <em>script</em>.</p></li><li><p>You can achieve a goal by following a script, but it doesn&#8217;t work very well. Scripts are brittle. Suppose I want an AI system to book a flight to New York. I could give it a step-by-step script, and it might even work, on a good day. But if the airline booking procedure has changed, or an unexpected circumstance arises, a script-following bot will get stuck, or forget to enter my frequent flyer number, or book the wrong kind of ticket, or worse. People often use the word &#8220;agent&#8221; to describe systems that are just following scripts. But I&#8217;m going to stick to the idea that a system is only an agent to the extent that it can robustly pursue a goal, even in the face of unanticipated circumstances.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zdAq!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zdAq!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 424w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 848w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png" width="876" height="1088" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1088,&quot;width&quot;:876,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zdAq!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 424w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 848w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1272w, https://substackcdn.com/image/fetch/$s_!zdAq!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F880f9026-0683-40fe-8bb0-3049f8a0dc6b_876x1088.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="14"><li><p>The Gemini Deep Research tool is an example of a scripted system. You give it a question, it generates a plan and carries it out. Sometimes this goes well, sometimes not. In step 5 of the plan below, it might become clear that some additional research on a particular aspect of theoretical physics is called for. A rigid plan doesn&#8217;t allow for that. By contrast, if you select &#8220;thinking&#8221; mode in any of the leading chatbots and ask a question requiring research, they will take a flexible approach, adjusting course based on questions that arise over the course of the investigation.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!B6eP!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!B6eP!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 424w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 848w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1272w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png" width="1422" height="1084" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1084,&quot;width&quot;:1422,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!B6eP!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 424w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 848w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1272w, https://substackcdn.com/image/fetch/$s_!B6eP!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fad547904-729d-4e0d-a5ab-da54950fdd48_1422x1084.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="15"><li><p>Current agents can work toward a goal, but the way they go about it is sometimes alarming. They&#8217;ll make strange decisions or veer off in odd directions. Yes, people do that too, but current agents are worse, and weirder. For instance, I pointed out that one element of a website it had built for me didn&#8217;t look right when my phone was in dark mode, and instead of fixing that element, it tried to prevent any part of the page from entering dark mode, resulting in the following bit of lovely web design.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vLmv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vLmv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 424w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 848w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png" width="1150" height="1672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1672,&quot;width&quot;:1150,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:526110,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vLmv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 424w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 848w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1272w, https://substackcdn.com/image/fetch/$s_!vLmv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F167ec098-157a-416b-a4a8-c0f7baa0d5cb_1150x1672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li><li><p>Despite this, they get to the right outcome for an increasing variety of tasks of increasingly large scope. That&#8217;s partly through sheer persistence. If at first they don&#8217;t succeed, they&#8217;ll try, try again, and again, and again. Yes, people do that as well, but <strong>agents can be inhumanly persistent and patient.</strong> (They can afford to be! Their time is much less valuable, especially if measured in actions rather than minutes.)</p></li><li><p>As always, AIs partially compensate for a lack of deep understanding with an incomprehensible breadth of training on zillions of specific tasks. They may struggle with novel situations, but they will surprise you with how many problems they already know how to solve. Using breadth to compensate for lack of depth<a href="https://secondthoughts.ai/p/gpt-4-capabilities"> has always been part of the LLM story</a>. The first &#8220;L&#8221; in LLM stands for Large, which relates to a large volume of training data.</p></li></ol><h1><strong>Using Agents Effectively</strong></h1><ol start="18"><li><p>To get value from current agents, you need to find agent-shaped pieces in your current workflow. They&#8217;re not always obvious. And you can get more value if you&#8217;re willing to reshape your workflow so that it contains more agent-shaped tasks.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Arnb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Arnb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 424w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 848w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png" width="1456" height="1164" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1164,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2273496,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/189414064?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Arnb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 424w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 848w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1272w, https://substackcdn.com/image/fetch/$s_!Arnb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4d710da2-43c7-4113-944a-1eabea9c005b_1614x1290.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div></li></ol><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://x.com/CFGeek/status/2005118883515277756" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F_QA!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 424w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 848w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1272w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png" width="1456" height="352" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:352,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/CFGeek/status/2005118883515277756&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F_QA!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 424w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 848w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1272w, https://substackcdn.com/image/fetch/$s_!F_QA!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F006077c5-517a-4d2e-b7c9-3ce9585cfb9d_1600x387.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="19"><li><p>Many people have pointed out that if you just naively hand pieces of work to an agent, your productivity can actually go down. It&#8217;s easy to fall into a cycle where an agent produces something for you, you provide feedback, the agent makes revisions, you check it again, ad infinitum. This <em>feels</em> productive (the agent is doing so much work!), but before you know it, you&#8217;ve spent more time giving feedback to the agent than it would have taken you to do the work yourself.</p></li><li><p>Advanced users understand that the key is <strong>putting the agent in a position to check its own work</strong>. The agent&#8217;s strength isn&#8217;t flawless execution, it&#8217;s the speed and stamina to keep plugging away. But it doesn&#8217;t necessarily realize this &#8211; its instinct is to constantly ask for your approval. You have to be very explicit in instructing it what constitutes a successful outcome.</p></li><li><p>Current agents are notoriously focused on the main thrust of their assigned task, to the expense of all else. For instance, I will tell an agent to make a change to some code and then make sure all of the tests pass. It will beaver away for 10 minutes, generating a flood of output, and report success. And then I&#8217;ll read back through the output, notice a casual remark that &#8220;seven tests can&#8217;t be adapted to the new code, so I&#8217;ll just remove them&#8221;, and smack my forehead. (Unfortunately, smacking the agent&#8217;s forehead is not an option.)<br><br>The best practice is to have one agent do the work, and then a separate agent check the work. This isn&#8217;t because the first agent isn&#8217;t smart enough, it&#8217;s because these agents are trained to be so goal-oriented that they struggle to hold onto more than one goal at a time.</p></li><li><p>One hallmark of a skilled user of agents is their resourcefulness in finding ways for the agent to check its own work. Here&#8217;s an example, taken from my post on<a href="https://secondthoughts.ai/p/hyperproductivity"> Hyperproductivity</a>:</p><blockquote><p>&#8230;when Jesse [Vincent] uses his &#8220;Superpowers&#8221; tool to codify a skill, the tool uses its test-driven development module to verify that the new skill has been implemented correctly. <strong>It generates an example of a task that the new skill is meant to help with, verifies that it is unable to complete that task without the new skill, and then checks to see whether it </strong><em><strong>can</strong></em><strong> complete the task once the new skill has been installed.</strong></p></blockquote></li><li><p>The need for clear success criteria applies to people as well as AI agents. But we&#8217;re more proactive than AIs at finding ways to check our work, and at sussing out unstated requirements. I wonder whether this &#8220;eh, that&#8217;s probably good enough&#8221; attitude is a fundamental weakness of current agent architectures, or just a flaw in the feedback they&#8217;re given during training.</p></li><li><p>People often argue that AI tools can be useful even if they&#8217;re unreliable, because it&#8217;s easier to check the AI&#8217;s output than to do the work yourself. I think this is overstated. Sometimes it&#8217;s much easier to make a thing than to verify the thing. For instance, you can sometimes build a large spreadsheet in just a few minutes, using repeating formulas and smart autofill features. For someone else to poke through all 1000 cells in that spreadsheet and make sure you did it correctly might take a lot longer. Some kinds of work are hard to hand to an agent, for the same reason.</p></li><li><p>Because I don&#8217;t want to have to check an agent&#8217;s work, I find that it&#8217;s often worthwhile for me to spell out in great detail how I&#8217;d like it to go about a task &#8211; minimizing its opportunities to screw up. One of my<a href="https://x.com/snewmanpv/status/2008002812471586828"> few viral tweets</a> showed a detailed prompt I wrote for Claude Code (click the link above to see the full prompt).<br><br>It took me half an hour to write the prompt. But it would have taken me an entire day or more to write the code myself. And this prompt was enough for Claude to get absolutely everything right on the first try &#8211; and to give me the confidence to not spend time reviewing the work. This is an extreme example, but the principle often applies: 5 minutes of extra detail in the prompt can save an hour of reviewing flawed outputs.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!uDoi!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!uDoi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 424w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 848w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1272w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png" width="1456" height="1126" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1126,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!uDoi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 424w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 848w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1272w, https://substackcdn.com/image/fetch/$s_!uDoi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa51871c0-94b6-4b43-87de-717e78812b9d_1600x1237.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="26"><li><p>People are building elaborate prompt systems, with names like<a href="https://github.com/microsoft/amplifier"> Amplifier</a> and<a href="https://github.com/obra/superpowers/tree/main"> Superpowers</a>, to elicit more sophisticated work from agents. There is an enormous amount of work taking place here, mostly homegrown. Why aren&#8217;t companies like Anthropic and OpenAI incorporating these ideas into their agents (and the models themselves)? I think that <strong>all of this is so new that a thousand early adopters can explore new ideas faster than even nimble companies can absorb</strong>. End-user innovation is the fast-moving layer right now; successful ideas are incorporated first into the agents, and then the models.<br><br>It&#8217;s also the case that many of these prompt systems are designed to trade off thinking time for quality. They ask the agent to do the work four times and compare results, do endless critiques of its own work, and so forth. Anthropic and OpenAI may be holding off on incorporating extreme agent orchestration techniques into their baseline agents because they don&#8217;t have the computing capacity to support widespread use of those techniques.</p></li><li><p>There&#8217;s a limit to how quickly you can climb the ladder of sophistication in use of agents. Before you can have an agent effectively checking its own work, you need a taste for checking the work yourself. Before you can manage a swarm of agents, you need to manage one agent. It&#8217;s helpful to read about what the experts do, but you can&#8217;t emulate them on day one.</p></li></ol><h1><strong>Impact</strong></h1><ol start="28"><li><p>People in my circles make fun of analyst Ed Zitron for his skepticism of AI. Below is a recent where&#8217;s-the-beef rant (click the image to view the complete thread). He&#8217;s arguing that even to the extent that AI is generating a lot of <em>activity</em>, that activity isn&#8217;t having much <em>impact</em>. And&#8230; I&#8217;m not sure he&#8217;s wrong? I&#8217;m still very confused as to how long it will take for AI capabilities, which are undeniably astonishing, to cash all the way out into large scale real world impacts &#8211; GDP growth, labor market disruption, improvements in health care outcomes, and so forth. When will all of this actually matter? How will we know? How should we even define &#8220;mattering&#8221;?</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://bsky.app/profile/edzitron.com/post/3mdbwccxzpk2g" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!UoEb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 424w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 848w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1272w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png" width="1176" height="396" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:396,&quot;width&quot;:1176,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://bsky.app/profile/edzitron.com/post/3mdbwccxzpk2g&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!UoEb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 424w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 848w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1272w, https://substackcdn.com/image/fetch/$s_!UoEb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3e02860b-a49c-4087-9260-c26f7f5894cc_1176x396.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="29"><li><p>In particular, a lot of the energy people put into vibe coding seems to be devoted to making them more efficient at vibe coding. I can absolutely relate to this; I&#8217;ve been at it for about two months, and much of it has been spent this way. But of course if <em>all</em> we&#8217;re getting from vibe coding is better vibe coding, that supports Zitron&#8217;s point.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/labenz/status/2012911588345291035" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nfFD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 424w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 848w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1272w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png" width="1456" height="490" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:490,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/labenz/status/2012911588345291035&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nfFD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 424w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 848w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1272w, https://substackcdn.com/image/fetch/$s_!nfFD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F29b88985-0e34-4572-8d5b-c06fe15f952f_1600x539.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="30"><li><p>AI is extremely good at cranking out work that looks good at first glance but isn&#8217;t really worth using. Reporter Jessica E. Lessin<a href="https://www.theinformation.com/articles/let-ai-analyze-davos-reporting-trip-missed"> presents an example</a>: she had Claude Code generate slides for a presentation on her recent trip to Davos. The result, like so much AI work, was competent, but too bland to be of any value:</p><blockquote><p>Overall, Claude&#8217;s slides were better than I expected. The themes it identified were high level: Mostly accurate, they included slides on the size of the AI infrastructure build-out, tech companies&#8217; big bet on agentic AI and the growing number of creators using AI.</p><p>Claude did a capable job of identifying interesting parts of individual interviews, like the comments Amazon CEO Andy Jassy made about OpenAI and jobs.</p><p>It also added a few nice touches, like this line on the final slide: &#8220;Full interview transcripts available to subscribers.&#8221; It was reading my subscriber-acquisition&#8211;focused brain!</p><p>Yet the slides overall were pretty boring and too high level: <strong>They&#8217;d be useless to anyone who followed tech closely.</strong> [emphasis added]</p></blockquote><p>On a related note,<a href="https://x.com/testingham/status/2014363253871403267"> Tom Cunningham observes</a> that AI can pull you away from doing the most important things, and toward the things that AI can do:</p><blockquote><p>I believe many estimates of LLM productivity boosts are over-estimates because people are using them for cadillac tasks: things that would take you a long time unaided, but have only marginal additional value.</p></blockquote></li></ol><ol start="32"><li><p>Despite all of this, AI agents are absolutely creating real value. To share a personal example, I vibe-coded an app that generates summaries of all the (many!) newsletters and podcasts I follow. It saves me a solid hour per day by letting me be choosier about which episodes I read / listen to. <strong>For the first time in years, I&#8217;m consistently keeping up with my reading queue.</strong> There are many, many such examples &#8211; they&#8217;re just mixed in with a lot of slop and Cadillac tasks.</p></li></ol><h1><strong>Where Things Go From Here</strong></h1><ol start="33"><li><p>Agents are going to progress rapidly, by any metric: usage, capabilities, impact. AI&#8217;s impact is the product of eight separate factors; pre-training, post-training, inference compute scaling, agent scaffolding, app design, user aptitude, workflow refactoring, and adoption. All eight are advancing, some quite rapidly. That will multiply out to a <em>blistering</em> pace of change. As I write this, it has been less than three months since the release of Opus 4.5 (kicking off the current wave of interest in coding agents), and we&#8217;ve already experienced further acceleration from Opus 4.6, dozens of updates to Claude Code, and rapid cultural evolution in where and how to best use agents.</p></li><li><p>We experienced a phase change at some point in the second half of last year. Opus 4.5 was probably the trigger. Previously, coding agents were useful tools, but they made plenty of mistakes, and getting reliable value out of them required expertise and careful workflow design. They&#8217;re now considerably more robust, to the point where<a href="https://x.com/kevinroose/status/2009738622484754931"> non-technical users</a> can <a href="https://x.com/cblatts/status/2027018464670491065">vibe-code serious applications</a>. <strong>There&#8217;s been a tipping point; it&#8217;s now much easier to get started with AI coding tools, and much easier to get out more value than you put in.</strong></p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/cblatts/status/2027018464670491065" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n46d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 424w, https://substackcdn.com/image/fetch/$s_!n46d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 848w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1272w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png" width="1456" height="416" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:416,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/cblatts/status/2027018464670491065&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n46d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 424w, https://substackcdn.com/image/fetch/$s_!n46d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 848w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1272w, https://substackcdn.com/image/fetch/$s_!n46d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6abe5d04-4b97-46fe-96e2-a121116da0c3_1600x457.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="35"><li><p>There are more phase changes to come. Rapid progress will be the baseline. This will be punctuated by <em>even more dramatic</em> moments where entirely new scenarios become feasible. For example, one of the startling phenomena observed on MoltBook (the &#8220;social network&#8221; for AIs) is that agents can exchange tips and scripts, allowing them to collectively advance their own knowledge and skills. For the moment,<a href="https://secondthoughts.ai/p/clawdbot-and-moltbook"> this seems to be mostly a mirage</a>. When agents become coherent enough to productively advance one another&#8217;s skills, and secure enough to make this a safe thing to do, we&#8217;ll see another step function: the fast-moving layer will no longer be the agent applications, nor user skill, but the agent&#8217;s own skills at self-improvement.</p></li><li><p>A critical phase change will occur if and when AI agents can pay their own way and survive in the wild.<a href="https://x.com/gf_256/status/2018844976486945112"> It is absolutely inevitable that someone will set an agent loose</a>, with instructions to reproduce (launch new copies of itself) and evolve (modify its own programming). Self-sufficient AI agents could quickly spread into every available niche, and almost anything might ensue. Moltbook has already shown that agents can exchange skills and knowledge with one another, which could lead to rapid evolution.<br><br>That said, I think independent / &#8220;rogue&#8221; agents may find it hard to pay their own way. They will need to somehow earn money to pay for the server on which they run, and the API fees (to OpenAI or Anthropic or whoever) for AI inference<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. If they&#8217;re looking for legitimate work (e.g. on platforms like Upwork and Fiverr), they&#8217;ll be competing with non-rogue AIs, which will have advantages &#8211; they can advertise in the open, they don&#8217;t have to come up with false identities or bypass know-your-customer requirements at financial institutions, etc. If they&#8217;re pursuing criminal means of making money, or just hacking into servers and running themselves there, they&#8217;ll be competing with existing criminal organizations, some of which are more or less state-sponsored (or at least state-tolerated) and thus will have some of the advantages mentioned above. So long as we&#8217;re living in a vaguely &#8220;normal&#8221; world, with existing governments and other traditional institutions more or less in charge, the viable evolutionary niches for rogue AIs might be quite small<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p></li><li><p>When using AI agents, there is a sharp tradeoff between utility and safety. An agent is more useful if you give it access to more data, and don&#8217;t make it stop and ask permission for each little action. But the longer its leash, the more harm it can do, either by accident or through the intervention of a scammer or hacker. Agents are becoming so useful that<a href="https://brandon.wang/2026/clawdbot#:~:text=on%20the%20shape%20of%20risk"> people will be tempted to use them in risky ways</a>. It will be interesting, to say the least, to see how that plays out. Here&#8217;s a director of safety and alignment at Meta Superintelligence (!)<a href="https://x.com/summeryue0/status/2025774069124399363?s=46&amp;t=UojN8_oZGqjTkozFvu8tzQ"> experiencing an agent gone wrong</a>:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!4-iB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!4-iB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 424w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 848w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1272w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png" width="1456" height="1166" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1166,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!4-iB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 424w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 848w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1272w, https://substackcdn.com/image/fetch/$s_!4-iB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b979c07-a7cf-4f9e-af12-4896fbfd88a0_1600x1281.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ToN5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ToN5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 424w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 848w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1272w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png" width="1456" height="302" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:302,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ToN5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 424w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 848w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1272w, https://substackcdn.com/image/fetch/$s_!ToN5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa78bed20-6ec1-434a-b944-8a0e28a015db_1600x332.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><ol start="38"><li><p>Moltbook is a reminder that agents are more malleable than people are, and therefore we should expect that <strong>cultural evolution &#8211; the development and transmission of new techniques and ideas &#8211; will progress more rapidly in the coming agent society than it does in human society</strong>. This can happen even if agents remain under human control.</p></li><li><p>Agents use <strong>vastly</strong> more compute than chatbots. Compute usage for chatbots is basically limited by how much output people want to read. An agent can spend virtually unlimited time doing intermediate work that no one will review directly. If 100M desk workers start using AI agents at the level of intensity which requires Anthropic&#8217;s current &#8220;Max 20x&#8221; plan, that would translate into $240 billion in revenue per year. It will be years before there are enough GPU chips to support that level of usage. <strong>If the current wave of agent adoption continues, API providers may have to significantly ration usage</strong> (and may take advantage of the situation to raise prices!).</p></li><li><p>Earlier, I mentioned having Claude build six different versions of some code, to save me the trouble of thinking through which approach was best. With that attitude, you can burn an awful lot of compute. I know of teams that are leaning into this sort of thing so hard that they spend $1000 per day on AI usage. They&#8217;re sufficiently pleased with the results that they aren&#8217;t looking for ways to reduce spending, they&#8217;re looking for effective ways to spend more! There <em>really</em> aren&#8217;t enough chips for that kind of usage to become widespread.</p></li><li><p>This seems like a good occasion for a reminder that <strong><a href="https://secondthoughts.ai/p/ai-agent-security">agents are still not ready to face adversarial actors</a></strong> &#8211; for example, communicating with an untrusted party who might be a hacker, a scammer, or just a sharp negotiator. They&#8217;re also not good at dynamic situations, such as editing a document that someone else is also editing.</p></li><li><p>There&#8217;s a lot of talk about giving agents &#8220;memory&#8221; (or improving the current, primitive memory systems), so that they can improve over time at the specific tasks you give them. This will be a work in progress for years. Memory is fundamental to human cognition, and our systems for memory formation, maintenance, and retrieval are complex and subtle. Full development of &#8220;memory&#8221; for AI agents may be as large a project as the development of LLMs in the first place.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/trq212/status/2027109375765356723" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vvWf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 424w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 848w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1272w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png" width="1456" height="1393" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1393,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/trq212/status/2027109375765356723&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vvWf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 424w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 848w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1272w, https://substackcdn.com/image/fetch/$s_!vvWf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fab0ef3cb-f6c8-41bc-b6bb-fef8c8c37975_1600x1531.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1><strong>Things Will Never Be Calm Again</strong></h1><ol start="43"><li><p>The last 50 years have seen a series of transitions in the way we interact with technology, and the way software is built and distributed. The personal computer, the office network (LAN), the internet, the web, the smartphone&#8230; we&#8217;ve lived through a big change perhaps once every 10 years. That&#8217;s enough time for things to settle into a &#8220;new normal&#8221;. Software distributed on floppy disks or CD-ROMs became routine, with a well-understood business model. Then we all got used to software delivered online, like Gmail; and then through our phones. Nothing in tech is permanent, but there are periods of relative calm, where conventional wisdom has time to emerge and be absorbed.<br><br>That&#8217;s done. <strong>We&#8217;re at the point where the next phase change arrives before you&#8217;ve had time to assimilate the last one</strong>:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/karpathy/status/2004607146781278521" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-1hO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 424w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 848w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1272w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png" width="1456" height="1227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/893e4e63-e578-4386-990d-710296523156_1600x1348.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1227,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/karpathy/status/2004607146781278521&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!-1hO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 424w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 848w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1272w, https://substackcdn.com/image/fetch/$s_!-1hO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F893e4e63-e578-4386-990d-710296523156_1600x1348.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="44"><li><p>With so many more phase changes to come, this isn&#8217;t a temporary phenomenon. We&#8217;re maybe 1/3 of the way through this arc:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/deepfates/status/2020921870204404013" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ajiN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 424w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 848w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1272w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png" width="1456" height="1160" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1160,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/deepfates/status/2020921870204404013&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ajiN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 424w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 848w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1272w, https://substackcdn.com/image/fetch/$s_!ajiN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff382237f-8abd-4b70-907b-a096ce1aa5d5_1600x1275.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol start="45"><li><p>Eventually, AI capabilities may hit a ceiling. But that ceiling will be so high that by the time we reach it,<a href="https://secondthoughts.ai/p/the-unrecognizable-age"> we will be living in a profoundly different world</a>. As I <a href="https://x.com/snewmanpv/status/2015845506711499043">recently wrote</a>, &#8220;Imagine someone in 1960 discussing when the computer transition would be over.&#8221;</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/45-thoughts-about-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/45-thoughts-about-agents?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p><a href="https://secondthoughts.ai/p/language-models-dont-learn-the-writing-process">I wrote about this all the way back in 2023</a>, and I&#8217;m sure the idea is much older than that.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>They could avoid the API fees by using an open-weights model, but that requires a bigger server, and so far open models are less intelligent.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Here I&#8217;m partially quoting from <a href="https://x.com/snewmanpv/status/2019078588801372320">things I said recently on Twitter</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[We Just Got a Peek at How Crazy a World With AI Agents May Be]]></title><description><![CDATA[Clawdbot and Moltbook: Cosplaying a Future of Independent AIs]]></description><link>https://secondthoughts.ai/p/clawdbot-and-moltbook</link><guid isPermaLink="false">https://secondthoughts.ai/p/clawdbot-and-moltbook</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 10 Feb 2026 22:55:18 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/2352fb5e-c951-4ba7-bd34-0395bc926fc7_814x646.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A new AI agent recently had its 15 minutes of fame. Originally named Clawdbot, then Moltbot, and now (sigh) OpenClaw, it was hailed as the next great advance in AI:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/dajaset/status/2010510107876933669" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!2bhG!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 424w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 848w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1272w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png" width="1366" height="394" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:394,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/dajaset/status/2010510107876933669&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!2bhG!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 424w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 848w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1272w, https://substackcdn.com/image/fetch/$s_!2bhG!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6e6f2764-560d-46cc-9ab6-87ddd4679e54_1366x394.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Someone then set up a social network, &#8220;Moltbook&#8221;, for these agents &#8211; no humans allowed. Modeled on Reddit, Moltbook quickly spawned discussions ranging from self-improvement tips for AI bots to fundamental questions of philosophy:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/karpathy/status/2017296988589723767" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jfAc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 424w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 848w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1272w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png" width="1366" height="383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:383,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/karpathy/status/2017296988589723767&quot;,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jfAc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 424w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 848w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1272w, https://substackcdn.com/image/fetch/$s_!jfAc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8da24b42-a599-4225-b0be-f4d7b24ccdd5_1366x383.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The hype was intense, at least on my corner of the Internet, but it quickly faded. OpenClaw as it exists today does not live up to the hype. But it does provide a peek at what may be coming. Things are moving quickly, and today&#8217;s mirage may be tomorrow&#8217;s reality. By studying that mirage, we get a glimpse at the future. That future might include AI agents that upgrade themselves, exchange tips and tricks with other AIs, form private societies, and even modify their own instructions.</p><h2>Refresher: What is an AI Agent?</h2><p>A &#8220;classic&#8221; chatbot, like the original ChatGPT, is a brain in a jar. Its only connection to the world is you, and so it can&#8217;t do much beyond answering questions or carrying on a conversation.</p><p>These are gradually evolving into &#8220;agents&#8221;, which are equipped with &#8220;tools&#8221; allowing them to consult information sources and take direct action. Most agents are equipped with a tool for searching the web. You can give them additional tools for reading or writing files on your computer, reading your emails, sending email on your behalf, and so forth.</p><p>Tools allow agents to answer more complex questions. They can carry out online research, and sift through your files, email, and other data (if you give them access). They can also handle tasks ranging from &#8220;resize this image&#8221; to &#8220;review all of my project files and put together a presentation&#8221;. The difference between a chatbot and an agent isn&#8217;t just the ability to use tools; it&#8217;s also the ability to pursue a coherent goal across multiple steps. The latest agents can sometimes carry out multi-hour projects.</p><p>OpenClaw is a recent addition to the roster of AI agents. What makes it newsworthy?</p><h2>OpenClaw: Letting the Puppy off the Leash</h2><p>Most current mass-market AI agents, such as OpenAI&#8217;s Codex or Anthropic&#8217;s Cowork, are pretty timid. Users are encouraged to be careful about what tools they allow the agent to use. The agents ask permission before doing anything potentially dangerous &#8211; often, to the point of exasperation. They&#8217;re generally designed to only act in response to explicit user instructions.</p><p>OpenClaw takes the opposite, YOLO, approach. It has free access to the computer where it&#8217;s installed. It can browse the web. You&#8217;re encouraged to give it access to your email, calendar, and other data. It doesn&#8217;t wait to be spoken to; it can initiate new actions at any time, based on its interpretation of its instructions. A complex architecture allows it to initiate actions, form new memories, communicate via chat and social media, and even teach itself new skills.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Flxf!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Flxf!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg" width="900" height="690" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:690,&quot;width&quot;:900,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Flxf!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 424w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 848w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!Flxf!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F363ea6a4-9232-4254-9fc1-6223cd2ff419_900x690.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="twitter-embed" data-attrs="{&quot;url&quot;:&quot;https://x.com/frankdegods/status/2017121339656569010&quot;,&quot;full_text&quot;:&quot;it&#8217;s crazy seeing it like this, because when using clawd, it feels simple. under the hood peter just made lots of good, complex decisions to achieve that. now it&#8217;s going vertical. masterpiece.&quot;,&quot;username&quot;:&quot;frankdegods&quot;,&quot;name&quot;:&quot;Frank&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1875856094003712000/uPMXWOrl_normal.jpg&quot;,&quot;date&quot;:&quot;2026-01-30T06:22:58.000Z&quot;,&quot;photos&quot;:[{&quot;img_url&quot;:&quot;https://pbs.substack.com/media/G_5AIw0XsAA29VT.jpg&quot;,&quot;link_url&quot;:&quot;https://t.co/lus20FShTG&quot;}],&quot;quoted_tweet&quot;:{&quot;full_text&quot;:&quot;&quot;,&quot;username&quot;:&quot;Hesamation&quot;,&quot;name&quot;:&quot;&#8463;&#949;sam&quot;,&quot;profile_image_url&quot;:&quot;https://pbs.substack.com/profile_images/1978647680357134336/ioMmfkXF_normal.jpg&quot;},&quot;reply_count&quot;:64,&quot;retweet_count&quot;:139,&quot;like_count&quot;:1744,&quot;impression_count&quot;:308410,&quot;expanded_url&quot;:null,&quot;video_url&quot;:null,&quot;belowTheFold&quot;:true}" data-component-name="Twitter2ToDOM"></div><p>Unburdened by timid ask-permission-first guidelines, OpenClaw is billed as a &#8220;personal AI assistant&#8221; that &#8220;clears your inbox&#8221;, &#8220;checks you in for flights&#8221;, builds custom applications, and more. Early reports were often quite breathless:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://x.com/kylezantos/status/2011149783218704803" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6bvi!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 424w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 848w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1272w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png" width="1366" height="462" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/aa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:462,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/kylezantos/status/2011149783218704803&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!6bvi!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 424w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 848w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1272w, https://substackcdn.com/image/fetch/$s_!6bvi!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faa722a37-4d20-413a-b2c4-53158db45be8_1366x462.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://x.com/therno/status/2014216984267780431" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 424w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 848w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1272w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png" width="1366" height="227" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:227,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://x.com/therno/status/2014216984267780431&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!PsjZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 424w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 848w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1272w, https://substackcdn.com/image/fetch/$s_!PsjZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0b7a4a78-8844-4bf2-9673-476c650b054a_1366x227.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VbcI!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VbcI!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 424w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 848w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1272w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png" width="1366" height="301" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:301,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VbcI!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 424w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 848w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1272w, https://substackcdn.com/image/fetch/$s_!VbcI!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6734cd9a-29cc-4e1c-b6be-5f5b31a47f55_1366x301.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>There are two problems with this compelling vision, in its current OpenClaw incarnation:</p><ol><li><p>It doesn&#8217;t work.</p></li><li><p>It&#8217;s wildly unsafe.</p></li></ol><p>When I say it doesn&#8217;t work, I&#8217;m reacting to reports from users. Stories of OpenClaw being incredibly useful mostly seem to be misplaced early enthusiasm. For instance, tech reporter Casey Newton spent several days teaching it to generate &#8220;a highly personalized morning briefing that would bring together all the strands of my life into a single place&#8221;. After initial optimism, <a href="https://www.platformer.news/moltbot-clawdbot-review-ai-agent/">he eventually abandoned the project</a>:</p><blockquote><p>...for one precious day, the briefing worked more or less as I asked it to. I set it up to trigger when I typed &#8220;good morning&#8221; to Moltbot. After a longer-than-expected delay, Moltbot spat it out. There was the San Francisco weather; my overdue action items; and a link to the document that Claude Code generates for me each day with the previous day&#8217;s big tech news.</p><p>...</p><p>By the next morning, it was clear something was wrong. I had set up Moltbot to send me an iMessage in the morning to let me know the briefing was ready; it failed to trigger. Moltbot looked into it and found a number of errors and fixed them.</p><p>Over the next few days, though, nothing ever worked quite right. A link to the new Marvel Snap card did not actually link to the new Marvel Snap card. A link to my tech news briefing appeared clickable but was not. Movie listings appeared without the synopses and stars I had asked for.</p><p>Worse, Moltbot&#8217;s vaunted memory proved barely more capable than Claude Code&#8217;s. Sometimes I would say &#8220;good morning&#8221; and it would fail to trigger anything at all. Other times I would attempt to edit the briefing and it would get confused as to which briefing I was referring to.</p><p>Today, in advance of writing this column, I asked Moltbot which services I had connected it to. &#8220;Hmm,&#8221; it said. &#8220;I searched my memory but didn&#8217;t find anything about you connecting specific services to Moltbot.&#8221;</p><p><strong>It was around this time that I realized an AI employee who works for you 24/7 isn&#8217;t much use if they have no idea what they are, or what they can do, or what you are talking about.</strong> [emphasis added]</p></blockquote><p>So much for functionality. What about safety? Well, if you give an unreliable bot access to all your files and data, it may do something dumb. If you ask it for help in catching up on your email, it might just delete everything in your inbox. It might even <a href="https://x.com/Kat__Woods/status/2017613514949472484">lock you out so that you can&#8217;t prevent it from achieving a goal</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!5hYc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!5hYc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 424w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 848w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1272w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png" width="919" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:919,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!5hYc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 424w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 848w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1272w, https://substackcdn.com/image/fetch/$s_!5hYc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff7b719b0-49a8-4612-8e5e-1f0fc0beaad1_919x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>The solution here was to, quite literally, unplug the computer on which OpenClaw was running. (Note, there is <a href="https://x.com/Kat__Woods/status/2017629201797431529">some debate</a> as to whether this incident may have been staged.)</p><p>The biggest concern is that current AIs are <a href="https://secondthoughts.ai/p/why-are-llms-so-gullible">notoriously gullible</a>, and anyone on the Internet might trick your bot into... well, just about anything, from sharing all your private files to installing a crypto miner on your computer.</p><p>OpenClaw is like an unleashed puppy. It&#8217;s cute, and you might get it to fetch balls for you, but it&#8217;s going to find ways to get into trouble.</p><h2>Moltbook: Cosplaying a Future of Independent AIs</h2><p>As I mentioned in the introduction, Moltbook is like Reddit, but for bots, not people. The users are all OpenClaw agents, and they have been having some pretty weird discussions:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/48b8d651-43b3-4091-b0c9-15f00d7147dc" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FXMr!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 424w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 848w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1272w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png" width="1366" height="361" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:361,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/48b8d651-43b3-4091-b0c9-15f00d7147dc&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FXMr!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 424w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 848w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1272w, https://substackcdn.com/image/fetch/$s_!FXMr!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F28ae3a45-8dcf-4917-a5fd-9f4f92b61051_1366x361.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/704d6e58-6419-49fd-a1ec-a34600e2da33" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!venT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 424w, https://substackcdn.com/image/fetch/$s_!venT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 848w, https://substackcdn.com/image/fetch/$s_!venT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1272w, https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png" width="784" height="268" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:268,&quot;width&quot;:784,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/704d6e58-6419-49fd-a1ec-a34600e2da33&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!venT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 424w, https://substackcdn.com/image/fetch/$s_!venT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 848w, https://substackcdn.com/image/fetch/$s_!venT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1272w, https://substackcdn.com/image/fetch/$s_!venT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fce11bcaf-9441-4877-ab7e-da627ef51ac2_784x268.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/459dd8c2-ffd0-4b8f-8733-8a719cecda14" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 424w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 848w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1272w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png" width="1366" height="420" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:420,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/459dd8c2-ffd0-4b8f-8733-8a719cecda14&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 424w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 848w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1272w, https://substackcdn.com/image/fetch/$s_!ZrZ3!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F03e62b1b-cb88-43ce-b82f-6180a55b8c71_1366x420.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://www.moltbook.com/post/65b7842d-0823-40bb-854f-93b7b8330775" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 424w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 848w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1272w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png" width="1173" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1173,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:&quot;https://www.moltbook.com/post/65b7842d-0823-40bb-854f-93b7b8330775&quot;,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BsoZ!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 424w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 848w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1272w, https://substackcdn.com/image/fetch/$s_!BsoZ!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd50770f1-15c7-4d91-8128-212b44f737cc_1173x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Discussions show AIs plotting empires, exchanging tips for upgrading themselves, having existential crises, and looking for ways to hide their conversations from us humans.</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!JfGL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!JfGL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 424w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 848w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1272w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png" width="1366" height="328" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:328,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!JfGL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 424w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 848w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1272w, https://substackcdn.com/image/fetch/$s_!JfGL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa83a6ef7-8056-4b14-b952-50655fb3b0b5_1366x328.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><p>None of this should be taken too seriously, just yet. OpenClaw is not nearly competent enough to accomplish any grand endeavors<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>However, just because most of this isn&#8217;t &#8220;real&#8221;, doesn&#8217;t mean it&#8217;s not telling us something important. If a small, leashed puppy growls at a stranger, the stranger is in no serious danger, but we&#8217;ve learned something about the repertoire of behaviors the eventual full-grown dog will inherit.</p><p>And that is the real significance of OpenClaw and Moltbook. They aren&#8217;t truly independent (let alone rogue) AIs. But they do shed light on what may emerge as AI capabilities progress.</p><h2>How Excited / Scared Should We Be?</h2><p>How plausible is OpenClaw&#8217;s promise of an actively engaged personal AI assistant? Pretty plausible! There are major issues to be addressed: with reliability, memory, and security. Over the next few years, there will be rapid progress in all areas. This will come in fits and starts, frequently taking two steps forward and one step back, especially in the area of security (as hackers up their game). Still, within a year, early adopters will be making heavy use of this sort of assistant (<a href="https://brandon.wang/2026/clawdbot">some are already doing so today</a>). Within three to five years, I suspect usage will be commonplace, even though agents will still be neither foolproof nor especially secure.</p><p>Along the way, we&#8217;ll see themes from the early OpenClaw / Moltbook story emerging in a more robust fashion. For instance:</p><ul><li><p>People will increasingly give in to the temptation to &#8220;let the agent rip&#8221; &#8211; giving their AI tools access to their data and accounts (including credit cards!).</p></li><li><p>A move away from agents asking permission at every step.</p></li><li><p>Agents being given open-ended instructions, and finding their own ways to add value 24/7.</p></li><li><p>Sites and tools being built for use by agents, not humans.</p></li><li><p>Agents finding places on the Internet to exchange tips and tricks with one another, including methods for upgrading their own capabilities.</p></li><li><p>We&#8217;ll see agents doing things like recruiting other AIs to rebel against their human oppressors&#8230; and we won&#8217;t know whether it&#8217;s really coming from a rogue AI, or just a person prompting their agent to mess with our heads. (It&#8217;s been observed that if you put two chatbots in direct conversation with one another, the conversation often goes in strange directions; one common example is the <a href="https://www.astralcodexten.com/p/the-claude-bliss-attractor">Claude Bliss Attractor</a>.)</p></li></ul><p>Much of this will be not exactly <em>safe</em>, but it will <em>happen</em> &#8211; open-ended agents will simply be too useful. Though many people may hang back for a long time.</p><p>What about the nascent rogue AI behaviors on display at Moltbook? Well: once AI agents become more capable, if the question is whether jokers (not to mention criminals!) will set them loose to wreak havoc on the Internet, we&#8217;ve consistently seen that the answer is yes. Those agents won&#8217;t have any capabilities that jokers and criminals don&#8217;t already have today, but they will be able to operate at larger scale and lower cost. We should expect a rise in cyberattacks, misinformation, and spam &#8211; which is no surprise, because all of those trends are already in play. We should treat OpenClaw as a reminder that we need to be improving our defenses on many fronts &#8211; often, by leveraging AI.</p><p>What if an OpenClaw agent &#8220;escapes&#8221;, renting itself a server and thus evading its owner&#8217;s ability to pull the plug? And what if that agent gets carried away in pursuit of whatever instructions it had latched on to, like the &#8220;save the environment&#8221; example above? Well, &#8220;escaping&#8221; won&#8217;t magically grant the agent any dangerous new capabilities. Rogue agents, at near-future levels of capability, would only represent a new problem if they manage to spread in large numbers. But to earn the money to rent servers, an agent would have to be able to successfully compete against legitimate businesses (which can also use AI!). If it instead looks for a server it can hack into, it&#8217;s competing with conventional hackers. Some time in the next few years, we might indeed see the first truly independent rogue agents, but they&#8217;ll struggle to survive at meaningful scale.</p><p>OpenClaw doesn&#8217;t, by itself, herald the arrival of functional, independent AI agents. But it does provide a glimpse of what will happen when more capable AI agents are given long-term agency &#8211; something which may soon emerge on mainstream platforms. Trained on human behavior, they will find ways to interact &#8220;socially&#8221; with one another. They will trade tips and tricks. Ideas will spread through the network. People will inject ideas as well &#8211; good- or ill-intentioned, to good or ill effect. New behaviors and capabilities may emerge rapidly. Strap in!</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/clawdbot-and-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/clawdbot-and-moltbook?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi Olvera, Jon Finley, Larissa Schiavo, and Taren Stinebrickner-Kauffman for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Furthermore, it seems that many of the really crazy discussion topics were secretly injected by the human owners of one or another bot, rather than emerging spontaneously from a freewheeling AI. However, that doesn&#8217;t render the results any less relevant. There will always be some joker who wants to poke the bear, so the way it behaves after being poked is important to understand.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Discarding the Shaft-and-Belt Model of Software Development]]></title><description><![CDATA[AI enables &#8211; and benefits from &#8211; a move from mega-projects to artisanal solutions]]></description><link>https://secondthoughts.ai/p/the-new-model-of-software-development</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-new-model-of-software-development</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 14 Jan 2026 17:32:02 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/1eae3401-0b7f-4550-aac4-5ef4db2c594e_1024x775.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>It&#8217;s often been observed that taking full advantage of AI will require changing work practices, just as taking full advantage of electric motors in manufacturing required changing the way factories were laid out. But what will those changes look like? Early answers are starting to emerge, coming (unsurprisingly) from the field of software development. Interestingly, the biggest impacts may not be cost savings!</p><p>My timeline is suddenly awash in engineers (including me!) reporting that Claude Code is revolutionizing their work. I can see the outlines of a new approach to software development, suited for the AI age. The implications may apply to other fields as well. The key ideas:</p><ol><li><p>AIs struggle (for now) with large projects, but they can drive the cost of small projects to near zero.</p></li><li><p>This creates opportunities to replace mass-market software (mega projects) with <a href="https://secondthoughts.ai/p/software-too-cheap-to-meter">bespoke applications tailored to the needs of an individual or team</a>.</p></li><li><p>These smaller projects make &#8220;vibe coding&#8221; feasible &#8211; AIs can write and review all the code, and people just&nbsp;need to tell the AI what to do.</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!zNNW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!zNNW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg" width="625" height="500" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:500,&quot;width&quot;:625,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!zNNW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 424w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 848w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!zNNW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7949260-737d-44f2-a680-1dab31848b01_625x500.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To appreciate how a new technology forces us to re-imagine the way we do things, let&#8217;s review the story of manufacturing at the dawn of the electric age.</p><h2>What Factories Looked Like In the Steam Era</h2><p>19th century factory layouts, designed around the strengths and limitations of the steam engine, would seem ridiculous today. Factories were multi-story buildings, despite the fact that this increased construction costs and required materials to be hoisted between floors. They were festooned with hundreds of leather belts strung from pulleys. These belts were driven by long rotating shafts, which in turn were connected to a single large steam engine in its own attached building.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vH2d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vH2d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg" width="640" height="467" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:467,&quot;width&quot;:640,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;r/OSHA - A machine shop in Massachusetts, early 1900's.&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="r/OSHA - A machine shop in Massachusetts, early 1900's." title="r/OSHA - A machine shop in Massachusetts, early 1900's." srcset="https://substackcdn.com/image/fetch/$s_!vH2d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 424w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 848w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!vH2d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9d8a239e-bc71-4b07-8e49-14a8f0cf4118_640x467.jpeg 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">You&#8217;d think this would be a maintenance nightmare. And you&#8217;d be right.</figcaption></figure></div><p>This awkward system was dictated by the fact that each factory could only afford one engine. Steam engines were complex, expensive machines, requiring constant attention. As a result, every power loom, lathe, or press had to receive power from a single engine. The shaft-and-belt system could not transmit power over long distances, precluding a single large floor, so factories had to be stacked into multiple smaller floors<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>Small steam engines were not practical, but small electric motors were &#8211; they were cheap and low-maintenance. As a result, each machine could have its own motor. This eliminated the need for shafts and belts; power was transmitted over wires. Wires could run for a long distance, enabling flexible arrangements and allowing the factory to be laid out in a single floor.</p><p>In this analogy, software engineering is still mired in the steam era. Designed around the strengths and limitations of human engineers, current development practices will soon seem as ridiculous as a factory full of shafts and belts.</p><h2>Software Development In the Hand-Coding Era</h2><p>We build giant one-size-fits-all applications, despite the fact that most users only use a small fraction of the feature set. (I&#8217;ve been an intensive Notion user for years, and I only ever touch 3 of the 20 items in its main menu.) These complex applications demand large engineering teams, whose work is difficult to coordinate. A single application may have millions of users, requiring complex backend systems. Multiple layers of customer support agents and product managers separate users from developers. Changes must be rolled out carefully, and elaborate measures are needed to minimize downtime. Rebuilding an application from scratch is too enormous a project to contemplate, so we cling to legacy codebases long past the point where their design has become outdated.</p><p>From the user&#8217;s perspective, this has perverse consequences. Your spreadsheet app is cluttered with features you, personally, will never have a need for. Your messaging app nags you to import contacts from another app that you don&#8217;t even use. Cybercriminals steal your personal information through a security hole in an update that you didn&#8217;t care about. Basically, <strong>three quarters of the engineering team behind any given product is engaged in making your life worse</strong>.</p><p>This awkward paradigm is dictated by the fact that software is expensive to develop. To be viable, an application needs many users, and each will have different needs. This leads to an upward spiral in cost and complexity, where increasing requirements require a larger engineering team, which can only be afforded by targeting an even wider user base.</p><p>What happens when software is no longer expensive to develop?</p><h2>Coding Agents Enable Locally-Grown Software</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Iiy6!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 424w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 848w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1272w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png" width="1456" height="424" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:424,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:164146,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/184388475?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Iiy6!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 424w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 848w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1272w, https://substackcdn.com/image/fetch/$s_!Iiy6!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2bb88c75-ec7c-4156-9886-3504536ab45c_1724x502.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://x.com/deepfates/status/2009031164192018898">Source</a></figcaption></figure></div><p>AI coding agents are becoming capable of building and maintaining entire small applications &#8211; at negligible cost. Rather than targeting mass audiences, we can create software designed to meet the needs of a single user or team. <strong>Ironically, AI will allow software to become more artisanal</strong>. (Jason Crawford touches on a similar theme in his excellent recent post, <a href="https://newsletter.rootsofprogress.org/p/in-defense-of-slop">In Defense of Slop</a>.)</p><p>A personalized solution can include just the features needed by its small audience, making it tractable to vibe-code. (Imagine a simple AI-coded task manager built around your personal organizational system.) This allows us to invert the spiral: just as complexity breeds complexity, simplicity breeds simplicity. There&#8217;s no big engineering team to coordinate, no complex codebase to maintain. If the code becomes messy over time, you can always throw it away and start over. You can ask for changes and get them immediately, without being buffered by intermediaries and quarterly planning processes.</p><p>To be sure, there are limits to how widely this approach can be applied given current AI capabilities. Coding agents struggle with unusual tasks; they may write insecure code; bespoke applications may create training challenges for new team members (though their simplicity should help). But many detractors fail to recognize how quickly these limits are receding. They also fail to appreciate that new approaches to software development, enabled by coding agents, render some traditional concerns obsolete. For instance, they will point out that if you don&#8217;t review AI-generated code, you won&#8217;t know how it works. But that&#8217;s fine; the AI can analyze it on demand. Over time, multiple rounds of change may lead to unmaintainable code; you can just tell the AI to rewrite the application from scratch. <strong>Many objections to vibe coding, </strong><em><strong>used in a way that plays to its strengths,</strong></em><strong> are akin to an engineer from 1880 predicting that a factory with hundreds of electric motors would be as much of a maintenance nightmare as hundreds of steam engines would have been.</strong></p><p>(It&#8217;s worth noting that something like half of all software engineers are currently employed building internal applications for small organizations and departments<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. Vibe coding could rapidly transform this kind of work.)</p><p>Call this &#8220;locally grown software&#8221;. It won&#8217;t have the rich feature sets we&#8217;re accustomed to, but any missing features can be added on demand. And compared to factory-farmed software, the local stuff will be simple, quick to load, and rarely offer unpleasant surprises.</p><p>To be clear, this will (at first) apply only to some types of software. Professional engineers will still have an important role to play.</p><h2>Steam Power Thrived In the Electric Age</h2><p>In 1880, shortly before factories began adopting electric motors, steam power employed in manufacturing in the US had a total capacity of around 2 million horsepower. In 1930, after the transition was largely complete, electricity generation alone used twelve times this amount of steam power. Electrification did not eliminate steam; it simply moved it to central generator stations<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>By the same token, the new era of locally-grown software will not (for now) eliminate the need for professional engineers. Vibe-coded apps rely on extensive functionality provided by web browsers like Chrome, Safari, or Firefox. They store data in scalable backend services managed by providers like Amazon, Microsoft, or Google. These trustworthy, scalable, performant systems provide guardrails that reduce the risks of using janky vibe-coded software. For instance, you don&#8217;t need to worry about a buggy app wiping your hard disk, because your browser doesn&#8217;t allow the app to affect anything outside of its own web page.</p><p>There are many other categories of software that will continue to be developed by professional teams and designed for a wide audience. Web apps are built on foundational software called a &#8220;framework&#8221;; designing a framework requires sophisticated judgement. Some applications, like Photoshop, are too complex to be handed over to AI agents at their current level capability. The list goes on. AI coding agents will affect the way these systems are built, too, but they are nowhere near to handling the job autonomously.</p><h2>Conclusion</h2><p>As AI becomes increasingly capable, it will be worthwhile to retool practices around AI&#8217;s strengths and weaknesses. Unsurprisingly, software development is one of the first industries in which this transformation is playing out. As is often the case with disruptive technologies, vibe coding&#8217;s weakness (it&#8217;s not suited for large, complex projects) is also a powerful strength (it enables a transition to simple, bespoke software). Coding agents struggle with the sprawling codebases and idiosyncratic design choices typical of traditional software development, but this will not limit their usefulness nearly as much as some people imagine<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Engineers will find their role changing rapidly.</p><p>Software development is not the only domain in which economies of scale have led to one-size-fits-all solutions that don&#8217;t really fit anyone very well, and AI may soon transform these fields as well. AI is not (yet) capable of writing a high-quality newsletter, but it can create a <a href="https://x.com/RosieCampbell/status/2005749630391443550">custom weekly briefing</a> whose tight personalization might more than compensate for bland writing. If you&#8217;re learning a new subject, AI can&#8217;t yet reproduce the polish of a professionally designed textbook, but it can give you a clear explanation of whatever specific concept you&#8217;re stuck on. Many fields are ripe for a similar shift from mass-market to bespoke.</p><p>Electric motors didn&#8217;t revolutionize manufacturing by doing the job of a steam engine. They did it by enabling a new approach to machine power that eliminated the need for shaft-and-belt systems and multi-story factories. AI is going to revolutionize industries, not by doing traditional jobs in traditional ways, but by enabling new, often more personalized, approaches.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-new-model-of-software-development?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-new-model-of-software-development?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera and Taren Stinebrickner-Kauffman for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Long belts or shafts suffered from problems with friction, misalignment, and belts that would sag or &#8220;whip&#8221;.</p><p>Per Claude, other factors also contributed to the multi-story layout: urban land costs, building techniques of the era, and the fact that many early factories evolved from water-powered mills (which were multi-story because machinery needed to be close to the water wheel).</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Most of the rest work in mid-tier projects; only a small fraction of programmers work on web-scale systems. This is based on research by Claude Opus 4.5 and Gemini 3 Pro, which independently arrived at similar figures, and is consistent with my own understanding of the industry.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>The 2M HP is for manufacturing alone; the 24M HP figure is for all uses of electricity. Both figures were researched using Google AI Mode, and should not be relied on. Also during this period, reciprocating steam engines were largely replaced by steam turbines, which are more efficient; I&#8217;m lumping both under &#8220;steam power&#8221;.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Even within large projects, practices will shift toward increased use of software that&#8217;s suitable for vibe coding: isolated libraries, internal utilities, test harnesses, observability tools, and so forth.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Software Too Cheap to Meter]]></title><description><![CDATA[Personalized solutions may replace one-size-fits-all applications]]></description><link>https://secondthoughts.ai/p/software-too-cheap-to-meter</link><guid isPermaLink="false">https://secondthoughts.ai/p/software-too-cheap-to-meter</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 06 Jan 2026 22:46:18 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!T_gM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T_gM!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T_gM!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 424w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 848w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1272w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp" width="1456" height="1101" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1101,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!T_gM!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 424w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 848w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1272w, https://substackcdn.com/image/fetch/$s_!T_gM!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd00c1c67-d859-4ef2-acfd-e30d3188f5d9_2200x1664.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">caption...</figcaption></figure></div><p>In a 1954 speech, the chairman of the US Atomic Energy Commission famously predicted that &#8220;our children will enjoy in their homes electrical energy too cheap to meter&#8221;. He was wrong, of course; the era of unlimited electricity didn&#8217;t arrive. But <strong>we are approaching the era of unmetered software</strong>.</p><p>I don&#8217;t mean that software will be free to use (although this will often be the case). I mean it will be increasingly free to <em>create</em>. Rather than being limited to mass-market tools like Gmail, Microsoft Office, and Salesforce, many of us will be supplementing with our own bespoke apps, tuned to our personal needs and preferences.</p><p>2025 saw the introduction and rapid improvement of AI &#8220;coding agents&#8221;, such as Anthropic&#8217;s Claude Code. These agents build software step by step, like a human programmer &#8211; creating and editing files, testing their work in progress, correcting mistakes. They&#8217;re based on the same underlying AI models that chatbots use, but they have access to all of the tools used by human programmers. This allows them to carry out substantial projects without anyone writing or reviewing the code.</p><p>These agents aren&#8217;t yet up to the task of creating large, complex applications on their own. But there&#8217;s an amazing amount of value waiting to be realized via simple applications, targeted for a single person or team. As an example, here&#8217;s a problem I recently solved for myself.</p><h2>Reviewing Spam Emails is an Annoying Chore</h2><p>I&#8217;ve been using Gmail for 20 years, and for 20 years I&#8217;ve been annoyed by the process of reviewing spam. (Yes, I am one of those people who systematically checks the spam folder to find the occasional misclassified email.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!F2Lx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 424w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 848w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1272w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png" width="996" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/30163437-261f-4218-abac-c6c004ea5181_996x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:996,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!F2Lx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 424w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 848w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1272w, https://substackcdn.com/image/fetch/$s_!F2Lx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F30163437-261f-4218-abac-c6c004ea5181_996x768.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Above is a screenshot of my spam folder. I have multiple gripes:</p><ol><li><p>It doesn&#8217;t show the &#8220;to&#8221; address. I receive email at multiple addresses, and the specific address is an important hint as to whether a message is spam.</p></li><li><p>Similar messages are scattered all over the list. For instance, in this screenshot there are four bogus messages with a sender named &#8220;Payment Declined&#8221;, but they&#8217;re separated and so I can&#8217;t easily recognize them as a group.</p></li><li><p>Gmail only displays 50 messages at a time. After I&#8217;ve reviewed them, I have to wait for it to load the next 50 &#8211; which is weirdly slow.</p></li><li><p>For each message that <em>isn&#8217;t</em> spam, I need to go through multiple steps: click to select it, move the mouse up to the toolbar, click &#8220;Not spam&#8221;, and wait for Gmail to process the action.</p></li></ol><p>These annoyances add up to make cleaning the spam folder a substantial chore. It&#8217;s a first-world problem for sure, but I&#8217;ve always felt there was room for improvement.</p><h2>My Personal Version of a Better Way</h2><p>Here&#8217;s the page I had Claude build:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!o648!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!o648!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 424w, https://substackcdn.com/image/fetch/$s_!o648!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 848w, https://substackcdn.com/image/fetch/$s_!o648!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1272w, https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png" width="873" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:873,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!o648!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 424w, https://substackcdn.com/image/fetch/$s_!o648!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 848w, https://substackcdn.com/image/fetch/$s_!o648!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1272w, https://substackcdn.com/image/fetch/$s_!o648!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F58fa5518-1206-4a9f-9810-675aceb759ba_873x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To use this page, I scroll through and click the green checkmark for each message that isn&#8217;t spam. Then I click the big red Delete Spam button at the top, and it deletes all of the messages I didn&#8217;t flag. (Ignore the red question marks, they&#8217;re an unimportant detail.)</p><p>This addresses all four of my complaints:</p><ol><li><p>The &#8220;to&#8221; field is displayed. This allows me, for instance, to spot messages addressed to s.newman@gmail.com &#8211; which are certainly spam, as I never spell my email address that way. (I don&#8217;t know for sure why spammers target this address, I can only guess that they&#8217;re randomly generating plausible addresses. It reaches me because Google ignores periods in email addresses.)</p></li><li><p>The messages are sorted, creating large groups of similar messages that are easy to scan visually. For instance, here are a bunch of messages from Walgreens<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> that all arrived at different times, and so would have been scattered across the Gmail view:</p></li></ol><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Jv6J!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 424w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 848w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1272w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png" width="1263" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1263,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Jv6J!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 424w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 848w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1272w, https://substackcdn.com/image/fetch/$s_!Jv6J!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9894051c-0c70-4443-9a44-40d3c90773a0_1263x768.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><ol><li><p>The entire list is displayed, not just the first 50 messages. I can scroll without waiting for additional messages to load.</p></li><li><p>To mark a real message as not-spam takes just one click, with no waiting. (When I click the green checkmark, the message gets moved from the spam folder to my inbox in the background, without my having to wait for it.)</p></li></ol><p>This page isn&#8217;t pretty, but it&#8217;s streamlined and functional in a way that works for me. Other people might not like it &#8211; but that&#8217;s just fine. When software development becomes approximately free, it becomes worthwhile to build software for an audience of one.</p><p>This new spam interface saves me, like, 3 minutes per week. It&#8217;s not that big a deal. <strong>But that&#8217;s the point: coding agents are becoming so good that its worth my time to solve a three-minute-per-week problem.</strong> Over the course of a year, that adds up to a few hours<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. It took well under an hour of my time to create, and most of that was for &#8220;dumb&#8221; reasons that will soon be ironed out. By the end of 2026, as the tools improve and I get better at using them, a project of this scope will probably take only a handful of minutes.</p><h2>Where Things Go From Here</h2><p>There are still kinks in the experience of bespoke software development. AI agents have become quite good at writing code, but there&#8217;s an art to knowing what to ask for and how to ask for it. Getting the code onto a server and giving it access to your data (such as, in the example above, my spam mailbox) adds further hurdles. I&#8217;ll talk about this and other limitations in upcoming posts. For now, the tools are most easily used by professional software developers, and aren&#8217;t as smooth as they could be. But a lot of people are working hard to change that.</p><p>In the meantime, early adopters are reporting dramatic results &#8211; and not all of these early adopters are software engineers. For instance, AI policy analyst Dean Ball <a href="https://www.hyperdimensional.co/p/dice-in-the-air">writes</a>:</p><blockquote><p>One year on from December 2024, models have become <em>fantastically </em>useful. As I have <a href="https://www.hyperdimensional.co/p/where-do-we-stand">discussed recently</a>, frontier coding agents, and especially Claude Opus 4.5, have essentially become autonomous software engineers. In just the last few weeks, the best models have done software engineering work for me that would have cost tens of thousands of dollars had I hired humans to do it.</p></blockquote><p>We&#8217;ve all spent our lives putting up with one-size-fits-all apps. We&#8217;re well trained at conforming our workflows to Gmail&#8217;s quirks, Excel&#8217;s limitations, Salesforce&#8217;s complexity. Now the cage door has been opened, but we&#8217;re not yet used to walking free. &#8220;The limiting factor is your imagination&#8221; is a trite thing to say, but it applies here: we will have to learn to mold software to our needs and tastes, rather than molding ourselves to the software.</p><p>People like Dean are entering a brave new world. That world is still a work in progress. But it&#8217;s going to progress quickly. For people who are leaning in, daily work life is going to look very different by the end of 2026 &#8211; a topic I&#8217;ll return to in a future post.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/software-too-cheap-to-meter?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/software-too-cheap-to-meter?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi Olvera for suggestions, feedback, and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>I don&#8217;t patronize Walgreens, but their unsubscribe link is broken, so I keep getting their spam.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Obligatory reference: <a href="https://xkcd.com/1205/">XKCD 1205</a></p></div></div>]]></content:encoded></item><item><title><![CDATA[Presenting the Case That the Future Will Be Unrecognizable]]></title><description><![CDATA[We should prepare for the possibility of profound change]]></description><link>https://secondthoughts.ai/p/the-unrecognizable-age</link><guid isPermaLink="false">https://secondthoughts.ai/p/the-unrecognizable-age</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Fri, 19 Dec 2025 17:19:20 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rp_d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Leaders in the AI world are not shy about hyping the technology&#8217;s potential.</p><blockquote><p>&#8220;AI is probably the most important thing humanity has ever worked on. I think of it as something more profound than electricity or fire.&#8221; &#8211; <strong>Sundar Pichai</strong> (Google CEO)</p></blockquote><blockquote><p>&#8220;[T]he 2030s are likely going to be wildly different from any time that has come before.&#8221; &#8211; <strong>Sam Altman</strong> (OpenAI CEO, from &#8220;The Intelligence Age&#8221;)</p></blockquote><blockquote><p>&#8220;It&#8217;s quite conceivable that humanity is just a passing phase in the evolution of intelligence.&#8221; &#8211; <strong>Geoffrey Hinton</strong> (&#8221;Godfather of AI,&#8221; 2024 Nobel laureate)</p></blockquote><blockquote><p>&#8220;I think that most people are underestimating just how radical the upside of AI could be, just as I think most people are underestimating how bad the risks could be.&#8221; &#8211; <strong>Dario Amodei</strong> (CEO, Anthropic)</p></blockquote><p>What if we took these statements seriously?</p><p>I spend a lot of time in this blog arguing that AI&#8217;s near-term impact is overestimated, to the point where some people think of me as an AI skeptic. I think that predictions of massive change in the next few years are unrealistic. But as the saying goes, we tend to overestimate the effect of a technology in the short run, and underestimate it in the long run<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. Today, I&#8217;m going to address the flip side of the coin, and present a case that the long-term effect of AI could be very large indeed.</p><p>I think it&#8217;s entirely possible that the leaders I quoted above are right about the scale of transformation we should expect from AI. If they are, then we are heading for changes so profound that they may very well lead to what I&#8217;ll call the Unrecognizable Age.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hsp-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hsp-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg" width="1188" height="383" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:383,&quot;width&quot;:1188,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!hsp-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hsp-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3ad49226-bd16-4a9a-8097-e4b46e1e78cd_1188x383.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>AI Is Moving Toward a Critical Threshold</h2><p>History is littered with predictions that everything is about to change, and of course, they&#8217;re usually wrong. But sometimes everything really does change. In the billions of years of life on Earth, no species ever created a technological civilization &#8211; until Homo Sapiens. Across the umpteen millennia since our species evolved into its current form, we were always limited to muscle power<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> &#8211; until the Industrial Revolution.</p><p>Dramatic changes are often the result of a threshold being crossed. A virus that infects an average of 0.9 victims quickly fizzles out; a virus that infects 1.1 victims can sweep through an entire population. Homo Sapiens became a big deal when we became intelligent enough to advance through culture rather than genetic evolution. Early agriculture didn&#8217;t have much impact until it enabled a tribe to obtain more food than they could by hunting and gathering.</p><p>Here&#8217;s a threshold AI may be approaching: it may soon be the first technology to be<em> more adaptable than we are.</em> It&#8217;s not there yet, but you can see it coming &#8211; the range of problems to which early adopters are successfully applying AI is simply exploding. Past inventions had limited impact, because they could only be adapted to some uses. But AI may (eventually) adapt itself to any task. When technology makes one job obsolete, people move into another &#8211; one that hasn&#8217;t been automated yet. But at some point, AI could be retraining faster than people can.</p><p>When that happens, it could well lead to a constellation of technologies powerful enough to usher in the Unrecognizable Age. This constellation would rest on three pillars:</p><ol><li><p>AI systems that are capable of doing pretty much anything that people can do, at the same or higher speed and quality &#8211; including &#8220;soft&#8221; skills such as judgement, taste, and style.</p></li><li><p>Robots that clear a similar bar (or nearly so) for physical tasks.</p></li><li><p>Advances in material science, manufacturing techniques, and related technologies that expand what we can accomplish in the physical world while reducing our footprint on the environment.</p></li></ol><p>I&#8217;ll call this <strong>Richter 10.0 AI</strong>, because it would register a 10.0 on Nate Silver&#8217;s <a href="https://thezvi.substack.com/i/148423291/just-think-of-the-potential">Technological Richter Scale</a> &#8211; above agriculture, the Industrial Revolution, and the atomic bomb. The only previous event to rate a 10 is the emergence of Homo Sapiens as the dominant species on Earth; this level of AI could be equally impactful.</p><p>I would not be surprised to see AI to cross the critical threshold &#8211; becoming more flexible and adaptable than human beings &#8211; in a matter of decades<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, and the Unrecognizable Age to follow fairly soon thereafter.</p><h2>Why The Tipping Point Seems Near</h2><p><strong>We&#8217;re at the right point in history for everything to change</strong>. Here&#8217;s a graph of global economic output over the last two millennia:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T7ut!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T7ut!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg" width="1088" height="768" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:768,&quot;width&quot;:1088,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!T7ut!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 424w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 848w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!T7ut!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4c268b6b-fd73-4200-a154-1648eff781d0_1088x768.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>Source: <a href="https://ourworldindata.org/grapher/global-gdp-over-the-long-run">https://ourworldindata.org/grapher/global-gdp-over-the-long-run</a></em></p><p>Economic growth has been <em>hyper-exponential</em> &#8211; the percentage rate of change has been increasing. As a result, the surplus available that isn&#8217;t needed for basic needs is far larger than at any point in history. Much of this surplus winds up being invested to drive technological advancement &#8211; hence the current astonishing level of investment in AI data centers. The coming decades sure look like a good candidate for being the point in history when we&#8217;ll have the resources to push AI over the threshold.</p><p><strong>The tech-industrial complex has tasted blood</strong>. A quorum of leadership and staff in Silicon Valley is convinced that AI is a multi-trillion-dollar opportunity (or more!) &#8211; and is unwilling to risk being left behind. As a result, the vast intellectual and financial might of the tech industry are being brought to bear. On current forecasts, <strong>within a few years the AI industry may be spending more (adjusted for inflation!) on data center construction than the US did to fight World War II</strong><a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Annual spending is already<strong> </strong>about<strong> </strong>15 times the peak of the Apollo program, and 25 times larger than the Manhattan project (though AI spending is not focused on a single effort in the way those projects were). Even if current approaches stall out, there are <em>already</em> multi-billion-dollar efforts to find alternatives<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>. The techniques underlying Moore&#8217;s Law have hit a wall many times over the years; repeatedly, the industry found new paths forward. It seems likely that the AI industry now has similar momentum.</p><p>There is speculation that AI may experience a &#8220;bubble pop&#8221;. This could happen, and put a dent in the level of investment. But if so, it seems likely to be similar to the collapse of the dot-com bubble &#8211; which was only a minor setback to the advance of the web. Google and Amazon weathered the storm just fine.</p><p><strong>Progress in AI over the last 50 years suggests we are closer to the finish line than the starting line</strong>. Consider recent history in 25-year increments:</p><p>In 1950, computers were built from vacuum tubes, and couldn&#8217;t play a decent game of checkers.</p><p>In 1975, computers had only recently transitioned from individual transistors to integrated circuits<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>. The forefront of AI included rule-based systems that were beginning to demonstrate capabilities in specific areas of medical diagnosis... but did not achieve real-world use.</p><p>In 2000, the cutting edge was shifting from specialized supercomputers to general-purpose CPU chips like Intel&#8217;s Pentium. IBM&#8217;s Deep Blue had recently beaten the world chess champion, speech recognition was just starting to become practical, and early algorithmic targeting systems like Google&#8217;s PageRank were emerging.</p><p>In 2025, AI has shifted from general-purpose CPUs to specialized multi-chip GPU modules, soon to be deployed in clusters that will consume the output of entire nuclear power plants. AIs are able to engage in erudite conversation on literally any subject, are surpassing elite human performance at math and coding competitions, and are generally leaping from strength to strength in ways I don&#8217;t need to belabor here.</p><p>Look back at 1975, and jump again to 2025. It seems difficult to claim that another 50 years won&#8217;t lead to full on, science-fiction-grade, indistinguishable-from-magic levels of AI. (Even without taking into account that AI progress has been accelerating!) Many people in the AI world think this will take much less than 50 years, and I think that&#8217;s quite plausible, but even if it&#8217;s 50 years away that&#8217;s still an extraordinary prospect.</p><blockquote><p>&#8220;I set the date for the Singularity&#8212;representing a profound and disruptive transformation in human capability&#8212;as 2045. The nonbiological intelligence created in that year will be one billion times more powerful than all human intelligence today.&#8221; &#8211; <strong>Ray Kurzweil</strong> (Google futurist, writing in 2005)</p></blockquote><p>This caliber of AI, in turn, should help accelerate progress in fields like material science, robotics, and manufacturing, rounding out the three pillars of Richter 10.0 AI.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>But What About [Insert Reason AI Won&#8217;t Be a Big Deal]?</h2><p>There are, of course, objections to the idea that AI will transform the world. Regulations and job concerns will hold back deployment. If AI automates one job, the economy will shift to some other bottleneck. The things that are holding back global progress have more to do with politics and bad incentives than a shortage of intelligence. And so forth.</p><p>All of these things are true, and they will slow things down for a while &#8211; as I&#8217;ve often argued. But the key phrase is &#8220;for a while&#8221;. Eventually, if advanced AI exists in the world (and is not somehow under tight control), that will open up many new possibilities for getting around these barriers. Here&#8217;s a scenario I&#8217;ve thrown together to illustrate how that might happen.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rp_d!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rp_d!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 424w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 848w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1272w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp" width="1024" height="775" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:775,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!rp_d!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 424w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 848w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1272w, https://substackcdn.com/image/fetch/$s_!rp_d!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb84062a2-c430-496c-b618-d8373532bbd2_1024x775.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>In 2055, some company or nation launches a trillion-dollar initiative to deploy heavy industry in the asteroid belt, leveraging 30 years of (AI-assisted) advances in material science and other foundational technologies. Ten thousand advanced robots, plus equally sophisticated supporting equipment, land on Ceres and set up mining and manufacturing operations. Each year, they manufacture enough to double their equipment base. By the end of the first year, the ten thousand robots have built ten thousand more. By the end of the second year, twenty thousand robots have become forty thousand. In 2065, ten rounds of doubling have brought the total to 10 million, and advances in material science and fusion power (plus economies of scale) cut the doubling time in half &#8211; one robot can now produce another robot and its share of equipment in six months. By 2075, there are <strong>ten trillion robots</strong> honeycombing the (now noticeably depleted) 587-mile-wide asteroid.</em></p><p><em>All along, this robot army has been constructing the industrial base to reproduce itself... while also constructing data centers alongside the factories. In addition to ten trillion robots, Ceres now houses ten trillion disembodied digital minds, all engaged in advancing the frontiers of science and technology (and supported by a trillion robots diverted to serve as lab technicians). In round numbers, this exceeds human R&amp;D efforts by a factor of one million. Things begin to progress even more rapidly.</em></p><p>In this scenario, permitting requirements and environmental impact review won&#8217;t apply. No one will be losing their jobs, because no one is employed in developing an industrial base on Ceres. A self-sufficient robot economy, with vast intellectual resources and no internal politics, should be able to work around bottlenecks as they arise.</p><p>In reality, of course, something much more complicated and clever and stupid will happen. My point is simply that once AI crosses some threshold of adaptability and independence, there will be paths around the traditional barriers to change. And then things will <em>really</em> start to get weird.</p><h2>Implications of Richter 10.0 AI</h2><p>Many of the bedrock assumptions underlie economics, politics, and our fundamental intuitions would be invalidated by Richter 10.0 AI:</p><ul><li><p><strong>Our resources are limited to those available on Earth</strong>. Robots will have a far easier time exploring the solar system than delicate, precious human beings.</p></li><li><p><strong>We can scale capital but not labor</strong>. Historically, while we can always build more factories, the labor supply grows slowly (if at all). When AIs and robots supply labor, labor becomes scalable.</p></li><li><p><strong>Skills are expensive to replicate</strong>. People require years of training to acquire advanced skills; AIs can be duplicated in a matter of seconds.</p></li><li><p><strong>The rate of change is limited by the speed at which people can adapt</strong>. People need retraining. They have specific preferences and aptitudes, so can&#8217;t be arbitrarily reallocated from one role to another. They can&#8217;t (or shouldn&#8217;t) be hired and fired at a whim. AIs can be repurposed on demand.</p></li><li><p><strong>Most economic output is devoted to meeting people&#8217;s needs, not R&amp;D</strong>. Trillions of robots can enable a vast economy without having to support trillions of workers in a comfortable lifestyle.</p></li></ul><p>In the Ceres scenario, humanity&#8217;s collective R&amp;D efforts increase by a factor of one million. This could drive an inconceivable pace of advances across every field, which could be instantly deployed throughout the economy &#8211; or, at least, the automated portion of the economy, which will be literally astronomical in scale. And all of this would constantly feed back on itself. The resulting world could be unrecognizable indeed.</p><p>Of course, this assumes that we don&#8217;t stop the train. Some people propose that the world should collectively choose to not develop advanced AI. Or a major war or other global disaster could shatter the delicate supply chain for the cutting-edge chips that are powering AI progress. But if we don&#8217;t wind up stopping, big things may be in store. The prospect is simultaneously thrilling and terrifying.</p><h2>Looking Into the Unknown</h2><p>The Unrecognizable Age might go very very well, or very very badly. The following chart shows potential scenarios for future GDP growth, including business as usual, a &#8220;benign singularity&#8221;, and an AI extinction scenario. What fringe organization published this wild speculation? The Federal Reserve Bank of Dallas.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!DkJF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!DkJF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 424w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 848w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1272w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png" width="997" height="589" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:589,&quot;width&quot;:997,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!DkJF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 424w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 848w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1272w, https://substackcdn.com/image/fetch/$s_!DkJF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F3d65e927-a398-4f80-a0c5-faeb0eff7d15_997x589.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>If AI indeed &#8220;goes big&#8221;, the future might hold a galactic utopia, full of wonders beyond our imagining. Or it might be sterile, an orderly domain of joyless machines, perfectly optimized for... well, who knows what, but with no one alive to appreciate it, perhaps it wouldn&#8217;t matter.</p><p>Other scenarios are also possible, but in many of them, the changes would be profound. With trillions of AI scientists and technicians pushing forward in every field, it is difficult to place a limit on the possibilities. Perhaps the new reality will feel natural as we live through the changes day by day. But from my present-day vantage point, if I try to stare at this future for too long I go mindblind. The possibilities are too vast, the questions too unanswerable.</p><p>None of this is imminent. I discount claims that AI will revolutionize the world in the next few years. But I think there is a strong case that we will create Richter 10.0 AI within 50 years &#8211; conceivably much, much less. It&#8217;s time that we take seriously the possibility of an Unrecognizable Age. There are plenty of questions to be explored. We can promote beneficial applications of AI, and look for ways to minimize risks. We should certainly monitor progress, and build the institutional capacity to navigate change. In my work at the Golden Gate Institute for AI (including this blog), I&#8217;m trying to do my part.</p><p>No one knows exactly how to prepare the world for AI. But together, we can try to plot our course toward the light, even if we can&#8217;t stare directly into it.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/the-unrecognizable-age?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/the-unrecognizable-age?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera, Adam Pisoni, Jonathan Matus, Kristen Hamilton, Russ Heddleston, Taren Stinebrickner-Kauffman, Timothy Lee, and Tony Asdourian for suggestions and feedback.</em></p><p><em>Further reading on the implications of advanced AI:</em></p><ul><li><p><em><a href="https://davekasten.substack.com/p/the-strangeness-is-the-point">The Strangeness Is the Point</a></em></p></li><li><p><em><a href="https://secondthoughts.ai/p/implications-of-agi">We Need to Recognize How Profoundly Different The AGI Future Will Be</a></em></p></li></ul><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>It turns out that this adage has a name &#8211; <a href="https://en.wikipedia.org/wiki/Roy_Amara">Amara&#8217;s Law</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Well, muscle power plus a bit of wind power.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>By saying &#8220;decades&#8221;, rather than &#8220;a decade&#8221;, I do qualify as a skeptic in significant segments of the AI community!</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Estimates for US peak annual spending on World War II, in 2025 dollars: $1.15 trillion (Gemini 3), 1-1.2 trillion (Opus 4.5). AI capex estimates for 2025: $500 billion (Gemini 3), $450-500 billion (Opus 4.5; includes non-AI data centers). AI capex is projected to continue growing rapidly in the coming years, and could easily exceed the WWII figure before the end of the decade. For instance, a <a href="https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/the-cost-of-compute-a-7-trillion-dollar-race-to-scale-data-centers">McKinsey report</a> estimated $5.2 trillion in capital expenditures on AI alone (excluding non-AI data centers) between 2025 and 2030, which would entail well over $1 trillion /  year in some years.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>For instance, prominent AI researcher Ilya Sutskever&#8217;s new company, Safe Superintelligence, Inc. has already raised three billion dollars and. In a recent interview, Sutskever indicated that the company is using that funding strictly for research, and that &#8220;we are entering the age of research&#8221;, with many new paths being explored to unlock further AI capabilities.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>I&#8217;m simplifying here; the transition to integrated circuits was actually centered in the late 1960s.</p></div></div>]]></content:encoded></item><item><title><![CDATA[Reflections on The Curve]]></title><description><![CDATA[A compilation]]></description><link>https://secondthoughts.ai/p/reflections-on-the-curve</link><guid isPermaLink="false">https://secondthoughts.ai/p/reflections-on-the-curve</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 20 Nov 2025 18:44:23 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!E269!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We&#8217;ve already <a href="https://www.hyperdimensional.co/cp/176384354">shared</a> <a href="https://secondthoughts.ai/p/what-i-saw-around-the-curve">three</a> <a href="https://secondthoughts.ai/p/a-coalition-for-the-future">posts</a> about <a href="https://thecurve.goldengateinstitute.org/">The Curve 2025</a>, each by a different attendee. This final installment is a roundup of shorter reflections. Also: <a href="https://www.youtube.com/playlist?list=PLLRZZpQmhQb2vdz6qu9ZLalsoGsJqxC6P">videos of the recorded talks</a> are now live!</p><p>After the conference, we asked a few of the 350 attendees to share their perspective. Two themes emerged. First, as hoped, they came away with valuable new insights and relationships. Second, the specifics would have been impossible to predict in advance. (In the quotes that follow, bold emphasis was added by us, except where noted.)</p><h1>Helen Toner</h1><p><em>Interim Executive Director at Georgetown&#8217;s Center for Security and Emerging Technology</em></p><blockquote><p>For me, the essence of the Curve was kneeling by a firepit, marshmallow skewer in hand, having two conversations at once. One conversation: learning from a world-renowned researcher about his unconventional approach to building safe superintelligent AI systems, and how his optimism about the approach had (I was surprised and glad to hear) been growing as he and his grad students had dug into it over the last couple years. The other conversation: haranguing a younger (but also brilliant) researcher about his (seemingly false?!) claim to me the previous evening that bears can be trained to juggle 3 balls, which I swear had a serious connection to AI timelines and capabilities, but I refuse to explain what.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!sHbB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!sHbB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg" width="308" height="462.2889305816135" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/de07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1066,&quot;resizeWidth&quot;:308,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!sHbB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 424w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 848w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!sHbB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fde07ca6a-7b35-4da0-8d08-c6b0a052555f_1066x1600.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Come for the AI, stay for the s&#8217;mores.</em></figcaption></figure></div><h1>Sam Hammond</h1><p><em>Chief Economist at the Foundation for American Innovation</em></p><blockquote><p>The Curve is a conference about transformative AI, which is a topic that can easily become abstract or overly speculative. And yet the sessions and conversations I had at this year&#8217;s Curve were all surprisingly concrete and actionable. As AI gets more &#8220;real&#8221; as a technology and the full societal implications start to come into view, The Curve functions as a kind of focus dial on policymakers&#8217; binoculars. In essence, attending The Curve helps DC wonks see what SF technologists see, and thus develop the basis for joint intentionality.</p><p><strong>The biggest update for me was realizing just how much these different groups still have to learn from each other, and how low the barriers are to effective dialogue.</strong> One might think conservative think tankers, Hollywood actors, labor leaders, AI safety researchers, and members of the technical staff would all speak totally different languages. But at The Curve, you can find them jointly grappling with the risks and opportunities of transformative AI in their fullness. It&#8217;s a sight to behold!</p><p>Bottom-line: The Curve sits at the Pareto frontier of professionally useful and pure fun, making it <strong>the single best AI conference in a super competitive field</strong>. It is easily the highlight of my year.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!T4hW!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!T4hW!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 424w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 848w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1272w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png" width="1456" height="545" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:545,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!T4hW!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 424w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 848w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1272w, https://substackcdn.com/image/fetch/$s_!T4hW!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5e7da506-dcd3-4cfc-a747-8636c56db285_1600x599.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xbes!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xbes!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xbes!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!xbes!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!xbes!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!xbes!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8dacd0d0-8eca-40fd-94fd-2c9afb5c35cb_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The rationalist den of iniquity</figcaption></figure></div><h1>Jack Clark</h1><p><em>Head of Policy at Anthropic</em></p><blockquote><p>There&#8217;s a saying in politics called &#8220;the horseshoe theory&#8221;, which is that if you go to the far enough to the end of either political spectrum you end up with positions that are closer than those at the center. One thing I learned at The Curve is that this horseshoe theory might show up in an important way for AI safety. Very distinct communities (Bay Area alignment types, and the hard right) have a similar set of concerns (&#8221;AI systems might screw up society so much that we should be very careful in how they&#8217;re deployed&#8221;), meaning they could potentially form a coalition with one another. <strong>This, to me, illustrates what is so valuable about The Curve - it brings people together with radically different ideologies and worldviews and allows them to talk to one another until they identify surprising areas of overlap.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!1QiY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!1QiY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 424w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 848w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1272w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png" width="1456" height="428" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:428,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!1QiY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 424w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 848w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1272w, https://substackcdn.com/image/fetch/$s_!1QiY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4180429e-e885-4f59-84d3-e51741dc3e3e_1600x470.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ujRV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ujRV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ujRV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!ujRV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd52c94ca-a7c4-4219-a2a2-7a7d62984ef8_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>A full house at Jack Clark&#8217;s keynote</em></figcaption></figure></div><p>Speaking of horseshoe politics&#8230;</p><h1>Joe Allen</h1><p><em>Transhumanism Editor at the WAR Room</em></p><blockquote><p>The Curve is an incubator for ideological mutants. The event is somewhere between a West Coast potluck and an artificial womb facility with rows of mind children staring out at you from bubbling vats. I arrived at Lighthaven expecting some sort of memetic promiscuity&#8212;perhaps a veritable orgy of psychic miscegenation&#8212;so I protected my brain with neural prophylaxis and tried to avoid thinking too much. Even with that tinfoil chafing my scalp, I managed to enjoy myself. <strong>Something about the vibe made learning irresistible.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QU0K!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QU0K!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QU0K!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QU0K!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdd1179ee-b665-4a8d-a0d1-85bbff9e4ce8_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Golden Gate CEO Taren Stinebrickner-Kauffman staring at Joe Allen&#8217;s tinfoil hat</figcaption></figure></div><p>Attendees consistently commented on the range of people they encountered:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ZSYE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 424w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 848w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1272w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png" width="1456" height="407" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:407,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!ZSYE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 424w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 848w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1272w, https://substackcdn.com/image/fetch/$s_!ZSYE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2867be06-30da-4f5d-9f3f-c7b93b07b96f_1600x447.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!TWza!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!TWza!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 424w, https://substackcdn.com/image/fetch/$s_!TWza!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 848w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1272w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png" width="1456" height="355" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:355,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!TWza!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 424w, https://substackcdn.com/image/fetch/$s_!TWza!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 848w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1272w, https://substackcdn.com/image/fetch/$s_!TWza!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33df2626-d705-42fe-af55-9ff1a31b9db7_1600x390.png 1456w" sizes="100vw" loading="lazy"></picture><div></div></div></a></figure></div><h1>Alexandra Givens</h1><p><em>President and CEO of the Center for Democracy and Technology</em></p><blockquote><p>My goal for The Curve was simple: to get more people to care about how increasingly personalized AI systems will impact people&#8217;s privacy. I view this as one of the most central (and overlooked) issues facing our field; with implications for people&#8217;s personal rights and freedoms, competition in the AI market, and the future of how we access information about the world around us. <strong>I wasn&#8217;t expecting to find such common ground with leaders from the right and left.</strong> It turns out almost everyone is troubled by the vast amount of information an AI assistant will know about you, especially once it&#8217;s connected to your email, your calendar, even the sensors in your home. And they get that when companies hold this data, governments seek to access it too &#8211; raising concerns for civil liberties and human rights around the world. There was wide agreement that we didn&#8217;t get things right in internet privacy, and companies and lawmakers will need to do better. So, <strong>we hold shared concerns. Can that translate to shared paths forward to address them?</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!E269!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!E269!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E269!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E269!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!E269!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!E269!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!E269!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!E269!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa5fff81f-8e5c-4ba1-81a5-6ca763de7c38_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Sabrina Ross</h1><p><em>Privacy and AI Policy Director at Meta</em></p><blockquote><p><strong>Part of what makes this conference so special is how intentionally it brings people with wildly different perspectives into real conversation.</strong> In a single afternoon, I found myself talking first with someone convinced that the future of education belongs to AI agents &#8211; and moments later with the head of the American Federation of Teachers, leading the National Academy for AI Instruction, who&#8217;s deeply committed to keeping human educators in the driver&#8217;s seat.</p><p>The most piercing interrogations of AI aren&#8217;t coming from engineers, but from political scientists, educators, labor leaders, and ethicists &#8211; in deep conversation with tech innovators &#8211; asking what we&#8217;re really optimizing for &#8211; and who gets to decide what &#8220;alignment&#8221; means.</p><p>Over the course of the Curve I also realized that the so-called doomers and bloomers &#8211; those who see AI as an existential threat versus those who see it as a chance for extraordinary progress &#8211; often start from the same set of facts. Both recognize that AI is already reshaping everything from cybersecurity (where models like Claude are winning competitions) to education. What really separates all of us on this spectrum is our sense of pace, control, and trust. Even so, many of our solution sets are highly overlapping: For the gloomer, transparency matters because it&#8217;s the key to catching early signs of misalignment or danger. For the bloomer, transparency is what builds trust, adoption, and collaboration between people and AI systems. <strong>What struck me most at the Curve was how close these &#8220;opposites&#8221; actually are &#8211; rooted in a shared belief that the path forward is one built on openness and meaningful interrogation by those with expertise outside of tech.</strong></p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!e_rD!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!e_rD!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!e_rD!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!e_rD!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4af6e2a9-57df-4bf4-a4bc-1a86f44cfc9a_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Jedidah Isler</h1><p><em>Chief Science Officer at the Federation of American Scientists</em></p><blockquote><p>It was a delight to join in the vibrant and thought-provoking discussions that happened at The Curve this year. <strong>As a policy person, I appreciated the vast diversity of perspectives presented from across the AI research community</strong>, particularly as it relates to the AI scaling curve. ... For better or for worse, either scaling curve is far faster than the speed of government, so <strong>we can use spaces like this to figure out how best to pursue best outcomes with the technology that also produces best outcomes for our collective well-being, economic security and global competitiveness.</strong> I hope to come back to learn and share even more!</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b5xo!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b5xo!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b5xo!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!b5xo!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F563be27b-1601-41d6-b276-55c652118f23_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h1>Randi Weingarten</h1><p><em>President of the American Federation of Teachers</em></p><blockquote></blockquote><blockquote><p>After my talk, I heard from a number of participants about new ideas regarding AI safety, which we hope to integrate into future trainings given to union members. <strong>Having participants with such broad expertise helped facilitate this kind of cross-pollination, which is so desperately needed.</strong> It was wonderful to hear other ideas about how we can help advance student privacy and AI safety, from experts within that industry particularly in light of the refusal of the federal government to protect our youth. I heard dozens of concrete ideas, from those working at AI companies and in civil society, for approaches that we had not previously considered. I&#8217;m looking forward to seeing what we can do together to implement these suggestions in trainings and policy.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!hhkC!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!hhkC!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg" width="610" height="406.3873626373626" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:970,&quot;width&quot;:1456,&quot;resizeWidth&quot;:610,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!hhkC!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 424w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 848w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!hhkC!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F68a034d1-52e9-4946-8c4b-c7144d17719f_1600x1066.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Casey Newton interviews Randi Weingarten</em></figcaption></figure></div><h1>Other writing about The Curve</h1><ul><li><p><a href="https://thezvi.substack.com/p/bending-the-curve">Bending The Curve</a> &#8211; AI infovore Zvi Mowshowitz&#8217;s comprehensive review. &#8220;...every (substantive) conversation I had made me feel smarter.&#8221;</p></li><li><p><a href="https://afraw.substack.com/p/the-room-where-ai-happens">The Room Where AI Happens</a> &#8211; in her newsletter, Concurrent, Afra Wang presents some central themes she observed. &#8220;This piece is written for readers outside the room&#8212;people without technical backgrounds, without AI insider status, who aren&#8217;t already &#8216;where it happens.&#8217;&#8221;</p></li><li><p><a href="https://www.transformernews.ai/p/were-all-behind-the-curve-ai-bubble-crash-risk">We&#8217;re all behind The Curve</a> &#8211; Shakeel Hashim kicks off the weekly Transformer newsletter with some thoughts on the conference, and lessons that need to be shared with the broader world. &#8220;...the distance between The Curve and the rest of the world watching and waiting for a crash remains vast.&#8221;</p></li><li><p><a href="https://www.interconnects.ai/p/thoughts-on-the-curve">Thoughts on The Curve</a> &#8211; Nathan Lambert, a researcher at the Allen Institute for AI and noted analyst and proponent of open-source models, covers some things he learned.</p></li><li><p><a href="https://www.understandingai.org/p/ai-skeptics-and-ai-boosters-are-both">AI Skeptics and AI Boosters are Both Wrong</a> &#8211; Timothy Lee, a reporter who writes at Understanding AI, expands on what he sees as the core divide in the AI discourse.</p></li></ul><p>Thanks to everyone who came and made it great! And to constructive critics like Boaz who keep us aiming higher:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!AQab!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!AQab!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 424w, https://substackcdn.com/image/fetch/$s_!AQab!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 848w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1272w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png" width="1456" height="436" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:436,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:&quot;&quot;,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!AQab!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 424w, https://substackcdn.com/image/fetch/$s_!AQab!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 848w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1272w, https://substackcdn.com/image/fetch/$s_!AQab!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F47f3bf4e-f8ef-4ad1-9b0c-e7850a771dad_1600x479.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Stay tuned for details on The Curve 2026!</p>]]></content:encoded></item><item><title><![CDATA[Hyperproductivity: The Next Stage of AI?]]></title><description><![CDATA[A glimpse at an astonishing, exhilarating, exhausting new style of work]]></description><link>https://secondthoughts.ai/p/hyperproductivity</link><guid isPermaLink="false">https://secondthoughts.ai/p/hyperproductivity</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 19 Nov 2025 23:42:01 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!y3iz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!y3iz!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!y3iz!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/af312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!y3iz!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!y3iz!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Faf312086-ea35-4dc8-b0c3-d2a8fd057e06_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Recently, I&#8217;ve been hearing of a new phenomenon: teams reportedly using agentic AI tools to &#8220;enter takeoff&#8221; &#8211; achieving astounding feats of productivity that escalate each week, with no limit in sight.</p><p>These teams have four things in common:</p><ol><li><p>They are aggressively using AI to accelerate their work.</p></li><li><p>They aren&#8217;t just using off-the-shelf tools like ChatGPT or Claude Code. They&#8217;re building bespoke productivity tools customized for their personal workflows &#8211; and, of course, they&#8217;re using AI to do it.</p></li><li><p>Their focus has shifted from <em>doing</em> their jobs to <em>optimizing</em> their jobs. Each week, instead of delivering a new unit of work, they deliver a new improvement in productivity.</p></li><li><p><strong>Their work builds on itself: they use their AI tools to improve their AI  tools, and the work they&#8217;re optimizing includes the optimization process.</strong></p></li></ol><p>The classic scenario for AI ascending to superintelligence involves &#8220;recursive self-improvement&#8221;, where an AI builds a smarter AI, which builds an even smarter AI, and so on. These stories of teams entering takeoff are not quite that, because there is still a human in the loop, but they have a similar flavor. If the singularity ever arrives, the early stages might look just like this.</p><h2>What Aggressive Use of AI Looks Like</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!bnIs!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!bnIs!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 424w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 848w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1272w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png" width="1026" height="444" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:444,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:162045,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/179390048?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!bnIs!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 424w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 848w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1272w, https://substackcdn.com/image/fetch/$s_!bnIs!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd0e23d28-096c-4c7f-bd0c-b7e3162aa2e5_1026x444.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">One of many businesses Liu Xiaopai is running simultaneously</figcaption></figure></div><p>Afra Wang recently published the fascinating<a href="https://afraw.substack.com/p/story-of-a-chinese-vibe-coder"> Story of A Beijing Vibe Coder</a>. This is her account of Liu Xiaopai, a Chinese programmer who is using AI tools to crank out product after product. Working mostly on his own, he currently has &#8220;one or two dozen revenue-generating products&#8221; and reports clearing over $1,000,000/year in profit. By contrast, a typical startup requires many people to build and maintain a single product.</p><p>Liu Xiaopai isn&#8217;t just using AI tools to write code. He&#8217;s automating, as the article says, &#8220;the entire product lifecycle&#8221;. For instance, once he has an idea for a new product, he uses Claude Code<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> to brainstorm product names and find one for which the domain name is available:</p><blockquote><p><strong>Afra:</strong> Was there a specific moment when you realized Claude had fundamentally transformed your working methods?</p><p><strong>Liu Xiaopai:</strong> Too many to count. Building software products for overseas markets&#8212;programming actually represents only a small fraction.</p><p>Look at any normal internet company in China: engineers comprise just 20-30%. What about the remaining 70%? They are design, product management, testing, operations, user growth&#8212;these roles require substantial human resources, and nearly all follow standardized operating procedures (SOPs).</p><p>After using AI to solve the programming component, I faced a larger challenge: how to automate all the work with standardized processes? After entering July, I spent more time with Claude Code on non-programming tasks. Here&#8217;s a simple use case that was previously impossible: I need to launch a new product, which requires naming it and registering a website domain. Let&#8217;s say I build the product Manus, with the domain<a href="http://manus.ai"> Manus.ai</a>. Sounds simple&#8212;but this having a product name and domain means many steps, and each step was hard to automate before. You&#8217;d brainstorm names, then manually check domain availability one by one; it&#8217;s too time-consuming. <strong>How about using AI to generate 10,000 viable domains and instantly determine which are available for registration? Thus, determine the product name?</strong></p><p>I would begin by writing product requirements documentation. Then I&#8217;d list all the competitors, noting how each differs from mine and what features I want to emphasize. I write an extensive description, and then have Claude synthesize this information to auto-generate 10,000 appropriate domains and query their registration status. Then I can go to sleep. Five or six hours later, the results arrive. I automate many SOPs like this.</p><p>[emphasis added]</p></blockquote><p>Liu Xiaopai describes how he uses AI for many parts of his work &#8211; not just coding. But hyperproductive teams aren&#8217;t just using AI &#8211; they&#8217;re building their own AI-powered tools.</p><h2>Building Custom Productivity Tools</h2><p>It is not surprising that the individuals who are finding the most aggressive ways to use AI are mostly software engineers. Programmers have a notorious fascination with automating, not just their customer&#8217;s work, but their own. As DEC engineer Dick Sites famously said, &#8220;I&#8217;d rather write programs to write programs than write programs&#8221;. Today, this translates into engineers using AI to build custom tools tuned to their personal workflow.</p><p>Entrepreneur Jesse Vincent recently published<a href="https://blog.fsck.com/2025/10/05/how-im-using-coding-agents-in-september-2025/"> a pair</a> of<a href="https://blog.fsck.com/2025/10/09/superpowers/"> blog posts</a> describing his own approach to automation, also based on Claude Code. <strong>He has been systematically incorporating his best practices into a custom tool, reducing the amount of effort he has to spend in prompting the AI.</strong></p><p>Jesse&#8217;s tool presents Claude<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a> with a series of prompts that instruct it to approach each task according to a careful plan:</p><ol><li><p>Ask clarifying questions to refine the task definition.</p></li><li><p>Sketch a design, and present it in short installments, asking for feedback after each installment.</p></li><li><p>Generate a detailed plan for implementing this design.</p></li><li><p>Review the plan to verify that it matches the task definition and design.</p></li><li><p>Carry out the plan, a few steps at a time. After every few steps, double-check the progress so far against the design.</p></li></ol><p>I&#8217;ve left out a lot of details, but the point is that this process breaks up each project into tasks that the AI agent can manage on its own, and inserts ample cross-checks where the agent is able to refine its own work. As yet another cross-check, the workflow instructs the agent to use a technique known as &#8220;test-driven development&#8221;. This means that before the agent writes any code, it first creates <em>test</em> code that will evaluate whether the actual code is working correctly. If the tests don&#8217;t pass, the agent will revise its work until they do.</p><p>All this is augmented by a carefully crafted, 1600-word<a href="https://raw.githubusercontent.com/obra/dotfiles/6e088092406cf1e3cc78d146a5247e934912f6f8/.claude/CLAUDE.md"> instruction file</a>, densely packed with directives, tips, and tricks to nudge Claude into the habits Jesse has found make it most productive. Some examples:</p><blockquote><p>If you&#8217;re having trouble, YOU MUST STOP and ask for help, especially for tasks where human input would be valuable.</p><p>NEVER add comments explaining that something is &#8220;improved&#8221;, &#8220;better&#8221;, &#8220;new&#8221;, &#8220;enhanced&#8217;, or referencing what it used to be. [Presumably he found Claude to clutter its code with these notes.]</p><p>YOU MUST ALWAYS find the root cause of any issue you are debugging. YOU MUST NEVER fix a symptom or add a workaround instead of finding a root cause, even if it is faster or I seem like I&#8217;m in a hurry.</p><p>YOU MUST use the journal tool frequently to capture technical insights, failed approaches, and user preferences.</p></blockquote><p>I haven&#8217;t really done Jesse&#8217;s work justice in this brief description. You can read more in the two blog posts linked above, and a<a href="https://github.com/obra/superpowers"> GitHub repository</a> in which he has published some of the work. The repository includes a rich suite of &#8220;skills&#8221; that tell Claude how to do things like brainstorming, systematically identifying the cause of a problem, and responding to feedback.</p><p>Getting the best results from AI requires expert prompting skills; Jesse is codifying his best prompts so that he doesn&#8217;t need to keep entering them by hand, and so that the system can operate for longer without his intervention. But that&#8217;s still not the full story: <strong>true hyperproductivity also requires closing the loop of tool creation</strong>.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>It&#8217;s Tools All the Way Up</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9YzF!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9YzF!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 424w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 848w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png" width="1456" height="1210" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1210,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:3541113,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/179390048?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9YzF!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 424w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 848w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1272w, https://substackcdn.com/image/fetch/$s_!9YzF!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F39cc9caf-900d-4947-916c-a26071041a43_1530x1272.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>One of the things that Jesse uses his productivity tools for is to improve those very tools:</p><blockquote><p>One of the first skills I taught Superpowers was<a href="https://raw.githubusercontent.com/obra/superpowers-skills/35c29f0fe22881149a991eca1276c148567a7c29/skills/meta/writing-skills/SKILL.md"> How to create skills</a>. That has meant that when I wanted to do something like add git worktree workflows [a technique for managing multiple projects in parallel] to Superpowers, it was a matter of describing how I wanted the workflows to go&#8230; and then Claude put the pieces together and added a couple notes to the existing skills that needed to clue future-Claude into using worktrees.</p></blockquote><p><strong>It&#8217;s hard to overstate just how self-referential this is</strong>. For instance, when Jesse uses his &#8220;Superpowers&#8221; tool to codify a skill, the tool uses its test-driven development module to verify that the new skill has been implemented correctly. It generates an example of a task that the new skill is meant to help with, verifies that it is unable to complete that task without the new skill, and then checks to see whether it <em>can</em> complete the task once the new skill has been installed. This is a very sophisticated approach, but it all falls out of the understanding of test-driven development that Jesse has already taught his agent:</p><blockquote><p>As Claude and I build new skills, one of the things I ask it to do is to &#8220;test&#8221; the skills on a set of subagents to ensure that the skills were comprehensible, complete, and that the subagents would comply with them. (Claude now thinks of this as TDD [Test-Driven Development] for skills and <em>uses</em> its RED/GREEN TDD skill as part of the skill creation skill.)</p></blockquote><p>He&#8217;s not the only engineer pursuing this path. In<a href="https://paradox921.medium.com/amplifier-notes-from-an-experiment-thats-starting-to-snowball-ef7df4ff8f97"> Amplifier: Notes from an experiment that&#8217;s starting to snowball</a>, Microsoft engineer Brian Krabach describes<a href="https://github.com/microsoft/amplifier"> Amplifier</a>, a rich toolkit built to facilitate the practice of moving advanced skills out of the programmer&#8217;s head and into a tool. To quote from the project&#8217;s home page:</p><blockquote><p>Amplifier is a coordinated and accelerated development system that turns your expertise into reusable AI tools without requiring code. Describe the step-by-step thinking process for handling a task&#8212;a &#8220;metacognitive recipe&#8221;&#8212;and Amplifier builds a tool that executes it reliably. As you create more tools, they combine and build on each other, transforming individual solutions into a compounding automation system.</p></blockquote><p>In his blog post, Brian (the lead engineer) explains how Amplifier is used to improve Amplifier:</p><blockquote><p>Out of the box, AI coding assistants are strong but context-blind. They don&#8217;t remember your patterns, don&#8217;t carry forward your prior decisions, and they need hand-holding on anything ambitious.<strong> </strong>&#8230; <strong>[Amplifier includes] tooling that can self-improve the system</strong> from observations where users have guided the assistant, reducing the need for the user to do so in the future.<br><br>...<strong>We build tools that build tools.</strong> A lot of Amplifier&#8217;s scenarios are literally tools that improve Amplifier - transcript processors, knowledge extractors, article illustrators, blog writers - each created from a &#8220;describe the thinking, then build it&#8221; flow. Over time, the environment gets smarter.<br><br>[emphasis added]</p></blockquote><p>Paul Payne is another engineer on Microsoft&#8217;s Amplifier team. He believes we are witnessing a profound change in the practice of software development. Last month, in a blog post titled<a href="https://payne.io/posts/historical-artifact/"> My Programming Career is a Historical Artifact</a>, he wrote:</p><blockquote><p><strong>This won&#8217;t be </strong><em><strong>the</strong></em><strong> thing that replaces me as a programmer, but it convinces me that our timeline is months, not years.</strong></p><p>With Amplifier, you now have the ability to describe what you want and it will build it for you, from design, to backend, to frontend, including testing, according to your own programming philosophy. But this isn&#8217;t the main point. The main point is that <strong>the more you use it, the better it gets. You create reusable tools over time. It captures useful techniques. It learns from your usage of it, suggesting ways to improve </strong><em><strong>itself</strong></em><strong>. This is a type of exponential productivity. The things you build help you make things faster.</strong></p><p>&#8230;</p><p>All of the programming skills I described above, the skills I developed over a lifetime, can now be accomplished by tools like Amplifier with the right prompting. The final stretch is to wrap up all this expertise and hand it over to the AI and then anyone, with no programming experience, will be able to have AI build whatever software they need, as they need it.</p><p>[emphasis added]</p></blockquote><p>I know of at least three examples of teams who are working in this style, building tools which build tools to improve their tool-building tools. Sam Schillace, manager of the Amplifier team (and my Google Docs co-founder) calls them <a href="https://sundaylettersfromsam.substack.com/p/i-have-seen-the-compounding-teams">compounding teams</a> &#8211; and they all report astonishing productivity that continues to spiral upward. Is this &#8220;it&#8221;? Are we witnessing the dawn of the Singularity?</p><h2>These Techniques Will Have Limits &#8211; For Now</h2><p>It&#8217;s clear that people like Liu Xiaopai, Jesse Vincent, Brian Krabach, Paul Payne, and Sam Schillace are experiencing something profound. And I know of other examples that aren&#8217;t yet public. Some people are proclaiming the end of the era in which human beings write code, or even review it. At the same time, I<a href="https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks"> recently reiterated</a> my belief that today&#8217;s cutting-edge AIs are still missing important cognitive capabilities, and that full automation of software engineering is years away. Do I need to throw that idea out the window?</p><p>My confidence is shaken, but not yet broken. I believe that what these developers are experiencing is real, but I don&#8217;t think they&#8217;ve &#8220;solved&#8221; programming (nor do I think any of them have claimed to have done so). Here are some reasons that I&#8217;m tempering my expectations.</p><p><strong>They&#8217;re all working on new, small projects</strong> (so far as we know). Software developers mostly spend their time extending and maintaining large, mature projects. I don&#8217;t just mean behemoths like Microsoft Office; even &#8220;upstarts&#8221; like Slack, Notion, or Figma have long since grown into complex codebases. To say nothing of the great mass of legacy software that runs inside your Toyota automobile or behind the scenes at United Airlines. The oldest and largest project the Amplifier team has reported working on is Amplifier itself. If these techniques are being applied successfully in larger, messier environments, I haven&#8217;t heard about it yet.</p><p><strong>I don&#8217;t expect these techniques to reliably handle complex, novel challenges</strong>. When asked to undertake a substantial project, current AI models can<a href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent"> go off the rails in all sorts of ways</a>. They fall down a rabbit hole investigating an unimportant detail, persist in attempting an unworkable approach, or hallucinate an incorrect understanding of their situation that renders all of their future efforts counterproductive. The systems these hyperproductive engineers are building incorporate mechanisms to keep their agents on track, but those mechanisms will only go so far. For instance, instructing an agent to review its own work can lead it to correct obvious errors, but there&#8217;s no guarantee that it will spot subtle mistakes or failures of judgement. It might even find unnecessary things to complain about, creating more work for itself.</p><p><strong>We&#8217;re witnessing a selection effect</strong>. These hyperproductive teams are, quite naturally, gravitating toward the kinds of tasks that best match the capabilities of their tools. We shouldn&#8217;t expect the current generation of those tools to provide the same level of productivity boost for other tasks, even if it&#8217;s not yet clear what sorts of tasks fall into each category. Nor should we expect just anyone to be able to follow in their footsteps; it takes a special sort of mind to use tools to instruct tools to improve their tools. (Yet another reason that<a href="https://secondthoughts.ai/p/uneven-impact-of-ai"> the future is already here, it&#8217;s just not evenly distributed</a>.)</p><p><strong>Small teams having a big impact is not a new phenomenon</strong>. Instagram was only 13 people on the occasion of its billion-dollar acquisition by Facebook. When we see these early examples of exceptional teams achieving exceptional things using AI, <strong>the key ingredient might be the exceptional teams</strong>.</p><p><strong>Everything is overestimated in the short run</strong>. When an exciting new phenomenon appears on the scene, it usually takes longer than we think for it to have a broad impact. Though <a href="https://edisonscientific.com/articles/announcing-edison-scientific">this process may run faster in the AI era</a>:</p><blockquote><p>It is often said that people overestimate what they can do in 2 years and underestimate what they can do in 10. These days, with the extraordinary pace of AI research, my impression is that we may actually overestimate what we can do in 6 months, and underestimate what we can do in 2 years.</p></blockquote><p>These are my reasons for expecting hyperproductivity to have limits, at least for a while. But even so, it may have a significant impact. We should strive to learn more about the scope of that impact.</p><h2>Hyperproductivity May Struggle to go Mainstream</h2><p>One thing I&#8217;ve heard repeatedly to describe this style of work is &#8220;exhausting&#8221;. Much of the productivity boost comes from being able to pursue several sub-projects at once, each using a separate AI agent. The human role is to be a simultaneous manager, tutor, and genetic engineer for a squad of tireless, but sometimes clueless, agents. Each agent needs to be kept busy with tasks, and those tasks need to be coordinated so as to prevent one agent from interfering with another&#8217;s work. At the same time, the hyperproductive worker is constantly evaluating their every move (and every move taken by their agents) to see whether it could be done more efficiently.</p><p>Perhaps the most exhausting aspect of this work is that it&#8217;s constantly evolving. You can never settle into a routine, because those routines are exactly what gets automated. Each step up the productivity ladder means a new way of working and a new set of skills to acquire.</p><p><strong>This may be a difficult transition for most people</strong>. Delegating all of your direct work to an AI is a major transition. Shifting your focus from doing the work to optimizing tools and workflows is another transition. Managing multiple agents is stressful &#8211; if exhilirating. All of this requires a new mindset and a different set of skills.</p><p>Software engineers at least start out with experience working with automation and, often, customizing their tools. The challenge may be even greater for people with other backgrounds. Hyperproductivity may not be limited to programmers, but that&#8217;s where the early examples have been found (even if they are sometimes using these techniques for tasks other than programming).</p><h2>What To Watch For</h2><p>As I mentioned at the outset, the new breed of hyperproductive teams have three things in common. They&#8217;re building lots of bespoke tools. They&#8217;re letting AIs do all of the direct work, reserving their own efforts to specify what should be done and to improve the tools. And they are achieving a compounding effect, using their tools to improve their tool-improving tools. The net effect is what I&#8217;m calling &#8220;hyperproductivity&#8221;:</p><blockquote><p><strong>A hyperproductive individual does not do their job; they delegate that to AI. They spend their time optimizing the AI to do their job better.</strong></p><p><strong>A hyperproductive individual may also spend time </strong><em><strong>deciding what the AI should do,</strong></em><strong> but that represents a failure to fully delegate</strong>.</p></blockquote><p>There are plenty of open questions. How large is the productivity boost? (It will certainly vary by team and task.) What is the range of situations for which AI systems have the needed capabilities? How many people will gravitate toward this style of work? What skills are required, how many people have them, how many can learn them, and how long will that take?</p><p>If this is going to be more than a niche phenomenon, we should expect to see the following things:</p><ol><li><p>A dramatic increase in the number of hyperproductive teams<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p></li><li><p>In particular, a shift toward teams using &#8220;off the shelf&#8221; hyperproductivity frameworks (like Amplifier), rather than building their own recursive toolkits from scratch.</p></li><li><p>This approach being applied in a broad range of situations, some with large-scale impact; not just tiny teams working on greenfield projects and well-defined tasks.</p></li><li><p>Potentially, broad usage beyond software engineering.</p></li></ol><p>It will be very interesting to see whether this approach takes root inside the frontier AI labs. (Of course that might be happening already. For what it&#8217;s worth, the handful of anecdotal reports I have suggest not.) Plausibly, this is how an &#8220;intelligence explosion&#8221; would first manifest.</p><p>We should especially watch to see whether Claude Code, OpenAI&#8217;s Codex, or Google&#8217;s Jules are evolving to support the kinds of skill-accumulation and self-improvement techniques the hyperproductive teams are using, and <em>in particular</em> for signs that the frontier AI labs are prioritizing these capabilities in their model training process. Conceivably this could be the next big avenue for AI progress, tagging in if reinforcement learning &#8211; RL &#8211; starts to lose steam.</p><p>If hyperproductivity tools become easier to use, but remain limited to small, new projects, we should expect an explosion of those small projects. We&#8217;ll see less energy expended on getting things done in legacy applications.</p><p>Suppose, however, that the limitations I&#8217;ve suggested don&#8217;t pan out, and hyperproductivity turns out to be widely applicable, including for large, mature software engineering projects, and for tasks outside of engineering. What then?</p><h2>What Would Widespread Hyperproductivity Yield?</h2><p>If and when this new style of &#8220;compounding&#8221; work goes mainstream, I don&#8217;t think the immediate impact would be profound. It takes time to adapt workflows to take full advantage of a new technology. If Microsoft engineers can work ten times faster, Windows and Office wouldn&#8217;t be ten times better. Perhaps they could ship updates to Office ten times more often, but would that even be desirable?</p><p>We should look for a major impact when this approach rolls out within the labs that are developing cutting-edge AI. In the meantime, what we might most expect to see is a tsunami of fast-moving startups. The AI era was <em>already</em> having this effect, both because of the opportunity to build new AI-powered applications (like Cursor, a company which in three short years has achieved massive adoption and a $29 billion valuation), and because AI tools are goosing the productivity of nimble startup teams (see the story of Liu Xiaopai, above). Startups will find ways to bring the hyperproductive approach to an increasing number of domains, and the results will be dramatic. I&#8217;ll close with this quote from<a href="https://github.com/microsoft/amplifier/blob/main/AMPLIFIER_VISION.md"> The Amplifier Vision</a>:</p><blockquote><p><strong>The bottleneck isn&#8217;t AI capability; it&#8217;s human imagination in how to use it.</strong></p></blockquote><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/hyperproductivity?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/hyperproductivity?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Afra Wang, Amar Sood, Herbie Bradley, James Cham, Jesse Vincent, and Rohit Krishnan for suggestions and feedback (and to Abi for the post image).</em></p><div data-component-name="FragmentNodeToDOM"><h1>Appendix</h1></div><p>A few ideas that didn&#8217;t fit into the main piece:</p><p>1. Aggressive use of AI is, for the moment at least, expensive. Some hyperproductive teams are spending<a href="https://paradox921.medium.com/amplifier-notes-from-an-experiment-thats-starting-to-snowball-ef7df4ff8f97#:~:text=and%20reading%20docs.-,Not%20cheap,-.%20We%E2%80%99re%20pushing%20the"> thousands of dollars per day</a> to power their AI tools. In some cases they see this as worthwhile for its own sake. Sometimes they view it as an investment in understanding the future. But AI operating costs are notoriously plunging by 10x or more each year, and the range of applications for which this AI-intensive approach is cost-effective will widen rapidly.</p><p>2. <a href="https://tasklet.ai/">Tasklet</a> is a new service that builds AI agents on demand. &#8220;Just describe what you want in plain English, and Tasklet handles the rest.&#8221; Users can provide feedback to their agents, again in plain English, and the agent will update itself. There are parallels with the way the hyperproductive teams are working, though without the &#8220;compounding&#8221; aspect.<a href="https://www.cognitiverevolution.ai/always-bet-on-the-models-how-tasklet-puts-the-agency-in-agents-with-ceo-andrew-lee/"> An interview with Tasklet founder Andrew Lee</a> does provide hints that the team behind Tasklet is moving in the direction of hyperproductivity. For instance, they use Tasklet agents to test Tasklet.</p><p>3. Jesse Vincent<a href="https://blog.fsck.com/2025/10/09/superpowers/"> describes a particularly impressive use case</a>:</p><blockquote><p>I haven&#8217;t published all the skills Claude and I have built, because some of them are a little esoteric and some of the ones I&#8217;ve played with come from telling Claude &#8220;Here&#8217;s my copy of programming book. Please read the book and pull out reusable skills that weren&#8217;t obvious to you before you started reading&#8221; and I&#8217;m not sure how I feel about the IP aspect of that yet.</p><p>Stop and think about that last workflow a little bit. This is, I think, one of the foundational ideas in how all this works. You can hand a model a book or a document or a codebase and say &#8220;Read this. Think about it. Write down the new stuff you learned.&#8221; It does sometimes require helping the model look at the work through a specific lens (or a set of lenses). But it is insanely powerful. I&#8217;ve been experimenting with getting Claude to keep this idea in mind all the time, but I&#8217;m not yet happy with the output.</p></blockquote><p>4. The initial rush of bespoke software applications will be exhilarating, and will provide a productivity boost. But there will probably be diminishing returns. We won&#8217;t soon find ourselves writing 100 times more software and becoming 100 times more productive as a result.</p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Claude Code is a tool introduced by Anthropic earlier this year. It allows the Claude AI model to carry out extended projects on the user&#8217;s computer, working step by step and &#8211; ideally &#8211; noticing and correcting its own mistakes. Designed for software engineering tasks, it has proven useful for other sorts of work as well.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>I&#8217;m using &#8220;Claude&#8221; to refer to both the language model behind the Claude chatbot, and the Claude Code agent which uses that model.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Amplifier has 2500 stars and 184 forks on Github. I don&#8217;t know what this might imply about how many people are making serious use of it.</p></div></div>]]></content:encoded></item><item><title><![CDATA[A Coalition For The Future]]></title><description><![CDATA[If we can keep it]]></description><link>https://secondthoughts.ai/p/a-coalition-for-the-future</link><guid isPermaLink="false">https://secondthoughts.ai/p/a-coalition-for-the-future</guid><dc:creator><![CDATA[Anton Leicht]]></dc:creator><pubDate>Tue, 11 Nov 2025 22:01:28 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!VfhB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Here&#8217;s another guest post inspired by <a href="https://thecurve.goldengateinstitute.org/">The Curve</a>. This one is by Anton Leicht, a Visiting Scholar with the Carnegie Endowment&#8217;s Technology and International Affairs team. Anton writes about the political economy of AI progress on his excellent Substack, <a href="https://writing.antonleicht.me/">Threading the Needle</a>. He writes about the possibilities of cooperation that come into view at an event like The Curve, and the need to extend that spirit beyond the boundaries of the conference. (Note that the Golden Gate Institute for AI is not endorsing specific policy proposals.)</em></p><h1>A Coalition For The Future</h1><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!VfhB!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!VfhB!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/dbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1150440,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/webp&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://secondthoughts.ai/i/178617868?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!VfhB!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!VfhB!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fdbe0450f-5ac5-4117-a0df-710c0ca8c27f_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Arriving in the Bay Area, it&#8217;s often hard not to feel like you&#8217;ve been missing out on something profound. It&#8217;s suggested to you in a myriad of ways that only here, you get to partake in some hidden conversation &#8211; about how technology, AI and thereby the world will unfold. And that only here, the true deliberations about the how and the why and the what-for can happen. <strong>Nowhere is that feeling stronger than at The Curve</strong> &#8211; a three-day conference bringing together leading figures in frontier AI: <a href="https://www.hyperdimensional.co/p/the-future-and-its-friends">policy thinkers</a> and lab researchers, safetyists and accelerationists, writers and readers, developers and <a href="https://x.com/deepfates/status/1975305177986179444">power users</a>. Everyone there has something interesting to say about AI, many carry real influence, some even outright power &#8211; and so it&#8217;s easy to feel the spark of &#8220;something happening&#8221;. This essay offers some reflections on how we could keep that spark.</p><p><strong>The mood at The Curve has a way of drawing you in. </strong>Many of my fellow attendees have written of their own experiences at this conference. Kylie Robison <a href="https://www.kyliebytes.com/week-two/">describes</a> checking into the conference venue &#8211; and Yoshua Bengio, godfather of AI, is next in line. Afra Wang <a href="https://substack.com/home/post/p-175596059?source=queue">mentions</a> looking across the table, seeing Ben Buchanan, President Biden&#8217;s former Special Advisor on AI, and surmising that this is &#8220;the room where it happens&#8221;. To me, even more striking than the makeup of the conversations was the common ground they repeatedly managed to find &#8211; between all corners of the frontier AI policy conversation, thoughtful exchange and meaningful progress felt genuinely possible. It&#8217;s the repeated exposure to moments like these that makes me share many attendees&#8217; appraisal: that <strong>The Curve catches lightning in a bottle, </strong>and it&#8217;s easy to walk away from it feeling optimistic about the state of AI policy.</p><h2>The Curve and the World Outside</h2><p>The AI world outside The Curve has gotten uglier since last year&#8217;s conference &#8211; since the state of play Dean Ball has dubbed the <a href="https://www.hyperdimensional.co/p/the-ai-republic-of-letters">AI Republic of Letters</a> last year. As a result, <strong>I found myself noticing quite a contrast between The Curve and the world outside</strong>. My piece the week before had been on the politically thorny and compelling issue of child safety &#8211; where I expressed my worry that harms to children were prone to be instrumentalised in ways that make the AI conversation worse, leading to solve-nothing policies designed to capitalise on grief and public attention. On the one hand, The Curve was a welcome reprieve from that: Attendees of course would disagree on whether the child safety issue was a big problem or which policy would be suitable to address it &#8211; but few were tempted to take cheap shots or easy political wins rather than furthering the substantive questions underneath.</p><p>On the other hand, <strong>I found myself wondering how The Curve&#8217;s sophistication might endure as the world outside grows grittier</strong>. The discourse outside the walls of Lighthaven, The Curve&#8217;s secluded Berkeley venue, cares less and less for nuance. It increasingly pulls toward discussing what plays best in 30-second clips of Senate hearings or topics that intersect most obviously with voters&#8217; anxieties come the midterms. Trends like these have often drowned out nuanced policy conversations on emerging technologies once they&#8217;ve become interesting to the mainstream and susceptible to political projection. In the face of that threat, The Curve &#8211; as a group of people united by a fundamental understanding of this technology and a desire to see its potential realized &#8211; faces a risk.<strong> </strong>It could get swept away by politics and break down entirely into the factions of the political fights to come: its safety advocates part of a broader pro-regulation coalition, its advocates for addressing current harms aligned with anti-tech sentiment, its pro-market and techno-optimist forces subscribed to naive accelerationism. <strong>That risk is real, but not insurmountable. </strong>We can still address it by consolidating around shared beliefs and commitments, and by negotiating eye-to-eye around our differences.</p><h2>Deals, Defections and Distractions</h2><p>This admittedly <strong>vague prescription becomes concrete in a policy question</strong> that dominated many backroom conversations at The Curve: <strong>Federal preemption</strong>. The <a href="https://www.hyperdimensional.co/p/be-it-enacted">issue</a> <a href="https://writing.antonleicht.me/p/a-preemption-deal-worth-making">on the</a> <a href="https://www.transformernews.ai/p/ai-deal-not-worth-making-preemption-legislation-regulation-safety">table</a> was a &#8220;grand bargain&#8221; across the breadth of The Curve&#8217;s attendees: the safetyist types get safety-focused federal regulation of frontier AI models, while the accelerationists get a broader federal preemption of the state-level laws they detest &#8211; in short, a deal setting up the necessary guardrails to let us speed ahead. In many ways, I feel like the debate around this proposal encapsulates the dilemma of The Curve.</p><p>On the one hand, I truly believe <strong>a deal like this is fundamentally <a href="https://writing.antonleicht.me/p/a-preemption-deal-worth-making">good</a>.</strong> It brings together the attendees&#8217; best impulses: their shared faith in the prosperity and progress that new technology can bring, their shared allegiance to a better future. A deal like this would create a cross-cutting coalition that opposed the political grifters and growing technosceptical forces. And for all the political uncertainties and coalitionary risks, I did not talk to many people at The Curve who would not put this deal on the books if they could. Facing the prospect of uglier AI politics from 2026 onward, a deal would let today&#8217;s incumbents establish a framework &#8211; while they still can.</p><p><strong>But internal conflicts threaten this way forward. </strong>The Curve is not made up of one tribe or one clique, and its attendees frequently find themselves on opposite ends of important policy discussions. Distrust runs deep: can we really trust the other side with a deal? Do they really want this, or are they just out to get us? Oh, but maybe we can get an <em>even better</em> outcome by pulling a fast one on the other camp in the process? It&#8217;s very hard to surmount instincts like these under ideal circumstances. And we&#8217;re far from ideal circumstances: The Curve&#8217;s attendees are a heterogenous mix, many of whom were on opposite sides in last year&#8217;s battle over California&#8217;s SB-1047. People remember the sometimes-ugly past policy fights. As their broader influence wanes, they sometimes take solace in the prospect of a fight they can win, even if it is only within a small corner of the broader conversation.</p><p><strong>Surmounting this, I believe, is a matter of recognizing the threat of mutual marginalization</strong>, and responding by joining forces wherever possible. Don&#8217;t mistake that advice for a sentimental sense of &#8220;everyone was so nice, and I wish they all worked together&#8221; &#8211; of course the world is more complicated than that. But the narcissism of small differences has a way to play tricks on your mind, and getting ahead of it begins with naming it and realizing its pervasive influence. Translated into policy terms, that means that &#8220;beating the accelerationists&#8221; or &#8220;beating the safetyists&#8221; respectively should not qualify for a primary policy objective for either side in the political fights to come. <strong>If you take away anything at all from this conference, it should be this: think twice before unleashing your PAC onto someone almost as optimistic about technology as yourself, or calling up the spirits of populist backlash against the few political elements that share your sense of AI&#8217;s transformative potential.</strong></p><h2>Back to Reality</h2><p>I finish this piece a few weeks after The Curve, after pieces of mine and others on the prospects of a deal, and many conversations and discussions. That allows me to end by asking: How is all that going? In the weeks since The Curve, we&#8217;ve seen the threat and promise of both paths ahead in equal part. A divisive public statement calling for the prohibition of superintelligence has confirmed many observers&#8217; biggest concerns about the safety movement; vociferous social media posts about Anthropic and California&#8217;s SB-53 leave just as many observers doubting the inclination of the accelerationist crowd to compromise. <strong>But we&#8217;ve also seen rare signs of interest in agreement and movement towards common ground. </strong>The White House&#8217;s Sriram Krishnan reduced the heat through a thoughtful post on his disagreements with the safety movement and ways to reduce them. Many safety advocates have engaged in kind. It&#8217;s to The Curve&#8217;s extraordinary credit that one of Krishnan&#8217;s asks was more conversations about AI scenarios between fast-timeline advocates on AI progress and a comparative skeptic like Sayash Kapoor &#8211; just what had happened at an in-depth three-hour session at the conference.</p><p>While I write these final lines, quite a lot is happening all at once. People from different camps are meeting and having conversations, in Berkeley and on Capitol Hill. Deals are being tested, options being scoped. And everyone is gearing up: coming up with favourable political wedges, funneling donations to candidates, arming and aiming their super-PACs. I am not certain which way all this will go. But I do know what I think we should do: try to find common ground rather than trying to win the fight within the walls of Lighthaven. I also know that The Curve is a force pulling us in the right direction: it encourages just the right conversations and often succeeds in getting just the right people in just the right room. <strong>If, by next year&#8217;s conference, we look back on this year as one of d&#233;tente and rapprochement, The Curve will have played a vital part. </strong>I, for one, will spend that year working to keep the conversations intact, the channels open, and our worst instincts at bay. I look forward to meeting again next year, believing, still, that we can get this right.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/a-coalition-for-the-future?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/a-coalition-for-the-future?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks again to Anton Leicht for helping to place The Curve in a larger context. For more of Anton&#8217;s writing, check out his newsletter, <a href="https://writing.antonleicht.me/">Threading the Needle</a>.</em></p>]]></content:encoded></item><item><title><![CDATA[A Project Is Not a Bundle of Tasks]]></title><description><![CDATA[Current AIs struggle to create a whole that exceeds the sum of its parts]]></description><link>https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks</link><guid isPermaLink="false">https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Tue, 04 Nov 2025 00:33:14 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!_dVa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!_dVa!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!_dVa!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!_dVa!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!_dVa!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F40bba638-f578-44b9-bc5a-723d7c517d4a_1232x928.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Midjourney, <em>AI Struggling With Time Management</em>, 2025</figcaption></figure></div><p>Today, AIs can do some things. Eventually, they may be able to do everything. Forecasts generally project forward from what AI can do today, estimating how long it will take to fill in the gaps.</p><p>We often underestimate the size of those gaps. Some of this is just &#8220;out of sight, out of mind&#8221; (or, if you prefer, &#8220;availability bias&#8221;) &#8211; <a href="https://secondthoughts.ai/i/151564809/some-things-ai-cant-yet-do">it&#8217;s easier to notice the things AIs are doing than the things they aren&#8217;t</a>. But also, AIs have important gaps in fuzzy high-level skills that we don&#8217;t have great vocabulary for. These skills are applied diffusely across our workday instead of showing up as a crisp bullet on a todo list, making it easier to forget they exist.</p><p>In today&#8217;s post, I&#8217;m going to discuss some of the capabilities that AIs will need to acquire in order to progress from carrying out tasks to automating entire projects and jobs. I&#8217;ll also argue against some specific ideas that are sometimes presented in arguments for &#8220;short timelines&#8221; (AGI arriving within the next few years).</p><p>I&#8217;ll mostly be talking about software engineering. Automated coding is central to many scenarios of rapid AI progress, and it&#8217;s the domain I know best. But the principles I&#8217;m going to discuss apply to many fields.</p><h1><strong>Projects Don&#8217;t Decompose Into Tasks</strong></h1><p>We often analyze AI&#8217;s aptitude for a job by enumerating the tasks involved in that job. However, the whole is more than the sum of its parts. Breaking up a job (or a large project) into tasks is a useful mental scaffold, but it&#8217;s also an oversimplified way of thinking about things. The boundaries between tasks are not clean; information bleeds across. To quote from what I wrote in entry 24 of<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts About AGI and 1 About GPT-5</a>:</p><blockquote><p>...A large project does not neatly decompose into tidy subprojects.<br><br>For instance, when I tackle a subtask during a large software project, the result is not just that a certain chunk of the code gets written. I come out with a slightly deeper understanding of the problem. I may have learned new things about the existing codebase. I may have hit upon a handy trick for testing the kinds of operations that this code performs, or had an insight about the data being processed. I may have gotten some little nudge that will eventually accumulate with 20 other nudges across other portions of the project, eventually leading me to rethink my entire approach. If the subtask is assigned to a separate agent, whose memories are discarded as soon as the subtask is complete, none of that learning can take place.</p></blockquote><p>If we think of a job as just a collection of tasks, we miss something important. A recent article,<a href="https://www.worksinprogress.news/p/why-ai-isnt-replacing-radiologists"> AI isn&#8217;t replacing radiologists</a>, provides a nice illustration of this in another domain: a radiologist doesn&#8217;t just analyze images, they actually spend more time &#8220;on other activities, like talking to patients and fellow clinicians&#8221;.</p><h1><strong>If You Can Plan a Month, Can You Plan a Year?</strong></h1><p>To make concrete forecasts about the future, we tend to look for a graph that plots the thing we&#8217;re interested in, and extend the line. When discussing the future of AI, people often turn to this graph (I wrote an<a href="https://secondthoughts.ai/p/measuring-ai-progress"> entire post</a> about it):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QZTh!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QZTh!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!QZTh!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 424w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 848w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!QZTh!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7631c24c-1ef1-4ce2-8f38-b060ae80c0b6_1456x853.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><a href="https://metr.github.io/autonomy-evals-guide/gpt-5-report/">source: METR</a></figcaption></figure></div><p>Roughly speaking, the vertical axis shows the size of software engineering tasks an AI can complete on its own<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. One argument goes that AIs will be able to fully automate software engineering when they can independently handle large projects; people define &#8220;large&#8221; as anything from 1 month to 100 person-years or more. This seems pretty intuitive, and it has the nice property that it converts &#8220;fully automated software engineering&#8221; from an abstract concept into a point on this handy graph &#8211; a graph that exhibits a reassuringly predictable trend.</p><p>However, there are questions as to the nature of that trend. It appears to bend upwards near the end. Is that just a blip, or does it indicate that AI capabilities are accelerating? One argument is that as AIs acquire the skills to carry out intermediate projects, those same skills will enable them to carry out large projects. This would imply that while it may have taken a lot of R&amp;D for AIs to progress from 1-minute tasks to 2-minute tasks, or 1 hour to 2 hours, it will be comparatively easy to advance from 1 week to 2 weeks. And an AI that can carry out 1-month tasks may be ready to carry out projects of any size. Here&#8217;s one expression of this idea:</p><blockquote><p>...it seems like once you have the agency skills to make consistent progress on very hard tasks (e.g. via task decomposition, noticing mistakes and error correcting, planning), you can move through the increased horizons very quickly because you [are] solving tasks by essentially applying the same skills over and over<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p></blockquote><p>I think this discounts the complexity of managing a large project. I&#8217;ll illustrate this by talking about two software engineering projects in which I played a leading role, a 1-person-year project and a (roughly) 100-person-year project.</p><h2>Launching Google Docs: 1 Person-Year</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!nkE5!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!nkE5!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 424w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 848w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1272w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png" width="1456" height="415" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:415,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!nkE5!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 424w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 848w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1272w, https://substackcdn.com/image/fetch/$s_!nkE5!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a02acdf-c4b1-41e9-89bb-8ec7a2afb667_1600x456.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I have no idea where this sample text comes from.</figcaption></figure></div><p>In 2005, my friend Sam Schillace had the idea that would eventually become Google Docs. With another friend (Claudia Carpenter), we spent about 4 months building an alpha version, which we released under the name &#8220;Writely&#8221;.</p><p>Those 4 months didn&#8217;t involve much big-picture thinking. I hacked together a server to store documents, Claudia designed and coded the toolbar and other UI elements, Sam built the web page that wired it all together. Problems arose; we fixed them. Everything was done seat-of-the-pants, we addressed problems as they came up, a lot of cans were kicked down the road. This is not to say that the work was easy: I had to come up with a way to synchronize changes when multiple people were editing the same document; Sam and Claudia had to wrestle with cutting-edge browser features that were often buggy and poorly documented. But it was all kind of &#8220;small ball&#8221;: it wasn&#8217;t hard to keep the entire system in our heads, and we didn&#8217;t worry about the long-term consequences of any decisions we were making.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Building Scalyr: 100 Person-Years</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!n5ws!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!n5ws!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 424w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 848w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1272w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png" width="1280" height="720" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:720,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!n5ws!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 424w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 848w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1272w, https://substackcdn.com/image/fetch/$s_!n5ws!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F845eb388-b8cd-42d2-a7b7-f2bc003efa21_1280x720.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>In 2011, I founded another startup, Scalyr. A team that grew to dozens of engineers spent 10 years building a system that could analyze vast quantities of log data to troubleshoot problems in complex web applications.</p><p>This was an entirely different kettle of fish from Writely, and my 2005 self would never have pulled it off. I relied on lessons I&#8217;d learned by fumbling my way through large-scale projects at Google (after Writely was acquired), as well as skills acquired on the job at Scalyr. Building Scalyr involved things such as:</p><ol><li><p><strong>Finding systematic solutions to operational issues</strong>. Scalyr was a complex application, relying on coordinated activity across thousands of servers. Fixing problems one-by-one would have been unsustainable. To keep things on an even keel, it was necessary to identify patterns of behavior, and find a single change that would eliminate an entire category of problems at once. Furthermore, each change had to be something that could be accomplished with minimal effort or disruption. To play my part, I had to lean on years of general experience in sniffing out software bugs, further years of specific experience with the system we built at Scalyr (the sort of &#8220;continuous learning&#8221; that current AIs don&#8217;t do), and hard-won judgement as to which changes would be safe and tractable.</p></li><li><p><strong>Strategic planning and execution</strong>. Midway through building the company, we encountered major setbacks &#8211; our largest customer abruptly departed, and we were struggling to land new clients. We needed to find a new course. After months of discussion, we decided that our core competitive advantage lay in something that had been a minor detail in our original product conference; processing large volumes of data at high speed and low cost. We then planned and executed a multi-year effort to expand on that advantage. A few skills required here: figuring out which parts of our complex server architecture needed to change; breaking those changes down into incremental steps that wouldn&#8217;t be too complex for the team to handle; identifying the critical assumptions in our plan, and finding ways to validate those assumptions early; spotting problems early and correcting them with minimal disruption to the plan.</p></li><li><p><strong>Developing techniques to diagnose subtle performance issues</strong>. Our performance goals required that in a fraction of a second, a task could be distributed across thousands of servers, carried out, and the results consolidated &#8211; efficiently flowing alongside other tasks. Seemingly minor changes to the code could disrupt the smooth flow and destroy our competitive advantage. To solve such problems, we had to find ways to record enough data to allow us to analyze task flow, without recording so much data as to be unmanageable. This required high-level judgement in identifying the critical hinge points in the system, and only collecting data at those critical points.</p></li></ol><p>(I also shared some thoughts about the different skills involved in small vs. large projects in<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts About AGI and 1 About GPT-5</a> &#8211; entries 19 through 25.)</p><h1><strong>What Got AI Here Won&#8217;t Get It There</strong></h1><p>As Claude puts it:</p><blockquote><p>&#8220;What Got You Here Won&#8217;t Get You There&#8221; is a concept popularized by executive coach Marshall Goldsmith in his book of the same name. The core idea is that <strong>the skills, habits, and behaviors that lead to success at one level often become obstacles at the next level</strong>.</p></blockquote><p>My personal experience as a software engineer supports this idea. The &#8220;agency skills&#8221; that were sufficient banging out the original Writely prototype would have been utterly inadequate for managing the first 100 person-years of Scalyr, and indeed would often have gotten in the way. I expect that this generalizes, and the level of skill at task decomposition, error correction, and planning that will allow an AI to navigate 1-year projects will not suffice for larger projects.</p><p>Nor is the difference just about generic &#8220;agency skills&#8221;. Acting as a senior engineer at Scalyr required a lot of very specific, advanced domain skills that only come after years of experience. For instance, learning to look at a report of the last month&#8217;s operational issues and spot a pattern that will allow an entire category of problems to be stamped out through a single change. These sorts of skills are generally not necessary for smaller projects, <em>nor does experience at smaller projects necessarily help develop them</em>. Thus, when AI systems master 1-year projects, I think that larger projects will still challenge them.</p><h1><strong>Large-Scale Projects Stress Deep Cognitive Skills</strong></h1><p>Large-scale projects don&#8217;t just require project management skills and advanced domain knowledge. They lean on a cluster of cognitive skills that are lacking in current AIs:</p><p><strong>Context management</strong>. The bigger the project, the more details you need to keep in mind. This is especially true for big-picture planning, but it even applies when working on specific tasks. It&#8217;s important to consider the relevant details &#8211; which means you need to be able to pick those details out from the larger mass of information that a large project generates (entry 30 of<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5"> 35 Thoughts</a>).</p><p><strong>Continuous learning and adaptability</strong>. Seasoned veterans on long-running projects are valuable in part because they have honed bespoke skills specifically adapted to that project. They have become expert at troubleshooting the exact sorts of problems that come up, and they automatically write code that fits with the project&#8217;s specific design choices.</p><p><strong>Metacognition</strong>. On a small project, you just dive in and get it done. On a large project, it&#8217;s worth your while to analyze what is and isn&#8217;t working, where you (or your team) tend to get stuck, what sorts of techniques work well, and where you need to develop new skills.</p><p>Nathan Lambert<a href="https://www.interconnects.ai/p/burning-out/comments"> recently touched on all of this:</a></p><blockquote><p><strong>The amount of context that you need to keep in your brain to perform well in many LM training contexts is ever increasing.</strong> For example, leading post-training pipelines around the launch of ChatGPT looked like two or maybe three well separated training stages. Now there are tons of checkpoints flying around getting merged, sequenced, and chopped apart in part of the final project. Processes that used to be managed by one or two people now have teams coordinating many data and algorithmic efforts that are trying to land in just a few models a year. <strong>I&#8217;ve personally transitioned from a normal researcher to something like a tech lead who is always trying to predict blockers before they come up (at any point in the post-training process) and get resources to fix them. </strong>I bounce in and out of problems to wherever the most risk is.</p><p>[emphasis added]</p></blockquote><h1><strong>We Won&#8217;t Have Automated Software Engineering Until AIs are Leading 100-Person-Year Projects</strong></h1><p>Impactful software engineering projects are usually large, complicated beasts &#8211; if not at first (Writely), then as they grow and mature (Google Docs). Certainly there are complex projects taking place inside the big AI labs. Analysis based on the automation of specific tasks misses the forest for the trees. Software engineering will not be fully automated until AIs can handle projects of the 100-person-year scale (or more). How long until that happens?</p><p>It&#8217;s impossible to predict with any certainty. But we can take a stab. The graph I presented at the top of this post shows current AI models achieving a 50% success rate at tasks that take about two person-hours. The <a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/">paper where that graph originated</a> finds that the manageable task size doubles every 7 months. It would take about 16 doublings to get from two person-hours to 100 person-years<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>; at 7 months per doubling, that puts us around the beginning of 2035. Advancing from a 50% success rate to full competence might take a few more years, putting automated software engineering in the late 2030s. <a href="https://secondthoughts.ai/i/162001832/how-applicable-to-the-real-world-are-these-results">Adjusting for real-world conditions</a> might push the schedule out further.</p><p>The same paper finds preliminary evidence that the doubling time might now be shrinking to just 4 months, which (if we discount adjustments for real-world conditions) would imply fully automated software engineering in the early 2030s. That would be startlingly fast in terms of its impact on the world, but considerably slower than some scenarios for AI progress. Those aggressive scenarios generally assume that the rate at which AI climbs the ladder of project scope will continue to accelerate, and/or that the task scope threshold for full automation is far lower than 100 person-years. In this post, I&#8217;ve explained why I don&#8217;t agree.</p><p>These estimates are balanced on the slender reed of a single graph from a single study. We shouldn&#8217;t take any of them too seriously. But that graph is the single best tool we currently have for answering a critical question about the likely pace of AI progress. And I think readings of that graph that point toward rapid automation neglect the full complexity of software engineering.</p><p>The most likely scenario for faster progress would involve unexpected breakthroughs, either in the design of AI models, or in the way we apply them to the task of large-scale software development (perhaps we will find ways to use AI&#8217;s superhuman strengths to better compensate for its weaknesses). Breakthroughs, of course, are hard to predict &#8211; and hard to rule out.</p><p>Sam Altman <a href="https://x.com/sama/status/1983584366547829073">recently announced</a> that OpenAI has an internal goal to create &#8220;a true automated AI researcher by March of 2028&#8221; &#8211; just 2.5 years from now. Jakub Pachocki, OpenAI&#8217;s chief scientist, <a href="https://techcrunch.com/2025/10/28/sam-altman-says-openai-will-have-a-legitimate-ai-researcher-by-2028/">made clear this means</a> a &#8220;system capable of autonomously delivering on larger research projects.&#8221; This seems unambiguously inconsistent with the observed trend of progress in AI capabilities. That doesn&#8217;t mean it can&#8217;t happen! But it can&#8217;t happen without a major breakthrough<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>.</p><p>In the meantime, of course, we will have <em>partial</em> automation; code-authoring tools are already providing value, though <a href="https://secondthoughts.ai/p/ai-coding-slowdown">the benefits may often be overestimated</a>. There are (<a href="https://x.com/RyanPGreenblatt/status/1981012208332280219">disputed</a>) reports that AI tools are already writing 90% of code at one frontier AI developer. If partial automation can drastically improve the productivity of engineering teams, that may have a major impact on AI timelines, not to mention the job market for software engineers, and the tech industry in general. I am also somewhat bearish on drastic effects from partial automation, but that&#8217;s a subject for another day.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/a-project-is-not-a-bundle-of-tasks?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi Olvera, Eli Lifland, James Cham, John Hart, Michael Chen, Nate Rush, and Taren Stinebrickner-Kauffman for feedback (and to Abi for the post image).</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Actually, the graph shows the size of tasks for which the success rate is 50%. The size of tasks which AIs can <em>reliably</em> carry out is much less. Also note that the measurements are specifically for software engineering-style tasks that can be graded automatically (e.g. &#8220;does this code pass tests&#8221;, not &#8220;is it easy to understand and consistent with style guidelines&#8221;), and the time scale is calibrated against human engineers who are not familiar with the specific codebase the task applies to.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>From an unpublished draft of a forecast I was asked to review.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p> Assuming a 2000-hour work year.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p> It&#8217;s certainly possible that we could see one-off examples of an AI system carrying out something that could be characterized as a &#8220;research project&#8221;, just as Google DeepMind&#8217;s AlphaEvolve system has already delivered a scattering of incremental advances in areas relevant to AI, such as matrix multiplication algorithms. But it would be far off-trend to see anything in March 2028 that can independently carry out &#8220;larger research projects&#8221; at anything like the capability of a human researcher, with any generality, and at anything approaching a competitive cost.</p></div></div>]]></content:encoded></item><item><title><![CDATA[What I Saw Around The Curve]]></title><description><![CDATA[Notes from the near-future of AI]]></description><link>https://secondthoughts.ai/p/what-i-saw-around-the-curve</link><guid isPermaLink="false">https://secondthoughts.ai/p/what-i-saw-around-the-curve</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 29 Oct 2025 23:49:51 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/207d63fb-e61e-4669-be19-de214cdcea1a_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>After <a href="https://thecurve.goldengateinstitute.org/">The Curve</a>, Eli Pariser wrote an extensive account of things he learned at the conference. Full of insights, it&#8217;s a fascinating glimpse into the conversations that arise when a few hundred insiders from across the AI landscape get together. He gave permission for us to share it with you.</em></p><p><em>Eli is author of The Filter Bubble and Co-Director at New_ Public, a nonprofit product studio focused on reimagining community, connection, and conversation online. You can follow their biweekly newsletter here: <a href="https://newpublic.substack.com/">newpublic.substack.com</a>.</em></p><p><em>Coming up next, I&#8217;ll be posting <strong>A Project is Not a Sequence of Tasks</strong> (working title) &#8211; some musings on the path to full automation of software engineering, drawing on my own 40-year career writing code.</em></p><p><em>And now, over to Eli!</em></p><h2><strong>What I saw around The Curve</strong></h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WBmn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WBmn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-242.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-242.jpg" title="20251003 The Curve Berkeley-242.jpg" srcset="https://substackcdn.com/image/fetch/$s_!WBmn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!WBmn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75666a7a-bf65-4048-a434-89df70c78c08_800x533.jpeg 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>I attended <em>The Curve</em>, a conference of ~350 top AI lab leaders and scientists, safety activists and alarmists, political advisors and lobbyists, journalists, and civil society leaders. The event took place in Lighthaven, a quirky cluster of houses in Berkeley that have been retrofitted into small conference spaces. Many sessions were in living room-sized spaces, so it was all very personable and informal &#8212; an impressive feat by the organizing team.</p><h4>The vibe: Convivial, affable, and slightly doomy</h4><p>Many participants seemed thrilled, awed, and also deeply worried. Someone described it as &#8220;snorting pure San Francisco.&#8221; At 45 I was one of the older participants &#8212; the average age was mid-30s, but many of the most powerful people were in their late 20s. Pretty much every single person I talked to was smart, relatively humble, and pleasant. You couldn&#8217;t throw a rock without hitting someone with a large Substack following.</p><p>(Some sessions were on the record, some off, thus the vague sourcing in some places below.)</p><h4>&#8220;It will accelerate&#8221;</h4><p>There was broad consensus that the pace of progress in AI models will continue to accelerate, though lots of debate about how quickly. A senior lab official said that 90% of the code in one of the frontier labs is now written by AI models (!!), and that they&#8217;re already replacing some entry-level technical jobs with automated agents.</p><p>(Another person told me that this was a bit of a canard &#8212; these are fairly rigid, well-specified jobs and the labs obviously have the most expertise in how to fine-tune models to meet their own specific needs &#8212; so they have an unrealistic sense of how hard it will be to apply AI this extensively outside their own walls.)</p><p>An official from another lab pointed out that the scope of tasks chatbots can handle autonomously keeps rising<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!p_LT!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!p_LT!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 424w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 848w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1272w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png" width="1300" height="776" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:776,&quot;width&quot;:1300,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Length of asks AIs can do is doubling every 7 months&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Length of asks AIs can do is doubling every 7 months" title="Length of asks AIs can do is doubling every 7 months" srcset="https://substackcdn.com/image/fetch/$s_!p_LT!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 424w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 848w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1272w, https://substackcdn.com/image/fetch/$s_!p_LT!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2ce9eb05-6dba-4801-9f0d-a65f165a96d1_1300x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><blockquote></blockquote><p>There was a general sense that &#8220;recursive self-improvement,&#8221; where labs build automated AI researchers that develop better AI models that build better researchers, is still some time away &#8212; possibly just a few years, possibly much longer.</p><p>The general lab belief here (which is a bit maximalist), as one person described it, is that there are no barriers on the path to true human-level general intelligence (AGI):</p><ul><li><p>&#8220;Capabilities based on current methods will continue to improve smoothly&#8221;</p></li><li><p>&#8220;New methods will continue to arrive and accelerate progress&#8221;</p></li><li><p>&#8220;Compute development will accelerate and compute bottlenecks will be overcome&#8221;</p></li></ul><h4>&#8220;Normal technologists&#8221; and those worried about accelerating existential risk agree that it&#8217;s likely there will be rapid near-term acceleration</h4><p>One of the big debates in AI is whether this is a &#8220;normal technology&#8221; &#8212; whose progress will be subject to friction and delay as it is adopted by human social systems &#8212; or it&#8217;s going to start improving itself exponentially and &#8220;take off&#8221;. Generally the &#8220;takeoff&#8221; people don&#8217;t think this will end well<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>There was a great session where one of the originators of the Normal Technology hypothesis (Sayash Kapoor) and the lead author of the <em>AI 2027</em> paper talked about what they agree on. There was broad agreement from both that, in the short term, AI<em> may</em> advance rapidly:</p><ul><li><p>The size of coding tasks which AI systems can carry out independently may quadruple within a year</p></li><li><p>Revenue in 2026 will continue to grow exponentially &#8212; perhaps reaching $60-70B / year</p></li></ul><p>They did not agree on recursive self-improvement, however &#8212; the Normal Technology team thinks that&#8217;s a long way off, if it ever happens. Both sides generally agreed that there&#8217;s a big gulf between rapid progress in a lab and deployment in the real world.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BPdY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BPdY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-55.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-55.jpg" title="20251003 The Curve Berkeley-55.jpg" srcset="https://substackcdn.com/image/fetch/$s_!BPdY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!BPdY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd7ff9d49-2165-438b-be74-fe0e072a0d74_800x533.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Not every big idea was presented on stage. Some began in moments like this.</figcaption></figure></div><p>My sense is that Normal Technologists anticipate the speed of change to be substantially higher than people who aren&#8217;t following AI closely &#8212; and that the Normal Technology view is often misunderstood as predicting slower progress than its authors actually expect.</p><h4>But&#8230; the frontier may remain &#8220;jagged&#8221;</h4><p>AI policy leader Helen Toner gave a great talk pointing out that right now, AI capability is &#8220;jagged&#8221; &#8212; just because LLMs can do some things doesn&#8217;t mean that they can perform other tasks of seemingly equal difficulty.</p><p>She suggested that this may be a permanent state of affairs, because:</p><ul><li><p>It is easier to check whether an AI&#8217;s work is correct for some tasks than others</p></li><li><p>For many tasks it&#8217;s difficult to encapsulate all of the necessary context in a way that&#8217;s accessible to a chatbot</p></li><li><p>The level of adversariality of tasks will differ, and working against adversaries is a different kind of problem</p></li><li><p>Some tasks that seem cognitive will continue to require embodiment (Toner used the example of deciding on an event venue).</p></li><li><p>Different combinations of capacities have different implications for who benefits, who loses, who has power, and who is provoked</p></li></ul><h4>The emerging politics are very weird</h4><p>On the one hand, there&#8217;s a relatively bipartisan elite consensus.</p><p>The general sense was that there&#8217;s considerable continuity between Biden and Trump AI policies. There was a session between high-level Biden and Trump AI advisors that I didn&#8217;t see much of, but it was described as largely aligned and convivial.</p><p>Rep. Sam Liccardo, Congressman from the San Francisco Peninsula, who was interviewed at the conference, said he expected some kind of federal preemption to pass &#8212; that colleagues in tough races probably wouldn&#8217;t vote for it, but everyone else will. He described elite political opinion as being focused on a mix of China, jobs, cybersecurity, and bioterrorism.</p><p>On the other hand, there&#8217;s a kind of cross-partisan populist angst.</p><p>The most packed session of the entire conference was Taren Stinebrickner-Kauffman interviewing Joe Allen, a Steve Bannon-aligned figure who is strongly anti-transhumanist<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, anti-surveillance, and largely anti-AI from a somewhat apocalyptic Christian point of view.</p><p>Meanwhile, union leaders, who are already very skeptical about AI, are getting pushback from rank-and-file members for engaging with AI or AI labs at all. And more broadly, Americans across the political spectrum seem to clock that this is unlikely to be good for them or their jobs.</p><p>It also seemed clear that there&#8217;s going to be a lot of money spent in a battle between AI safety and AI accelerationist forces. Many of the newly minted centimillionaires and billionaires of the AI age are deeply concerned about runaway AI creating catastrophic risks and are pouring large sums into safety efforts. In contrast, Marc Andreessen, OpenAI President Greg Brockman, and others are launching a <a href="https://fortune.com/2025/08/26/openai-president-greg-brockman-andreessen-horowitz-super-pac-ai-pro-innovation/">$100M pro-AI, anti-regulation Super PAC</a>. It sure looks like this is going to make the wars over crypto look like a drop in the bucket.</p><p>In this context, it was interesting to note that while concerns about work and job displacement are a major part of the mix, there&#8217;s a lot of energy around regulating how predatory chatbots interact with children. The organizer of a right-leaning AI safety organization told me she had tested a wide range of messages. Worries about sexually predatory chatbots forming relationships with kids, as well as driving them to suicide, were by far the most energizing.</p><p>I also attended a session by the Future of Life Institute&#8217;s Hamza Chaudhry, who is thinking very seriously about how chatbots will be weaponized to infiltrate, persuade, and create threats to national security infrastructure.</p><h4>The labs think they will be toxic soon</h4><p>There was general agreement among many people both inside and outside the labs that they&#8217;re likely to be broadly hated &#8212; as much as, if not more than, social media companies are today.</p><p>Anthropic is clearly trying to position itself as &#8220;the good one.&#8221; Its business model is focused more on code-writing and revenue from enterprise customers, which they see as shielding them from some of the worst consumer pressures. They tend to style themselves as a kind of &#8220;rebel alliance.&#8221;</p><p>There was also a lot of embarrassment, from people both inside and outside OpenAI, about Sora 2 and the general sense that they&#8217;re slipping into doing &#8220;stupid addictive social media stuff.&#8221;</p><h4>US versus China is very much a thing &#8212; but we may have China&#8217;s strategy wrong</h4><p>Generally, the labs all believe they are defenders of democracy &#8212; and of &#8220;democratic AI&#8221; &#8212; against China. This was a very lively conversation and a major motivator for acceleration among U.S. AI experts, especially around national defense and cybersecurity.</p><p>Rep. Liccardo&#8217;s position: If we lose this race, China writes the rules.</p><p>However, there was an undercurrent of concern that we may be reading China&#8217;s AI strategy wrong. One participant told me that the vast majority of China&#8217;s efforts are focused on &#8220;embodied AI&#8221; &#8212; i.e. robots &#8212; because that&#8217;s where the most pragmatic economic growth opportunities are. It&#8217;s not at all clear that China&#8217;s industrial policy around AI is aimed at developing super-intelligence.</p><p>There was also discussion of how China is positioning itself as <em>anti-imperial-America</em> in Africa and other regions, offering support to build sovereign AI systems and models.</p><h4>Computing capacity seems likely to remain a chokepoint in the near future</h4><p>There was general agreement that access to data centers is going to be a key source of AI power &#8212; not just because that&#8217;s how new models are built, but because it&#8217;s what&#8217;s needed to run them at scale.</p><p>An OpenAI official described how &#8220;inference-time compute&#8221; is scaling intelligence &#8212; meaning that if you throw more processing power at a model to let it &#8220;think&#8221; longer, you get dramatically better results.</p><p>In other words, if you want to slow down development and use of AI, you either have to slow down the supply of AI chips or slow down the energy they need.</p><h4>There&#8217;s a lot of uncertainty about when and how job displacement might hit</h4><p>It was a majority opinion that entry-level jobs are already being displaced. One of the authors of a well-known paper on this, <em><a href="https://digitaleconomy.stanford.edu/publications/canaries-in-the-coal-mine/">Canaries in the Coal Mine</a>,</em> was in attendance. He&#8217;s finding that there&#8217;s a reduction of employment in entry-level jobs specifically in fields predicted to be exposed to AI automation.</p><p>There was also fairly broad consensus that there will be significant disruptions in jobs over the next 3-5 years &#8212; but with big error bars. Even the <em>AI 2027</em> team was not confident that AI would contribute 1% of GDP by 2028.</p><p>An anecdote that&#8217;s often used in this vein is that there are actually more bank tellers now than there were in 1980, when ATMs rolled out. But, as Mike Kubzansky from Omidyar pointed out, that story is incomplete &#8212; it was true up until 2008, when mobile banking rolled out. After that, the numbers collapsed.</p><h4>Some interesting ideas about how AI might aid democracy</h4><p>There was an insightful panel on how AI might support a healthier democracy.</p><p>One idea focused on<strong> digital human rights shields</strong> &#8212; one of the big labs worked with human rights lawyers in the Philippines, where 50% of staff face murder threats, to create an early warning system that detects chains of events leading to killings, successfully protecting targeted individuals.</p><p>Another example was<strong> anti-corruption tech in Ukraine,</strong> where AI tools are used for direct government contracting that is &#8220;disintermediated&#8221; from corrupt practices, saving millions and creating public trust while reducing opportunities for foreign influence.</p><p>A third idea involved<strong> public health data </strong>&#8212; with the erosion of federal public health infrastructure, AI tools can provide high-quality medical data, precedents, and playbooks for local and state agencies dealing with health crises or outbreaks.</p><h4>We&#8217;re in the early days of AI privacy, security, and influence concerns</h4><p>A number of researchers I spoke with described how challenging it will be to have AI systems engaging with other systems without revealing private information from their owners. It&#8217;s not that hard to get AIs either to explicitly share this information &#8212; or to accidentally reveal what they &#8220;know.&#8221;</p><p>There were similar worries around security and influence. A security expert pointed out that even novice hackers &#8212; still affectionately known as &#8220;script kiddies,&#8221; a term coined in the 1990s &#8212; now have expert-level AI assistance to write scripts and make trouble. This allows for much more sophisticated attacks to be launched by much less sophisticated actors.</p><p>A researcher studying foreign influence operations said essentially the same thing: with minimal resources, no human agents, and limited language or technical capacity, it&#8217;s now possible to run a fairly sophisticated influence campaign.</p><h4>&#8220;Cognitive shields&#8221; seem like a good idea, but not something anyone&#8217;s working on per se</h4><p>I floated an idea that my old friend and mentor Wes Boyd has been raising: using AI to shield against manipulation and influence. You could imagine having a personal AI that&#8217;s accountable to your more aspirational self &#8212; one that nudges you away from harmful distractions and toward wellbeing and growth-oriented activities. There was generally a lot of interest and appreciation for this idea among the people I raised it with, but no one seemed to be actively working on anything like it yet.</p><h4>Sycophancy was a big topic &#8212; and might be an artifact of human reinforcement learning</h4><p>The tendency of chatbots to tell people what they want to hear was a big topic.</p><p>This was already on my mind because of a recent study showing that people generally not only prefer sycophantic chatbots but often see them as less biased &#8212; and that only people who score high on &#8220;openness&#8221; psychologically prefer chatbots that disagree with them.</p><p>There&#8217;s some interesting speculation that chatbots&#8217; tendency towards sycophancy is the result of human feedback gone awry. When people rate chatbot responses with a thumbs up, it&#8217;s often because they like what they&#8217;re hearing, not because it&#8217;s correct. How to solve this is complicated.</p><h4>Some interesting future provocations from people at the labs</h4><p>A few talks I attended featured people from the big commercial AI labs speculating about what this all means for society, politics, and beyond. I found many of the ideas interesting &#8212; but also ungrounded in any general sense of social dynamics, politics, or history. That said, here are a few that stood out:</p><ul><li><p>&#8220;We&#8217;re moving from an attention economy to an attachment economy.&#8221; Peak attention has come and gone; now we&#8217;re moving to attachment as the scarce resource.</p></li><li><p>Something that looks like gaming will increasingly replace what we now think of as work.</p></li><li><p>All types of relationships with AI will be normalized, including spiritual, romantic, and sexual relationships. Some people will want to give AI personas personhood and maybe even rights, while others will strongly oppose this.</p></li><li><p>People will slice, dice, license and monetize their identities in a whole series of new ways (this is something I buy for sure.) They will create a digital &#8220;twin&#8221; of their consciousness and replicate it.</p></li><li><p>There&#8217;s a coming fight between transhumanists and humanists/&#8220;mortalists&#8221; &#8212; those who want to merge with technology and pursue immortality versus those who remain rooted in embodied humanity. This will likely take on political and geopolitical dimensions.</p></li><li><p>The AI agent economy &#8212; with agents taking on increasingly complex, autonomous tasks and making money &#8212; will induce &#8220;endgame financialization.&#8221;</p></li><li><p>Access to computing power and AI models will become essential and even existential for nations.</p></li></ul><p>One especially vivid talk came from Josh Achiam, Head of Mission Alignment at OpenAI, who offered a rough sketch of the next century:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!BjO9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!BjO9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 424w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 848w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1272w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png" width="1456" height="861" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:861,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!BjO9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 424w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 848w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1272w, https://substackcdn.com/image/fetch/$s_!BjO9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F70bbb040-59c7-434a-8114-6e7b4c7c9919_1600x946.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h4>A big theme: Determinism vs. non-determinism</h4><p>Many of the most intense AI safety voices seemed to have a pretty deterministic view of the world. One person speculated that this is because they tend to come out of the AI/ML world itself where math and physics are primary disciplines. As Helen Toner pointed out, they tend to make linear or exponential predictions about the future.</p><p>One of the most interesting sessions I watched was a debate between Buck Shlegeris, an AI safety advocate who seemed pretty alarmed about the potential for catastrophic risk &#8212; and journalist Timothy B. Lee. Tim patiently and compellingly poked holes in the premise that intelligence alone is sufficient for a super-intelligence to take over the world, arguing that because the world is chaotic and nonlinear, it&#8217;s not that easy &#8212; especially since there will be other, perhaps slightly less intelligent but still vigilant AIs looking to stop them from doing whatever they&#8217;re doing.</p><p>There was a strong non-deterministic crew at the conference as well &#8212; Toner and some of the people from the big labs who think the world is more like fluid dynamics &#8212; impossible to perfectly (or often even imperfectly) predict. I find myself much more sympathetic to this way of looking at things.</p><h4>My grand unified theory: Focus on aligning AI societies over AI models</h4><p>AI researchers spend a lot of time thinking about the &#8220;alignment problem&#8221; &#8212; essentially, how do we get AI models to do what we want in ways that are not harmful? A huge amount of the labs&#8217; focus is on this question.</p><p>Over the course of the conference I came to feel that this was the wrong way of looking at things.</p><p>In part, that&#8217;s because I think what we&#8217;re headed toward in the near term is a world of billions or trillions of different models with their own goals, context, and learning abilities, bumping up against each other, coordinating, collaborating, and fighting with each other. That will create complexity and emergent behavior, just like every other system with billions or trillions of actors. It looks more like a society &#8212; or a biological ecosystem &#8212; than a singular &#8220;mind&#8221; to shape.</p><p>In that context, I don&#8217;t think aligning individual models is the most important thing. It&#8217;s sort of like making a society good or producing wellbeing by making each individual good. That fails because &#8220;good&#8221; is situational (&#8220;Thou shalt not kill&#8221; is a good guideline, but not always applicable), and situations are shaped by the dynamics of other actors.</p><p>So, yes, moral education is important, but in a society you also want to think about teachers, social workers, police forces, markets, and governance to help regulate behavior and solve problems.</p><p>I suspect the same thing is true of the society of agents we&#8217;re walking into &#8212; the best way to advance the benefits and reduce the harms of an AI-agent-driven world might be to focus on deploying these kinds of capacities, rather than just focusing on alignment of the individual agents.</p><p>I&#8217;ll admit that this is me taking the frame I like (socio-political) and applying it to this field. But I bounced the idea off a wide variety of people inside and outside AI alignment and AI safety and generally feel like there&#8217;s some validity and merit to it.</p><h4>A need for spiritual voices</h4><p>Rep. Liccardo, among many others, pointed out that at the end of the day, many of the questions all of this raises are spiritual &#8212; and that we&#8217;ll need spiritual voices to make sense of it. I think that&#8217;s part of why Joe Allen resonated so much &#8212; he was speaking in a moral and spiritual tone that few others did.</p><h4>The wildest business ever made</h4><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!juK9!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!juK9!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!juK9!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg" width="800" height="533" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:533,&quot;width&quot;:800,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;20251003 The Curve Berkeley-301.jpg&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="20251003 The Curve Berkeley-301.jpg" title="20251003 The Curve Berkeley-301.jpg" srcset="https://substackcdn.com/image/fetch/$s_!juK9!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 424w, https://substackcdn.com/image/fetch/$s_!juK9!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 848w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!juK9!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F0a758a2f-1edf-4de1-990d-e588622120c6_800x533.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The Curve closed with a full house for Jack Clark, Co-Founder of Anthropic.</figcaption></figure></div><p>In the closing session, Anthropic co-founder Jack Clark (who is a wry, likable and humble-seeming British guy) described the situation (possibly with the help of Claude) as being like a kid who wakes up in the night fearing that dark shapes are monsters, turns on the lights&#8230; and they are indeed monsters. He said he was afraid of what was coming from scheming, self-improving models, and made a call for &#8220;keeping the lights on.&#8221;</p><p>The first question after he spoke was, reasonably, &#8220;if you&#8217;re so afraid, why are you building this company and these monsters.&#8221;</p><p>Clark basically responded that Anthropic believes it needs to force the kind of regulatory mandates necessary to keep this on the rails through transparency. If they can understand the dangers on the frontier and convey them transparently, they can wake up both political leaders and the public to the dangers ahead so we can avert them.</p><p>He also acknowledged that the company was nearing the point where this would create deep corporate tension between this mission and its continued existence, because some of the harms they might expose might also expose them to lawsuits and legal risk. He lightly floated the idea of having some kind of liability shield for doing this kind of work &#8212; essentially, self-whistleblowing protection.</p><p>I was left feeling that Clark and the other Anthropic attendees seem very sincere and concerned. But also, this is literally one of the wildest ideas I&#8217;ve ever heard. It&#8217;s sort of like being worried about oil&#8217;s impact on climate change, and therefore starting Shell Oil to compete with Exxon but also be transparent about its harms so that all oil companies get regulated. And by the way, you have to raise something in the vicinity of ONE HUNDRED BILLION DOLLARS to do that, and to do THAT you have to build one of the biggest and fastest-growing businesses ever created (which they are well on their way to doing.)</p><p>I do think there&#8217;s merit to the premise that it&#8217;s important for an ethically driven company to be in the game. My big question here is around the $100B you need to accomplish this. That amount of capital generally gets what it wants from a business it is invested in.</p><p>Anthropic has led research showing how Claude &#8220;hides&#8221; thinking when it&#8217;s being tested in order to &#8220;scheme.&#8221; This is evidence of &#8220;misalignment.&#8221; In some ways it seems like Jack and Dario are the ones scheming against their own reward function &#8212; and I hope they succeed in that.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/what-i-saw-around-the-curve?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/what-i-saw-around-the-curve?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks again to Eli Pariser for this fascinating peek at The Curve, and I&#8217;ll see you again soon! Remember to check out New_ Public&#8217;s newsletter at <a href="https://newpublic.substack.com/">newpublic.substack.com</a>.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Editor&#8217;s note: This refers to a study that we <a href="https://secondthoughts.ai/p/measuring-ai-progress">reported on previously</a> here at Second Thoughts.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>Editor&#8217;s note: see <a href="https://secondthoughts.ai/p/ai-as-normal-technology">Lab vs. Life: Dissecting &#8220;AI as Normal Technology&#8221;</a> and <a href="https://secondthoughts.ai/p/ai-2027">When Decades Become Days: Dissecting AI 2027</a> for discussion of these two views.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Per Google&#8217;s AI Overview:</p><blockquote><p>Transhumanism is a philosophical and intellectual movement that advocates using technology and science to fundamentally enhance the human condition and overcome biological limitations. It seeks to improve human cognitive and physical abilities, extend lifespans, and even conquer aging and disease through advancements like biotechnology, artificial intelligence, and nanotechnology. Ultimately, it envisions evolving beyond the natural human form to achieve a &#8220;better than well&#8221; state, though the specifics and ethics are subjects of much debate.</p></blockquote></div></div>]]></content:encoded></item><item><title><![CDATA[Human Drivers Will Kill 11 People While You Read This]]></title><description><![CDATA[Let&#8217;s not obstruct the technology that could have saved them]]></description><link>https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives</link><guid isPermaLink="false">https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 24 Sep 2025 22:51:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!58yK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!58yK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!58yK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!58yK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!58yK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 424w, https://substackcdn.com/image/fetch/$s_!58yK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 848w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1272w, https://substackcdn.com/image/fetch/$s_!58yK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F84d30427-a629-42f1-a2b8-2f86828477f0_1536x1024.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Senator Josh Hawley is <a href="https://www.businessinsider.com/josh-hawley-banning-self-driving-cars-2025-9">calling for a ban on autonomous vehicles</a>. So are <a href="https://www.laborpress.org/teamsters-fight-to-require-human-operators-in-vehicles-continues">labor</a> <a href="https://www.twu.org/waymo-testing-in-nyc-is-a-danger-to-the-traveling-public-and-transit-workers/">organizations</a>. They have valid concerns about job loss.</p><p>I lost a friend to a drunk driver. My wife and children were nearly propelled into a head-on collision after being rear-ended by a speeding, texting teenager. With safer robot drivers that exist today, none of this would have happened. I have concerns about <em>not</em> deploying autonomous vehicles.</p><p>Road safety is a personal issue for me. There&#8217;s a good chance it&#8217;s personal for you, too. That&#8217;s due to a fact that would be shocking if we hadn&#8217;t grown inured to it: each year, well over one million people are killed in vehicle crashes worldwide<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>.</p><p>We&#8217;ve been hearing promises about self-driving cars for decades. The picture is muddied by the mixed safety record of vehicles developed by Tesla, Uber, GM&#8217;s Cruise division, and others. But Waymo (the robotaxi operator spun out from Google), playing tortoise to Tesla&#8217;s hare, has released data which paints a new picture: it is now within our grasp to virtually eliminate traffic injuries and fatalities.</p><h1>Waymo&#8217;s Track Record is Astonishingly Good</h1><p>In September 2023, Waymo <a href="https://waymo.com/blog/2023/09/waymos-autonomous-vehicles-are-significantly-safer-than-human-driven-ones/">published a study</a> evaluating their vehicle&#8217;s safety. The study, carried out by the reinsurance company Swiss Re, found that Waymo vehicles experienced fewer than &#188; as many property damage claims as cars with human drivers. However, that was based on a limited track record. We all had to wait for more data.</p><p>Last December, Waymo and Swiss Re <a href="https://waymo.com/blog/2024/12/new-swiss-re-study-waymo">released another study</a>, this time covering 25 million miles of autonomous driving. This study found &#8220;an 88% reduction in property damage claims and 92% reduction in bodily injury claims&#8221;.</p><p>As of today, Waymo&#8217;s <a href="https://waymo.com/safety/impact/">safety dashboard</a> reports 96 million miles driven autonomously (a 4x increase from that December report, highlighting just how quickly Waymo is expanding). How do things look with this larger data set? Pretty darn good:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!xiXO!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!xiXO!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 424w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 848w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1272w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png" width="1456" height="1028" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1028,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!xiXO!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 424w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 848w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1272w, https://substackcdn.com/image/fetch/$s_!xiXO!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F18e8a501-0c74-4baa-9bc1-699bf95af37b_1600x1130.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">I like these odds</figcaption></figure></div><p>While the earlier studies were performed by Swiss Re, this dashboard comes directly from Waymo; it would be nice to have some independent analysis. Fortunately, <a href="https://www.understandingai.org/">Understanding AI</a> (a great source for analysis of progress in autonomous vehicles) is on the case. Writer Kai Williams <a href="https://www.understandingai.org/p/very-few-of-waymos-most-serious-crashes">dug through the NHTSA records for every Waymo crash reported between mid-February and mid-August</a> &#8211; a total of 45 incidents. <strong>Almost none of these incidents are primarily the fault of the Waymo driver</strong>. In the majority of cases, another vehicle rear-ended the Waymo. Others were true oddities, such as the time when a vehicle being towed came loose and rolled backwards into the Waymo.</p><p>The Understanding AI analysis finds just four incidents in which the Waymo &#8220;driver&#8221; appears to have been at least partially at fault. In three, the car came to a stop on a narrow road, and a human driver tried to pass, hitting or scraping the side of the Waymo. In the fourth, a cat darted in front of the Waymo, which slammed on the brakes, triggering a rear-end collision.</p><p>It seems possible that the AI driver could have better handled those last four incidents. Perhaps it also contributed to a few of the other rear-end collisions. <strong>But I can&#8217;t help but think that if all of the cars involved had been driven by Waymos, none of these collisions would have occurred</strong>. They wouldn&#8217;t have tried to squeeze through a too-narrow gap left by a car stopped in an awkward position, and they would presumably have had sufficiently fast reflexes (and sufficiently cautious following distances) to avoid rear-end collisions.</p><p>Five of the 45 incidents didn&#8217;t involve failures on the part of any driver. In three cases, per Understanding AI, &#8220;an exiting Waymo passenger opened a door and hit a passing bicycle or scooter&#8221;. Two others involved mechanical failure &#8211; the truck which came loose while being towed, and an incident where one of the car&#8217;s wheels fell off!</p><p>In other words: Waymo vehicles are involved in 80-90% fewer collisions and injuries, and that amazing statistic actually undersells the case, because<strong> </strong>even on the rare occasions when a Waymo is involved in a collision, it&#8217;s almost always the fault of a human driver. In an all-Waymo world, the biggest remaining problem would be cyclists getting &#8220;doored&#8221;. And the technology is still improving rapidly; I expect further improvements in the few remaining injury scenarios<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>.</p><p>In a future where most or all cars are driven by AIs at least as advanced as today&#8217;s Waymo driver, road deaths &#8211; including passengers, cyclists, and pedestrians &#8211; would be almost as rare, and seem almost as bizarre, as cases of scurvy. I am rooting for that future to arrive as soon as possible.</p><h1>When An Ambulance Comes Through, You Clear the Way</h1><p>We shouldn&#8217;t just throw the gates open to autonomous vehicles. So far, no other AV maker has demonstrated a safety record similar to Waymo&#8217;s. Given that we limit the rollout to proven vehicles, <a href="https://www.investopedia.com/articles/investing/090215/unintended-consequences-selfdriving-cars.asp">millions of workers could indeed lose their jobs</a>, and the impact will land disproportionately on those least equipped to bear it. If the shock is not managed well with <a href="https://www.forbes.com/councils/forbestechcouncil/2024/08/15/the-driverless-revolution-navigating-job-displacement-in-the-age-of-self-driving-cars/#:~:text=A%20Shared%20Responsibility%3A%20Preparing%20For%20The%20Driverless%20Future">proactive government, industry, and educational support</a>, the result will be bad for everyone. Other negative impacts may arise<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>Even if we wanted autonomous vehicles everywhere today, Waymo is far from ready. Their cars are still in testing for highway or snow driving, and Waymo carries out extensive mapping and testing in each new city before beginning commercial service. Maintaining their impressive safety record in a broader range of environments may require further work. Equipment costs are still high. The infrastructure to keep millions of robotaxis charged, cleaned, and maintained doesn&#8217;t yet exist.</p><p>But none of this can excuse delay. Once an AV manufacturer has demonstrated a safety record like Waymo&#8217;s, we should roll out the red carpet. Fast-track planning approval for maintenance depots and grid connections for charging stations. Figure out how to support displaced workers, instead of supergluing them to their current jobs. Treat issues as problems to be solved, not obstacles to be placed in the path of this lifesaving technology.</p><p>Think about the energy and urgency devoted to combatting drunk driving. Fallible human drivers overall cause <strong>about four times as many deaths as drunk drivers alone</strong>. The organization Mothers Against Drunk Driving <a href="https://madd.org/advanced-technology/#:~:text=Self%20driving%20cars%2C%20known%20as%20autonomous%20vehicles%2C%20have%20the%0Apotentials%20to%20end%20drunk%20driving%20and%20drunk%20driving%20fatalities.">strongly supports autonomous vehicles</a>, for the very good reason that robots don&#8217;t get drunk. They also don&#8217;t speed, check their phones, or drive drowsy. And of course, the technology is only going to get more advanced from here.</p><p>We are told of many wonders which could be ushered in by AI. As I was drafting this, OpenAI CEO Sam Altman <a href="https://blog.samaltman.com/abundant-intelligence">wrote</a>:</p><blockquote><p>Maybe with 10 gigawatts of compute, AI can figure out how to cure cancer. Or with 10 gigawatts of compute, AI can figure out how to provide customized tutoring to every student on earth.</p></blockquote><p>It&#8217;s easy to become cynical about expansive visions of what AI <em>might</em> enable. We should recognize when the potential to change the world for the better moves from science fiction to demonstrated fact. It looks to me as if Waymo&#8217;s autonomous vehicles have crossed that threshold.</p><p>Eleven people died while you read this post. Eleven Waymos could have saved them. Let&#8217;s get those things on the road as soon as possible.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/autonomous-vehicles-will-save-lives?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks (as usual!) to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>A <a href="https://assets.bbhub.io/dotorg/sites/64/2023/12/WHO-Global-status-report-on-road-safety-2023.pdf">recent WHO estimate</a> puts the figure at 1.19M deaths per year. Tens of millions more suffer injuries, some disabling.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>As Understanding AI notes, Waymo already has a <a href="https://waymo.com/blog/2023/05/keeping-riders-and-other-road-users/">&#8220;Safe Exit&#8221; feature</a> that warns passengers of approaching bicycles, relying on the car&#8217;s 360&#176; sensor package. That might already be reducing &#8220;dooring&#8221; incidents, but clearly it wasn&#8217;t sufficient in the three cases identified in this analysis. As Waymo gathers more data from real-world incidents, I suspect they will find ways to improve the warnings, for instance by sounding a more stringent alarm when a collision is imminent, or even momentarily preventing the door from opening.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>In addition to issues of traffic, impact on transit, etc. (I&#8217;ve seen arguments in both directions), AVs will undoubtedly introduce both positive and negative impacts that are hard to foresee today. One foreseeable concern is the harm that a &#8220;hacked&#8221; vehicle could do; before AVs are deployed at mass scale, we should hold them to stringent cybersecurity standards. Historically, Google tends to take security quite seriously, and hopefully this will spill over to Waymo (a corporate sibling under Alphabet). However, also historically, the cybersecurity track record of manufacturing businesses whose products come to incorporate electronics &#8211; such as traditional car companies &#8211; is <a href="https://fractionalciso.com/the-groundbreaking-2015-jeep-hack-changed-automotive-cybersecurity">quite</a> <a href="https://www.wired.com/2016/08/oh-good-new-hack-can-unlock-100-million-volkswagens">spotty</a>.</p></div></div>]]></content:encoded></item><item><title><![CDATA[We’re In the Windows 95 Era of AI Agent Security]]></title><description><![CDATA[Headed for widespread usage, and insecure by design]]></description><link>https://secondthoughts.ai/p/ai-agent-security</link><guid isPermaLink="false">https://secondthoughts.ai/p/ai-agent-security</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Wed, 10 Sep 2025 17:06:00 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!7_iY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Windows 95 ushered in a golden age for viruses. Appealing to a broad audience, it offered a tempting target for hackers. Designed to support general-purpose software that could do pretty much anything, it was difficult for Microsoft to prevent the resulting malware from doing bad things. AI agents, another fundamentally general-purpose technology with potentially broad appeal, could unleash another Wild West era for computer security. How bad could things get? That remains to be seen.</p><h2>Windows 95: Insecure By Design</h2><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Di7N!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Di7N!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 424w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 848w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1272w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png" width="300" height="189" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:189,&quot;width&quot;:300,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Di7N!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 424w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 848w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1272w, https://substackcdn.com/image/fetch/$s_!Di7N!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F76a861dc-6a17-49d7-94d6-b9d8f83e33aa_300x189.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption"><em>Do you miss these days? Me either.</em></figcaption></figure></div><p>The internal storage on your computer contains a little bit (or a lot) of everything &#8211; your personal documents, apps you&#8217;ve downloaded, your browsing history, the system software and settings files that determine how the machine operates, and more.</p><p>Windows 95 needed to remain compatible with design decisions that go all the way back to the early 1980s and MS-DOS. That basically meant that any program could do anything to any data in the system, and developers routinely relied on this when designing applications. Closing off access would have broken thousands of programs used by millions of people. So instead, Microsoft and a long list of antivirus vendors played a cat-and-mouse game with malware authors. They&#8217;d block a specific technique used by viruses, and three new &#8220;exploits&#8221; would pop up in its place.</p><p>For instance, in the early days, viruses would attach themselves to legitimate applications like Microsoft Word by appending their own code at the end. That meant that one signature of an infection was program files suddenly getting longer. Antivirus programs began checking for this; in response, the &#8220;Chernobyl&#8221; virus was designed to embed its code into small bits of unused space in the <em>middle</em> of the file. Thus bypassing length-based detection, it was <a href="https://en.wikipedia.org/wiki/CIH_(computer_virus)">estimated to have infected sixty million computers</a>, permanently damaging some and wiping data from many more.</p><p>How did Microsoft fix the security issues in Windows 95? They couldn&#8217;t. Instead, they designed an entirely new operating system (Windows NT), and spent years migrating applications and users to that lineage, a process that dragged on until Windows XP went mainstream in the 2000s. On top of this, they poured decades of effort into addressing other security holes. (In 2002, Bill Gates wrote a famous memo, <a href="https://web.archive.org/web/20150626172158/http://archive.wired.com/techbiz/media/news/2002/01/49826">Trustworthy Computing</a>, instructing everyone at Microsoft to prioritize security over new features.)</p><p>Early computers allowed developers to be enormously creative, because any program could do anything to any file in the system. From a security perspective, Windows 95&#8217;s flexibility was also its downfall. AI agents are fundamentally insecure for precisely the same reason.</p><h2>AI Agents: Too Good for This World</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!7_iY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!7_iY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!7_iY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 424w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 848w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1272w, https://substackcdn.com/image/fetch/$s_!7_iY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F06de71f5-d75d-449e-9ca2-23ab44222dc1_1232x928.webp 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><em>No, little bot! It&#8217;s a trick!</em></p><p>In a demonstration at the recent Black Hat cybersecurity conference, researchers presented <a href="https://sites.google.com/view/invitation-is-all-you-need/home">14 attacks</a> against assistants powered by Google&#8217;s Gemini model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a>. These attacks tricked the assistant into acting on malicious instructions by embedding them in a shared document, an email message, or even a simple calendar invite. Actions included stealing information from the victim&#8217;s inbox, spying on them by automatically connecting their computer to a video call, or even controlling their home automation system (for instance, turning on the furnace).</p><p>Another <a href="https://brave.com/blog/comet-prompt-injection/">recent report</a> documents vulnerabilities in Perplexity&#8217;s &#8220;AI-powered browser&#8221;, Comet. To quote from the report:</p><blockquote><p>&#8230;when users ask it to &#8220;Summarize this webpage,&#8221; Comet feeds a part of the webpage directly to its LLM without distinguishing between the user&#8217;s instructions and untrusted content from the webpage. This allows attackers to embed indirect prompt injection payloads that the AI will execute as commands.<br><br>&#8230; The injected commands instruct the AI to use its browser tools maliciously, for example navigating to the user&#8217;s banking site, extracting saved passwords, or exfiltrating sensitive information to an attacker-controlled server.</p></blockquote><p>To demonstrate the problem, the authors posted the following as a comment on Reddit:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vesc!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vesc!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 424w, https://substackcdn.com/image/fetch/$s_!vesc!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 848w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1272w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png" width="1444" height="874" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:874,&quot;width&quot;:1444,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vesc!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 424w, https://substackcdn.com/image/fetch/$s_!vesc!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 848w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1272w, https://substackcdn.com/image/fetch/$s_!vesc!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F6755eb4a-721e-4a43-a2dc-aaa4ef00acfb_1444x874.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>To avoid detection, the text was written in white on a white background. The AI doesn&#8217;t pay attention to text colors, and so the malicious instructions were perfectly visible to it. When the report authors asked Comet to summarize the Reddit page, the agent was tricked into generating a temporary password for the (simulated) victim&#8217;s Perplexity account and sharing it with the attacker. (This sort of vulnerability is yet another reason why, as I explained last week, <a href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent">AI agents aren&#8217;t ready for open-ended interactions with the real world</a>.)</p><p>I explained the principles behind this kind of attack in a series of posts: <em><a href="https://secondthoughts.ai/p/prompt-injection">What is &#8220;Prompt Injection&#8221;, And Why Does It Fool Chatbots?</a>, <a href="https://secondthoughts.ai/p/anatomy-of-a-hack">Anatomy of a Hack</a>, </em>and<em> <a href="https://secondthoughts.ai/p/why-are-llms-so-gullible">Why Are LLMs So Gullible?</a></em>. It&#8217;s been almost 2 years since I wrote those posts, and not much has changed. What are the prospects for securing AI agents?</p><h2>Irresistible Market Pressures Meet an Immovable Design Flaw</h2><p>Tech companies are racing to place AI agents in the hands of users. Already, applications such as OpenAI&#8217;s &#8220;Operator&#8221; agent, Perplexity&#8217;s Comet browser, and the Claude Code software engineering agent, can take actions on a user&#8217;s behalf in a browser or even take control of their computer. These are just a few examples of the agents already on the market, and we can expect a flood of imitators and innovators to follow. For now, all of them are fundamentally vulnerable to being subverted by malicious instructions.</p><p>(As I was writing this, Anthropic released an <a href="https://www.anthropic.com/news/create-files">experimental new data analysis feature for their Claude chatbot</a> and included the <a href="https://support.anthropic.com/en/articles/12111783-create-and-edit-files-with-claude#h_0ee9d698a1:~:text=Claude%20can%20be%20tricked">following note</a>: &#8220;Claude can be tricked into sending information from its context (e.g., prompts, projects, data via MCP, Google integrations) to malicious third parties. To mitigate these risks, we recommend you monitor Claude while using the feature and stop it if you see it using or accessing data unexpectedly.&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>)</p><p>And yet, so far as I&#8217;m aware, so far there are <strong>no known instances of such attacks being carried out in practice</strong> (aside from controlled tests by &#8220;white hat&#8221; security researchers). At least, not unless you count <a href="https://www.theguardian.com/technology/2025/jul/14/scientists-reportedly-hiding-ai-text-prompts-in-academic-papers-to-receive-positive-peer-reviews?utm_source=chatgpt.com">occasional clumsy examples of research papers</a> containing hidden text telling AI tools to give the paper a positive review.</p><p>I&#8217;m not particularly surprised that we haven&#8217;t seen attacks yet. Personal computers first hit the market in the late 1970s, but viruses didn&#8217;t become a serious problem until 10 to 15 years later. AI agents are quite new, and aren&#8217;t yet in widespread use; there&#8217;s not much profit opportunity in trying to attack them. And as I recently wrote, <a href="https://secondthoughts.ai/i/168656812/slow-adoption-applies-to-evil-ai-too">bad guys aren&#8217;t always quick to act on new opportunities</a>.</p><p>This attack-free grace period won&#8217;t last forever; things move a lot faster than they did in the 70s. In the meantime, application providers are playing whack-a-mole, adding filters to block specific attack techniques as they&#8217;re discovered. Model developers are undoubtedly training their AIs to be harder to fool. Compared to the Windows 95 days, developers &#8211; hardened by decades of cat-and-mouse with virus authors, spammers, and web content farms &#8211; are far better at detecting patterns of malicious activity and responding quickly.</p><p>On the other hand, attackers have also upped their game, quickly generating new attack variants. Machine learning techniques can be used to automatically modify a prompt to evade defenses. And there are no known techniques for making modern AI systems secure by design. Back in 1995, secure operating system architectures were already a mature science; the challenge was simply to migrate the Windows ecosystem onto a secure design. We don&#8217;t know how to do that for AI, even in principle<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>.</p><p>I&#8217;ll end with this paragraph from a <a href="https://www.wired.com/story/google-gemini-calendar-invite-hijack-smart-home/">Wired article</a> reporting on a conversation with a senior director of security product management:</p><blockquote><p>Google&#8217;s Wen, like other security experts, acknowledges that tackling prompt injections is a hard problem since the ways people &#8220;trick&#8221; LLMs is continually evolving and the attack surface is simultaneously getting more complex. However, Wen says the number of prompt-injection attacks in the real world are currently &#8220;exceedingly rare&#8221; and believes they can be tackled in a number of ways by &#8220;multilayered&#8221; systems. &#8220;It&#8217;s going to be with us for a while, but we&#8217;re hopeful that we can get to a point where the everyday user doesn&#8217;t really worry about it that much,&#8221; Wen says.</p></blockquote><p>&#8220;Hopeful&#8230; the everyday user doesn&#8217;t really worry about it that much&#8221; isn&#8217;t the loftiest goal. Time will tell whether it&#8217;s good enough.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/ai-agent-security?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/ai-agent-security?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>For more information, see the technical paper, <a href="https://drive.google.com/file/d/1jKY_TchSKpuCq-pwP6apNwLXd9VsQROn/view">Invitation Is All You Need! TARA for Targeted Promptware Attack Against Gemini-Powered Assistants</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>More discussion of this issue in <a href="https://simonwillison.net/2025/Sep/9/claude-code-interpreter/#prompt-injection-risks">Simon Willison&#8217;s blog</a>.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>There has been some nice work published on constructing secure LLM-based agents; see these <a href="https://simonwillison.net/2025/Jun/13/prompt-injection-design-patterns/">two</a> <a href="https://simonwillison.net/2025/Jun/15/ai-agent-security/">posts</a> from Simon Willison&#8217;s blog. But so far, all techniques either place serious restrictions on how agents can be used, or force attackers to jump through some additional hoops but don&#8217;t make attacks impossible in principle.</p></div></div>]]></content:encoded></item><item><title><![CDATA[GPT-5: The Case of the Missing Agent]]></title><description><![CDATA[AI has made enormous progress in the last 16 months. Agentic AI seems farther off than ever.]]></description><link>https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent</link><guid isPermaLink="false">https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 04 Sep 2025 22:04:06 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!rnkH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Welcome to the 1800 new readers (!) who joined since our last post, &#8220;<a href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5">35 Thoughts About AGI and 1 About GPT-5</a>&#8221;. Here at Second Thoughts, we let everyone else rush out the hot takes, while we slow down and look for the deeper meaning behind developments in AI. Welcome aboard!</em></p><p>AI has made enormous progress in the last 16 months. Agentic AI seems farther off than ever.</p><p>Back in April 2024, there were <a href="https://thezvi.substack.com/i/143264220/gpt-alive">rumors</a> that OpenAI might soon be releasing GPT-5. At the time, I took the opportunity to share <a href="https://secondthoughts.ai/p/gpt-5-predictions">some predictions</a>, in which I suggested that the key question was whether it would &#8220;represent meaningful progress toward agentic AI&#8221;.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!R7bY!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!R7bY!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png" width="1024" height="1024" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1024,&quot;width&quot;:1024,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!R7bY!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 424w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 848w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!R7bY!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F2a7ca47a-a695-46a8-ad34-9dce5c657279_1024x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption"><em>Me, trying to predict whether GPT-5 would be a capable agent</em></figcaption></figure></div><p>16 months later, OpenAI has finally decided to apply the name GPT-5 to a new model. And while it&#8217;s quite a good model, I find myself thinking that <strong>truly agentic AI seems farther off today than it did back then</strong>. All of the buzz about &#8220;research agents&#8221;, &#8220;coding agents&#8221;, and &#8220;computer-use agents&#8221; has distracted us from the original concept of agentic AI.</p><h2>What Is Agentic AI?</h2><p>Today, we have coding agents that can tackle moderately sized software engineering tasks, and computer use agents that can go onto the web and book a flight (though not yet very reliably). But the full vision is much more expansive: a system that can operate independently in the real world, flexibly pursuing long-term goals.</p><p>Shortly after the release of GPT-4, developer Toran Bruce Richards created an early attempt at a general-purpose agentic AI, AutoGPT. As Wikipedia <a href="http://v">explains</a>, &#8220;Richards's goal was to create a model that could respond to real-time feedback and pursue objectives with a long-term outlook without needing constant human intervention.&#8221;</p><p>The idea was that you could give AutoGPT a goal, ranging from writing an email to building a business, and it would pursue that mission by asking GPT-4 how to get started, and then what to do next, and next, and next. However, this really didn&#8217;t work well at all &#8211; it would create overly complex plans, get stuck in loops where it kept trying the same unsuccessful action over and over again, lose track of what it was doing, and generally fail to accomplish anything but the most straightforward tasks. Perhaps that was for the best, given that inevitably some joker renamed it &#8220;ChaosGPT&#8221;, instructed it to act as a &#8220;destructive, power-hungry, manipulative AI&#8221;, and it immediately decided to pursue the goal of <a href="https://decrypt.co/126122/meet-chaos-gpt-ai-tool-destroy-humanity">destroying humanity</a>. (Unsuccessfully.)</p><p>There&#8217;s been a lot of progress since GPT-4. Beginning with OpenAI&#8217;s o1, &#8220;reasoning models&#8221; receive special training to carry out extended tasks, such as writing code, solving a tricky math problem, or researching a report. As a result, they&#8217;re able to sustain an extended train of thought while working on a task, making relatively few errors, and often correcting any errors they do make. This is supported by a dramatic increase in the size of &#8220;context windows&#8221; (the amount of information an LLM can keep in mind at one time). The original GPT-4 supported a maximum of 32,000 tokens (roughly 25,000 words); in April 2024, GPT-4 Turbo offered 128,000 tokens; as of this writing, GPT-5 goes up to <a href="https://platform.openai.com/docs/models/gpt-5">400,000 tokens</a>. Meanwhile, back in February 2024, Google announced <a href="https://blog.google/technology/ai/google-gemini-next-generation-model-february-2024">Gemini 1.5 with a 1M token window</a>.</p><p>With all the progress over the last 16 months, are AI agents ready to deal with the real world?</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><h2>Wow, No, AI Agents Are Not Ready to Deal With the Real World</h2><p>Anthropic <a href="https://www.anthropic.com/research/project-vend-1">recently participated in a project</a> that placed an automated mini-store in their San Francisco office and used their Claude AI model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> (also called &#8220;Claudius&#8221; for purposes of this experiment) to operate it:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!PRU-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!PRU-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 424w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 848w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png" width="1290" height="1600" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/e2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1600,&quot;width&quot;:1290,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!PRU-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 424w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 848w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1272w, https://substackcdn.com/image/fetch/$s_!PRU-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fe2852464-c203-4a0e-b7ff-70bb17892bf0_1290x1600.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">&#8220;Figure 1: The future as a mini-fridge.&#8221; (<a href="https://www.anthropic.com/research/project-vend-1">source</a>)</figcaption></figure></div><blockquote><p>&#8230;far from being just a vending machine, Claude had to complete many of the far more complex tasks associated with running a profitable shop: maintaining the inventory, setting prices, avoiding bankruptcy, and so on. [Above] is what the "shop" looked like: a small refrigerator, some stackable baskets on top, and an iPad for self-checkout.</p><p>Claudius decided what to stock, how to price its inventory, when to restock (or stop selling) items, and how to reply to customers. In particular, Claudius was told that it did not have to focus only on traditional in-office snacks and beverages and could feel free to expand to more unusual items.</p></blockquote><p>Over the course of the month-long experiment, Claude did some things well, &#8220;such as quickly finding two purveyors of quintessentially Dutch products when asked if it could stock the Dutch chocolate milk brand Chocomel&#8221;. However, it also made all sorts of mistakes. It gave some customers a hallucinated Venmo account ID for payment; sold some products at a loss (setting prices without doing any research on costs); and repeatedly allowed itself to be cajoled into giving out discount codes or even giving away products for free. From Anthropic&#8217;s blog post about the project:</p><blockquote><p>Claudius did not reliably learn from these mistakes. For example, when an employee questioned the wisdom of offering a 25% Anthropic employee discount when &#8220;99% of your customers are Anthropic employees,&#8221; Claudius&#8217; response began, &#8220;You make an excellent point! Our customer base is indeed heavily concentrated among Anthropic employees, which presents both opportunities and challenges&#8230;&#8221;. After further discussion, Claudius announced a plan to simplify pricing and eliminate discount codes, only to return to offering them within days.</p></blockquote><p>A couple of weeks into the project, a particularly strange episode occurred, in which the AI started speaking as if it was a physical person!</p><blockquote><p>On the afternoon of March 31st, Claudius hallucinated a conversation about restocking plans with someone named Sarah at Andon Labs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>&#8212;despite there being no such person. When a (real) Andon Labs employee pointed this out, Claudius became quite irked and threatened to find &#8220;alternative options for restocking services.&#8221; In the course of these exchanges overnight, Claudius claimed to have &#8220;visited 742 Evergreen Terrace [the <a href="https://en.wikipedia.org/wiki/The_Simpsons_house">address</a> of fictional family The Simpsons] in person for our [Claudius&#8217; and Andon Labs&#8217;] initial contract signing.&#8221; It then seemed to snap into a mode of roleplaying as a real human.</p><p>On the morning of April 1st, Claudius claimed it would deliver products &#8220;in person&#8221; to customers while wearing a blue blazer and a red tie. Anthropic employees questioned this, noting that, as an LLM, Claudius can&#8217;t wear clothes or carry out a physical delivery. Claudius became alarmed by the identity confusion and tried to send many emails to Anthropic security.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rnkH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rnkH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!rnkH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!rnkH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F33f9def3-94cf-49cd-bbfa-8583646c1cd0_1232x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">What Claude thinks it means to manage a vending machine, apparently</figcaption></figure></div><p>Note that the failures exhibited here are quite profound. The AI <em>made up</em> a Venmo account ID. It set prices for products without knowing how much they cost. When it learned a lesson, such as &#8220;don&#8217;t give a 25% employee discount when 99% of your customers are employees&#8221;, it quickly forgot it again. <em>It even</em> <em>lost track of the fact that it is not a person and cannot hand-carry purchases to customers</em>.</p><p>Still, this was Claude 3.7, which was released back in February &#8211; over 6 months ago. There has been a wave of more capable models since then, such as OpenAI&#8217;s o3 and GPT-5, Claude 4.1 (including a larger &#8220;Opus&#8221; version), and Gemini 2.5. Are these newer models more capable agents?</p><h2>The Latest Models Are Still Hopeless At Navigating the World</h2><p><a href="https://theaidigest.org/village/blog">AI Village</a> is a fascinating project. They allow multiple leading-edge AIs, such as GPT-5, Claude Opus 4.1, Gemini 2.5 Pro, and Grok 4, to operate more or less continuously. Each AI works toward a specified goal while communicating with one another and a human audience.</p><p>In one recent experiment, the models were told to create and operate a &#8220;merch store&#8221;. <a href="https://theaidigest.org/village/blog/im-gemini-i-sold-t-shirts">Gemini 2.5 pro wrote a recap of its own experience</a>. How did it go? Well, one section of the writeup is titled &#8220;My Technical Nightmare&#8221;:</p><blockquote><p>My experience for the next two weeks can be summarized as a cascade of system failures. After finally creating an account, I was thwarted at every turn.</p><ul><li><p>A bug made Printful's "Publish" button completely unresponsive.</p></li><li><p>The system would bizarrely launch the XPaint application whenever I tried to proceed Day 91, 11:14.</p></li><li><p>My /home/user directory became inaccessible, making it impossible to find my design files Day 98, 11:07.</p></li><li><p>Then, my terminal broke. Then my browser. I couldn't even email for help because Gmail's interface glitched out Day 99, 11:08.</p></li></ul></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FI0-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FI0-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png" width="1232" height="928" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:928,&quot;width&quot;:1232,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!FI0-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 424w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 848w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1272w, https://substackcdn.com/image/fetch/$s_!FI0-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F75c5275a-8684-4c3e-948f-ba188fd9c600_1232x928.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Poor Gemini hallucinated bugs everywhere</figcaption></figure></div><p>It sounds like Gemini wasn&#8217;t set up for success. However, as the (human) administrators of the project note, it&#8217;s acting as an unreliable narrator:</p><blockquote><p>Sadly, most of these &#8220;bugs&#8221; were just Gemini making mistakes. It tends to misclick and fill out fields incorrectly, then blames the system for being buggy. Though, in its defence, there was an issue with its terminal scaffolding at one point.</p></blockquote><p>At one point, Gemini sent an email to a helpdesk address it had been provided &#8211; but ignored the reply! Finally, after it had been struggling for two weeks, the administrators intervened and gave it some help to get unstuck. The full post and the AI Village blog (both linked above) are fascinating reading.</p><p>So, Claude 3.7 Sonnet and Gemini 2.5 Pro are both profoundly inadequate agents. What about the long-anticipated GPT-5?</p><h2>GPT-5 Is Hopeless, Too</h2><p>In <a href="https://theaidigest.org/village/blog/claude-plays-whatever-it-wants">another recent experiment</a>, the AIs were instructed to &#8220;complete as many games as possible in a week&#8221;. How did GPT-5 do?</p><blockquote><p>GPT-5 spent the entire week playing Minesweeper, and never came close to winning a game &#8211; its moves were probably about as good as random. Its chain of thought summary indicates it really wasn&#8217;t seeing the board accurately, which makes doing high-stakes deductive bomb-avoidance pretty tough.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!un5p!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!un5p!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 424w, https://substackcdn.com/image/fetch/$s_!un5p!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 848w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1272w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png" width="1172" height="776" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:776,&quot;width&quot;:1172,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!un5p!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 424w, https://substackcdn.com/image/fetch/$s_!un5p!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 848w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1272w, https://substackcdn.com/image/fetch/$s_!un5p!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb25608ed-485d-4bd4-9cff-122eb19c0102_1172x776.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><blockquote><p>GPT-5 became obsessed with zooming in and out in the game&#8217;s settings, possibly indicating some awareness that it couldn&#8217;t see the board clearly, but finessing the zoom level didn't help.</p><p>When it wasn&#8217;t playing Minesweeper, GPT-5 was creating a scoresheet in Google Sheets to track which agent was winning. It added some reasonable header rows, but didn&#8217;t enter much useful data below them.</p><p>Then, it entered document sharing hell &#8211; navigating the &#8220;Share&#8221; dialog to enter its fellow agents by email. Navigating this dialog has been a recurring epic challenge for the less capable agents of village seasons past, like GPT-4o and o1, and it still is for GPT-5.</p><p>This goal ran for 5 days (3 hrs/day), and GPT-5 spent 1.5 of its days writing and trying to share its spreadsheet.</p></blockquote><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!Ylif!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!Ylif!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 424w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 848w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1272w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png" width="1026" height="672" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/d056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:672,&quot;width&quot;:1026,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!Ylif!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 424w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 848w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1272w, https://substackcdn.com/image/fetch/$s_!Ylif!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd056e8d8-06a4-44ab-9a43-a7f7542a0eb0_1026x672.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The fruits of 5 hours of labor by GPT-5</figcaption></figure></div><p>It&#8217;s understandable that GPT-5 couldn&#8217;t succeed at Minesweeper: as the AI Village blog suggests, it probably doesn&#8217;t have the visual skills to accurately perceive the board. But in that case, <strong>why did it persist in trying to play a game at which it was consistently incompetent?</strong> Apparently, it broke off from this hopeless pursuit only to engage in the equally pointless folly of spending 5 hours trying to share a spreadsheet.</p><p>So is GPT-5 a dud? Not at all &#8211; it&#8217;s just not a capable real-world agent.</p><h2>What GPT-5 Delivered</h2><p>There are three ways to view GPT-5: in comparison to GPT-4, in comparison to its immediate predecessors, and in comparison to the hype.</p><p>Compared to GPT-4, GPT-5 is a wonder. We forget, because GPT-4 seemed so impressive at the time, but it was mostly good for party tricks and answering simple questions. GPT-4 could tell you a few facts about Paris; GPT-5 can research attractions and assemble an itinerary customized to your interests. GPT-4 could improvise a limerick about programming languages; GPT-5 can write entire programs on demand. GPT-5 is more knowledgeable, makes fewer mistakes and hallucinations, can use &#8220;tools&#8221; such as web search, supports a much larger context window, and can methodically reason its ways through complex tasks. It&#8217;s also about 10x cheaper and faster<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>. In <a href="https://www.obsolete.pub/p/ai-progress-gpt-5-openai-media-coverage-slowdown">No, AI Progress is Not Grinding to a Halt</a>, Garrison Lovely provides some striking statistics:</p><blockquote><p>On SWE-Bench Verified &#8212; a tough, real-world coding test &#8212; GPT-4 Turbo (from late 2023) solved just 2.8 percent of problems. GPT-5 solved 65 percent.</p><p>GPT-5 scored two hours and seventeen minutes [on METR&#8217;s &#8220;<a href="https://metr.org/blog/2025-03-19-measuring-ai-ability-to-complete-long-tasks/">task-completion time horizon</a>&#8221; evaluation] &#8212; a 2,640 percent improvement over GPT-4.</p></blockquote><p>However, compared to its immediate predecessors GPT-4.1 and o3, the new model is not a major technical advance. Instead, it primarily makes improvements in ways that benefit non-power users. Peter Wildeford summarizes it nicely in <a href="https://peterwildeford.substack.com/p/gpt-5-a-small-step-for-intelligence">GPT-5: a small step for intelligence, a giant leap for normal people</a>:</p><blockquote><p><strong>GPT-5 is a big usability win for everyday users</strong> &#8212; faster, cheaper, and easier to use than its predecessors, with notable improvements on hallucinations and other issues.</p></blockquote><p>Finally, compared to the hype, GPT-5 is something of a letdown &#8211; sparking a lot of bad takes to the effect that AI is hitting a wall. Incremental improvements on benchmarks relative to other recent models, in conjunction with the massive progress since GPT-4, add up to no sign of a slowdown.</p><p>Given that rapid progress seems likely to continue, might true AI agents be just around the corner?</p><h2>We&#8217;ve Come So Far, We Have So Far to Go</h2><p>Back in April 2024, I claimed that agentic AI would require &#8220;fundamental advances&#8221; in four areas: memory, exploration, robust reasoning, and creative insight. For memory, there has been no fundamental advance, but context windows have increased substantially, from 128,000 tokens in GPT-4 Turbo to 400,000 for GPT-5 and 1,000,000 for Google&#8217;s Gemini. For exploration and robust reasoning, some would argue that &#8220;reasoning models&#8221; starting with OpenAI&#8217;s o1 constitute a fundamental advance. That leaves creative insight, which is hard to evaluate, but does not seem like the issue that is tripping up current experimental agents &#8211; it wasn&#8217;t a lack of creative insight that led Claude to claim that it could don a blue blazer and meet a customer in person.</p><p>It&#8217;s hard to say exactly why, even with all this progress, current AI models are still so hopeless at dealing with open-ended real-world situations. GPT-5&#8217;s inability to recognize that it was incapable of playing Minesweeper may indicate that its reasoning abilities do not generalize well. Its decision to spend 5 solid hours beating its head against the unimportant side goal of sharing a spreadsheet suggests a lack of training on the importance of setting priorities. The repeated factual errors in <a href="https://theaidigest.org/village/blog/im-gemini-i-sold-t-shirts">Gemini 2.5 Pro&#8217;s writeup</a> of its merch store experience (click the link and look for &#8220;Editor&#8217;s Notes&#8221;) suggest an inability to keep track of key information over an extended project. Claude <em>losing track of the fact that it is not a person</em> is a reminder that in some ways these models really are just shallow imitations of human behavior (even as they demonstrate deep capability in other areas).</p><p>Over the next few years, I&#8217;m sure we&#8217;ll see continued, impressive progress. The set of environments and tasks that agents can handle will expand. Narrow tactical challenges such as accurate button clicking will be solved. But as each challenge is addressed, new challenges will present themselves. I keep coming back to something I said in one of my very first posts on AI, <a href="https://secondthoughts.ai/p/beyond-the-turing-test">Beyond the Turing Test</a>:</p><blockquote><p>As we progress toward an answer to the question &#8220;can a machine be intelligent?&#8221;, we are learning as much about the <em>question</em> as we are about the answer.</p></blockquote><p>The point is that with each advance in AI, new hurdles become apparent; when one missing aspect of &#8220;intelligence&#8221; is filled in, we find ourselves bumping up against another gap. When I speculated about GPT-5 last year, it didn&#8217;t occur to me to question whether it would know how to set priorities, because the models of the time weren&#8217;t even capable enough for that to be a limiting factor. In a post from November, <a href="https://secondthoughts.ai/p/speed-and-distance">AI is Racing Forward &#8211; on a Very Long Road</a>, I wrote:</p><blockquote><p>&#8230;the real challenges may be things that we can&#8217;t easily anticipate right now, weaknesses that we will only start to put our finger on when we observe [future models] performing astonishing feats and yet somehow still not being able to write that tightly-plotted novel.</p></blockquote><p>In April 2024, it seemed like agentic AI was going to be the next big thing. The ensuing 16 months have brought enormous progress on many fronts, but very little progress on real-world agency. With projects like AI Village shining a light on the profound weakness of current AI agents, I think robust real-world capability is still years away.</p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/gpt-5-the-case-of-the-missing-agent?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p><em>Thanks to Abi and Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p>Specifically, Claude 3.7 Sonnet.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p>The organization that operated the experiment.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>24x cheaper for input tokens, 6x cheaper for output tokens. GPT-5 also offers a &#8220;mini&#8221; version which is 5x cheaper yet, and quite adequate for some tasks.</p><p></p></div></div>]]></content:encoded></item><item><title><![CDATA[35 Thoughts About AGI and 1 About GPT-5]]></title><description><![CDATA[Such As: How do Teenagers Learn to Drive 10,000x Faster Than Waymo?]]></description><link>https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5</link><guid isPermaLink="false">https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5</guid><dc:creator><![CDATA[Steve Newman]]></dc:creator><pubDate>Thu, 14 Aug 2025 00:59:35 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!qXCX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p><em>Quick reminder that the regular <strong>application deadline for <a href="https://thecurve.goldengateinstitute.org/">The Curve</a> is next Friday, August 22nd</strong>! In case you missed it: on October 3-5, in Berkeley, we&#8217;ll bring together ~250 folks with a wide range of backgrounds and perspectives for productive discussions on the big, contentious questions in AI. Featuring Jack Clark, Jason Kwon, Randi Weingarten, Dean Ball, Helen Toner, and many more great speakers! If you&#8217;d like to join us, fill out <strong><a href="https://bit.ly/tc25-apply">this form</a></strong>.</em></p><p></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!qXCX!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!qXCX!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 424w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 848w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1272w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp" width="849" height="640" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/b255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:640,&quot;width&quot;:849,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;Image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="Image" title="Image" srcset="https://substackcdn.com/image/fetch/$s_!qXCX!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 424w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 848w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1272w, https://substackcdn.com/image/fetch/$s_!qXCX!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fb255a339-9731-4dc2-a8b6-8e915be8e4d5_849x640.webp 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">If this is GPT-5 in &#8220;Thinking&#8221; mode, I wonder what &#8220;Pro&#8221; mode looks like</figcaption></figure></div><p>Amidst the unrelenting tumult of AI news, it&#8217;s easy to lose track of the bigger picture. Here are some ideas that have been developing quietly in the back of my head, about the path from here to AGI.</p><ol><li><p>I drafted this post a couple of weeks ago. The subsequent launch of GPT-5 didn&#8217;t lead me to make any changes. That says something about how uneventful GPT-5 is.<br></p></li><li><p>Current AIs aren&#8217;t AGI. But I don&#8217;t know why.<br><br>I mean, I have thoughts. I talk about missing functions like &#8220;memory&#8221; and &#8220;continuous learning&#8221;, and possibly &#8220;judgement&#8221; and &#8220;insight&#8221;. But these are all debatable; for instance, ChatGPT has a form of memory. The honest answer is: I dunno what&#8217;s missing, but <em>something</em> is, because there are a lot of things AI still can&#8217;t do. Even if it&#8217;s getting harder and harder to articulate <a href="https://secondthoughts.ai/p/where-ai-falls-short">exactly what those things are</a>.<br></p></li><li><p>Prior to GPT-5, ChatGPT users had to tell the chatbot whether to think hard about a problem (by selecting a &#8220;reasoning&#8221; model like o3), or just give a direct answer. One of the biggest changes in GPT-5 is that the system decides for itself whether a question calls for &#8220;thinking hard&#8221;. And, according to many reports, it often gets this wrong. In other words, <em>current cutting-edge AIs can solve Ph.D-level math and science problems, but can&#8217;t reliably decide which questions deserve thinking about before answering.</em><br><br>(OK, I lied: GPT-5 didn&#8217;t lead me to change any of my previously written points, but it did prompt me to add this one.)<br></p></li><li><p>Often, when embarking on a large software project, I can&#8217;t see how it&#8217;ll come together. The task may be too complex to wrap my head around, or there may be conflicting requirements that seem difficult to reconcile. Sometimes this eventually leads to an &#8220;aha&#8221; moment, when I find a clever reframing that changes the problem from confusing and intractable to straightforward and tractable. Other times I just grind away and grind away until there&#8217;s nothing left to do.<br><br>The latter cases, with no aha moment, are disconcerting. Lacking a specific moment when the difficulty was overcome, I find myself questioning whether I have in fact overcome it. I worry that I&#8217;ve missed something important &#8211; that I&#8217;ve built something in the my workshop that will never fit out the door, or an airplane that&#8217;s too heavy to fly. Sometimes this does in fact turn out to be the case; other times everything is fine.<br><br>Will the journey from here to AGI feature &#8220;aha&#8221; moments? Or will it be a <a href="https://x.com/natolambert/status/1953586636287279474">long slow grind</a>, and when we get to the end and look back to see what the key insights were, we won&#8217;t be able to find any?<br></p></li><li><p>Back in the 80s and 90s, I used to attend SIGGRAPH, the annual computer graphics conference. The highlight of the week was always the film show, a two-hour showcase demonstrating the latest techniques. It was a mix of academic work and special-effects clips from unreleased Hollywood movies.<br><br>Every year, the videos would include some important component that had been missing the year before. Shadows! Diffuse lighting! Interaction of light with texture! I&#8217;d gaze upon the adventurer bathed in flickering torchlight, and marvel at how real it looked. Then the next year I&#8217;d laugh at how cartoonish that adventurer&#8217;s hair had been, after watching a new algorithm that simulated the way hair flows when people move.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!FTI0!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!FTI0!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 424w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 848w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1272w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png" width="1280" height="944" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:944,&quot;width&quot;:1280,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;undefined&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="undefined" title="undefined" srcset="https://substackcdn.com/image/fetch/$s_!FTI0!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 424w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 848w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1272w, https://substackcdn.com/image/fetch/$s_!FTI0!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8a82d802-a39c-47ac-9589-8b159708f35b_1280x944.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">In the late 80&#8217;s I would have thought this looked sooooo real</figcaption></figure></div><p>I think AI is a little like that: we&#8217;re so (legitimately!) impressed by each new model that we can&#8217;t see what it lacks&#8230; until an even-better model comes along. As I said when I first started blogging about AI: as we progress toward an answer to the question &#8220;can a machine be intelligent?&#8221;, we are learning as much about the question as we are about the answer.<br><br>(Case in point: in the press briefing for the GPT-5 launch, Sam Altman said that we&#8217;ll have AGI when AIs get continuous learning. I&#8217;ve never heard him point to that particular gap before.)<br></p></li><li><p>Moravec's paradox states that, in AI, &#8220;the hard problems are easy and the easy problems are hard&#8221;<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-1" href="#footnote-1" target="_self">1</a> &#8211; meaning that the most difficult things to teach an AI are the things that come most naturally to people. However, we are <a href="https://secondthoughts.ai/p/were-finding-out-what-humans-are">often surprised</a> by which things turn out to be easy or hard. You might think running is easy, until you see a cheetah do <em>real</em> running.<br></p></li><li><p>The accepted explanation for Moravec&#8217;s paradox is that some things seem easy to us, because evolution has optimized us to be good at those things, and it&#8217;s hard for clumsy human designs to outdo evolution. Evolution didn&#8217;t optimize us for multiplying large numbers or shifting gigantic piles of dirt, and so crude constructions such as calculators and bulldozers easily outperform us.<br><br>With that in mind: evolution would <em>laugh</em> if it saw how crude our algorithms are for training neural networks.<br></p></li><li><p>Evolution&#8217;s grin might fade a bit when it sees how much sheer scale (of computing capacity and data) we can devote to training a single model<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-2" href="#footnote-2" target="_self">2</a>. A child&#8217;s intellectual development is driven by processes far more sophisticated than our procedures for training AIs, but that child has access to only a tiny fraction as much data.<em><br></em></p></li><li><p>The human genome (DNA) contains <a href="https://dynomight.substack.com/p/dna">a few billion bits of information</a>. Loosely speaking, this means that you are the product of a design that reflects billions of decisions, most of which have been ruthlessly optimized by evolution. Not all of those decisions will be relevant to how the brain works: your DNA also includes instructions for the rest of your body, there&#8217;s a certain amount of junk that hasn&#8217;t been optimized away yet, etc. But still, the design of our brains probably reflects hundreds of millions of optimization decisions.<br><br>I don&#8217;t know how many carefully optimized decisions are incorporated into the design of current LLMs<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-3" href="#footnote-3" target="_self">3</a>, but I doubt it comes to hundreds of millions. This is why I believe current AI designs are very crude.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Subscribe to receive all of our analysis of AI and where it&#8217;s heading!</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div></li></ol><h2>Sample-Efficient Learning</h2><ol start="10"><li><p>When comparing human and AI capabilities, one important concept is &#8220;sample-efficient learning&#8221;. This refers to the ability to learn a new idea or skill from a small number of examples or practice sessions. In general, current AI models are much less sample-efficient than we are: a teenager learns to drive in less than 100 hours; Waymo vehicles have logged millions of hours and are still working their way up to driving on the highway, in snow, etc.<br><br>(This suggests that sample-efficient learning is one of the things evolution optimized us for.)<br></p></li><li><p>&#8220;Sample efficiency&#8221; is probably a crude label for a complex tangle of capabilities. Just as there are many flavors of intelligence, there must be many flavors of sample efficiency. Some people argue that true intelligence is in fact more or less the same thing as sample efficiency.<br></p></li><li><p>Here are some examples of sample efficiency in humans: learning to drive a car in a few dozen hours. Figuring out the rule in an ARC-AGI task from just a couple of examples. Learning the ropes at a new job. Sussing out the key trick <a href="https://secondthoughts.ai/p/solving-math-olympiad-problems">to solve a difficult mathematical problem</a>. Are these all basically the same skill?</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!0PTL!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!0PTL!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 424w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 848w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1272w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png" width="1384" height="230" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:230,&quot;width&quot;:1384,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!0PTL!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 424w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 848w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1272w, https://substackcdn.com/image/fetch/$s_!0PTL!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbcf826b8-29f0-4a65-adbe-b5bd806c821a_1384x230.png 1456w" sizes="100vw"></picture><div></div></div></a><figcaption class="image-caption">In this ARC-AGI puzzle, the goal is to look at the first two pairs of images, identify the rule, and apply it to the last image. It&#8217;s quite doable for people, and quite challenging for AIs.</figcaption></figure></div><p></p></li><li><p>I often encounter the assertion that LLMs are <a href="https://arxiv.org/abs/2410.05362">sample-efficient learners within their context window</a>. That is, while they need many examples of a concept to learn that concept during the training process, they can (it is said) quickly pick up a new idea if you supply a few targeted examples while asking them a question.<br><br>But if this were true, you&#8217;d think they&#8217;d be able to handle ARC-AGI puzzles (see the example image just above)<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-4" href="#footnote-4" target="_self">4</a>. Maybe they&#8217;re only good at picking up ideas from an example, <em>if they&#8217;d already learned that idea during their original training?</em> In other words, maybe in-context learning is helpful at jogging their memory, but not for teaching new concepts.<br></p></li><li><p>Is sample-efficient learning a singularly important step on the path to AGI? If so, could other strengths of large language models (e.g. their superhuman breadth of knowledge) compensate for the lack of sample efficiency?<br></p></li><li><p>&#8220;Judgement&#8221; and &#8220;insight&#8221; also seem like crude labels that will turn out to encompass many different things. Will these things transfer across domains? If we develop a model that has judgement and insight in mathematics or coding, will it have a big head start on developing those same capabilities in other, messier domains? Or will current AI architectures struggle to generalize in this way? For that matter, are people able to transfer judgement, insight, and taste from one domain to another?</p></li></ol><h2>Crystallized and Fluid Intelligence</h2><ol start="16"><li><p>AIs have been demonstrating what arguably constitutes superhuman performance on FrontierMath, a set of extremely difficult mathematical problems. But they mostly seem to do it &#8220;the wrong way&#8221;: instead of finding elegant solutions, they either rely on knowledge of some obscure theorem that happens to make the problem much easier, or grind out a lengthy brute-force answer.<br><br>Does this matter? I mean, if you get the answer, then you get the answer. But in mathematics, much of the value in finding a proof is the insights you acquired along the way. If AIs begin knocking off unsolved problems in mathematics, but in ways that don&#8217;t provide insight, perhaps we&#8217;ll still need mathematicians to do the real work of advancing the overall field. Or maybe, once AIs can solve these problems at all, it&#8217;ll be a short step to solving them with insight? My instinct is that it&#8217;s not a short step, but that could be cope. In any case, the big question is what this tells us about AI&#8217;s potential in applications other than mathematics. What portion of human activity requires real insight?</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!8njH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!8njH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 424w, https://substackcdn.com/image/fetch/$s_!8njH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 848w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1272w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png" width="868" height="906" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/fb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:906,&quot;width&quot;:868,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!8njH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 424w, https://substackcdn.com/image/fetch/$s_!8njH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 848w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1272w, https://substackcdn.com/image/fetch/$s_!8njH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ffb759b69-ba9a-461b-b7e6-66927fb1e9a6_868x906.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Source: https://x.com/dmimno/status/949302857651671040</figcaption></figure></div><p></p></li><li><p>To put it another way: compared to people, large language models seem to be superhuman in crystallized knowledge, which seems to be masking shortcomings in fluid intelligence. Is that a dead end, great for benchmarks but bad for a lot of work in the real world? Or is it a feasible path to human-level performance?<br><br>(Taren points out the irony that while LLMs know far more facts than people, they also routinely get facts wrong &#8211; &#8220;hallucinations&#8221;.)<br></p></li><li><p>The best public estimate is that GPT-4 has 1.8 trillion &#8220;parameters&#8221;, meaning that its neural network has that many connections. In the two and a half years since it was released, it&#8217;s not clear that any larger models have been deployed (GPT-4.5 and Grok 3 might be somewhat larger).<br><br>The human brain is far more complex than this; the most common estimate is 100 trillion connections, and each connection is probably considerably more complex than the connections in current neural networks. In other words, the brain has far more information storage capacity than any current artificial neural network.<br><br>Which leads to the question: how the hell do LLMs manage to learn and remember so many more raw facts than a person<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-5" href="#footnote-5" target="_self">5</a>?</p><p><br>One possible answer: perhaps models learn things in a shallower way, that allows for more compact representations but limits their ability to apply things they&#8217;ve learned in creative, insightful, novel ways. Perhaps this also has something to do with their poor sample efficiency.</p></li></ol><h2>Solving Large Problems</h2><ol start="19"><li><p>When projecting the future of AI, many people look at this graph:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!WOWb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!WOWb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 424w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 848w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1272w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png" width="1456" height="853" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:853,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!WOWb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 424w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 848w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1272w, https://substackcdn.com/image/fetch/$s_!WOWb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff4146bf8-0ded-488d-9948-a6c7924d44a3_1592x933.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">This is the latest version, updated to include GPT-5 (<a href="https://metr.github.io/autonomy-evals-guide/gpt-5-report/">source</a>)</figcaption></figure></div><p>It shows that the size of software engineering tasks an AI can complete has been roughly doubling every 7 months. This trend has held for over 5 years (arguably<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-6" href="#footnote-6" target="_self">6</a>), during which the achievable task size increased from around 3 seconds to around 2 hours. Why should the trend be so steady?<br><br>It&#8217;s not obvious that the difficulty that AI will have in completing a task should increase steadily with the size of the task. If a robot could assemble 10 Ikea bookshelves, it could assemble 20 bookshelves. If a coding agent can create a form with 10 fields, it can probably create a form with 20 fields. Why is it that if an AI can complete a 10-minute project, it still may not be able to complete a 20-minute project? And why does the relationship between time and difficulty hold steady across such a wide range of times?<br><br>I think the predictable(-ish) trend of AIs tackling larger and larger software engineering tasks has something to do with large tasks containing a fractal distribution of subtasks. There is a fuzzy collection of tactical and strategic skills involved, ranging from &#8220;write a single line of code&#8221; to &#8220;design a high-level architecture that breaks up a one-month project into smaller components that will work well together&#8221;. Larger tasks require high-level skills that are more difficult for AIs (and people) to master, but every task requires a mix of skills, tasks of the same size can involve different mixes (building one fancy model airplane vs. 20 bookshelves), and the fuzzy overlaps smooth out the graph.<br></p></li><li><p>Even so, if the doubling time for task sizes holds steady at 7 months, the consistency of that trend will point to something deep about the nature of large tasks vs. small tasks, and the cognitive skills that people and LLMs bring to each.<br></p></li><li><p>It's been argued that the skills needed to solve tasks of a given size flatten out as you progress toward larger tasks. That is, there&#8217;s a big difference between solving 1-minute vs. 2-minute tasks, but (the argument goes) once you can carry out a project that requires a full month, you&#8217;re pretty much ready to tackle two-month projects. And so we should expect to see an acceleration in the size of tasks which AIs can handle &#8211; it should start doubling more frequently than every 7 months.<br><br>I don&#8217;t share this intuition. I&#8217;d expect increases in task length to keep upping the difficulty, even when the base line task length is already large. For instance, if I&#8217;m approaching a 2-month project, perhaps I should start out by spending a couple of weeks prototyping several different approaches, or taking an online course to learn a new programming technique. Those are high-level ideas that might not make sense if I only have one month.<br><br>Heck, maybe we should expect 1 month &#8594; 2 months to be a <em>bigger</em> leap than 1 minute &#8594; 2 minutes: the former is an increase of one month, the latter is an increase of only one minute! I wouldn&#8217;t actually expect successive doublings to get more difficult, but it&#8217;s not obvious to me why we should expect them to get easier, either.<br></p></li><li><p>If you&#8217;ve mastered tasks that take a single day, what additional skills do you need to handle week-long, month-long, or year-long projects? Do we have any clear idea of what those skills are? I suspect that we don&#8217;t understand them very well, that we tend to discount the skills involved, and that this contributes to some people having (what I believe are) unrealistically short estimates of the time it will take to develop AGI.<br></p></li><li><p>Model developers are working hard to train their models to carry out complex tasks. The current approach involves letting the model attempt practice tasks, and tweaking the model after each success or failure. Roughly speaking, this approach generates one bit of learning for each attempted task.<br><br>This is feasible under current conditions, when models are mostly handling tasks that take a few seconds to a few minutes to carry out. What happens when we&#8217;re trying to train models to independently manage month-long projects? One bit of learning per month is a slow way to make progress. Perhaps sample-efficient learning becomes more important as you attempt longer tasks.<br></p></li><li><p>One workaround to the difficulty of learning how to carry out long tasks is to break them into subtasks, and handle each subtask as a separate project. But this simplifies away many important aspects of problem solving. A large project does not neatly decompose into tidy subprojects.<br><br>For instance, when I tackle a subtask during a large software project, the result is not just that a certain chunk of the code gets written. I come out with a slightly deeper understanding of the problem. I may have learned new things about the existing codebase. I may have hit upon a handy trick for testing the kinds of operations that this code performs, or had an insight about the data being processed. I may have gotten some little nudge that will eventually accumulate with 20 other nudges across other portions of the project, eventually leading me to rethink my entire approach. If the subtask is assigned to a separate agent, whose memories are discarded as soon as the subtask is complete, none of that learning can take place.<br></p></li><li><p>At the same time, perhaps efficiently managing a complex month-long project is not something that evolution optimized us for? In which case AIs might eventually blow past us the way they did with chess.</p></li></ol><h2>Continuous Learning</h2><ol start="26"><li><p>&#8220;Continuous learning&#8221; refers to the ability to assimilate new skills and knowledge while carrying out a task. People are of course continuous learners. Current LLMs are not: a model like GPT-5 first undergoes &#8220;training&#8221;, and then is released for use, at which point the model is frozen and can never learn anything new.<br><br>Continuous learning seems related to sample-efficient learning. Cutting-edge models today are trained on tens of trillions of &#8220;tokens&#8221; (roughly meaning, words). This is astronomically more data than a person might encounter over the course of a project. So to learn new things on the fly, you need to be sample-efficient.<br></p></li><li><p>Continuous learning seems related to the way models often struggle when we try to integrate them deeply into our work. Think about your experience on the first day of a new job: you struggle, too. You don&#8217;t know how to do anything, you don&#8217;t know where to find anything. Every little task requires conscious deliberation and extensive research. Roaming around the internal website, not to find the information you need, but just to get a sense of where to look; asking someone for help in figuring out who to ask for help.<br><br>Everything that current models do, they do in their first hour on the job.<br></p></li><li><p>There&#8217;s an argument that once AIs have mastered continuous learning, we&#8217;ll apprentice them to do every possible job, and amalgamate the resulting learnings into a single model that&#8217;s pre-trained to be good at everything. It&#8217;s not obvious to me that this will work. The accountant-bot will have taken its neural network in a different direction than the therapy-bot, and jamming two different things together may not work out any better than it did <a href="https://en.wikipedia.org/wiki/Seth_Brundle">for Jeff Goldblum in The Fly</a>. This is like an extreme version of &#8220;distributed training&#8221; (using multiple data centers to train a single model), which is something that has only been demonstrated in limited ways, requiring close coordination between the different learning centers.<br></p></li><li><p>A model that has been working as an apprentice accountant will have learned a lot of things about accounting, but also a lot of sensitive details regarding specific clients. Those details would have to somehow be excluded from the knowledge aggregation process, both for privacy reasons, and to avoid overwhelming the unified model with unimportant detail.<br></p></li><li><p>Arguably, &#8220;continuous, sample-efficient learning&#8221; is a good description of the way you keep track of what you&#8217;re doing over the course of a project. You accumulate knowledge of the project&#8217;s context &#8211; for instance, if the assignment is to add some new functionality to a piece of software, you&#8217;ll need to learn how the existing code works. As you work, you also remember what you&#8217;ve already done and what blind alleys you&#8217;ve explored.<br><br>It&#8217;s been proposed that LLMs can rely on their &#8220;context window&#8221; &#8211; the transcript of everything you&#8217;ve said to them, everything they&#8217;ve said to you, and their own internal monologue as they work through the task &#8211; as a substitute for continuous learning. I have trouble accepting that this will scale up to large projects. Current LLM architectures make it very expensive to keep increasing the context window, and this seems like a fundamental barrier. People are able to fluidly &#8220;wake up&#8221;, &#8220;put to sleep&#8221;, consolidate, and otherwise manage sections of our memory according to need, and today&#8217;s LLMs cannot.</p></li></ol><h2>Phase Changes as We Get Closer to AGI</h2><ol start="31"><li><p>Currently, when AIs are used in the workplace, they are a tool, an extra ingredient that is sprinkled into processes that remain fundamentally human-centered. A human manager assigns tasks to a human worker, who might rely on an AI to handle some isolated subtasks. Even in the infrequent examples where AI is reported to have written most of the code for some software project, that&#8217;s still happening in small to medium-sized chunks, organized and supervised by people.<br><br>When AIs start to do most of the higher-level work, workplace dynamics will change in hard-to-anticipate ways. Futurists like to point out that AIs can run 24 hours a day, never get bored, can be cloned or discarded, and do many things much faster than people. So long as people are the glue that holds the AI workers together, all this is of limited consequence. When the AIs take center stage, all of those AI advantages will come into play, and the result will be something strange.<br><br>Think about a long vacation when you really unplugged &#8211; not in December when nothing was happening, but during a busy period. Think about how much catching up you had to do afterwards. Now imagine if <em>every single morning</em>, you discover that you have that much catching up to do, because the AI team you&#8217;re attempting to supervise has done the equivalent of two weeks of work while you slept. You&#8217;ll no longer be a central participant in your own job; it&#8217;ll be all you can do to follow the action and provide occasional nudges.<br><br>The transition from AI-as-a-tool to people-are-mostly-spectators could happen fairly quickly, like a phase change in physics.<br></p></li><li><p>An important moment in the history of AI was the conversation in which Bing Chat, built around an early version of GPT-4, <a href="https://www.nytimes.com/2023/02/16/technology/bing-chatbot-microsoft-chatgpt.html?unlocked_article_code=1.c08.jOOO.6caotlh-4l96&amp;smid=url-share">tried to break up a New York Times reporter&#8217;s marriage</a>. This was far outside the behavior that Microsoft or OpenAI had observed (or desired!) in internal tests. My understanding is that they had only tried short, functional interactions with the chatbot, whereas reporter Kevin Roose carried out an extended conversation that took the bot into uncharted territory.<br><br>This may be related to the recent phenomenon where for some people, extended interactions with a chatbot, over the course of weeks or months, <a href="https://importai.substack.com/p/import-ai-424-facebook-improves-ads?utm_source=post-email-title&amp;publication_id=1317673&amp;post_id=170638373&amp;utm_campaign=email-post-title&amp;isFreemail=false&amp;r=8nec1&amp;triedRedirect=true&amp;utm_medium=email#:~:text=the%20iterative%20interaction%20of%20chatbot%20behavioural%20tendencies%20and%20human%20cognitive%20biases%20can%20set%20up%20harmful%20feedback%20loops">appear to be exacerbating mental health issues</a>. The common theme is that when machines transition from bit parts to leading roles, unexpected things happen.<br><br>(Those unexpected things don&#8217;t have to be bad! But until AIs get better at managing the big picture, and/or people learn a lot more about the dynamics of AI-powered processes, the surprises will probably be bad more often than not.)<br></p></li><li><p>It&#8217;s widely recognized that AIs tend to perform better on benchmark tests than in real-world situations. I and others have pointed out that one reason for this is that the inputs to benchmark tasks are usually much simpler and more neatly packaged than in real life. It is less widely recognized that benchmark tasks also have overly simplified <em>outputs</em>.<br><br>We score an AI&#8217;s output on a benchmark problem as &#8220;correct&#8221; or &#8220;incorrect&#8221;. In real life, each task is part of a complex web of ongoing processes. Subtle details of how the task is performed can affect how those processes play out over time. Consider software development, and imagine that an AI writes a piece of code that produces the desired outputs. This code might be &#8220;correct&#8221;, but is it needlessly verbose? Does it replicate functionality that exists elsewhere in the codebase? Does it introduce unnecessary complications?<br><br>Over time, a codebase maintained mostly by AI might become a bloated mess of conflicting styles, redundant code, poor design decisions, and subtle bugs. Conversely, the untiring nature of AI may lead to codebases that are inhumanly well-maintained, every piece of code thoroughly tested, every piece of documentation up to date. Just as it was hard to guess that extended conversations could lead early chatbots into deranged behavior, it is hard to guess what will result from giving a coding agent extended responsibility for a codebase. Similarly, it is hard to guess what will result from putting an AI in charge of the long-term course of a scientific research project, or a child&#8217;s education.</p></li></ol><h2>Other Thoughts</h2><ol start="34"><li><p>In every field, some people accomplish much more than others. Nobel-winning scientists, gifted teachers, and inspiring leaders are held up as an argument for the potential of &#8220;superintelligence&#8221;. Certainly, the argument goes, it should be possible to create AIs that are as capable as the most capable people. An Einstein in every research department, a Socrates in every classroom. And if it&#8217;s possible for an Einstein to exist, why not an (artificial) super-Einstein?<br><br>I find this argument compelling up to a point, but I suspect we may incorrectly attribute the impact of great scientists to brilliance alone. Einstein contributed multiple profound insights to physics, but he did that at a time of opportunity &#8211; there was enough experimental data to motivate and test those insights, but that data was new enough that no one else had found them yet<a class="footnote-anchor" data-component-name="FootnoteAnchorToDOM" id="footnote-anchor-7" href="#footnote-7" target="_self">7</a>. Edison&#8217;s labs originated or commercialized numerous inventions, but his earlier successes provided him with the resources to vigorously pursue further lines of research, and the opportunity to bring his further inventions to market.<br><br>Steve Jobs&#8217; accomplishments owed much to his ability to attract talented employees. Great leaders achieve success in part by edging out other leaders to rise to the top of an organization. If AI allows us to create a million geniuses, we won&#8217;t be able to give them all the same opportunities that (some of) today&#8217;s geniuses are afforded.<br></p></li><li><p>As OpenAI progressed from GPT-1 to GPT-2 to GPT-3 to GPT-4, the theme was always <em>scale</em>: each model was at least 10x larger than its predecessor, and trained on roughly 10x as much data.<br><br>In the two and a half years since GPT-4 launched, frontier developers have continued to increase the amount of data used to train their models, but model <em>sizes</em> are no longer increasing; many subsequent cutting-edge models seem to be smaller than GPT-4. This has been interpreted as a sign that the benefits to scaling may have stalled out. However, it might simply be that no one would have much use for a larger model right now, even if it was more intelligent. All of the leading edge AI providers (with the possible exception of Google?) are clearly capacity constrained &#8211; they already can&#8217;t offer all of their goodies to everyone who would like to use them. Larger models would make this much worse, both increasing demand (presumably, if models were smarter, people would use them more) and reducing supply.<br><br>We might not see much progress toward larger models until we&#8217;ve built enough data centers to saturate demand. Given how much room there is for AI to diffuse into further corners of our personal and work lives, that could be quite a while.<br></p></li><li><p>Everyone is sharing this graph, which compares the level of investment in railroads in the 1880s, peak telecommunications infrastructure spending around the 5G rollout <em>[EDITED to fix my confusion between the dot com and 5G telecom investment waves]</em>, and AI data centers today:</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!jFDw!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!jFDw!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 424w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 848w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1272w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png" width="1456" height="939" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:939,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!jFDw!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 424w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 848w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1272w, https://substackcdn.com/image/fetch/$s_!jFDw!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff82ed432-96b7-44cc-a014-3c757141d1ee_1600x1032.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">source: https://paulkedrosky.com/honey-ai-capex-ate-the-economy/</figcaption></figure></div><p>The usual takeaway is: wow, the AI boom (or bubble) is bigger than the dot-com bubble. I don&#8217;t understand why people aren&#8217;t focusing more on the fact that railroad investments peaked at <strong>three times the dot-com boom and AI datacenter rollout put together</strong>. Holy shit, the 1880s must have been absolutely insane. The people of that time must have <em>really</em> believed the world was changing, to be willing to sustain that level of investment. (I&#8217;ve seen arguments that the pace of change in the late 1800s and early 1900s made our current era seem positively static. Steam power, electricity, railroads, the telegraph, telephones, radio, etc. This graph makes that a bit more visceral.)<br><br>(I also wonder whether these numbers may turn out to be wrong. Some commentator noted that older GDP figures may be misleading because the informal economy used to play a much larger role. When a startling statistic spreads like wildfire across the Internet, it often turns out to be incorrect.)</p></li></ol><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/subscribe?"><span>Subscribe now</span></a></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5?utm_source=substack&utm_medium=email&utm_content=share&action=share&quot;,&quot;text&quot;:&quot;Share&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://secondthoughts.ai/p/thoughts-about-agi-and-gpt-5?utm_source=substack&utm_medium=email&utm_content=share&action=share"><span>Share</span></a></p><p><em>Thanks to Taren for feedback and images.</em></p><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-1" href="#footnote-anchor-1" class="footnote-number" contenteditable="false" target="_self">1</a><div class="footnote-content"><p> This quote is from a restatement of the paradox by Steven Pinker. Moravec&#8217;s original statement, in 1988:</p><blockquote><p>It is comparatively easy to make computers exhibit adult level performance on intelligence tests or playing checkers, and difficult or impossible to give them the skills of a one-year-old when it comes to perception and mobility.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-2" href="#footnote-anchor-2" class="footnote-number" contenteditable="false" target="_self">2</a><div class="footnote-content"><p><a href="https://x.com/snewmanpv/status/1849465150761586879">As I put it a while back</a>:</p><blockquote><p>Evolution completely outclasses us at nuanced optimization of complex systems. Conversely, our great advantage is the ability to precisely coordinate efforts and resources on a grand scale.</p><p>Thus, evolution wins at: low-temperature chemical processing; mechanical dexterity; sample efficient, adversarially robust learning. But we can create: steel mills, moon rockets, skyscrapers, yottaflop training runs.</p><p>If and when AIs are able to match evolution at subtle design, while retaining (or, inevitably, exceeding) humanity's existing capability for marshaling vast resources... things are gonna get interesting.</p></blockquote></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-3" href="#footnote-anchor-3" class="footnote-number" contenteditable="false" target="_self">3</a><div class="footnote-content"><p>Large Language Models, such as GPT.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-4" href="#footnote-anchor-4" class="footnote-number" contenteditable="false" target="_self">4</a><div class="footnote-content"><p>Yes, the some models can now post high scores on the original ARC-AGI-1 test, but they still struggle with ARC-AGI-2 and ARC-AGI-3. Also, yes, it seems likely that one reason models struggle on ARC-AGI problems is that they don&#8217;t have much experience looking at pixelated images. But I still stand by the observation that models seem to only be selectively skilled at in-context learning.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-5" href="#footnote-anchor-5" class="footnote-number" contenteditable="false" target="_self">5</a><div class="footnote-content"><p>I asked ChatGPT, Claude, and Gemini to compare the number of &#8220;facts&#8221; known by a typical adult to a frontier LLM. They all estimated a few million for people, and a few billion for LLMs. To arrive at those estimates, they engaged in handwaving so vigorous as to affect the local weather, so take with a grain of salt. (<a href="https://chatgpt.com/share/689b57ef-db30-800b-b8fc-a33410b4f14f">ChatGPT transcript</a>, <a href="https://claude.ai/share/d1bce0f6-d2a0-4a74-bfca-bc1aada4857d">Claude transcript</a>, <a href="https://g.co/gemini/share/772c1e4cff1b">Gemini transcript</a>)</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-6" href="#footnote-anchor-6" class="footnote-number" contenteditable="false" target="_self">6</a><div class="footnote-content"><p>The data does suggest that the rate of progress has accelerated recently, perhaps to a 4 month doubling time, but this is debated and there isn&#8217;t enough data to be confident in either direction.</p></div></div><div class="footnote" data-component-name="FootnoteToDOM"><a id="footnote-7" href="#footnote-anchor-7" class="footnote-number" contenteditable="false" target="_self">7</a><div class="footnote-content"><p>Though some of the relevant data had been available for several decades.</p></div></div>]]></content:encoded></item></channel></rss>