<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Lin Cong Fieldnotes]]></title><description><![CDATA[Field-based geopolitics insights on US-China-UK]]></description><link>https://www.lincongfieldnotes.com</link><image><url>https://www.lincongfieldnotes.com/img/substack.png</url><title>Lin Cong Fieldnotes</title><link>https://www.lincongfieldnotes.com</link></image><generator>Substack</generator><lastBuildDate>Wed, 03 Jun 2026 14:15:10 GMT</lastBuildDate><atom:link href="https://www.lincongfieldnotes.com/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Lin Cong]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[lincong1@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[lincong1@substack.com]]></itunes:email><itunes:name><![CDATA[Lin Cong]]></itunes:name></itunes:owner><itunes:author><![CDATA[Lin Cong]]></itunes:author><googleplay:owner><![CDATA[lincong1@substack.com]]></googleplay:owner><googleplay:email><![CDATA[lincong1@substack.com]]></googleplay:email><googleplay:author><![CDATA[Lin Cong]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Notes on the 7-hour marathon interview with Saining Xie]]></title><description><![CDATA[Some observations from the first and only podcast interview with Saining Xie.]]></description><link>https://www.lincongfieldnotes.com/p/notes-on-the-7-hour-marathon-interview</link><guid isPermaLink="false">https://www.lincongfieldnotes.com/p/notes-on-the-7-hour-marathon-interview</guid><dc:creator><![CDATA[Lin Cong]]></dc:creator><pubDate>Mon, 27 Apr 2026 00:01:45 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!vIcu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Saining Xie is an Assistant Professor of Computer Science at NYU and a co-founder and CTO of AMI Labs. He argues that human-like intelligence will require moving beyond language-only systems toward models that learn from continuous signals in the real world.</p><p>Please watch the full podcast <a href="https://www.youtube.com/watch?v=rIwgZWzUKm8">here</a>, and support its channel <em>Zhang Xiaojun Podcast</em>.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vIcu!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vIcu!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 424w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 848w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 1272w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vIcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png" width="1384" height="766" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:766,&quot;width&quot;:1384,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:1124113,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.lincongfieldnotes.com/i/195535264?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!vIcu!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 424w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 848w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 1272w, https://substackcdn.com/image/fetch/$s_!vIcu!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F8172c103-b570-417d-b6b5-0a0adfed26ec_1384x766.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p><strong>1. Teaching AI How to Love</strong></p><p style="text-align: justify;">Early in the interview, Xie recalls a phone call with Ilya Sutskever. The call was meant to be about whether Xie might join Ilya&#8217;s company at the time. Instead, they spent the whole time talking about how to teach AI to love.</p><p style="text-align: justify;">There was no action plan at the end, but the anecdote comes to show the kind of question Xie thinks is worth spending time on. He sees the ability to love as basic to the future stability of AI. At the same time, he sees the other side of the coin: if a system can love, hate may follow. That is part of why he&#8217;s drawn to world models not as a purely technical goal, but as a path to safer intelligence.</p><p style="text-align: justify;"><strong>2. On the Nature of Research</strong></p><p style="text-align: justify;">Xie said he dislikes the word &#8220;influence.&#8221; He agrees with his friend Kaiming He (whom he mentioned 84 times during the conversation) that the nature of research is to share knowledge. A paper should help us understand something, which matters much more than performance numbers or academic posturing.</p><p style="text-align: justify;">Xie invokes Arendt&#8217;s well-known distinction between influence and understanding&#8212;in a 1964 interview she called the desire to be influential &#8216;a masculine question&#8217; and said she preferred to understand. As  Arendt put it, the purpose of this pursuit is &#8220;understanding&#8221;: living beings need to be understood. </p><p style="text-align: justify;">That is the version of research Xie wants to do. You arrive at some understanding, you write it down, you put it out into the world. If it resonates, the understanding spreads &#8212; and somewhere along the way, you feel understood too. He described that loop as something close to family.</p><p style="text-align: justify;">Xie added: influence is self-centred; mutual understanding raises the collective intelligence of the whole planet. And more intelligence benefits everyone.</p><p style="text-align: justify;"><strong>3. Research Taste</strong></p><p style="text-align: justify;">Xie said good research taste is hard to define directly. But he was clear on what bad taste looks like.</p><p style="text-align: justify;">Bad taste, in his view, comes from chasing appearances: conference acceptances, praise, hype, quick results, status&#8212;all the external signs that make something look successful.</p><p style="text-align: justify;">He linked this to a line from the <em>Diamond Sutra</em>: all appearances are illusory; what you see on the surface is often not what matters most.</p><p style="text-align: justify;">That idea applies far beyond research&#8212;to products, marketing, writing, and life in general. It is easy to do things because they look right or because society rewards them. It is much harder to keep asking what is fundamental.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!iPbx!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!iPbx!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 424w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 848w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!iPbx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png" width="1200" height="1108" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/bbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1108,&quot;width&quot;:1200,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!iPbx!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 424w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 848w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!iPbx!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fbbdd72e1-caeb-480e-b319-4e227ab1fa09_1200x1108.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">Xie mentioned the non-linear research career by Professor Bill Freeman. Strong researchers are often driven by long-term passion and pure curiosity. They care less about short-term reception and more about whether their work eventually reaches its fullest expression.</p><p style="text-align: justify;"><strong>4. Doing Research and Filmmaking</strong></p><p style="text-align: justify;">Xie mentioned many times that he loves movies. His childhood dream was to become a director, though it quickly faded.</p><p style="text-align: justify;">As an adult, he soon realised that doing research is no different from making a film.</p><p style="text-align: justify;">A good paper is not just a pile of methods and results. The techniques matter, but what matters just as much is the story behind the paper: what decisions were made, why they were made, what changed, what the researchers saw that others missed, and whether the reader leaves with a new path to explore.</p><p style="text-align: justify;">He puts this in personal terms as well. Quoting Martin Scorsese&#8217;s idea that the most creative things are the most personal, Xie argues that long-term work needs an inner force behind it. Technique matters. But so does finding &#8220;the fire in your heart&#8221; and using your own perspective to guide the work.</p><p style="text-align: justify;"><strong>5. Video as non-Static</strong></p><p style="text-align: justify;">On studying world models through video, Xie mentioned his admiration for Chinese directors Jia Zhangke and Bi Gan, both known for long takes.</p><p style="text-align: justify;">Xie summarised Jia Zhangke&#8217;s idea: what makes cinema interesting is that every frame on the timeline can be expanded through cinematic space &#8211; what he called as &#8220;blank space (&#30041;&#30333;)&#8221;. Although we see a sequence of frames, what lies behind it is the state of the world and global information across space.</p><p style="text-align: justify;">At the same time, this raises a harder question. Different people could infer different things from the same scene. What exactly should a universal world model reconstruct? Is there one shared world behind perception, or is part of the difficulty that interpretation is never fully identical?</p><p style="text-align: justify;">Xie did not offer a final answer. But that may be part of the point. World models are difficult not only because they must recognise what is visible, but also because they must learn the hidden structure behind what we see.</p><p style="text-align: justify;"><strong>6. What Problem Does Computer Vision Solve?</strong></p><p style="text-align: justify;">This is where video becomes interesting. A still image gives us a scene, but video gives us continuity, motion, and clues about cause and effect. Xie draws on cinema to argue that each frame points beyond itself to a larger world behind the image. This matters because it shows what he thinks current AI lacks. Humans process the world via inferring space, objects, continuity, and hidden structure across time. In a sense, seeing is a form of modelling. A system that only labels pixels is doing something far narrower than perception as humans experience it.</p><p style="text-align: justify;">In <em>Cambrian-S</em>, Xie and his co-authors argue that multimodal AI must move beyond simple image description toward what they call &#8220;spatial supersensing.&#8221; The key idea is that real progress will require models that go beyond labelling what they see and build internal structure from continuous experience.</p><p style="text-align: justify;">Xie uses an analogy from autonomous driving levels to explain the path from language-only models to systems that can perceive continuous visual streams, infer spatial structure, and eventually predict the world. The exact labels in the interview are informal, but the underlying argument is consistent with <em>Cambrian-S</em>:</p><ul><li><p><strong>L0</strong>: Pure LLM. It cannot see images or video, knows the world only through language, like Plato&#8217;s cave.</p></li><li><p><strong>L1</strong>: Current multimodal systems. Show them an image, and they can answer questions about it.</p></li><li><p><strong>L2</strong>: Systems that can handle continuous visual streams, not just still images, and understand what is happening.</p></li><li><p><strong>L3</strong>: Spatial cognition. At every point in time, the system can infer the real 3D space behind the pixels.</p></li><li><p><strong>L4</strong>: A predictive world model&#8212;an agent that truly lives in the real world.</p></li></ul><p style="text-align: justify;">In Xie&#8217;s view, computer vision tackles a basic problem that intelligence must solve: to build lasting internal models of the world, use them to organise incoming information, and predict what happens next.</p><p style="text-align: justify;"><strong>7. Representation Learning Sits Underneath Everything Else</strong></p><p style="text-align: justify;">A related theme is representation learning. Xie keeps returning to how a system learns the right internal representations in the first place. On his personal site, he describes his work as pushing the boundaries of multimodal intelligence and &#8220;spatial supersensing&#8221; across images and videos. The emphasis is on both outputs and the underlying structure that enables a model to perceive and reason well.</p><p style="text-align: justify;">This is also why he worries that language can become a crutch. If vision only serves language, the model may never learn the deeper abstractions that perception should provide on its own. In his view, good intelligence may depend less on piling up more text and more on learning stronger internal representations from the world itself.</p><p style="text-align: justify;"><strong>8. LLMs Are Anti&#8211;Bitter Lesson</strong></p><p style="text-align: justify;">The Bitter Lesson (Rich Sutton) says: AI history keeps showing that using lots of computing power with general-purpose methods always beats human-designed &#8220;clever&#8221; solutions in the long run. In other words, don&#8217;t overvalue human prior knowledge; let machines learn for themselves.</p><p style="text-align: justify;">Many see LLMs as a perfect example of the Bitter Lesson. But Xie argues the opposite. He thinks LLMs are anti&#8211;Bitter Lesson.</p><p style="text-align: justify;">Language itself is one of humanity&#8217;s most refined designs over thousands of years. It has syntax, structure, logic. From day one, LLMs have remained within human prior frameworks. As Xie put it: &#8220;Language is an extremely refined human structure.&#8221; Leaning too heavily on language risks distorting the core of vision research.</p><p style="text-align: justify;">In Xie&#8217;s view, it might be abandoning language&#8217;s structure and learning something more basic&#8212;pixels, or even other than pixels (which we will dive in deeper in the later part).</p><p style="text-align: justify;">This points to a deeper question: what really counts as intelligence? If a system only ever works within rules humans laid down for it, in what sense is it thinking? Real intelligence, the argument goes, would be a system that breaks past our categories and works out the basic logic of the world on its own.</p><p style="text-align: justify;"><strong>9. Robotics as a Problem of Intelligence</strong></p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!YRXV!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!YRXV!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!YRXV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg" width="1456" height="819" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:819,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!YRXV!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 424w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 848w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 1272w, https://substackcdn.com/image/fetch/$s_!YRXV!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F5243ebee-e612-4169-8cdd-45ff78c1f033_2096x1179.jpeg 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">The same logic appears in the interview&#8217;s discussion of robotics. Xie&#8217;s view is that robot hardware has advanced way faster than intelligence. The dancing robots we all saw on the Chinese New Year gala were surely spectacular, but in reality, according to Xie&#8217;s private conversation with the research team behind these robots, they were worried about the missing piece &#8212; the &#8220;brains&#8221;.</p><p style="text-align: justify;">That fits the broader public direction of AMI Labs, which aims to build AI systems that understand the real world and apply them in robotics, automation, healthcare, and wearable devices. The emphasis is on systems that are reliable and controllable in messy real environments, not only impressive in demos.</p><p style="text-align: justify;"><strong>10. Goal, Not a Route</strong></p><p style="text-align: justify;">Xie&#8217;s definition of a world model is simple: given the current state of a system and an action, the AI predicts what comes next. That prediction is what guides the decision.</p><p style="text-align: justify;">The idea isn&#8217;t new. Psychologist Kenneth Craik proposed it in 1943. Engineers used it to fly moon probes in the 1960s and 70s. Reinforcement learning uses it too.</p><p style="text-align: justify;">Xie&#8217;s point is that a world model isn&#8217;t a particular algorithm or technique. It&#8217;s a destination. LLM researchers, video researchers, and robotics researchers are all walking toward it from different directions.</p><p style="text-align: justify;">Video generation gets you closer than language alone, because to make a believable video, a model has to know things about the physical world &#8212; for example, that a cat has four legs, not three.</p><p style="text-align: justify;">Still, Xie doesn&#8217;t think pixels are the final answer. Pixels are a grid we built for human eyes &#8212; a simulator made for watching, not for predicting. The point of a world model is to predict the world, not to render a nice-looking video.</p><p style="text-align: justify;"><strong>11. How to Train a World Model: Download Humanity</strong></p><p style="text-align: justify;">If the LLM era meant downloading the whole internet, then the world model era means downloading humanity.</p><p style="text-align: justify;">That sounds like science fiction, because the data scale is terrifying. LeCun gave an example: a four-year-old child has seen roughly 50 times more visual data than the largest LLMs are trained on.</p><p style="text-align: justify;">But starting from internet videos is a workable path, and Xie agrees. YouTube has huge amounts of data. The current issue is copyright and access. (The interviewer commented, &#8220;ByteDance has a real advantage,&#8221; and Xie agreed.)</p><p style="text-align: justify;">Another interesting question: what is the product of a world model?</p><p style="text-align: justify;">LLMs have chatbots&#8212;a huge success. Xie mentioned two directions: AI glasses (always-on real-world perception and smart decision-making) and robots (but the brain problem remains unsolved).</p><p style="text-align: justify;"><strong>12. AI Companies Are in an Arms Race; AMI Labs Wants a &#8220;Grassroots Alliance&#8221;</strong></p><p style="text-align: justify;">On the endless hot topics and competition in AI, Xie noted that there is a huge value chain:</p><p style="text-align: justify;">At the top are stories like AGI, scaling laws, LLMs. These stories define which benchmarks to compete on. Benchmarks decide where resources go. Resource allocation forces everyone onto the same track.</p><p style="text-align: justify;">The result: everyone loses the ability to define the problem.</p><p style="text-align: justify;">He gave an example: at Google, a researcher wanted to do representation learning work but was stopped after two weeks because the product cycle had to finish. It is not that people lack the desire or ability&#8212;it is that the value chain leaves no room.</p><p style="text-align: justify;">What AMI Labs wants to do, Xie calls &#8220;reverse OpenAI.&#8221; Instead of taking shortcuts by downloading internet data, they work with real-world people who have specific problems and data to co&#8209;build a world model. He compares it to how Mastercard once fought Visa: one small bank cannot beat Bank of America, but if many join forces to launch a credit card, they can compete. Hence a &#8220;grassroots alliance.&#8221;</p><p style="text-align: justify;"><strong>13. LeCun&#8217;s Magnetism and the &#8220;Metaphysical&#8221; Side of Research</strong></p><p style="text-align: justify;">When talking about starting a company, Xie shared many details about working with Yann LeCun.</p><p style="text-align: justify;">He admired LeCun&#8217;s many hobbies&#8212;astrophotography, building model aeroplanes, electronic music, sailing, watching films. LeCun came across as a deeply passionate and many&#8209;sided person. Xie said that was part of why he wanted to work with him.</p><p style="text-align: justify;">Beyond academics, LeCun also seems to have a personal pull for Xie. In Xie&#8217;s words, like Jobs and Musk, LeCun has a kind of &#8220;reality distortion field&#8221; that makes it easy to believe in what he is saying.</p><p style="text-align: justify;">In a sense, Xie&#8217;s choice to work with LeCun reflects his broader view of world models: that they emerge through contact with the real world and through human relationships. The path is hard, but the problem is worth solving, and worth taking risks for.</p><p style="text-align: justify;">This outlook shapes his research taste, his community, and even his career and startup decisions. He describes part of this as &#8220;metaphysical (&#29572;&#23398;)&#8221;.</p><p style="text-align: justify;"><strong>14. What Is True Intelligence?</strong></p><p style="text-align: justify;">Xie prefers not to define intelligence. Different animals have different kinds of it, and humans are just one kind.</p><p style="text-align: justify;">He agrees with a point Rich Sutton has made: we treat LLMs that write code, win IMO gold medals, and help us reach Mars as impressive &#8212; but the truly hard thing is building a squirrel&#8217;s intelligence.</p><p style="text-align: justify;">It is a deliberately non-anthropocentric view. If you can build a squirrel &#8212; give it its own goals, emotions, hunger, and ability to survive in the real world &#8212; then writing code or going to Mars becomes trivial by comparison.</p><p style="text-align: justify;">The same shift applies to world models. Instead of asking when AGI will arrive, ask whether any robot today can do all the chores a 12-year-old can. In Xie&#8217;s view, that is the question every robotics company should be asking.</p><p style="text-align: justify;"><strong>15. &#8220;I Am the Normal One&#8221;</strong></p><p style="text-align: justify;">Near the end of the interview, the interviewer asked Xie about something he had said: &#8220;I am not the chosen one, I am the normal one.&#8221;</p><p style="text-align: justify;">Xie explained the line came from his favourite football coach, J&#252;rgen Klopp. Klopp may dress like a punk and project confidence, but he still believes he is not the chosen one&#8212;just an ordinary person.</p><p style="text-align: justify;">To me, this echoes Xie&#8217;s broader life philosophy. It is an attempt to step outside egocentrism and stay self&#8209;aware. That may be why his research feels both grounded and otherworldly.</p><p style="text-align: justify;"></p><p style="text-align: justify;"><em>Appendix: Most influential AI papers mentioned by Xie in the interview</em></p><blockquote><p>&#183; Sutton, R. S. (1991). <em>Dyna, an integrated architecture for learning, planning, and </em>reacting. <strong>(Dyna)</strong></p><p>&#183; LeCun, Y., Bottou, L., Bengio, Y., &amp; Haffner, P. (1998). <em>Gradient-based learning applied to document recognition</em>. <strong>(LeNet)</strong></p><p>&#183; Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., &amp; Fei-Fei, L. (2009). <em>ImageNet: A large-scale hierarchical image database</em>. <strong>(ImageNet)</strong></p><p>&#183; Krizhevsky, A., Sutskever, I., &amp; Hinton, G. E. (2012). <em>ImageNet classification with deep convolutional neural networks</em>. <strong>(AlexNet)</strong></p><p>&#183; Goodfellow, I., et al. (2014). <em>Generative adversarial nets</em>. <strong>(GAN)</strong></p><p>&#183; Girshick, R., Donahue, J., Darrell, T., &amp; Malik, J. (2014). <em>Rich feature hierarchies for accurate object detection and semantic segmentation</em>. <strong>(R-CNN)</strong></p><p>&#183; Ren, S., He, K., Girshick, R., &amp; Sun, J. (2015). <em>Faster R-CNN: Towards real-time object detection with region proposal networks</em>. <strong>(Faster R-CNN)</strong></p><p>&#183; He, K., Zhang, X., Ren, S., &amp; Sun, J. (2016). <em>Deep residual learning for image recognition</em>. <strong>(ResNet)</strong></p><p>&#183; Vaswani, A., et al. (2017). <em>Attention is all you need</em>. <strong>(Transformer)</strong></p><p>&#183; Devlin, J., Chang, M.-W., Lee, K., &amp; Toutanova, K. (2019). <em>BERT: Pre-training of deep bidirectional transformers for language understanding</em>. <strong>(BERT)</strong></p><p>&#183; Ho, J., Jain, A., &amp; Abbeel, P. (2020). <em>Denoising diffusion probabilistic models</em>. <strong>(DDPM)</strong></p><p>&#183; Brown, T. B., et al. (2020). <em>Language models are few-shot learners</em>. <strong>(GPT-3)</strong></p><p>&#183; Mildenhall, B., et al. (2020). <em>NeRF: Representing scenes as neural radiance fields for view synthesis</em>. <strong>(NeRF)</strong></p><p>&#183; Dosovitskiy, A., et al. (2021). <em>An image is worth 16x16 words: Transformers for image recognition at scale</em>. <strong>(ViT)</strong></p><p>&#183; Radford, A., et al. (2021). <em>Learning transferable visual models from natural language supervision</em>. <strong>(CLIP)</strong></p><p>&#183; Rombach, R., Blattmann, A., Lorenz, D., Esser, P., &amp; Ommer, B. (2022). <em>High-resolution image synthesis with latent diffusion models</em>. <strong>(LDM)</strong></p><p>&#183; Kerbl, B., Kopanas, G., Leimk&#252;hler, T., &amp; Drettakis, G. (2023). <em>3D Gaussian splatting for real-time radiance field rendering</em>. (<strong>3DGS</strong>)</p></blockquote>]]></content:encoded></item><item><title><![CDATA[Overheard as a Chinese Employee Working in Western Think Tanks Looking at China]]></title><description><![CDATA[Last summer marked the end of a six year chapter: working for Western think tanks as a China programme specialist.]]></description><link>https://www.lincongfieldnotes.com/p/overheard-as-a-chinese-employee-working</link><guid isPermaLink="false">https://www.lincongfieldnotes.com/p/overheard-as-a-chinese-employee-working</guid><dc:creator><![CDATA[Lin Cong]]></dc:creator><pubDate>Wed, 04 Mar 2026 21:38:43 GMT</pubDate><enclosure url="https://substackcdn.com/image/fetch/$s_!9QCS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!9QCS!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!9QCS!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!9QCS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png" width="1456" height="971" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:971,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:2360903,&quot;alt&quot;:null,&quot;title&quot;:null,&quot;type&quot;:&quot;image/png&quot;,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:&quot;https://www.lincongfieldnotes.com/i/189918616?img=https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png&quot;,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" srcset="https://substackcdn.com/image/fetch/$s_!9QCS!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 424w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 848w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 1272w, https://substackcdn.com/image/fetch/$s_!9QCS!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fef9dc0cf-7bfa-4833-ba37-8a9225487d27_1536x1024.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p style="text-align: justify;">Last summer marked the end of a six year chapter: working for Western think tanks as a China programme specialist. I studied international affairs at a policy school in the US and built my career around one question: how to narrow the gap between China and the world, without flattening either side into caricature. My CV is full of organisations with &#8220;Asia&#8221; or &#8220;China&#8221; in their names.</p><p style="text-align: justify;">A few months have passed since I left my last role at Asia House in London. This is my attempt to share what I overheard on that lonely yet rewarding path, working at the intersection of China and policy.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.lincongfieldnotes.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Lin Cong Fieldnotes is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p style="text-align: justify;">Often, I found myself caught between competing interests. Often, I had to navigate prejudice from peers and, at times, colleagues. Often, as the only Chinese person in the room, I envied the solidarity I saw in adjacent worlds of journalism, law, finance and etc.</p><p style="text-align: justify;">So what DID I overhear?</p><div><hr></div><h3>Security risks</h3><p style="text-align: justify;">The UK has been repeatedly unsettled by China related espionage allegations since the previous administration. From the &#8220;two Christophers&#8221; cases to the latest reporting involving the spouse of a Labour MP, it is hardly surprising that anyone with China based ties or China facing work is quickly pulled into what can feel like &#8220;<a href="https://substack.com/home/post/p-176283090">spy territory</a>&#8221;, against a backdrop of heightened national security scrutiny.</p><p style="text-align: justify;">A friend once suggested an analogy with insider trading, not because the offences are the same, but because both turn on how societies police sensitive information and proximity. Imagine if you have a  partner working in finance and sharing an offhand view about the markets over dinner, have they crossed a legal line, or simply talked about their day-to-day? Similarly, working close to foreign governments, let alone the &#8220;enemy&#8221; ones, could feel like walking a even tighter rope given the high stakes. You can be as meticulous as you want about what you say, but trust can remain very fragile. I have crossed paths professionally with people later described as suspected spies, without any awareness at the time. What I remember most is not recklessness, but thier restraint and carefulness. It brings to mind a darker logic familiar from Xi&#8217;s anti-corruption campaign: in any system, few are spotless under a microscope. The difference is who gets examined, when, and to what end.</p><p style="text-align: justify;">To make this less abstract, I&#8217;ve worked in foreign think tanks inside China (RIP Carnegie-Tsinghua Center), where scrutiny feels immediate and personal. Local staff at foreign organisations are always on the radar and sometimes would be &#8220;invited to tea&#8221;, a euphemism for an interview with public security. What stayed with me was the speed with which surveillance could become intervention. After a brief phone call about an incoming delegation, I received a follow up call asking, in detail, what I was working on and who I would meet. It is not uncommont that local staff feels caught between official pressure and a basic professional duty to protect the organisation&#8217;s privacy.</p><p style="text-align: justify;">Yet suspicion also runs in the other direction. More than once, I waited outside foreign embassies in Beijing with the Chinese driver while my expat colleagues went in, a small ritual that made the hierarchy of trust visible. Learning to move between these systems with empathy, discretion, and respect for real constraints became one of the most valuable lessons one was able to learn from working inside foreign policy institutions.</p><div><hr></div><h3>Identity and positioning</h3><p style="text-align: justify;">When I moved to London in early 2023, I was struck by how limited the city&#8217;s &#8220;China capacity&#8221; felt compared with Washington. At the risk of minimising my credentials, I often became the &#8220;China expert&#8221; by default, simply because of how I look. The label could be useful, but it also narrowed me, turning a whole career into a single identity tag.</p><p style="text-align: justify;">At a sponsored event by a Chinese company at the think tank I was working at, an attendee questioned whether I, the only Chinese employee, influenced the event&#8217;s &#8220;friendly&#8221; take on China, even though I hadn&#8217;t been involved into running the programme. Another time, when a journalist&#8217;s attendance was rejected due to venue overcapacity, they asked to speak to me specifically &#8212; &#8220;the Chinese staff&#8221; &#8212; assuming my nationality had something to do with this decision to turn down their attendance. Moments like that reminded me that my expertise and identity are intertwined with professional perception as I often felt like a bridge between cultures and an outsider within them. </p><p style="text-align: justify;">Another more extreme incident was a Westminster special adviser once joked to my face that I had been sent by Beijing as a &#8220;honey trap&#8221;. Even though the claim was clearly untrue, I still felt at the moment shaken with uncertainty what I could say that would not make things worse. There remains a strain of paranoia in parts of the China rhetoric abroad, shaped by history and politics that reinforce each other. I feel for the people whose genuine efforts to build relationships and support constructive engagement are recast as something malign.</p><div><hr></div><h3>China through the lens of DC and London</h3><p style="text-align: justify;">Living in San Francisco at the moment, I have been struck by the mood of Chinamaxxing and &#8220;you met me at a very Chinese time in my life&#8221;. People pass around books like Dan Wang&#8217;s <em>Breakneck</em>, trade jokes about Trump admiring Xi&#8217;s authority, and scroll TikToks that turn China&#8217;s infrastructure into a kind of abundance spectacle. </p><p style="text-align: justify;">London feels different. Perhaps it&#8217;s the city&#8217;s colonial legacy that still lends certain circles an old-boy-club air, a faint nostalgia for its historic connection to China through its dominance of Hong Kong. I&#8217;ve met retired UK business executives who fondly recall the bicycle-filled streets of 1990s Beijing yet show little curiosity about the electric vehicles that have replaced them. Still, London brings together a wide range of voices, though fewer with recent on-the-ground experience in the political and policy circle. In Jeremy Goldkorn&#8217;s <em><a href="https://www.thechinaweek.com/p/very-british-china-problems">Very British China problems</a></em>, he argues that the UK is stuck in a weird position. It wants to stay on good terms with the United States and needs trade and investment after Brexit. That leaves Britain dependent on China, which now holds far more leverage. Chinese voices have made it plain that the UK no longer carries the weight it once did and can be replaced in China&#8217;s wider strategy.</p><div><hr></div><h3>What matters</h3><p style="text-align: justify;">I often get messages from students asking for advice to pursue a career in China policy studies. Here is what I thought might be helpful if I were to do this again.</p><ol><li><p style="text-align: justify;"><strong>Stay current</strong><br>Pick two or three reliable newsletters and stick with them. Add sector specific reads that match your interests: energy, climate, finance, tech.</p></li><li><p style="text-align: justify;"><strong>Learn how policy is made</strong><br>Surprisingly few people, even in the field, can explain it clearly. Learn how bills move through committees, appropriations, and conference stages in your interested jurisdiction. Learn how agencies draft and finalise regulations. Then practise explaining those processes out loud. </p></li><li><p style="text-align: justify;"><strong>Master your Specilisation</strong><br>&#8220;Proximity to power deludes some into thinking they wield it.&#8221; Frank Underwood was right. Being in the room is not the same as being heard, but you can still learn from everything. Pair China expertise with another skillset: data, energy, climate, tech, transport. Find a topic that keeps you up at night and make it yours.</p></li></ol><p style="text-align: justify;">What has your experience been like working in policy. Which place shaped your view of China most. Any moments that changed how you see the whole China watching world. I would love to hear them.</p><p style="text-align: justify;">[I would like to thank my two Substack spirit animals, <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;afra&quot;,&quot;id&quot;:2227115,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!p8sZ!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9e7c3c6d-a2e3-412d-b2b6-e62097d444af_800x800.png&quot;,&quot;uuid&quot;:&quot;0a1bb48a-4145-465e-a9df-8107a471a6f4&quot;}" data-component-name="MentionToDOM"></span> and <span class="mention-wrap" data-attrs="{&quot;name&quot;:&quot;Yaling Jiang&quot;,&quot;id&quot;:37186412,&quot;type&quot;:&quot;user&quot;,&quot;url&quot;:null,&quot;photo_url&quot;:&quot;https://substackcdn.com/image/fetch/$s_!dV_Q!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1271277b-8fff-43b9-9ab0-77f560a0452a_596x596.png&quot;,&quot;uuid&quot;:&quot;da2fdfec-971f-4815-a571-3ac90f31e78f&quot;}" data-component-name="MentionToDOM"></span> for being the star examples to encourage me to start writing. And to the mentors who guided me through deep waters: Jan Berris, Paul Haenle, Dan Russell, Michael Hart, and Geoffrey Jackson, thank you for showing me what diplomacy looks like in practice.]</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.lincongfieldnotes.com/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Lin Cong Fieldnotes is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item></channel></rss>