<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:atom="http://www.w3.org/2005/Atom" xmlns:sy="http://purl.org/rss/1.0/modules/syndication/" xmlns:media="http://search.yahoo.com/mrss/"><channel><title>Jepa on k4i's blog</title><link>https://k4i.top/tags/jepa/</link><description>Recent content in Jepa on k4i's blog</description><generator>Hugo -- gohugo.io</generator><language>en</language><managingEditor>sky_io@outlook.com (K4i)</managingEditor><webMaster>sky_io@outlook.com (K4i)</webMaster><copyright>All content is subject to the license of &lt;a rel="license noopener" href="https://creativecommons.org/licenses/by-nc-sa/4.0/" target="_blank"&gt;CC BY-NC-SA 4.0&lt;/a&gt; .</copyright><lastBuildDate>Thu, 18 Jun 2026 10:00:00 +0800</lastBuildDate><atom:link href="https://k4i.top/tags/jepa/index.xml" rel="self" type="application/rss+xml"/><item><title>Three Routes For Embodied Models: VLA, World Models, And WAM</title><link>https://k4i.top/posts/embodied-models-vla-jepa-wam/</link><pubDate>Thu, 18 Jun 2026 10:00:00 +0800</pubDate><author>sky_io@outlook.com (K4i)</author><atom:modified>Thu, 18 Jun 2026 10:00:00 +0800</atom:modified><guid>https://k4i.top/posts/embodied-models-vla-jepa-wam/</guid><description>&lt;p&gt;If a language model only has to answer with text, an embodied model has to answer one extra question: &lt;strong&gt;what should this sentence become as an action?&lt;/strong&gt;&lt;/p&gt;
&lt;p&gt;Suppose you tell a tabletop robot: “push the red cup next to the plate.” The model must identify the cup, understand “next to,” decide how the arm should move, close or release the gripper at the right moment, and recover if the cup slips. The hard part is not multimodality alone. It is the closed loop between language, vision, physical state, and continuous action: the action changes the world, and the new world changes the next action.&lt;/p&gt;</description><dc:creator>K4i</dc:creator><media:content url="https://k4i.top//images/posts/embodied-models-vla-jepa-wam/embodied-models-cover.svg" medium="image"><media:title type="html">featured image</media:title></media:content><category>embodied-ai</category><category>robotics</category><category>vla</category><category>world-model</category><category>jepa</category><category>notes</category></item></channel></rss>