<?xml version="1.0" encoding="UTF-8"?><rss version="2.0" xmlns:content="http://purl.org/rss/1.0/modules/content/"><channel><title>SOTA&apos;s Seminar</title><description>SOTA&apos;s Seminar</description><link>https://verse-astro.vercel.app/</link><language>en-US</language><item><title>Native unified model</title><link>https://verse-astro.vercel.app/posts/Weiyang%20Jin/</link><guid isPermaLink="true">https://verse-astro.vercel.app/posts/Weiyang%20Jin/</guid><description>Think about the pros and cons of native mllm and SD backone from the perspective of the current strongest closed-source model.</description><pubDate>Sat, 05 Apr 2025 00:00:00 GMT</pubDate><content:encoded>&lt;!-- ![](./images/3-31-ninggao.png) --&gt;

**Talker**: [Weiyang Jin](https://github.com/WayneJin0918)

**Bio**: Weiyang Jin is now the intern at New York University advised by Prof. Saining Xie. His research interests are about MLLMs/VLMs and visual representation learning

**Key note**: 
- Introdues the defination of native MLLMs and current works.
- Analyze the possible architecture of the current strongest closed-source model.
- Introducing insights from recent papers about revealing the model principle.
- Discuss the pros and cons of both systems and possible future development directions.

[Bilibili Link](https://b23.tv/sR9g9co)</content:encoded></item><item><title>Dual system embodied intelligence</title><link>https://verse-astro.vercel.app/posts/Ning%20Gao/</link><guid isPermaLink="true">https://verse-astro.vercel.app/posts/Ning%20Gao/</guid><description>Analyze model applications of single and dual systems. Discuss the differences between work in contemporary periods such as Gemini-robotics and GR00T.</description><pubDate>Mon, 31 Mar 2025 00:00:00 GMT</pubDate><content:encoded>&lt;!-- ![](./images/3-31-ninggao.png) --&gt;

**Talker**: [Ning Gao](https://axi404.top/)

**Bio**: Ning Gao is a 3rd year student from XJTU. He is conducting a scientific internship in the Embodied Intelligence Center of Shanghai Artificial Intelligence Laboratory (SHAILAB), and is engaged in research in the field of Embodied Intelligence Manipulation

**Key note**: 
- Paper sharing and discussion on the dual-system model in Gemini-Robotics, HiRobot, and other relevant frameworks.
- Provide a personal definition of the dual-system framework in embodied AI.
- Comparative analysis of end-to-end, dual-system, and prompt-based models. Highlight the advantages and limitations of each approach in embodied tasks.
- Define the function in VLA and VLM for embodied system
- Discussion future trend in embodied AI</content:encoded></item><item><title>Proof (LLM) robustness is to solve the 0-1 backpack problem</title><link>https://verse-astro.vercel.app/posts/Huanran%20Chen/</link><guid isPermaLink="true">https://verse-astro.vercel.app/posts/Huanran%20Chen/</guid><description>Explore the robustness of autoregressive models and diffusion models, and think about the analysis of the upper and lower boundaries of robustness and its applications.</description><pubDate>Sat, 29 Mar 2025 00:00:00 GMT</pubDate><content:encoded>&lt;!-- ![](./images/3-29-huanran.png) --&gt;

**Talker**: [Huanran Chen](https://huanranchen.github.io/)

**Bio**: Huanran Chen is a PhD student from TSAIL (Fall 2025), advised by Prof. Jun Zhu, and closely collaborate with Prof. [Yinpeng Dong]. He has a keen interest in the physics of machine learning. His unattainable yet motivating dream is to elevate AI to the realm of science, making every phenomenon explainable and predictable.

**Key note**: 
- What is worst-case robustness?
- Why worst-case robustness?
- Upper bounding worst-case robustness
- Lower bounding worst-case robustness (knapsack problem)
- Specific application to text domain
- Theoretical insights (e.g., diffusion&gt;MaskGen&gt;ARM)
- Applications

[slides](https://drive.google.com/file/d/19QrJOxkKdknQkQiKYeoZmYVP5C4OGCKO/view)</content:encoded></item></channel></rss>