Update README.md

This commit is contained in:
Bohan Hou 2025-02-04 10:21:00 +08:00 committed by GitHub
parent b79b4e9a66
commit bdee82da8b
No known key found for this signature in database
GPG Key ID: B5690EEEBB952194
1 changed files with 3 additions and 4 deletions

View File

@ -30,8 +30,7 @@
<li><a href="#4dv">3.5.3 4D Vision - 四维视觉</a></li>
</ul>
</li>
<li>><a href="#mm">3.6 Multimodal Models - 多模态模型</a>
</li>
<li><a href="#mm"> 3.6 Multimodal Models - 多模态模型</a></li>
<li><a href="#embodied-ai-4-x">3.7 Embodied AI for X - 具身智能+X</a>
<ul>
<li><a href="#medical">3.7.1 Embodied AI for Healthcare - 具身智能+医疗</a></li>
@ -260,8 +259,8 @@ CS231n (斯坦福计算机视觉课程): [website](https://cs231n.stanford.edu/s
## 3.6 Multimodal Models - 多模态模型
> 多模态旨在统一来自不同模态信息的表征, 在具身智能中由于面对着机器识别的视觉信息与人类自然语言的引导信息等不同模态的信息,多模态技术愈发重要。
* 最经典的工作CLIP [知乎](https://zhuanlan.zhihu.com/p/493489688)<br>
* 多模态大语言模型的经典工作 LLaVA[website](https://llava-vl.github.io/)<br>
* 最经典的工作CLIP: [知乎](https://zhuanlan.zhihu.com/p/493489688)<br>
* 多模态大语言模型的经典工作 LLaVA: [website](https://llava-vl.github.io/)<br>
<section id="embodied-ai-4-x"></section>
## 3.7 Embodied AI for X - 具身智能+X