https://github.com/jjcao-school/cv/
Focus on the real problems
- embodied intelligence: https://www.vincentsitzmann.com/blog/bitter_lesson_of_cv/
- In the past, learning perception-action loops directly was intractable.
- computer vision: map images to intermediate representations for perception.
- robot learning and control: ingest these specific representations—point clouds, bounding boxes, and masks—and map them to actions.
- This factorization was a necessary compromise for the time.
- The historical boundaries between computer vision, robot learning, and control will dissolve. Frontier research will no longer draw a boundary between “seeing” and “learning to act.”
- Tesla;
- human-machine interface
- What about engineering tasks such as architecture, CAD, or manufacturing? Surely, we need explicit 3D representations to build a house or 3D-print an engine part.
- tasks just for perception: Digital cultural relics, Digital watermark, …
- In the very long run, also should be done end-to-end.
https://hanlab.mit.edu/courses/2023-fall-65940 好课。
这项研究的作者共有四位, 其中一位是深度强化学习大牛、UC 伯克利教授 Pieter Abbeel 。Abbeel 在业余时间还出了很多课程,其中 Intro to AI 课程在 edX 上吸引了 10 万多名学生学习,他的深度强化学习和深度无监督学习教材是 AI 研究者的经典学习资料,包括 CS294-158(Deep Unsupervised Learning)、CS188(Introduction to Artificial Intelligence)、CS287(Advanced Robotics)等。
Research report topics - 26
01_basics
02_features and matching
03_machine_Learning
04_Recognition_Detection
05_motion & tracking
06_Sensors
07 Segmentation
08 Object Detection
09 Anomaly detection & FOD