- 🏫 A CS Ph.D. student at the University of Rochester (UR).
- 🎓 Obtained bachelor's degree from the Southern University of Science and Technology (SUSTech).
- 🎯 My research interests include Multimodal Learning, especially Video Understanding & Generation.
- 🎮 Welcome to my personal homepage: yunlong10.github.io
- 😜
YOLO
is a soramimi/mondegreen forYunlong
. Similarly, inyunlong10
: Tang → Ten → 10.
🕹️
Focusing
Ph.D. Student @ UR CS
-
University of Rochester
- Rochester, NY
-
01:30
(UTC -05:00) - https://yunlong10.github.io
- in/yunlong-yolo-tang
Pinned Loading
-
Awesome-LLMs-for-Video-Understanding
Awesome-LLMs-for-Video-Understanding Public🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
-
ttengwang/Caption-Anything
ttengwang/Caption-Anything PublicCaption-Anything is a versatile tool combining image segmentation, visual captioning, and ChatGPT, generating tailored captions with diverse controls for user preferences. https://huggingface.co/sp…
-
-
zjr2000/LLMVA-GEBC
zjr2000/LLMVA-GEBC PublicWinner solution to Generic Event Boundary Captioning task in LOVEU Challenge (CVPR 2023 workshop)
-
LaunchpadGPT
LaunchpadGPT PublicRepo for ICMC 2023 paper: LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.