I'm Yin Xie (yiyexy), a deep learning algorithm engineer specializing in computer vision, large-scale vision-language models, model compression & acceleration, and distributed training in DeepGlint. My recent work focuses on visual representation learning, end-to-end facial feature pretraining, and pretraining techniques for vision-language models, with several papers published in top conferences and active contributions to open-source projects. Open to collaboration and discussion!π
Pinned Loading
-
deepglint/Victor
deepglint/Victor PublicViCToR: Improving Visual Comprehension via Token Reconstruction for Pretraining LMMs
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.