🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
-
Updated
Jan 23, 2024 - Python
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
a state-of-the-art-level open visual language model | 多模态预训练模型
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
Codebase for ECCV18 "The Sound of Pixels"
[CVPR2023] The code for 《Position-guided Text Prompt for Vision-Language Pre-training》
[CVPR 2023] Diverse Embedding Expansion Network and Low-Light Cross-Modality Benchmark for Visible-Infrared Person Re-identification
[CVPR-2024] The First High Definition (HD) Event based Visual Object Tracking Benchmark Dataset
PyTorch implementation of the paper "Semantically Tied Paired Cycle Consistency for Zero-Shot Sketch-based Image Retrieval", CVPR 2019.
Co-Separating Sounds of Visual Objects (ICCV 2019)
Demo code for visible thermal (cross-modality) person re-identification
CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)
[CVPR2024]Day-Night Cross-domain Vehicle Re-identification
[IJCAI 2025] Official implementation of "T2S: High-resolution Time Series Generation with Text-to-Series Diffusion Models"
A New Strong and Simple Baseline Method for VI-ReID (Bridging the Gap: Multi-level Cross-modality Joint Alignment for Visible-infrared Person Re-identification)
Pytorch code for Towards a Unified Middle Modality Learning for Visible-Infrared Person Re-Identification
[ICME 2024] VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering
An alternative EQA paradigm and informative benchmark + models (BMVC 2019, ViGIL 2019 spotlight)
Code for the WWW'20 paper "Nowhere to Hide: Cross-modal Identity Leakage between Biometrics and Devices"
[WACV2024] HalluciDet: Hallucinating RGB Modality for Person Detection Through Privileged Information (Accepted at WACV 2024 and LatinX@CVPR2024 Extended Abstract)
[AAAI 2022] Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics.
Add a description, image, and links to the cross-modality topic page so that developers can more easily learn about it.
To associate your repository with the cross-modality topic, visit your repo's landing page and select "manage topics."