발행물
컨퍼런스
IEEE/CVF Conference on Computer Vision and Pattern Recognition
2022
,
MSTR: Mutli-Scale Transformer for End-to-End Human-Object Interaction Detection
Video-Text Representation Learning via Differentiable Weak Temporal Alignment
Annual Meeting of the Association for Computational Linguistics
Hypergraph Transformer: Weakly-supervised Multi-hop Reasoning for Knowledge-based Visual Question Answering
Findings of the Association for Computational Linguistics: EMNLP
2021
Semantic Alignment with Calibrated Similarity for Multilingual Sentence Embedding
HOTR: End-to-End Human-Object Interaction Detection with Transformers