BiFormer: Vision transformer with bi-level routing attention. CVPR 2023. [code]
Vision transformer with progressive sampling. ICCV 2021. [code]
Data-driven neuron allocation for scale aggregation networks. CVPR 2019. [paper] [github] [blog (in Chinese)] Learnable scale-adaptive building blocks (an alternative to Residual Blocks) for deep convolutional neural networks.
Scale-equalizing pyramid convolution for object detection. CVPR 2020. [blog (in Chinese)] [code] Pyramid convolution (a kind of 3D convolution) to extract multiscale features as a better alternative design of Feature Pyramid Networks (FPN).
Fashion retrieval via graph reasoning networks on a similarity pyramid. TPAMI 2023. [doi]
Fashion retrieval via graph reasoning networks on a similarity pyramid. ICCV 2019. (Oral presentation)
Learning local similarity with spatial relations for object retrieval. ACM MM 2019.
Aggregated deep feature from activation clusters for particular object retrieval. ACM MM Thematic Workshops 2017.
Copyright © 2023 Wayne Zhang