Qihang Yu

Orcid: 0009-0009-4685-3598

According to our database1, Qihang Yu authored at least 50 papers between 2015 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories.
Trans. Mach. Learn. Res., 2024

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.
Medical Image Anal., 2024

FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching.
CoRR, 2024

Randomized Autoregressive Visual Generation.
CoRR, 2024

MaskBit: Embedding-free Image Generation via Bit Tokens.
CoRR, 2024

Alleviating Distortion in Image Generation via Multi-Resolution Diffusion Models.
CoRR, 2024

An Image is Worth 32 Tokens for Reconstruction and Generation.
CoRR, 2024

Enhancing Temporal Consistency in Video Editing by Reconstructing Videos with 3D Gaussian Splatting.
CoRR, 2024

Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

GaussianTalker: Speaker-specific Talking Head Synthesis via 3D Gaussian Splatting.
Proceedings of the 32nd ACM International Conference on Multimedia, MM 2024, Melbourne, VIC, Australia, 28 October 2024, 2024

An intelligent detection method for assembly based on multi-model cascade.
Proceedings of the 2024 16th International Conference on Machine Learning and Computing, 2024

Towards Open-Ended Visual Recognition with Large Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

COCONut: Modernizing COCO Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

ViTamin: Designing Scalable Vision Models in the Vision-Language Era.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation.
CoRR, 2023

Towards Open-Ended Visual Recognition with Large Language Model.
CoRR, 2023

3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers.
CoRR, 2023

A Study of Autoregressive Decoders for Multi-Tasking in Computer Vision.
CoRR, 2023

Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans.
CoRR, 2023

Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

ReMaX: Relaxing for Better Training on Efficient Panoptic Segmentation.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

MOAT: Alternating Mobile Convolution and Attention Brings Strong Vision Models.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

CancerUniT: Towards a Single Unified Model for Effective Detection, Segmentation, and Diagnosis of Eight Major Cancers Using a Large Collection of CT Scans.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Compositor: Bottom-Up Clustering and Compositing for Robust Part and Object Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
k-means Mask Transformer.
Proceedings of the Computer Vision - ECCV 2022, 2022

PartImageNet: A Large, High-Quality Dataset of Parts.
Proceedings of the Computer Vision - ECCV 2022, 2022

CMT-DeepLab: Clustering Mask Transformers for Panoptic Segmentation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

TubeFormer-DeepLab: Video Mask Transformer.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
DeepLab2: A TensorFlow Library for Deep Labeling.
CoRR, 2021

TransUNet: Transformers Make Strong Encoders for Medical Image Segmentation.
CoRR, 2021

Glance-and-Gaze Vision Transformer.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Shape-Texture Debiased Neural Network Training.
Proceedings of the 9th International Conference on Learning Representations, 2021

Mask Guided Matting via Progressive Refinement Network.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
Recurrent Saliency Transformation Network for Tiny Target Segmentation in Abdominal CT Scans.
IEEE Trans. Medical Imaging, 2020

Can Temporal Information Help with Contrastive Self-Supervised Learning?
CoRR, 2020

CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Network.
CoRR, 2020

Detecting Pancreatic Adenocarcinoma in Multi-phase CT Scans via Alignment Ensemble.
CoRR, 2020

Detecting Pancreatic Ductal Adenocarcinoma in Multi-phase CT Scans via Alignment Ensemble.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020

C2FNAS: Coarse-to-Fine Neural Architecture Search for 3D Medical Image Segmentation.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Neural Architecture Search for Lightweight Non-Local Networks.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

When Radiology Report Generation Meets Knowledge Graph.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
Thickened 2D Networks for 3D Medical Image Segmentation.
CoRR, 2019

2D-Based Coarse-to-Fine Approaches for Small Target Segmentation in Abdominal CT Scans.
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Imaging and Clinical Informatics, 2019

2018
Recurrent Saliency Transformation Network: Incorporating Multi-Stage Visual Cues for Small Organ Segmentation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

2017
Saliency Transformation Network: Incorporating Multi-stage Visual Cues for Pancreas Segmentation.
CoRR, 2017

2016
A reconfigurable parallel FPGA accelerator for the adapt-then-combine diffusion LMS algorithm.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2016

2015
A reconfigurable parallel FPGA accelerator for the kernel affine projection algorithm.
Proceedings of the 2015 IEEE International Conference on Digital Signal Processing, 2015

A real-time permutation entropy computation for EEG signals.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015

A 128-way FPGA platform for the acceleration of KLMS algorithm.
Proceedings of the 20th Asia and South Pacific Design Automation Conference, 2015


  Loading...