Zhaowen Wang

Orcid: 0000-0002-8475-850X

According to our database1, Zhaowen Wang authored at least 117 papers between 2008 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density.
EURASIP J. Audio Speech Music. Process., December, 2024

Improving Diffusion Models for Scene Text Editing with Dual Encoders.
Trans. Mach. Learn. Res., 2024

A 64 Gb/s NRZ O-Band Ring Modulator with 3.2 THz FSR for DWDM Applications.
Proceedings of the Optical Fiber Communications Conference and Exhibition, 2024

Multi-modal Video Topic Segmentation with Dual-Contrastive Domain Adaptation.
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024

A Digital Pre-Distortion Technique for High-Linearity, Low-Power, Compact, Phase Interpolators.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2024

PB-DVAE: A Performance Bottleneck Location Model Based On Deep Variational Autoencoder.
Proceedings of the International Joint Conference on Neural Networks, 2024

RegGPT: A Tool for Cross-Domain Service Regulation Language Conversion.
Proceedings of the IEEE International Conference on Web Services, 2024

ICDAR 2024 Competition on Artistic Text Recognition.
Proceedings of the Document Analysis and Recognition - ICDAR 2024 - 18th International Conference, Athens, Greece, August 30, 2024

WAS: Dataset and Methods for Artistic Text Segmentation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Visual Layout Composer: Image-Vector Dual Diffusion Model for Design Layout Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Scaling Up Video Summarization Pretraining with Large Language Models.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
A Very High Linearity Twin Phase Interpolator With a Low-Noise and Wideband Delta Quadrature DLL for High-Speed Data Link Clocking.
IEEE J. Solid State Circuits, 2023

LiveSeg: Unsupervised Multimodal Temporal Segmentation of Long Livestream Videos.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

MuGeVI: A Multi-Functional Gesture-Controlled Virtual Instrument.
Proceedings of the 23rd International Conference on New Interfaces for Musical Expression, 2023

Moment Detection in Long Tutorial Videos.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

DualVector: Unsupervised Vector Font Synthesis with Dual-Part Representation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SVGformer: Representation Learning for Continuous Vector Graphics using Transformers.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Align and Attend: Multimodal Summarization with Dual Contrastive Losses.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

SCCS: Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

Layout Representation Learning with Spatial and Structural Hierarchies.
Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence, 2023

2022
Analysis of Injection-Locked Ring Oscillators for Quadrature Clock Generation in Wireline or Optical Transceivers.
IEEE Trans. Circuits Syst. I Regul. Pap., 2022

Privacy-Preserving Deep Action Recognition: An Adversarial Learning Framework and A New Dataset.
IEEE Trans. Pattern Anal. Mach. Intell., 2022

Multi-Phase Clock Generation for Phase Interpolation With a Multi-Phase, Injection-Locked Ring Oscillator and a Quadrature DLL.
IEEE J. Solid State Circuits, 2022

Semantics-Consistent Cross-domain Summarization via Optimal Transport Alignment.
CoRR, 2022

MHMS: Multimodal Hierarchical Multimedia Summarization.
CoRR, 2022

A 65nm CMOS, 3.5-to-11GHz, Less-Than-1.45LSB-INLpp, 7b Twin Phase Interpolator with a Wideband, Low-Noise Delta Quadrature Delay-Locked Loop for High-Speed Data Links.
Proceedings of the IEEE International Solid-State Circuits Conference, 2022

Automatic Chinese National Pentatonic Modes Recognition Using Convolutional Neural Network.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Toward Understanding WordArt: Corner-Guided Transformer for Scene Text Recognition.
Proceedings of the Computer Vision - ECCV 2022, 2022

Query-Aware Sequential Recommendation.
Proceedings of the 31st ACM International Conference on Information & Knowledge Management, 2022

2021
Black-Box Diagnosis and Calibration on GAN Intra-Mode Collapse: A Pilot Study.
ACM Trans. Multim. Comput. Commun. Appl., 2021

Melody Harmonization with Controllable Harmonic Rhythm.
CoRR, 2021

Font Completion and Manipulation by Cycling Between Multi-Modality Representations.
CoRR, 2021

STALP: Style Transfer with Auxiliary Limited Pairing.
Comput. Graph. Forum, 2021

A Multi-Implicit Neural Representation for Fonts.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

11.4 A High-Accuracy Multi-Phase Injection-Locked 8-Phase 7GHz Clock Generator in 65nm with 7b Phase Interpolators for High-Speed Data Links.
Proceedings of the IEEE International Solid-State Circuits Conference, 2021

Rethinking Text Segmentation: A Novel Dataset and a Text-Specific Refinement Approach.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

Locker: Locally Constrained Self-Attentive Sequential Recommendation.
Proceedings of the CIKM '21: The 30th ACM International Conference on Information and Knowledge Management, Virtual Event, Queensland, Australia, November 1, 2021

2020
Visual Font Pairing.
IEEE Trans. Multim., 2020

G-DARTS-A: Groups of Channel Parallel Sampling with Attention.
CoRR, 2020

A deep learning method based on an attention mechanism for wireless network traffic prediction.
Ad Hoc Networks, 2020

Texture Hallucination for Large-Factor Painting Super-Resolution.
Proceedings of the Computer Vision - ECCV 2020, 2020

Disentangled Image Generation for Unsupervised Domain Adaptation.
Proceedings of the Computer Vision - ECCV 2020 Workshops, 2020

Screencast Tutorial Video Understanding.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Spatial Class Distribution Shift in Unsupervised Domain Adaptation: Local Alignment Comes to Rescue.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

2019
Texture Hallucination for Large-Scale Painting Super-Resolution.
CoRR, 2019

Privacy-Preserving Deep Visual Recognition: An Adversarial Learning Framework and A New Dataset.
CoRR, 2019

Log2Intent: Towards Interpretable User Modeling via Recurrent Semantics Memory Unit.
Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2019

A Calibration-Free Triple-Loop Bang-Bang PLL Achieving 131fsrms Jitter and-70dBc Fractional Spurs.
Proceedings of the IEEE International Solid- State Circuits Conference, 2019

Adversarial Graph Embedding for Ensemble Clustering.
Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, 2019

Dance Dance Generation: Motion Transfer for Internet Videos.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision Workshops, 2019

An Internal Learning Approach to Video Inpainting.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Multimodal Style Transfer via Graph Cuts.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Controllable Artistic Text Style Transfer via Shape-Matching GAN.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Large-Scale Tag-Based Font Retrieval With Generative Feature Learning.
Proceedings of the 2019 IEEE/CVF International Conference on Computer Vision, 2019

Image Super-Resolution by Neural Texture Transfer.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

2018
Perception-driven semi-structured boundary vectorization.
ACM Trans. Graph., 2018

Learning Temporal Dynamics for Video Super-Resolution: A Deep Learning Approach.
IEEE Trans. Image Process., 2018

Learning to Sketch with Deep Q Networks and Demonstrated Strokes.
CoRR, 2018

Wide Activation for Efficient and Accurate Image Super-Resolution.
CoRR, 2018

Reference-Conditioned Super-Resolution by Neural Texture Transfer.
CoRR, 2018

Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding.
Proceedings of The Third Workshop on Representation Learning for NLP, 2018

Brush stroke synthesis with a generative adversarial network driven by physically based simulation.
Proceedings of the 7th Joint Symposium on Computational Aesthetics, 2018

Towards Privacy-Preserving Visual Recognition via Adversarial Training: A Pilot Study.
Proceedings of the Computer Vision - ECCV 2018, 2018

Synthetically Supervised Feature Learning for Scene Text Recognition.
Proceedings of the Computer Vision - ECCV 2018, 2018

Flow-Grounded Spatial-Temporal Video Prediction from Still Images.
Proceedings of the Computer Vision - ECCV 2018, 2018

"Factual" or "Emotional": Stylized Image Captioning with Adaptive Learning and Attention.
Proceedings of the Computer Vision - ECCV 2018, 2018

Visually Indicated Sound Generation by Perceptually Optimized Classification.
Proceedings of the Computer Vision - ECCV 2018 Workshops, 2018

Visual to Sound: Generating Natural Sound for Videos in the Wild.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2018

Multi-Task Adversarial Network for Disentangled Feature Learning.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Re-Weighted Adversarial Adaptation Network for Unsupervised Domain Adaptation.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Multi-Content GAN for Few-Shot Font Style Transfer.
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018

Learning to Doodle with Stroke Demonstrations and Deep Q-Networks.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Exploring Asymmetric Encoder-Decoder Structure for Context-based Sentence Representation Learning.
CoRR, 2017

Trimming and Improving Skip-thought Vectors.
CoRR, 2017

Robust Lane Tracking with Multi-mode Observation Model and Particle Filtering.
CoRR, 2017

AMC: Attention guided Multi-modal Correlation Learning for Image Search.
CoRR, 2017

Photometric Stabilization for Fast-forward Videos.
Comput. Graph. Forum, 2017

Rethinking Skip-thought: A Neighborhood based Approach.
Proceedings of the 2nd Workshop on Representation Learning for NLP, 2017

Universal Style Transfer via Feature Transforms.
Proceedings of the Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, 2017

Visually-Aware Fashion Recommendation and Design with Generative Image Models.
Proceedings of the 2017 IEEE International Conference on Data Mining, 2017

Robust Video Super-Resolution with Learned Temporal Dynamics.
Proceedings of the IEEE International Conference on Computer Vision, 2017

Diversified Texture Synthesis with Feed-Forward Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Temporal Domain Neural Encoder for Video Representation Learning.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2017

AMC: Attention Guided Multi-modal Correlation Learning for Image Search.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Robust Single Image Super-Resolution via Deep Networks With Sparse Prior.
IEEE Trans. Image Process., 2016

Vista: A Visually, Socially, and Temporally-aware Model for Artistic Recommendation.
Proceedings of the 10th ACM Conference on Recommender Systems, 2016

Image Captioning with Semantic Attention.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Learning a Mixture of Deep Networks for Single Image Super-Resolution.
Proceedings of the Computer Vision - ACCV 2016, 2016

Epitomic Image Super-Resolution.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016

2015
Learning Super-Resolution Jointly From External and Internal Examples.
IEEE Trans. Image Process., 2015

Deeply Improved Sparse Coding for Image Super-Resolution.
CoRR, 2015

Scalable Similarity Learning Using Large Margin Neighborhood Embedding.
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015

DeepFont: A System for Font Recognition and Similarity.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Deep Networks for Image Super-Resolution with Sparse Prior.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

Self-tuned deep super resolution.
Proceedings of the 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015

Sparse Coding and its Applications in Computer Vision
WorldScientific, ISBN: 9789814725064, 2015

2014
Learning sparse representation for image signals
PhD thesis, 2014

Spatial-Spectral Classification of Hyperspectral Images Using Discriminative Dictionary Designed by Learning Vector Quantization.
IEEE Trans. Geosci. Remote. Sens., 2014

A joint perspective towards image super-resolution: Unifying external- and self-examples.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014

An ontological bagging approach for image classification of crowdsourced data.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo Workshops, 2014

Active Planning, Sensing, and Recognition Using a Resource-Constrained Discriminant POMDP.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2014

2013
Opportunistic sensing for object recognition - A unified formulation for dynamic sensor selection and feature extraction.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

A Max-Margin Perspective on Sparse Representation-Based Classification.
Proceedings of the IEEE International Conference on Computer Vision, 2013

Discriminative and compact dictionary design for Hyperspectral Image classification using learning VQ framework.
Proceedings of the IEEE International Conference on Acoustics, 2013

Research on Basic Problems of Cognitive Network Intrusion Prevention.
Proceedings of the Ninth International Conference on Computational Intelligence and Security, 2013

2012
Recognizing Emotions From an Ensemble of Features.
IEEE Trans. Syst. Man Cybern. Part B, 2012

Coupled Dictionary Training for Image Super-Resolution.
IEEE Trans. Image Process., 2012

Bilevel sparse coding for coupled feature spaces.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

Substructure and boundary modeling for continuous action recognition.
Proceedings of the 2012 IEEE Conference on Computer Vision and Pattern Recognition, 2012

2011
Time Varying Dynamic Bayesian Network for Nonstationary Events Modeling and Online Inference.
IEEE Trans. Signal Process., 2011

Modelling and analyses of WSN-based pursuit-evasion strategies for multi-pursuers to multi-evaders.
Int. J. Model. Identif. Control., 2011

Emotion recognition from an ensemble of features.
Proceedings of the Ninth IEEE International Conference on Automatic Face and Gesture Recognition (FG 2011), 2011

2009
CamShift guided particle filter for visual tracking.
Pattern Recognit. Lett., 2009

Event recognition with time varying Hidden Markov Model.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
Influence of anteroposterior shifting of trunk mass centroid on vibrational configuration of human spine.
Comput. Biol. Medicine, 2008

Shanghai Jiao Tong University participation in high-level feature extraction, automatic search and surveillance event detectionat TRECVID 2008.
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008

Fuzzy Logic Control for Vehicle Suspension Systems.
Proceedings of the Intelligent Robotics and Applications, First International Conference, 2008


  Loading...