Wei Li

Orcid: 0000-0002-4486-8341

Affiliations:
  • Fudan University, School of Computer Science, Shanghai Key Laboratory of Intelligent Information Processing, China (PhD 2004)


According to our database1, Wei Li authored at least 77 papers between 1999 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Technical Report for ActivityNet Challenge 2022 - Temporal Action Localization.
CoRR, 2024

Technical Report for Soccernet 2023 - Dense Video Captioning.
CoRR, 2024

Technical Report for SoccerNet Challenge 2022 - Replay Grounding Task.
CoRR, 2024

Semi-Supervised Self-Learning Enhanced Music Emotion Recognition.
CoRR, 2024

Harmonic Frequency-Separable Transformer for Instrument-Agnostic Music Transcription.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

Improving Drum Source Separation with Temporal-Frequency Statistical Descriptors.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2024

A Scalable Sparse Transformer Model for Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2024

Mertech: Instrument Playing Technique Detection Using Self-Supervised Pretrained Model with Multi-Task Finetuning.
Proceedings of the IEEE International Conference on Acoustics, 2024

2023
Stripe-Transformer: deep stripe feature learning for music source separation.
EURASIP J. Audio Speech Music. Process., December, 2023

The capacity of k-connectivity d-dimensional wireless networks with node failure.
Sci. China Inf. Sci., October, 2023

Multi-scale network with shared cross-attention for audio-visual correlation learning.
Neural Comput. Appl., September, 2023

Melody Generation from Lyrics with Local Interpretability.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Variational Autoencoder with CCA for Audio-Visual Cross-modal Retrieval.
ACM Trans. Multim. Comput. Commun. Appl., 2023

Intelligent Channel Prediction and Power Adaptation in LEO Constellation for 6G.
IEEE Netw., 2023

A neural harmonic-aware network with gated attentive fusion for singing melody extraction.
Neurocomputing, 2023

MFAE: Masked frame-level autoencoder with hybrid-supervision for low-resource music transcription.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

LC-Beating: An Online System for Beat and Downbeat Tracking using Latency-Controlled Mechanism.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2023

Frame-Level Multi-Label Playing Technique Detection Using Multi-Scale Network and Self-Attention Mechanism.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022
A personality-guided affective brain - computer interface for implementation of emotional intelligence in machines.
Frontiers Inf. Technol. Electron. Eng., 2022

SEAL: A Large-scale Video Dataset of Multi-grained Spatio-temporally Action Localization.
CoRR, 2022

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network.
CoRR, 2022

Melody Generation from Lyrics Using Three Branch Conditional LSTM-GAN.
Proceedings of the MultiMedia Modeling - 28th International Conference, 2022

Singing Voice Detection via Similarity-Based Semi-Supervised Learning.
Proceedings of the 4th ACM International Conference on Multimedia in Asia, 2022


HPPNet: Modeling the Harmonic Structure and Pitch Invariance in Piano Transcription.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Automatic Chinese National Pentatonic Modes Recognition Using Convolutional Neural Network.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Playing Technique Detection by Fusing Note Onset Information in Guzheng Performance.
Proceedings of the 23rd International Society for Music Information Retrieval Conference, 2022

Multimodal Music Emotion Recognition with Hierarchical Cross-Modal Attention Network.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

HarmoF0: Logarithmic Scale Dilated Convolution for Pitch Estimation.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2022

A Glance-and-Gaze Network for Respiratory Sound Classification.
Proceedings of the IEEE International Conference on Acoustics, 2022

Hierarchical Graph-Based Neural Network for Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2022

Deepchorus: A Hybrid Model of Multi-Scale Convolution And Self-Attention for Chorus Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

Tonet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music.
Proceedings of the IEEE International Conference on Acoustics, 2022

Robust Capacity of Wireless Networks Under Cascading Failures.
Proceedings of the IEEE Global Communications Conference, 2022

MV-TAL: Mulit-view Temporal Action Localization in Naturalistic Driving.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, 2022

2021
HANME: Hierarchical Attention Network for Singing Melody Extraction.
IEEE Signal Process. Lett., 2021

Musical Tempo Estimation Using a Multi-scale Network.
Proceedings of the 22nd International Society for Music Information Retrieval Conference, 2021

Singer Identification Using Deep Timbre Feature Learning with KNN-NET.
Proceedings of the IEEE International Conference on Acoustics, 2021

Frequency-Temporal Attention Network for Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2021

An Hrnet-Blstm Model With Two-Stage Training For Singing Melody Extraction.
Proceedings of the IEEE International Conference on Acoustics, 2021

2020
Computer Audition for Healthcare: Opportunities and Challenges.
Frontiers Digit. Health, 2020

Music Artist Classification with WaveNet Classifier for Raw Waveform Audio Data.
CoRR, 2020

Comparison for Improvements of Singing Voice Detection System Based on Vocal Separation.
CoRR, 2020

Residual Attention Based Network for Automatic Classification of Phonation Modes.
Proceedings of the IEEE International Conference on Multimedia and Expo, 2020

2019
Automatic Audio Chord Recognition With MIDI-Trained Deep Feature and BLSTM-CRF Sequence Decoding Model.
IEEE ACM Trans. Audio Speech Lang. Process., 2019

Large-vocabulary Chord Transcription Via Chord Structure Decomposition.
Proceedings of the 20th International Society for Music Information Retrieval Conference, 2019

Vocal Melody Extraction via DNN-based Pitch Estimation and Salience-based Pitch Refinement.
Proceedings of the IEEE International Conference on Acoustics, 2019

2018
Music Chord Recognition Based on Midi-Trained Deep Feature and BLSTM-CRF Hybird Decoding.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

2017
流行音乐主旋律提取技术综述 (Review on Main Melody Extraction from Pop Music).
计算机科学, 2017

2015
SIFT-based local spectrogram image descriptor: a novel feature for robust music identification.
EURASIP J. Audio Speech Music. Process., 2015

Towards Solving the Bottleneck of Pitch-based Singing Voice Separation.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Latent time-frequency component analysis: A novel pitch-based approach for singing voice separation.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

2013
Multi-Stage Non-Negative Matrix Factorization for Monaural Singing Voice Separation.
IEEE Trans. Speech Audio Process., 2013

Low-order auditory Zernike moment: a novel approach for robust music identification in the compressed domain.
EURASIP J. Adv. Signal Process., 2013

Music content authentication based on beat segmentation and fuzzy classification.
EURASIP J. Audio Speech Music. Process., 2013

2012
A Double-Ranking Strategy for Long-Tail Product Recommendation.
Proceedings of the 2012 IEEE/WIC/ACM International Conferences on Web Intelligence, 2012

On the music content authentication.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012

2011
Towards content-based audio fragment authentication.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011

2010
Robust music identification based on low-order zernike moment in the compressed domain.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

Robust audio identification for MP3 popular music.
Proceedings of the Proceeding of the 33rd International ACM SIGIR Conference on Research and Development in Information Retrieval, 2010

A novel audio fingerprinting method robust to time scale modification and pitch shifting.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

Robust hashing for music copyright protection by combining beat segmentation and chroma.
Proceedings of the 18th International Conference on Multimedia 2010, 2010

2009
A Robust Mesh Watermarking Scheme Based on PCA.
Proceedings of the Fifth International Conference on Image and Graphics, 2009

2008
Audio Quality-Based Authentication Using Wavelet Packet Decomposition and Best Tree Selection.
Proceedings of the 4th International Conference on Intelligent Information Hiding and Multimedia Signal Processing (IIH-MSP 2008), 2008

2006
Localized audio watermarking technique robust against time-scale modification.
IEEE Trans. Multim., 2006

2004
Multilingual Collection Retrieving Via Ontology Alignment.
Proceedings of the Digital Libraries: International Collaboration and Cross-Fertilization, 2004

2003
An Audio Watermarking Technique That Is Robust Against Random Cropping.
Comput. Music. J., 2003

Audio Watermarking Based on Statistical Feature in Wavelet Domain.
Proceedings of the Twelfth International World Wide Web Conference - Posters, 2003

Content Based Localized Robust Audio Watermarking.
Proceedings of the Interactive Multimedia on Next Generation Networks, 2003

Audio Watermarking Based on Music Content Analysis: Robust against Time Scale Modification.
Proceedings of the Digital Watermarking, Second International Workshop, 2003

A Novel Feature-Based Robust Audio Watermarking for Copyright Protection.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

Multi-channel Data Hiding Scheme for Color Images.
Proceedings of the 2003 International Symposium on Information Technology (ITCC 2003), 2003

An Optimized Multi-bits Blind Watermarking Scheme.
Proceedings of the Information and Communications Security, 5th International Conference, 2003

Robust Spatial Data Hiding for Color Images.
Proceedings of the Communications and Multimedia Security, 2003

2000
New approaches without postprocessing to FIR system identification using selected order cumulants.
IEEE Trans. Signal Process., 2000

Speech enhancement using the constrained-optimization technique.
IEEE Signal Process. Lett., 2000

1999
Recovery of single source signal from noisy and reverberant environments using second-order statistics.
Signal Process., 1999


  Loading...