Naoya Takahashi

Orcid: 0000-0001-8553-4797

According to our database1, Naoya Takahashi authored at least 58 papers between 1987 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
The whole is greater than the sum of its parts: improving music source separation by bridging networks.
EURASIP J. Audio Speech Music. Process., December, 2024

LOCKEY: A Novel Approach to Model Authentication and Deepfake Tracking.
CoRR, 2024

SilentCipher: Deep Audio Watermarking.
CoRR, 2024

Effects of modulation method and presentation part on distantly-presented bone-conducted ultrasonic perception.
Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024

2023


The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation.
CoRR, 2023

Cross-modal Face- and Voice-style Transfer.
CoRR, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Iteratively Improving Speech Recognition and Voice Conversion.
Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Person Tracking Method Robust to Dynamic Viewport Changes for AR apps.
Proceedings of the 19th International Conference on Intelligent Environments, 2023

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

Numerical Simulation of the Octorotor Flying Car in Sudden Rotor Stop.
Proceedings of the Computational Science - ICCS 2023, 2023

Hierarchical Diffusion Models for Singing Voice Neural Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2023

Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing.
Proceedings of the IEEE International Conference on Acoustics, 2023

Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.
Proceedings of the IEEE International Conference on Acoustics, 2023

2022


Robust One-Shot Singing Voice Conversion.
CoRR, 2022

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Amicable Examples for Informed Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.
Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.
Proceedings of the IEEE International Conference on Acoustics, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.
Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021
Source Mixing and Separation Robust Audio Steganography.
CoRR, 2021

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.
CoRR, 2021

Hierarchical disentangled representation learning for singing voice conversion.
Proceedings of the International Joint Conference on Neural Networks, 2021

Adversarial Attacks on Audio Source Separation.
Proceedings of the IEEE International Conference on Acoustics, 2021

Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.
Proceedings of the IEEE International Conference on Acoustics, 2021

End-to-End Lyrics Recognition with Voice to Singing Style Transfer.
Proceedings of the IEEE International Conference on Acoustics, 2021

Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020
Densely connected multidilated convolutional networks for dense prediction tasks.
CoRR, 2020

D3Net: Densely connected multidilated DenseNet for music source separation.
CoRR, 2020

Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.
CoRR, 2020

Improving Voice Separation by Incorporating End-To-End Speech Recognition.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Electric Field Communication using a Wide Metal Plate as the Transmission Path.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

2019
Recursive Speech Separation for Unknown Number of Speakers.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Electric Field Communication using a Car Body as a Transmission Medium.
Proceedings of the 13th International Conference on Sensing Technology, 2019

A Knowledge Chunk Reuse Support Tool based on Heterogeneous Ontologies.
Proceedings of the 11th International Joint Conference on Knowledge Discovery, 2019

A Teaching Assistant Robot Design Tool Based on Knowledge Chunks Reuse.
Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019

2018
AENet: Learning Deep Audio Features for Video Analysis.
IEEE Trans. Multim., 2018

A Practical Teacher-Robot Collaboration Lesson Application Based on PRINTEPS.
Rev. Socionetwork Strateg., 2018

Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.
Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017
Development of applications for teaching assistant robots with teachers in PRINTEPS.
Proceedings of the International Conference on Web Intelligence, 2017

Multi-Scale multi-band densenets for audio source separation.
Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Implementation of Teacher-Robot Collaboration Lesson Application in PRINTEPS.
Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 21st International Conference KES-2017, 2017

Improving music source separation based on deep neural networks through data augmentation and network blending.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016
Noise reduction combining microphone and piezoelectric device.
CoRR, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Detection.
CoRR, 2016

Automatic Pronunciation Generation by Utilizing a Semi-Supervised Deep Neural Networks.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2014
Chronological states of viewer's intentions using hidden Markov models and features of eye movement.
EAI Endorsed Trans. Context aware Syst. Appl., 2014

2010
Fluorescent pipettes for optically targeted patch-clamp recordings.
Neural Networks, 2010

2007
Electric Koto by vibrating Body.
Proceedings of the 2007 International Computer Music Conference, 2007

1995
Masked Trnsferring Method of Discontinuous Sectors in Disk Cache System.
IEICE Trans. Inf. Syst., 1995

1994
Performance improvement of jukebox-type optical disk file system.
Syst. Comput. Jpn., 1994

1987
Disjoint disjunctive form of boolean functions and its applications.
Syst. Comput. Jpn., 1987


  Loading...