Naoya Takahashi

Sho Otsuka

Seiji Nakagawa

Proceedings of the 46th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, 2024

2023

STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023.

[BibT_eX]

[DOI]

Aapo Hakala

Shusuke Takahashi

Dataset, March, 2023

STARSS23: Sony-TAu Realistic Spatial Soundscapes 2023.

[BibT_eX]

[DOI]

Aapo Hakala

Shusuke Takahashi

Dataset, March, 2023

The Whole Is Greater than the Sum of Its Parts: Improving DNN-based Music Source Separation.

[BibT_eX]

[DOI]

CoRR, 2023

Cross-modal Face- and Voice-style Transfer.

[BibT_eX]

[DOI]

CoRR, 2023

STARSS23: An Audio-Visual Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.

[BibT_eX]

[DOI]

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Iteratively Improving Speech Recognition and Voice Conversion.

[BibT_eX]

[DOI]

Naoyuki Onoe

Proceedings of the 24th Annual Conference of the International Speech Communication Association, 2023

Multi-Person Tracking Method Robust to Dynamic Viewport Changes for AR apps.

[BibT_eX]

[DOI]

Tatsuya Amano

Hirozumi Yamaguchi

Proceedings of the 19th International Conference on Intelligent Environments, 2023

CLIPSep: Learning Text-queried Sound Separation with Noisy Unlabeled Videos.

[BibT_eX]

[DOI]

Taylor Berg-Kirkpatrick

Proceedings of the Eleventh International Conference on Learning Representations, 2023

Numerical Simulation of the Octorotor Flying Car in Sudden Rotor Stop.

[BibT_eX]

[DOI]

Proceedings of the Computational Science - ICCS 2023, 2023

Hierarchical Diffusion Models for Singing Voice Neural Vocoder.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Nonparallel Emotional Voice Conversion for Unseen Speaker-Emotion Pairs Using Dual Domain Adversarial Network & Virtual Domain Pairing.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

Diffroll: Diffusion-Based Generative Music Transcription with Unsupervised Pretraining Capability.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2023

2022

STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset.

[BibT_eX]

[DOI]

Sharath Adavanne

Yuichiro Koyama

Shusuke Takahashi

Tuomas Virtanen

Dataset, May, 2022

STARSS22: Sony-TAu Realistic Spatial Soundscapes 2022 dataset.

[BibT_eX]

[DOI]

Adavanne Politis

Dataset, March, 2022

Robust One-Shot Singing Voice Conversion.

[BibT_eX]

[DOI]

CoRR, 2022

Leveraging Symmetrical Convolutional Transformer Networks for Speech to Singing Voice Style Transfer.

[BibT_eX]

[DOI]

Shrutina Agarwal

Sriram Ganapathy

Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Amicable Examples for Informed Source Separation.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Multi-ACCDOA: Localizing And Detecting Overlapping Sounds From The Same Class With Auxiliary Duplicating Permutation Invariant Training.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

Spatial Data Augmentation with Simulated Room Impulse Responses for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2022

STARSS22: A Dataset of Spatial Recordings of Real Scenes with Spatiotemporal Annotations of Sound Events.

[BibT_eX]

[DOI]

Proceedings of the 7th Workshop on Detection and Classification of Acoustic Scenes and Events 2022, 2022

2021

Source Mixing and Separation Robust Audio Steganography.

[BibT_eX]

[DOI]

CoRR, 2021

Ensemble of ACCDOA- and EINV2-based Systems with D3Nets and Impulse Response Simulation for Sound Event Localization and Detection.

[BibT_eX]

[DOI]

CoRR, 2021

Hierarchical disentangled representation learning for singing voice conversion.

[BibT_eX]

[DOI]

Proceedings of the International Joint Conference on Neural Networks, 2021

Adversarial Attacks on Audio Source Separation.

[BibT_eX]

[DOI]

Shota Inoue

Proceedings of the IEEE International Conference on Acoustics, 2021

Accdoa: Activity-Coupled Cartesian Direction of Arrival Representation for Sound Event Localization And Detection.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

End-to-End Lyrics Recognition with Voice to Singing Style Transfer.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2021

Densely Connected Multi-Dilated Convolutional Networks for Dense Prediction Tasks.

[BibT_eX]

[DOI]

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

2020

Densely connected multidilated convolutional networks for dense prediction tasks.

[BibT_eX]

[DOI]

CoRR, 2020

D3Net: Densely connected multidilated DenseNet for music source separation.

[BibT_eX]

[DOI]

CoRR, 2020

Sound Event Localization and Detection Using Activity-Coupled Cartesian DOA Vector and RD3net.

[BibT_eX]

[DOI]

CoRR, 2020

Improving Voice Separation by Incorporating End-To-End Speech Recognition.

[BibT_eX]

[DOI]

Sudarsanam Parthasaarathy

Sakya Basak

Sriram Ganapathy

Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Electric Field Communication using a Wide Metal Plate as the Transmission Path.

[BibT_eX]

[DOI]

Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

2019

Recursive Speech Separation for Unknown Number of Speakers.

[BibT_eX]

[DOI]

Sudarsanam Parthasaarathy

Nabarun Goswami

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Electric Field Communication using a Car Body as a Transmission Medium.

[BibT_eX]

[DOI]

Proceedings of the 13th International Conference on Sensing Technology, 2019

A Knowledge Chunk Reuse Support Tool based on Heterogeneous Ontologies.

[BibT_eX]

[DOI]

Proceedings of the 11th International Joint Conference on Knowledge Discovery, 2019

A Teaching Assistant Robot Design Tool Based on Knowledge Chunks Reuse.

[BibT_eX]

[DOI]

Proceedings of the 43rd IEEE Annual Computer Software and Applications Conference, 2019

2018

AENet: Learning Deep Audio Features for Video Analysis.

[BibT_eX]

[DOI]

Michael Gygli

Luc Van Gool

IEEE Trans. Multim., 2018

A Practical Teacher-Robot Collaboration Lesson Application Based on PRINTEPS.

[BibT_eX]

[DOI]

Rev. Socionetwork Strateg., 2018

Mmdenselstm: An Efficient Combination of Convolutional and Recurrent Neural Networks for Audio Source Separation.

[BibT_eX]

[DOI]

Nabarun Goswami

Proceedings of the 16th International Workshop on Acoustic Signal Enhancement, 2018

PhaseNet: Discretized Phase Modeling with Deep Neural Networks for Audio Source Separation.

[BibT_eX]

[DOI]

Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

2017

Development of applications for teaching assistant robots with teachers in PRINTEPS.

[BibT_eX]

[DOI]

Proceedings of the International Conference on Web Intelligence, 2017

Multi-Scale multi-band densenets for audio source separation.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, 2017

Implementation of Teacher-Robot Collaboration Lesson Application in PRINTEPS.

[BibT_eX]

[DOI]

Proceedings of the Knowledge-Based and Intelligent Information & Engineering Systems: Proceedings of the 21st International Conference KES-2017, 2017

Improving music source separation based on deep neural networks through data augmentation and network blending.

[BibT_eX]

[DOI]

Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

2016

Noise reduction combining microphone and piezoelectric device.

[BibT_eX]

[DOI]

Mitsuharu Matsumoto

Shuji Hashimoto

CoRR, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Detection.

[BibT_eX]

[DOI]

CoRR, 2016

Automatic Pronunciation Generation by Utilizing a Semi-Supervised Deep Neural Networks.

[BibT_eX]

[DOI]

Tofigh Naghibi

Beat Pfister

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Deep Convolutional Neural Networks and Data Augmentation for Acoustic Event Recognition.

[BibT_eX]

[DOI]

Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

2014

Chronological states of viewer's intentions using hidden Markov models and features of eye movement.

[BibT_eX]

[DOI]

Minoru Nakayama

EAI Endorsed Trans. Context aware Syst. Appl., 2014

2010

Fluorescent pipettes for optically targeted patch-clamp recordings.

[BibT_eX]

[DOI]

Neural Networks, 2010

2007

Electric Koto by vibrating Body.

[BibT_eX]

[DOI]

Mitsuharu Matsumoto

Shuji Hashimoto

Proceedings of the 2007 International Computer Music Conference, 2007

1995

Masked Trnsferring Method of Discontinuous Sectors in Disk Cache System.

[BibT_eX]

[DOI]

IEICE Trans. Inf. Syst., 1995

1994

Performance improvement of jukebox-type optical disk file system.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1994

1987

Disjoint disjunctive form of boolean functions and its applications.

[BibT_eX]

[DOI]