Kazuya Takeda

Orcid: 0000-0002-0330-1787

Affiliations:
  • Nagoya University


According to our database1, Kazuya Takeda authored at least 390 papers between 1987 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

Online presence:

On csauthors.net:

Bibliography

2024
Runner re-identification from single-view running video in the open-world setting.
Multim. Tools Appl., November, 2024

Open-Vocabulary Predictive World Models from Sensor Observations.
Sensors, July, 2024

LiDAR Point Cloud Augmentation for Adverse Conditions Using Conditional Generative Model.
Remote. Sens., June, 2024

A Survey on Testbench-Based Vehicle-in-the-Loop Simulation Testing for Autonomous Vehicles: Architecture, Principle, and Equipment.
Adv. Intell. Syst., June, 2024

R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut.
Sensors, May, 2024

Decentralized policy learning with partial observation and mechanical constraints for multiperson modeling.
Neural Networks, 2024

Estimation of control area in badminton doubles with pose information from top and back view drone videos.
Multim. Tools Appl., 2024

Automatic Detection of Faults in Simulated Race Walking from a Fixed Smartphone Camera.
Int. J. Comput. Sci. Sport, 2024

Sparse Prototype Network for Explainable Pedestrian Behavior Prediction.
CoRR, 2024

MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction.
CoRR, 2024

DRUformer: Enhancing Driving Scene Important Object Detection With Driving Scene Relationship Understanding.
IEEE Access, 2024

Evaluation of a Virtual Experience System for Restaurants.
Proceedings of the 29th International ACM Conference on 3D Web Technology, 2024

360 LiDAR + 360 RGB + 360 Thermal: Multimodal Targetless Calibration.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

RSG-Search Plus: An Advanced Traffic Scene Retrieval Methods based on Road Scene Graph.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Contrasting Disentangled Partial Observations for Pedestrian Action Prediction.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2024

Audio Difference Learning for Audio Captioning.
Proceedings of the IEEE International Conference on Acoustics, 2024

Pseudo-label based unsupervised fine-tuning of a monocular 3D pose estimation model for sports motions.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

2023
L-DIG: A GAN-Based Method for LiDAR Point Cloud Processing under Snow Driving Conditions.
Sensors, October, 2023

Controllable Unsupervised Snow Synthesis by Latent Style Space Manipulation.
Sensors, October, 2023

Learning to Predict Navigational Patterns From Partial Observations.
IEEE Robotics Autom. Lett., September, 2023

Localization System for Vehicle Navigation Based on GNSS/IMU Using Time-Series Optimization with Road Gradient Constrain.
J. Robotics Mechatronics, April, 2023

Estimating the effect of hitting strategies in baseball using counterfactual virtual simulation with deep learning.
Int. J. Comput. Sci. Sport, March, 2023

Framework for Generation and Removal of Multiple Types of Adverse Weather from Driving Scene Images.
Sensors, February, 2023

Distracted driving detection based on the fusion of deep learning and causal reasoning.
Inf. Fusion, 2023

DRUformer: Enhancing the driving scene Important object detection with driving relationship self-understanding.
CoRR, 2023

Runner re-identification from single-view video in the open-world setting.
CoRR, 2023

Compositional Semantics for Open Vocabulary Spatio-semantic Representations.
CoRR, 2023

R-Cut: Enhancing Explainability in Vision Transformers with Relationship Weighted Out and Cut.
CoRR, 2023

Multi-Agent Deep-Learning Based Comparative Analysis of Team Sport Trajectories.
IEEE Access, 2023

Action Valuation of On- and Off-Ball Soccer Players Based on Multi-Agent Deep Reinforcement Learning.
IEEE Access, 2023

Intervention Request Planning with Operator Capability Model for Human-Automation Cooperative Recognition.
Proceedings of the IEEE International Conference on Mobility, 2023

Predictive World Models from Real-World Partial Observations.
Proceedings of the IEEE International Conference on Mobility, 2023

Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs.
Proceedings of the 6th International Workshop on Multimedia Content Analysis in Sports, 2023

LiDAR Point Cloud Translation Between Snow and Clear Conditions Using Depth Images and GANs.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Synthesizing Realistic Snow Effects in Driving Images Using GANs and Real Data with Semantic Guidance.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

RSG-Search: Semantic Traffic Scene Retrieval Using Graph-Based Scene Representation.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Real-Time Graph-Based Optimization for GNSS-Doppler Integrated RTK-GNSS/IMU/DR Positioning System in Urban Area.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Open-world driving scene segmentation via multi-stage and multi-modality fusion of vision-language embedding.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Expert-driven Rule-based Refinement of Semantic Segmentation Maps for Autonomous Vehicles.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Uncertainty Aware Task Allocation for Human-Automation Cooperative Recognition in Autonomous Driving Systems.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2023

Personalized Causal Factor Generalization for Subjective Risky Scene Understanding with Vision Transformer.
Proceedings of the 25th IEEE International Conference on Intelligent Transportation Systems, 2023

Sequence-to-Sequence Network Training Methods for Automatic Guitar Transcription With Tokenized Outputs.
Proceedings of the 24th International Society for Music Information Retrieval Conference, 2023

Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from Inertial Sensors.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

Score prediction using multiple object tracking for analyzing movements in 2-vs-2 Handball.
Proceedings of the 12th IEEE Global Conference on Consumer Electronics, 2023

2022
SecretSign: A Method of Finding a Specific Vehicle Privately and Quickly Using Flashing Lights.
IEEE Intell. Transp. Syst. Mag., 2022

Cooperative play classification in team sports via semi-supervised learning.
Int. J. Comput. Sci. Sport, 2022

Automatic detection of faults in race walking from a smartphone camera: a comparison of an Olympic medalist and university athletes.
CoRR, 2022

Estimating counterfactual treatment outcomes over time in complex multi-agent scenarios.
CoRR, 2022

Estimating the Effect of Team Hitting Strategies Using Counterfactual Virtual Simulation in Baseball.
CoRR, 2022

Deepware: An Open-Source Toolkit for Developing and Evaluating Learning-Based and Model-Based Autonomous Driving Models.
IEEE Access, 2022

Occlusion-Aware Motion Planning With Visibility Maximization via Active Lateral Position Adjustment.
IEEE Access, 2022

Deep Reinforcement Learning in a Racket Sport for Player Evaluation With Technical and Tactical Contexts.
IEEE Access, 2022

Data-Driven Risk-Sensitive Control for Personalized Lane Change Maneuvers.
IEEE Access, 2022

Disentangled Bad Weather Removal GAN for Pedestrian Detection.
Proceedings of the 95th IEEE Vehicular Technology Conference, 2022

Methods of Gently Notifying Pedestrians of Approaching Objects when Listening to Music.
Proceedings of the Adjunct Publication of the 35th Annual ACM Symposium on User Interface Software and Technology, 2022

Efficient Training Method for Point Cloud-Based Object Detection Models by Combining Environmental Transitions and Active Learning.
Proceedings of the Robot Intelligence Technology and Applications 7, 2022

Evaluation of Creating Scoring Opportunities for Teammates in Soccer via Trajectory Prediction.
Proceedings of the Machine Learning and Data Mining for Sports Analytics, 2022

Real-to-Synthetic: Generating Simulator Friendly Traffic Scenes from Graph Representation.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

An enhanced driver's risk perception modeling based on gate recurrent unit network.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

Auditory and visual warning information generation of the risk object in driving scenes based on weakly supervised learning.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

Driving Risk and Intervention: Subjective Risk Lane Change Dataset.
Proceedings of the 2022 IEEE Intelligent Vehicles Symposium, 2022

Emergence of Collaborative Hunting via Multi-Agent Deep Reinforcement Learning.
Proceedings of the Pattern Recognition, Computer Vision, and Image Processing. ICPR 2022 International Workshops and Challenges, 2022

Estimating counterfactual treatment outcomes over time in multi-vehicle simulation.
Proceedings of the 30th International Conference on Advances in Geographic Information Systems, 2022

Automatic fault detection in race walking from a smartphone camera via fine-tuning pose estimation.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

Improvement of Serial Approach to Anomalous Sound Detection by Incorporating Two Binary Cross-Entropies for Outlier Exposure.
Proceedings of the 30th European Signal Processing Conference, 2022

Improving Dense Representation Learning by Superpixelization and Contrasting Cluster Assignment.
Proceedings of the 33rd British Machine Vision Conference 2022, 2022

2021
Motion Analysis and Performance Improved Method for 3D LiDAR Sensor Data Compression.
IEEE Trans. Intell. Transp. Syst., 2021

Autonomous Driving in Adverse Weather Conditions: A Survey.
CoRR, 2021

ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations.
CoRR, 2021

Flexible prediction of opponent motion with internal representation in interception behavior.
Biol. Cybern., 2021

RSG-Net: Towards Rich Sematic Relationship Prediction for Intelligent Vehicle in Complex Environments.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Eagleye: A Lane-Level Localization Using Low-Cost GNSS/IMU.
Proceedings of the IEEE Intelligent Vehicles Symposium Workshops, 2021

A Comparison of Methods for Sharing Recognition Information and Interventions to Assist Recognition in Autonomous Driving System.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

How to monitor multiple autonomous vehicles remotely with few observers: An active management method.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

OpenPlanner 2.0: The Portable Open Source Planner for Autonomous Driving Applications.
Proceedings of the IEEE Intelligent Vehicles Symposium Workshops, 2021

Characterization of Multiple 3D LiDARs for Localization and Mapping Performance using the NDT Algorithm.
Proceedings of the IEEE Intelligent Vehicles Symposium Workshops, 2021

Prediction of Personalized Driving Behaviors via Driver-Adaptive Deep Generative Models.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2021

Visibility Estimation in Complex, Real-World Driving Environments Using High Definition Maps.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

A recognition phase Intervention Interface to Improve Naturalness of Autonomous Driving for Distracted Drivers.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Learning a Model for Inferring a Spatial Road Lane Network Graph using Self-Supervision.
Proceedings of the 24th IEEE International Intelligent Transportation Systems Conference, 2021

Anomalous Sound Detection Using a Binary Classification Model and Class Centroids.
Proceedings of the 29th European Signal Processing Conference, 2021

Leveraging State-of-the-art ASR Techniques to Audio Captioning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

An Ensemble Approach to Anomalous Sound Detection Based on Conformer-Based Autoencoder and Binary Classifier Incorporated with Metric Learning.
Proceedings of the 6th Workshop on Detection and Classification of Acoustic Scenes and Events 2021 (DCASE 2021), 2021

A Method for Location Initialization of Handheld Devices using Autonomous Driving Vehicles for Interactive Systems.
Proceedings of the AutomotiveUI '21: 13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, Leeds, United Kingdom, September 9-14, 2021, 2021

Automatic Generation of Road Trip Summary Video for Reminiscence and Entertainment using Dashcam Video.
Proceedings of the AutomotiveUI '21: 13th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2021

2020
Extracting Human-Like Driving Behaviors From Expert Driver Data Using Deep Learning.
IEEE Trans. Veh. Technol., 2020

Personalized Subjective Driving Risk: Analysis and Prediction.
J. Robotics Mechatronics, 2020

Road Scene Graph: A Semantic Graph-Based Scene Representation Dataset for Intelligent Vehicles.
CoRR, 2020

Policy learning with partial observation and mechanical constraints for multi-person modeling.
CoRR, 2020

Characterization of Multiple 3D LiDARs for Localization and Mapping using Normal Distributions Transform.
CoRR, 2020

A Survey of Autonomous Driving: Common Practices and Emerging Technologies.
IEEE Access, 2020

Performance Analysis of 10 Models of 3D LiDARs for Automated Driving.
IEEE Access, 2020

Generation of Origami Folding Animations from 3D Point Cloud Using Latent Space Interpolation.
Proceedings of the SIGGRAPH Asia 2020 Posters, 2020

Point Grid Map-Based Mid-To-Mid Driving without Object Detection.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

LIBRE: The Multiple 3D LiDAR Dataset.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2020

Intelligibility Enhancement Based on Speech Waveform Modification Using Hearing Impairment.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

End-to-End Automatic Speech Recognition Integrated with CTC-Based Voice Activity Detection.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Weakly-Supervised Sound Event Detection with Self-Attention.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Espnet-TTS: Unified, Reproducible, and Integratable Open Source End-to-End Text-to-Speech Toolkit.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

Trajectory prediction with imitation learning reflecting defensive evaluation in team sports.
Proceedings of the 9th IEEE Global Conference on Consumer Electronics, 2020

Conformer-Based Sound Event Detection with Semi-Supervised Learning and Data Augmentation.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

Intervention Force-based Imitation Learning for Autonomous Navigation in Dynamic Environments.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

Cross-Lingual Voice Conversion using a Cyclic Variational Auto-encoder and a WaveNet Vocoder.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2020

2019
ITS+DM Hackathon (ITSC 2017): Lane Departure Prediction With Naturalistic Driving Data.
IEEE Intell. Transp. Syst. Mag., 2019

Real-Time Streaming Point Cloud Compression for 3D LiDAR Sensor Using U-Net.
IEEE Access, 2019

Underdetermined Source Separation Based on Generalized Multichannel Variational Autoencoder.
IEEE Access, 2019

Impact of Driver Behavior on Fuel Consumption: Classification, Evaluation and Prediction Using Machine Learning.
IEEE Access, 2019

Risky Action Recognition in Lane Change Video Clips using Deep Spatiotemporal Networks with Segmentation Mask Transfer.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Crossing Blind Intersections from a Full Stop Using Estimated Visibility of Approaching Vehicles.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Training Engineers in Autonomous Driving Technologies using Autoware.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Personalized Safety-focused Control by Minimizing Subjective Risk.
Proceedings of the 2019 IEEE Intelligent Transportation Systems Conference, 2019

Robustness of Statistical Voice Conversion Based on Direct Waveform Modification Against Background Sounds.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Pre-Trained Text Embeddings for Enhanced Text-to-Speech Synthesis.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Point Cloud Compression for 3D LiDAR Sensor using Recurrent Neural Network with Residual Blocks.
Proceedings of the International Conference on Robotics and Automation, 2019

A Predictive Reward Function for Human-Like Driving Based on a Transition Model of Surrounding Environment.
Proceedings of the International Conference on Robotics and Automation, 2019

Scene-dependent Anomalous Acoustic-event Detection Based on Conditional Wavenet and I-vector.
Proceedings of the IEEE International Conference on Acoustics, 2019

Generalized Multichannel Variational Autoencoder for Underdetermined Source Separation.
Proceedings of the 27th European Signal Processing Conference, 2019

Improving target selection accuracy for vehicle touch screens.
Proceedings of the Adjunct Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2019

LeadingDisplay: a versatile, robotic display for infotainment in autonomous vehicles.
Proceedings of the Adjunct Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2019

Effects on user perception of a 'modified' speed experience through in-vehicle virtual reality.
Proceedings of the Adjunct Proceedings of the 11th International Conference on Automotive User Interfaces and Interactive Vehicular Applications, 2019

Attention-Based Speech Recognition Using Gaze Information.
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2019

Optimizing Learned Object Detection on Point Clouds from 3D Lidars Through Range and Sparsity Information.
Proceedings of the 2019 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2019

2018
Integrating Driving Behavior and Traffic Context Through Signal Symbolization for Data Reduction and Risky Lane Change Detection.
IEEE Trans. Intell. Veh., 2018

Tsukuba Challenge 2017 Dynamic Object Tracks Dataset for Pedestrian Behavior Analysis.
J. Robotics Mechatronics, 2018

End-to-End Autonomous Mobile Robot Navigation with Model-Based System Support.
J. Robotics Mechatronics, 2018

ITSS Technical Activities Spotlight: Getting to Know the Naturalistic Driving Data Analytics Technical Committee [Technical Activities].
IEEE Intell. Transp. Syst. Mag., 2018

Stereophonic Music Separation Based on Non-Negative Tensor Factorization with Cepstral Distance Regularization.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018

Daily Activity Recognition with Large-Scaled Real-Life Recording Datasets Based on Deep Neural Network Using Multi-Modal Signals.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2018

Modeling Driver Risk Perception on City Roads Using Deep Learning.
IEEE Access, 2018

Learning How to Drive in Blind Intersections from Human Data.
Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, 2018

Back-Translation-Style Data Augmentation for end-to-end ASR.
Proceedings of the 2018 IEEE Spoken Language Technology Workshop, 2018

End-to-End Navigation with Branch Turning Support Using Convolutional Neural Network.
Proceedings of the IEEE International Conference on Robotics and Biomimetics, 2018

Driving Feature Extraction and Behavior Classification Using an Autoencoder to Reproduce the Velocity Styles of Experts.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

SecretSign: A Method of Finding an Off-Line Target Object without Revealing the Target to Observers.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

High Density Ground Maps using Low Boundary Height Estimation for Autonomous Vehicles.
Proceedings of the 21st International Conference on Intelligent Transportation Systems, 2018

Multi-Head Decoder for End-to-End Speech Recognition.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018

Spectral clustering based approach for evaluating the effect of driving behavior on fuel economy.
Proceedings of the IEEE International Instrumentation and Measurement Technology Conference, 2018

Connectionist Temporal Classification-based Sound Event Encoder for Converting Sound Events into Onomatopoeic Representations.
Proceedings of the 26th European Signal Processing Conference, 2018

Anomalous Sound Event Detection Based on WaveNet.
Proceedings of the 26th European Signal Processing Conference, 2018

2017
Duration-Controlled LSTM for Polyphonic Sound Event Detection.
IEEE ACM Trans. Audio Speech Lang. Process., 2017

Signal Processing for Smart Vehicle Technologies: Part 2 [From the Guest Editors].
IEEE Signal Process. Mag., 2017

Open Source Integrated Planner for Autonomous Navigation in Highly Dynamic Environments.
J. Robotics Mechatronics, 2017

A Single-Dimensional Interface for Arranging Multiple Audio Sources in Three-Dimensional Space.
IEICE Trans. Inf. Syst., 2017

Missing component restoration for masked speech signals based on time-domain spectrogram factorization.
Proceedings of the 27th IEEE International Workshop on Machine Learning for Signal Processing, 2017

Continuous point cloud data compression using SLAM based prediction.
Proceedings of the IEEE Intelligent Vehicles Symposium, 2017

Speaker-Dependent WaveNet Vocoder.
Proceedings of the 18th Annual Conference of the International Speech Communication Association, 2017

Music staging AI.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

BLSTM-HMM hybrid system combined with sound activity detection network for polyphonic Sound Event Detection.
Proceedings of the 2017 IEEE International Conference on Acoustics, 2017

Stereophonic music separation based on non-negative tensor factorization with cepstrum regularization.
Proceedings of the 25th European Signal Processing Conference, 2017

An investigation of multi-speaker training for wavenet vocoder.
Proceedings of the 2017 IEEE Automatic Speech Recognition and Understanding Workshop, 2017

An investigation of recurrent neural network for daily activity recognition using multi-modal signals.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017

2016
Accelerated Deformable Part Models on GPUs.
IEEE Trans. Parallel Distributed Syst., 2016

Classification of Driver's Neutral and Cognitive Distraction States Based on Peripheral Vehicle Behavior in Driver's Gaze Transition.
IEEE Trans. Intell. Veh., 2016

Driver-Behavior Modeling Using On-Road Driving Data: A new application for behavior signal processing.
IEEE Signal Process. Mag., 2016

Signal Processing for Smart Vehicle Technologies [From the Guest Editors].
IEEE Signal Process. Mag., 2016

Investigation of DNN-Based Audio-Visual Speech Recognition.
IEICE Trans. Inf. Syst., 2016

Impact of acoustic similarity on efficiency of verbal information transmission via subtle prosodic cues.
EURASIP J. Audio Speech Music. Process., 2016

Integrating driving behavior and traffic context through signal symbolization.
Proceedings of the 2016 IEEE Intelligent Vehicles Symposium, 2016

Compressing continuous point cloud data using image compression methods.
Proceedings of the 19th IEEE International Conference on Intelligent Transportation Systems, 2016

Robust Example Search Using Bottleneck Features for Example-Based Speech Enhancement.
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016

Evaluation on Required Time and Arrival Order of Blocks for Segmented File Downloading Methods Using PR-SCTP and Unordered Delivery.
Proceedings of the 36th IEEE International Conference on Distributed Computing Systems Workshops, 2016

Bidirectional LSTM-HMM Hybrid System for Polyphonic Sound Event Detection.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events, 2016

Modeling and Detecting Excessive Trust from Behavior Signals: Overview of Research Project and Results.
Proceedings of the Human-Harmonized Information Technology, Volume 1 - Vertical Impact, 2016

2015
Modeling of Physical Characteristics of Speech under Stress.
IEEE Signal Process. Lett., 2015

An Open Approach to Autonomous Vehicles.
IEEE Micro, 2015

Tracking driver signage observation using local feature matching and optical flow.
Proceedings of the 2015 IEEE/SICE International Symposium on System Integration, 2015

Traffic trajectory history and drive path generation using GPS data cloud.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Automatic lane change extraction based on temporal patterns of symbolized driving behavioral data.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Analyzing driver gaze behavior and consistency of decision making during automated driving.
Proceedings of the 2015 IEEE Intelligent Vehicles Symposium, 2015

Integration of deep bottleneck features for audio-visual speech recognition.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Exploring multi-channel features for denoising-autoencoder-based speech enhancement.
Proceedings of the 2015 IEEE International Conference on Acoustics, 2015

Performance Evaluation on Control Parameters for Segmented File Download Method Using PR-SCTP.
Proceedings of the Third International Symposium on Computing and Networking, 2015

Daily activity recognition based on DNN using environmental sound and acceleration signals.
Proceedings of the 23rd European Signal Processing Conference, 2015

Audio-visual speech recognition using deep bottleneck features and high-performance lipreading.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Daily activity recognition based on acoustic signals and acceleration signals estimated with Gaussian process.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

Relationship between speaker/listener similarity and information transmission quality in speech communication.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2015

2014
A Graph-Based Spoken Dialog Strategy Utilizing Multiple Understanding Hypotheses.
Inf. Media Technol., 2014

Effective Frame Selection for Blind Source Separation Based on Frequency Domain Independent Component Analysis.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2014

Improvement of multimodal gesture and speech recognition performance using time intervals between gestures and accompanying speech.
EURASIP J. Audio Speech Music. Process., 2014

Nagoya University at TRECVID 2014: the Instance Search Task.
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014

STD Method Based on Hash Function for NTCIR11 SpokenQuery&Doc Task.
Proceedings of the 11th NTCIR Conference on Evaluation of Information Access Technologies, 2014

Measuring aggressive driving behavior using signals from drive recorders.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

Analysis of peripheral vehicular behavior in driver's gaze transition: Differences between driver's neutral and cognitive distraction states.
Proceedings of the 17th International IEEE Conference on Intelligent Transportation Systems, 2014

Stochastic modeling and disaggregation of energy-consumption behavior.
Proceedings of the IEEE International Conference on Acoustics, 2014

Development and preliminary analysis of sensor signal database of continuous daily living activity over the long term.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Noisy speech recognition using blind spatial subtraction array technique and deep bottleneck features.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2014

Content-Based Driving Scene Retrieval Using Driving Behavior and Environmental Driving Signals.
Proceedings of the Smart Mobile In-Vehicle Systems, Next Generation Advancements, 2014

2013
Modeling and Analysis of Driving Behavior Based on a Probability-Weighted ARX Model.
IEEE Trans. Intell. Transp. Syst., 2013

Stochastic Mixture Modeling of Driving Behavior During Car Following.
J. Inform. and Commun. Convergence Engineering, 2013

Classification of speech under stress based on modeling of the vocal folds and vocal tract.
EURASIP J. Audio Speech Music. Process., 2013

Spoken Content Retrieval Using Distance Combination and Spoken Term Detection Using Hash Function for NTCIR10 SpokenDoc2 Task.
Proceedings of the 10th NTCIR Conference on Evaluation of Information Access Technologies, 2013

Integrated modeling of driver gaze and vehicle operation behavior to estimate risk level during lane changes.
Proceedings of the 16th International IEEE Conference on Intelligent Transportation Systems, 2013

Classification of speech under stress by modeling the aerodynamics of the laryngeal ventricle.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013

Estimation of vocal tract parameters for the classification of speech under stress.
Proceedings of the IEEE International Conference on Acoustics, 2013

Computationally efficient single channel dereverberation based on complementary wiener filter.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling head-related transfer functions via spatial-temporal Gaussian process.
Proceedings of the IEEE International Conference on Acoustics, 2013

Analysis and modeling of entrainment in chorus singing.
Proceedings of the IEEE International Conference on Acoustics, 2013

Modeling subjective evaluation of music similarity using tolerance.
Proceedings of the 21st European Signal Processing Conference, 2013

GPU implementations of object detection using HOG features and deformable models.
Proceedings of the 1st IEEE International Conference on Cyber-Physical Systems, 2013

A Discussion on the Consistency of Driving Behavior across Laboratory and Real Situational Studies.
Proceedings of the 35th Annual Meeting of the Cognitive Science Society, 2013

Spoken document retrieval using both word-based and syllable-based document spaces with latent semantic indexing.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2013

2012
Self-Coaching System Based on Recorded Driving Data: Learning From One's Experiences.
IEEE Trans. Intell. Transp. Syst., 2012

Practically Efficient Blind Speech Separation Using Frequency Band Selection Based on Magnitude Squared Coherence and a Small Dodecahedral Microphone Array.
J. Electr. Comput. Eng., 2012

Acoustic Model Training Using Pseudo-Speaker Features Generated by MLLR Transformations for Robust Speaker-Independent Speech Recognition.
IEICE Trans. Inf. Syst., 2012

Causal analysis of task completion errors in spoken music retrieval interactions.
Proceedings of the Eighth International Conference on Language Resources and Evaluation, 2012

An improved driver-behavior model with combined individual and general driving characteristics.
Proceedings of the 2012 IEEE Intelligent Vehicles Symposium, 2012

Measuring driver awareness based on correlation between gaze behavior and risks of surrounding vehicles.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Detection of driver distraction based on temporal relationship between eye-gaze and peripheral vehicle behavior.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Analysis and prediction of deceleration behavior during car following using stochastic driver-behavior model.
Proceedings of the 15th International IEEE Conference on Intelligent Transportation Systems, 2012

Classification of Stressed Speech Using Physical Parameters Derived from Two-Mass Model.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

Impact of driving context on stochastic driver-behavior model: Quantitative analysis of car following task.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2012

Physical characteristics of vocal folds during speech under stress.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Estimating sound source depth using a small-size array.
Proceedings of the 2012 IEEE International Conference on Acoustics, 2012

Fast source separation based on selection of effective temporal frames.
Proceedings of the 20th European Signal Processing Conference, 2012

Multi-platform Experiment to Discuss Behavioral Consistency across Laboratory and Real Situational Studies.
Proceedings of the 34th Annual Meeting of the Cognitive Science Society, 2012

Acoustic model training using feature vectors generated by manipulating speech parameters of real speakers.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

Subjective similarity of music: Data collection for individuality analysis.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012

2011
International Large-Scale Vehicle Corpora for Research on Driver Behavior on the Road.
IEEE Trans. Intell. Transp. Syst., 2011

Analysis of Real-World Driver's Frustration.
IEEE Trans. Intell. Transp. Syst., 2011

Blind Source Separation Using Dodecahedral Microphone Array under Reverberant Conditions.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2011

Spoken document retrieval method combining query expansion with continuous syllable recognition for NTCIR-SpokenDoc.
Proceedings of the 9th NTCIR Workshop Meeting on Evaluation of Information Access Technologies: Information Retrieval, 2011

An Analysis of the Speech Under Stress Using the Two-Mass Vocal Fold Model.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

On-line detection of task incompletion for spoken dialog systems using utterance and behavior tag N-gram vectors.
Proceedings of the Paralinguistic Information and its Integration in Spoken Dialogue Systems, 2011

Modeling and adaptation of stochastic driver-behavior model with application to car following.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2011

Improving driving behavior by allowing drivers to browse their own recorded driving data.
Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems, 2011

Online parameter estimation of driving behavior using probability-weighted ARX models.
Proceedings of the 14th International IEEE Conference on Intelligent Transportation Systems, 2011

Detection of Task-Incomplete Dialogs Based on Utterance-and-Behavior Tag N-Gram for Spoken Dialog Systems.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

Music Recommendation System Based on Human-to-human Conversation Recognition.
Proceedings of the Workshop Proceedings of the 7th International Conference on Intelligent Environments, 2011

Improving head-related impulse response measured in noisy environments with spatio-temporal frequency analysis.
Proceedings of the IEEE International Conference on Acoustics, 2011

Driver risk evaluation based on acceleration, deceleration, and steering behavior.
Proceedings of the IEEE International Conference on Acoustics, 2011

Efficient blind speech separation suitable for embedded devices.
Proceedings of the 19th European Signal Processing Conference, 2011

Robust seed model training for speaker adaptation using pseudo-speaker features generated by inverse CMLLR transformation.
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011

2010
Acoustic Feature Transformation Combining Average and Maximum Classification Error Minimization Criteria.
IEICE Trans. Inf. Syst., 2010

Acoustic Feature Transformation Based on Discriminant Analysis Preserving Local Structure for Speech Recognition.
IEICE Trans. Inf. Syst., 2010

Evaluation of Combinational Use of Discriminant Analysis-Based Acoustic Feature Transformation and Discriminative Training.
IEICE Trans. Inf. Syst., 2010

Estimation Method of User Satisfaction Using N-gram-based Dialog History Model for Spoken Dialog System.
Proceedings of the International Conference on Language Resources and Evaluation, 2010

A browsing and retrieval system for driving data.
Proceedings of the IEEE Intelligent Vehicles Symposium (IV), 2010

Automatic detection of task-incompleted dialog for spoken dialog system based on dialog act n-gram.
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010

Analyzing grasping for inferring cognitive states of users.
Proceedings of the IEEE International Conference on Acoustics, 2010

A small dodecahedral microphone array for blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2010

Development of a Multilingual Translation Service for Interpretations and Usage Examples of Mobile Phone Pictograms.
Proceedings of the Culture and Computing, 2010

CENSREC-1-AV: an audio-visual corpus for noisy bimodal speech recognition.
Proceedings of the Auditory-Visual Speech Processing, 2010

2009
Driving Profile Modeling and Recognition Based on Soft Computing Approach.
IEEE Trans. Neural Networks, 2009

A Study of Driver Behavior Under Potential Threats in Vehicle Traffic.
IEEE Trans. Intell. Transp. Syst., 2009

Selective Listening Point Audio Based on Blind Signal Separation and Stereophonic Technology.
IEICE Trans. Inf. Syst., 2009

A multimedia corpus of driving behaviors.
Proceedings of the 2009 IEEE International Workshop on Multimedia Signal Processing, 2009

Representation and comparison of HRTF in spatio-temporal frequency domain.
Proceedings of the 3rd International Universal Communication Symposium, 2009

Prediction model of driving behavior based on traffic conditions and driver types.
Proceedings of the 12th International IEEE Conference on Intelligent Transportation Systems, 2009

Automatic Identification for Singing Style based on Sung Melodic Contour Characterized in Phase Plane.
Proceedings of the 10th International Society for Music Information Retrieval Conference, 2009

Driver evaluation based on classification of rapid decelerating patterns.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

Evaluation of driver-behavior models in real-world car-following task.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2009

Feature transformation based on discriminant analysis preserving local structure for speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

Stochastic modeling of vehicle trajectory during lane-changing.
Proceedings of the IEEE International Conference on Acoustics, 2009

Spoken dialog strategy based on understanding graph search.
Proceedings of the IEEE International Conference on Acoustics, 2009

Blind source separation based on acoustic pressure distribution and normalized relative phase using dodecahedral microphone array.
Proceedings of the 17th European Signal Processing Conference, 2009

2008
Special Section on Robust Speech Processing in Realistic Environments.
IEICE Trans. Inf. Syst., 2008

Multichannel Speech Enhancement Based on Generalized Gamma Prior Distribution with Its Online Adaptive Estimation.
IEICE Trans. Inf. Syst., 2008

3DAV integrated system featuring arbitrary listening-point and viewpoint generation.
Proceedings of the International Workshop on Multimedia Signal Processing, 2008

In-car Speech Data Collection along with Various Multimodal Signals.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Evaluation Framework for Distant-talking Speech Recognition under Reverberant Environments: newest Part of the CENSREC Series -.
Proceedings of the International Conference on Language Resources and Evaluation, 2008

Building and combining document and music spaces for music query-by-webpage system.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Parameter estimation method of F0 control model for singing voices.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

CENSREC-4: development of evaluation framework for distant-talking speech recognition under reverberant environments.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008

Generating lane-change trajectories of individual drivers.
Proceedings of the IEEE International Conference on Vehicular Electronics and Safety, 2008

An integrative recognition method for speech and gestures.
Proceedings of the 10th International Conference on Multimodal Interfaces, 2008

Encoding large array signals into a 3D sound field representation for selective listening point audio based on blind source separation.
Proceedings of the IEEE International Conference on Acoustics, 2008

Binaural sound localization for untrained directions based on a Gaussian mixture model.
Proceedings of the 2008 16th European Signal Processing Conference, 2008

CENSREC-AV: evaluation frameworks for audio-visual speech recognition.
Proceedings of the International Conference on Auditory-Visual Speech Processing 2008, 2008

2007
Driver Modeling Based on Driving Behavior and Its Evaluation in Driver Identification.
Proc. IEEE, 2007

Robust In-Car Speech Recognition Based on Nonlinear Multiple Regressions.
EURASIP J. Adv. Signal Process., 2007

A Stochastic Representation of the Dynamics of Sung Melody.
Proceedings of the 8th International Conference on Music Information Retrieval, 2007

Statistical segmentation and recognition of fingertip trajectories for a gesture interface.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

Development of VAD evaluation framework CENSREC-1-C and investigation of relationship between VAD and speech recognition performance.
Proceedings of the IEEE Workshop on Automatic Speech Recognition & Understanding, 2007

2006
On-line Gaussian mixture modeling in the log-power domain for signal-to-noise ratio estimation and speech enhancement.
Speech Commun., 2006

Driver Identification Using Driving Behavior Signals.
IEICE Trans. Inf. Syst., 2006

Single-Channel Multiple Regression for In-Car Speech Enhancement.
IEICE Trans. Inf. Syst., 2006

CENSREC-3: An Evaluation Framework for Japanese Speech Recognition in Real Car-Driving Environments.
IEICE Trans. Inf. Syst., 2006

Gamma Modeling of Speech Power and Its On-Line Estimation for Statistical Speech Enhancement.
IEICE Trans. Inf. Syst., 2006

Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus.
Proceedings of the Fifth International Conference on Language Resources and Evaluation, 2006

CENSREC2: corpus and evaluation environments for in car continuous digit speech recognition.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Multipoint Measuring System for Video and Sound - 100-camera and microphone system.
Proceedings of the 2006 IEEE International Conference on Multimedia and Expo, 2006

Characterizing in-Car Conversational Speech of Different Dialogue Modes.
Proceedings of the First International Conference on Innovative Computing, Information and Control (ICICIC 2006), 30 August, 2006

Arbitrary Listening-Point Generation Using Sub-Band Representation of Sound Wave Ray-Space.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Cepstral Analysis of Driving Behavioral Signals for Driver Identification.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Adaptive Regression Based Framework for In-Car Speech Recognition.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Development of Micro-Dodecahedral Loudspeaker for Measuring Head-Related Transfer Functions in The Proximal region.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

Multichannel Speech Enhancement Based on Speech Spectral Magnitude Estimation Using Generalized Gamma Prior Distribution.
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006

2005
Adaptive log-spectral regression for in-car speech recognition using multiple distributed microphones.
IEEE Signal Process. Lett., 2005

Analysis and recognition of whispered speech.
Speech Commun., 2005

Construction and Evaluation of a Large In-Car Speech Corpus.
IEICE Trans. Inf. Syst., 2005

AURORA-2J: An Evaluation Framework for Japanese Noisy Speech Recognition.
IEICE Trans. Inf. Syst., 2005

Multiple Regression of Log Spectra for In-Car Speech Recognition Using Multiple Distributed Microphones.
IEICE Trans. Inf. Syst., 2005

Adaptive Nonlinear Regression Using Multiple Distributed Microphones for In-Car Speech Recognition.
IEICE Trans. Fundam. Electron. Commun. Comput. Sci., 2005

CIAIR In-Car Speech Corpus - Influence of Driving Status.
IEICE Trans. Inf. Syst., 2005

Speech Recognition Using Finger Tapping Timings.
IEICE Trans. Inf. Syst., 2005

Performance Evaluation of H.264 Video Streaming over Inter-Vehicular 802.11 Ad Hoc Networks.
Proceedings of the IEEE 16th International Symposium on Personal, 2005

Maximum a Posterior Probability and Cumulative Distribution Function Equalization Methods for Speech Spectral Estimation with Application in Noise Suppression Filtering.
Proceedings of the Nonlinear Analyses and Algorithms for Speech Processing, 2005

Modeling of individualities in driving through spectral analysis of behavioral signals.
Proceedings of the Eighth International Symposium on Signal Processing and Its Applications, 2005

Speaker verification using Gaussian mixture models within changing real car environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Data collection and evaluation of speech recognition for motorbike riders.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Discrimination between singing and speaking voices.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

Subjective and objective quality assessment of regression-enhanced speech in real car environments.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

The sound wave ray-space.
Proceedings of the 2005 IEEE International Conference on Multimedia and Expo, 2005

Improved Noise Spectra Estimation and Log-spectral Regression for In-car Speech Recognition.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

Analysis of a large in-car speech corpus.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

CENSREC-3: Data Collection for In-Car Speech Recognition and Its Common Evaluation Framework.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

A speech enhancement system based on data clustering and cumulative histogram equalization.
Proceedings of the 21st International Conference on Data Engineering Workshops, 2005

SNR and Local Noise Power Estimations Based on Gaussian Mixture Modeling on the Log-Power Domain.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Two-stage Noise Spectra Estimation and Regression based In-car Speech Recognition using Single Distant Microphone.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Spatial coding based on the extraction of moving sound sources in wavefield synthesis.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Analysis of a large in-car speech corpus and its application to the multimodel ASR.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Generalized gamma modeling of speech and its online estimation for speech enhancement.
Proceedings of the 2005 IEEE International Conference on Acoustics, 2005

Parametric Versus Non-parametric Models of Driving Behavior Signals for Driver Identification.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 2005

2004
Multimedia Corpus of In-Car Speech Communication.
J. VLSI Signal Process., 2004

In-Car Speech Recognition Using Distributed Multiple Microphones.
Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Audio-visual SPeaker localization for car navigation systems.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Example-based spoken dialogue system with online example augmentation.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Speech enhancement based on magnitude estimation using the gamma prior.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Optimizing regression for in-car speech recognition using multiple distributed microphones.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Recent progress of open-source LVCSR engine julius and Japanese model repository.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

CIAIR in-car speech database.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Analysis of in-car speech recognition experiments using a large-scale multi-mode dialogue corpus.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Speech recognition using synchronization between speech and finger tapping.
Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Biometric identification using driving behavioral signals.
Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

2003
Blind Source Separation Combining Independent Component Analysis and Beamforming.
EURASIP J. Adv. Signal Process., 2003

Integration of noise reduction algorithms for Aurora2 task.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

A study on domain recognition of spoken dialogue systems.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions.
Proceedings of the 2003 IEEE International Conference on Acoustics, 2003

2002
Recognition of Consonant-Vowel (CV) Units of Speech in a Broadcast News Corpus Using Support Vector Machines.
Proceedings of the Pattern Recognition with Support Vector Machines, 2002

Continuous Speech Recognition Consortium an Open Repository for CSR Tools and Models.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

The Present Status of Speech Database in Japan: Development, Management, and Application to Speech Research.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Multi-Dimensional Data Acquisition for Integrated Acoustic Information Research.
Proceedings of the Third International Conference on Language Resources and Evaluation, 2002

Experiments on recognition of lavalier microphone speech and whispered speech in real world environments.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Multiple regression of log-spectra for in-car speech recognition.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Recognition of continuous speech segments of monophone units using support vector machines.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Spectral Modification and MIDI Control for Improved Quality of Violin Sound Synthesis.
Proceedings of the 2002 International Computer Music Conference, 2002

Spatial compression of multi-channel audio signals using inverse filters.
Proceedings of the IEEE International Conference on Acoustics, 2002

Acoustic analysis and recognition of whispered speech.
Proceedings of the IEEE International Conference on Acoustics, 2002

Synthesis of car noise based on a composition of engine noise and friction noise.
Proceedings of the IEEE International Conference on Acoustics, 2002

2001
Robust speech recognition based on selective use of missing frequency band HMMs.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Multimedia data collection of in-car speech communication.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Continuous speech recognition without end-point detection.
Proceedings of the IEEE International Conference on Acoustics, 2001

Blind source separation combining frequency-domain ICA and beamforming.
Proceedings of the IEEE International Conference on Acoustics, 2001

Direction of arrival estimation based on nonlinear microphone array.
Proceedings of the IEEE International Conference on Acoustics, 2001

A study on perceptual distance measure for phase spectrum of stimuli.
Proceedings of the IEEE International Conference on Acoustics, 2001

Close-Class-Set Discrimination Method for Recognition of Stop_Consonant-Vowel Utterances Using Support Vector Machines.
Proceedings of the Artificial Neural Networks, 2001

Recognition of consonant-vowel utterances using Support Vector Machines.
Proceedings of the 9th European Symposium on Artificial Neural Networks, 2001

2000
IPA Japanese Dictation Free Software Project.
Proceedings of the Second International Conference on Language Resources and Evaluation, 2000

Vector space representation of language probabilities through SVD of n-gram matrix.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Blind source separation based on subband ICA and beamforming.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Free software toolkit for Japanese large vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Construction of speech corpus in moving car environment.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An acoustic measure for predicting recognition performance degradation.
Proceedings of the IEEE International Conference on Acoustics, 2000

Speech recognition based on space diversity using distributed multi-microphone.
Proceedings of the IEEE International Conference on Acoustics, 2000

Speech enhancement using nonlinear microphone array with noise adaptive complementary beamforming.
Proceedings of the IEEE International Conference on Acoustics, 2000

A new phonetic tied-mixture model for efficient decoding.
Proceedings of the IEEE International Conference on Acoustics, 2000

Evaluation of blind signal separation method using directivity pattern under reverberant conditions.
Proceedings of the IEEE International Conference on Acoustics, 2000

Speech enhancement based on noise adaptive nonlinear microphone array.
Proceedings of the 10th European Signal Processing Conference, 2000

1999
Speech enhancement using nonlinear microphone array under nonstationary noise conditions.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Speaker conversion through non-linear frequency warping of straight spectrum.
Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Speech enhancement using nonlinear microphone array with complementary beamforming.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Compensating of room acoustic transfer functions affected by change of room temperature.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

Audio data hiding by use of band-limited random sequences.
Proceedings of the 1999 IEEE International Conference on Acoustics, 1999

1998
Estimating entropy of a language from optimal word insertion penalty.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Sharable software repository for Japanese large vocabulary continuous speech recognition.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

The design of the newspaper-based Japanese large vocabulary continuous speech recognition corpus.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Balancing acoustic and linguistic probabilities.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Spectral weighting of SBCOR for noise robust speech recognition.
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

1997
Voice activity detection using source separation techniques.
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997

A binaural speech processing method using subband-cross correlation analysis for noise robust recognition.
Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

Speaker Identification Using Harmonic Structure of LP-residual Spectrum.
Proceedings of the Audio- and Video-Based Biometric Person Authentication, 1997

1996
Variability of lombard effects under different noise conditions.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Extracting speech features from human speech-like noise.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Subband-crosscorrelation analysis for robust speech recognition.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996

1995
Error Analysis of Field Trial Results of a Spoken Dialogue System for Telecommunications Applications.
IEICE Trans. Inf. Syst., 1995

Top-down speech detection and n-best meaning search in a voice activated telephone extension system.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

A prototype of a Japanese-Korean realtime speech translation system.
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995

1994
A trellis-based implementation of minimum error rate training.
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

1993
Improving robustness of network grammar by using class HMM.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

A voice-activated extension telephone exchange system.
Proceedings of the Third European Conference on Speech Communication and Technology, 1993

1992
Architecture and algorithms of a real-time word recognizer for telephone input.
Proceedings of the Second International Conference on Spoken Language Processing, 1992

1990
ATR Japanese speech database as a tool of speech recognition and synthesis.
Speech Commun., 1990

On unit selection algorithms and their evaluation in non-uniform unit speech synthesis.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

The control of segmental duration in speech synthesis using linguistic properties.
Proceedings of the ESCA Workshop on Speech Synthesis, 1990

On the unit search criteria and algorithms for speech synthesis using non-uniform units.
Proceedings of the First International Conference on Spoken Language Processing, 1990

A large-scale Japanese speech database.
Proceedings of the First International Conference on Spoken Language Processing, 1990

Statistical analysis for segmental duration rules in Japanese speech synthesis.
Proceedings of the First International Conference on Spoken Language Processing, 1990

1989
Adaptive manipulation of non-uniform synthesis units using multi-level unit transcription.
Proceedings of the First European Conference on Speech Communication and Technology, 1989

Construction of a large-scale Japanese speech database and its management system.
Proceedings of the IEEE International Conference on Acoustics, 1989

1987
Acoustic-phonetic labels in a Japanese speech database.
Proceedings of the European Conference on Speech Technology, 1987


  Loading...