Koichi Shinoda
Orcid: 0000-0003-1095-3203
According to our database1,
Koichi Shinoda
authored at least 147 papers
between 1990 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question Answering.
CoRR, 2024
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024
Proceedings of the MultiMedia Modeling - 30th International Conference, 2024
Proceedings of the 2024 6th International Conference on Image, Video and Signal Processing, 2024
2023
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023
EvIs-Kitchen: Egocentric Human Activities Recognition with Video and Inertial Sensor Data.
Proceedings of the MultiMedia Modeling - 29th International Conference, 2023
Synthesizing Speech from ECoG with a Combination of Transformer-Based Encoder and Neural Vocoder.
Proceedings of the IEEE International Conference on Acoustics, 2023
Sensor Data Representation with Transformer-Based Contrastive Learning for Human Action Recognition and Detection.
Proceedings of the 31st European Signal Processing Conference, 2023
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023
2022
Proceedings of the International Joint Conference on Neural Networks, 2022
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2022
Proceedings of the IEEE International Conference on Acoustics, 2022
Proceedings of the Computer Vision - ECCV 2022, 2022
2021
Speech Paralinguistic Approach for Detecting Dementia Using Gated Convolutional Neural Network.
IEICE Trans. Inf. Syst., 2021
Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2021
2020
Comput. Speech Lang., 2020
Proceedings of the 2020 TREC Video Retrieval Evaluation, 2020
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020
Estimation of Leaf Angle Distribution Based on Statistical Properties of Leaf Shading Distribution.
Proceedings of the IEEE International Geoscience and Remote Sensing Symposium, 2020
Proceedings of the Companion Publication of the 2020 International Conference on Multimodal Interaction, 2020
2019
Comput. Speech Lang., 2019
CoRR, 2019
Multimodal Fusion of BERT-CNN and Gated CNN Representations for Depression Detection.
Proceedings of the 9th International on Audio/Visual Emotion Challenge and Workshop, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019
Proceedings of the 2019 IEEE International Geoscience and Remote Sensing Symposium, 2019
Sequence-level Knowledge Distillation for Model Compression of Attention-based Sequence-to-sequence Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2019
2018
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018
I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Detecting Alzheimer's Disease Using Gated Convolutional Neural Network from Audio Data.
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Proceedings of the 19th Annual Conference of the International Speech Communication Association, 2018
Deep Learning Based Multi-modal Addressee Recognition in Visual Scenes with Utterances.
Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the British Machine Vision Conference 2018, 2018
2017
Comput. Vis. Image Underst., 2017
TokyoTech-AIST at TRECVID 2017: Multimedia Event Detection Using Deep CNNs and Zero-Shot Classiers.
Proceedings of the 2017 TREC Video Retrieval Evaluation, 2017
Boredom Recognition Based on Users' Spontaneous Behaviors in Multiparty Human-Robot Interactions.
Proceedings of the MultiMedia Modeling - 23rd International Conference, 2017
CTC Network with Statistical Language Modeling for Action Sequence Recognition in Videos.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017
Proceedings of the 25th European Signal Processing Conference, 2017
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
A unified network for multi-speaker speech recognition with multi-channel recordings.
Proceedings of the 2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2017
2016
IEEE Trans. Pattern Anal. Mach. Intell., 2016
EURASIP J. Audio Speech Music. Process., 2016
Robust discriminative training against data insufficiency in PLDA-based speaker verification.
Comput. Speech Lang., 2016
Proceedings of the 2016 TREC Video Retrieval Evaluation, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the Working Notes Proceedings of the MediaEval 2016 Workshop, 2016
Proceedings of the 17th Annual Conference of the International Speech Communication Association, 2016
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2016
2015
Speech Commun., 2015
IEICE Trans. Inf. Syst., 2015
Proceedings of the 2015 TREC Video Retrieval Evaluation, 2015
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015
Combining Audio Features and Visual I-Vector @ MediaEval 2015 Multimodal Person Discovery in Broadcast TV.
Proceedings of the Working Notes Proceedings of the MediaEval 2015 Workshop, 2015
2014
Proceedings of the 2014 TREC Video Retrieval Evaluation, 2014
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
An efficient error correction interface for speech recognition on mobile touchscreen devices.
Proceedings of the 2014 IEEE Spoken Language Technology Workshop, 2014
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Discriminative PLDA training with application-specific loss functions for speaker verification.
Proceedings of the Odyssey 2014: The Speaker and Language Recognition Workshop, 2014
Proceedings of the MultiMedia Modeling - 20th Anniversary International Conference, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the 15th Annual Conference of the International Speech Communication Association, 2014
Proceedings of the IEEE International Conference on Acoustics, 2014
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 2014
Proceedings of the Computer Vision - ACCV 2014, 2014
2013
IEEE Signal Process. Mag., 2013
Speech Commun., 2013
Speech Commun., 2013
J. Vis. Commun. Image Represent., 2013
IEICE Trans. Inf. Syst., 2013
EURASIP J. Image Video Process., 2013
EURASIP J. Image Video Process., 2013
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Combining deep speaker specific representations with GMM-SVM for speaker verification.
Proceedings of the 14th Annual Conference of the International Speech Communication Association, 2013
Proceedings of the New Trends in Image Analysis and Processing - ICIAP 2013, 2013
Proceedings of the IEEE International Conference on Computer Vision, 2013
2012
A Fast and Accurate Video Semantic-Indexing System Using Fast MAP Adaptation and GMM Supervectors.
IEEE Trans. Multim., 2012
IEICE Trans. Inf. Syst., 2012
Online Speaker Clustering Using Incremental Learning of an Ergodic Hidden Markov Model.
IEICE Trans. Inf. Syst., 2012
IEICE Trans. Inf. Syst., 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Overlapped Speech Detection in Meeting Using Cross-Channel Spectral Subtraction and Spectrum Similarity.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012
Proceedings of the 19th IEEE International Conference on Image Processing, 2012
Acoustic model training using committee-based active and semi-supervised learning for speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2012
q-Gaussian Mixture Models Based on Non-extensive Statistics for Image and Video Semantic Indexing.
Proceedings of the Computer Vision, 2012
2011
Speech Commun., 2011
IEICE Trans. Inf. Syst., 2011
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
A fast MAP adaptation technique for gmm-supervector-based video semantic indexing systems.
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 2011 joint ACM workshop on Human gesture and behavior understanding, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the IEEE International Conference on Acoustics, 2011
Proceedings of the 2011 IEEE Workshop on Automatic Speech Recognition & Understanding, 2011
2010
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the 11th Annual Conference of the International Speech Communication Association, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the 20th International Conference on Pattern Recognition, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
2009
Proceedings of the TRECVID 2009 workshop participants notebook papers, 2009
Proceedings of the Human Language Technology. Challenges for Computer Science and Linguistics, 2009
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009
Proceedings of the IEEE International Conference on Acoustics, 2009
2008
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008
Proceedings of the 2nd ACM Workshop on Video Summarization, 2008
Proceedings of the Large-Scale Knowledge Resources. Construction and Application, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
Robust spoken term detection using combination of phone-based and word-based recognition.
Proceedings of the 9th Annual Conference of the International Speech Communication Association, 2008
2007
EURASIP J. Adv. Signal Process., 2007
Proceedings of the TRECVID 2007 workshop participants notebook papers, 2007
Dynamic language model adaptation using presentation slides for lecture speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Automatic estimation of scaling factors among probabilistic models in speech recognition.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the IEEE International Conference on Acoustics, 2007
Proceedings of the 15th European Signal Processing Conference, 2007
Proceedings of the 6th ACM International Conference on Image and Video Retrieval, 2007
2006
IEICE Trans. Inf. Syst., 2006
Proceedings of the 2006 TREC Video Retrieval Evaluation, 2006
Proceedings of the 8th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2006
Proceedings of the 2006 IEEE International Conference on Acoustics Speech and Signal Processing, 2006
2005
Robust highlight extraction using multi-stream hidden Markov models for baseball video.
Proceedings of the 2005 International Conference on Image Processing, 2005
2002
Vocal tract length normalization using rapid maximum-likelihood estimation for speech recognition.
Syst. Comput. Jpn., 2002
Efficient reduction of Gaussian components using MDL criterion for HMM-based speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2002
2001
IEEE Trans. Speech Audio Process., 2001
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001
2000
1998
Proceedings of the 1998 IEEE International Conference on Acoustics, 1998
1997
Proceedings of the Fifth European Conference on Speech Communication and Technology, 1997
1996
Unsupervised and incremental speaker adaptation under adverse environmental conditions.
Proceedings of the 4th International Conference on Spoken Language Processing, 1996
Proceedings of the 1996 IEEE International Conference on Acoustics, 1996
1995
Proceedings of the Fourth European Conference on Speech Communication and Technology, 1995
Proceedings of the 1995 International Conference on Acoustics, 1995
1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
Proceedings of the 3rd International Conference on Spoken Language Processing, 1994
1991
Proceedings of the 1991 International Conference on Acoustics, 1991
1990
Proceedings of the First International Conference on Spoken Language Processing, 1990