Jun Ogata

Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

AI robustness analysis with consideration of corner cases.

[BibT_eX]

[DOI]

Proceedings of the 2021 IEEE International Conference on Artificial Intelligence Testing, 2021

2020

Piecewise Linear Regression under Noise Level Variation via Convex Optimization.

[BibT_eX]

[DOI]

Hiroki Kuroda

Proceedings of the 28th European Signal Processing Conference, 2020

2019

Knowledge Distillation for Throat Microphone Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Visual explanation of neural network based rotation machinery anomaly detection system.

[BibT_eX]

[DOI]

Proceedings of the 2019 IEEE International Conference on Prognostics and Health Management, 2019

Effects of Mounting Position on Throat Microphone Speech Recognition.

[BibT_eX]

[DOI]

Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

Our Neural Machine Translation Systems for WAT 2019.

[BibT_eX]

[DOI]

Wei Yang

Proceedings of the 6th Workshop on Asian Translation, 2019

2018

Fast Intra Mode Decision Method Based on Outliers of DCT Coefficients and Neighboring Block Information for H.265/HEVC.

[BibT_eX]

[DOI]

Koichi Ichige

Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Tandem Connectionist Anomaly Detection: Use of Faulty Vibration Signals in Feature Representation Learning.

[BibT_eX]

[DOI]

Proceedings of the 2018 IEEE International Conference on Prognostics and Health Management, 2018

Bottleneck feature-mediated DNN-based feature mapping for throat microphone speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2015

Non-iterative coding tree depth estimation for H.265/HEVC using neighboring block information.

[BibT_eX]

[DOI]

Koichi Ichige

Proceedings of the 10th International Conference on Information, 2015

2014

Two-level fast-forwarding using speech detection for rapidly perusing video.

[BibT_eX]

[DOI]

Proceedings of the 5th Augmented Human International Conference, 2014

2012

PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.

[BibT_eX]

[DOI]

Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.

[BibT_eX]

[DOI]

Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.

[BibT_eX]

[DOI]

Proceedings of the 2012 Information Theory and Applications Workshop, 2012

PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds.

[BibT_eX]

[DOI]

Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011

LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics.

[BibT_eX]

[DOI]

IEEE J. Sel. Top. Signal Process., 2011

PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions.

[BibT_eX]

[DOI]

Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010

PodCastle: A Spoken Document Retrieval Service Improved by Anonymous User Contributions.

[BibT_eX]

[DOI]

Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

2009

PodCastle: a spoken document retrieval system for podcasts and its performance improvement by anonymous user contributions.

[BibT_eX]

[DOI]

Proceedings of the third workshop on Searching spontaneous conversational speech, 2009

Acoustic event detection for spotting "hot spots" in podcasts.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription.

[BibT_eX]

[DOI]

Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The use of acoustically detected filled and silent pauses in spontaneous speech recognition.

[BibT_eX]

[DOI]

Katunobu Itou

Proceedings of the IEEE International Conference on Acoustics, 2009

2008

A similar content retrieval method for podcast episodes.

[BibT_eX]

[DOI]

Junta Mizuno

Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Hyperlinking Lyrics: A Method for Creating Hyperlinks Between Phrases in Song Lyrics.

[BibT_eX]

[DOI]

Hiromasa Fujihara

Proceedings of the ISMIR 2008, 2008

2007

Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array.

[BibT_eX]

[DOI]

EURASIP J. Audio Speech Music. Process., 2007

Automatic transcription for a web 2.0 service to search podcasts.

[BibT_eX]

[DOI]

Kouichirou Eto

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Podcastle: a web 2.0 approach to speech recognition research.

[BibT_eX]

[DOI]

Kouichirou Eto

Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Presentation sensei: a presentation training system using speech and image processing.

[BibT_eX]

[DOI]

Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

2006

Stream-Based Classification and Segmentation of Speech Events in Meeting Recordings.

[BibT_eX]

[DOI]

Futoshi Asano

Proceedings of the Multimedia Content Representation, 2006

Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals.

[BibT_eX]

[DOI]

Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Detection and separation of speech events in meeting recordings.

[BibT_eX]

[DOI]

Futoshi Asano

Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Speech pen: predictive handwriting based on ambient multimodal recognition.

[BibT_eX]

[DOI]

Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006

2005

Recognition of speech from live sports coverage using acoustic and language model adaptation.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2005

Speech repair: quick error correction just by using selection operation for speech input interfaces.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

State estimation of meetings by information fusion using Bayesian network.

[BibT_eX]

[DOI]

Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004

Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface.

[BibT_eX]

[DOI]

EURASIP J. Adv. Signal Process., 2004

A Drum Pattern Retrieval Method by Voice Percussion.

[BibT_eX]

[DOI]

Proceedings of the ISMIR 2004, 2004

Robust speech interface based on audio and video information fusion for humanoid HRP-2.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

2003

Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition.

[BibT_eX]

[DOI]

Natsuo Yamamoto

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Syllable-based acoustic modeling for Japanese spontaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Live speech recognition in sports games by adaptation of acoustic model and language model.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Unsupervised acoustic model adaptation based on phoneme error minimization.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001

Improved speech recognition using iterative decoding based on confidence measures.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000

Study on New Term Weighting Method and New Vector Space Model Based on Word Space in Spoken Document Retrieval.

[BibT_eX]

[DOI]

Seiichi Takao

Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Topic segmentation of news speech using word similarity.

[BibT_eX]

[DOI]

Seiichi Takao

Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Expanded vector space model based on word space in cross media retrieval of news speech data.

[BibT_eX]

[DOI]

Seiichi Takao

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An efficient lexical tree search for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Large vocabulary continuous speech recognition under real environments using adaptive sub-band spectral subtraction.

[BibT_eX]

[DOI]

Masahiro Fujimoto

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998

Indexing and classification of TV news articles based on speech dictation using word bigram.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

News Dictation and Article Classification Using Automatically Extracted Announcer Utterance.

[BibT_eX]

[DOI]

Masafumi Nishida

Proceedings of the Advanced Multimedia Content Processing, First International Conference, 1998

1995

Allele-specific methylation and expression of an imprinted U2af1-rs1 (SP2) gene.

[BibT_eX]

[DOI]

Nucleic Acids Res., 1995

1992

Neural Network Approaches for Attractive Area Extraction from Video Images.

[BibT_eX]

[DOI]