Yasuo Ariki

Kiyoshi Tsukada

Proceedings of the Advances in Multimedia Information Processing - PCM 2004, 5th Pacific Rim Conference on Multimedia, Tokyo, Japan, November 30, 2004

Structuring of baseball live games based on speech recognition using task dependant knowledge.

[BibT_eX]

[DOI]

Atsushi Sako

Proceedings of the 8th International Conference on Spoken Language Processing, 2004

Video shooting navigation system by real-time useful shot discrimination based on video grammar.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Automatic extraction of PC scenes based on feature mining for a real time delivery system of baseball highlight scenes.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Multimedia and Expo, 2004

Robust speech recognition in additive and channel noise environments using GMM and EM algorithm.

[BibT_eX]

[DOI]

Proceedings of the 2004 IEEE International Conference on Acoustics, 2004

2003

Highlight scene extraction in real time from baseball live video.

[BibT_eX]

[DOI]

Masahito Kumano

Kiyoshi Tsukada

Proceedings of the 5th ACM SIGMM International Workshop on Multimedia Information Retrieval, 2003

Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition.

[BibT_eX]

[DOI]

Natsuo Yamamoto

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Syllable-based acoustic modeling for Japanese spontaneous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Combination of temporal domain SVD based speech enhancement and GMM based speech estimation for ASR in noise - evaluation on the AURORA2 task -.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Live speech recognition in sports games by adaptation of acoustic model and language model.

[BibT_eX]

[DOI]

Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002

Automatic Useful Shot Extraction for a Video Editing Support System.

[BibT_eX]

[DOI]

Masahito Kumano

Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2002), 2002

Unsupervised acoustic model adaptation based on phoneme error minimization.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Evaluation of noisy speech recognition based on noise reduction and acoustic model adaptation on the Aurora2 tasks.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology.

[BibT_eX]

[DOI]

Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

Video Editing Support System Based on Video Grammar and Content Analysis.

[BibT_eX]

[DOI]

Proceedings of the 16th International Conference on Pattern Recognition, 2002

Noise robust hands-free speech recognition using microphone array and Kalman filter as front-end system of conversational TV.

[BibT_eX]

[DOI]

Proceedings of the IEEE 5th Workshop on Multimedia Signal Processing, 2002

2001

Segmentation of goods catalog video based on video caption.

[BibT_eX]

[DOI]

Hiroshi Matsumoto

Proceedings of the 2001 ACM workshops on Multimedia: multimedia information retrieval, Ottawa, ON, Canada, September 30, 2001

Improved speech recognition using iterative decoding based on confidence measures.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speaker recognition by separating phonetic space and speaker space.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Speech recognition under musical environments using kalman filter and iterative MLLR adaptation.

[BibT_eX]

[DOI]

Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

Summarization Of News Speech With Unknown Topic Boundary.

[BibT_eX]

[DOI]

T. Haru

Proceedings of the 2001 IEEE International Conference on Multimedia and Expo, 2001

Continuous speech recognition under non-stationary musical environments based on speech state transition model.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2001

2000

Automatic classification of TV sports news video by multiple subspace method.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 2000

Multimedia Technologies for Structuring and Retrieval of TV News.

[BibT_eX]

[DOI]

New Gener. Comput., 2000

Study on New Term Weighting Method and New Vector Space Model Based on Word Space in Spoken Document Retrieval.

[BibT_eX]

[DOI]

Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Topic segmentation of news speech using word similarity.

[BibT_eX]

[DOI]

Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Organization and retrieval of continuous media.

[BibT_eX]

[DOI]

Proceedings of the ACM Multimedia 2000 Workshops, Los Angeles, CA, USA, October 30, 2000

Expanded vector space model based on word space in cross media retrieval of news speech data.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An efficient lexical tree search for large vocabulary continuous speech recognition.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Speaker verification by integrating dynamic and static features using subspace method.

[BibT_eX]

[DOI]

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Large vocabulary continuous speech recognition under real environments using adaptive sub-band spectral subtraction.

[BibT_eX]

[DOI]

Masahiro Fujimoto

Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Noisy speech recognition using noise reduction method based on Kalman filter.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Acoustics, 2000

An Advanced Processing Environment for Managing the Continuous and Semistructured Features of Multimedia Content.

[BibT_eX]

[DOI]

Proceedings of the Current Issues in Databases and Information Systems, 2000

1999

Effectiveness of KL-transformation in spectral delta expansion.

[BibT_eX]

[DOI]

M. Tokuhira

Proceedings of the Sixth European Conference on Speech Communication and Technology, 1999

Speaker Indexing for News Articles, Debates and Drama in Broadcasted TV Programs.

[BibT_eX]

[DOI]

Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Automatic Classification of TV News Articles Based on Telop Character Recognition.

[BibT_eX]

[DOI]

Katsumi Matsuura

Proceedings of the IEEE International Conference on Multimedia Computing and Systems, 1999

Telop and Flip Frame Detection and Character Extraction from TV News Articles.

[BibT_eX]

[DOI]

Katsumi Matsuura

Proceedings of the Fifth International Conference on Document Analysis and Recognition, 1999

1998

Scene cut detection and article extraction in news video based on clustering of DCT features.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1998

Indexing and classification of TV news articles based on speech dictation using word bigram.

[BibT_eX]

[DOI]

Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

Real time speaker indexing based on subspace method - application to TV news articles and debate.

[BibT_eX]

[DOI]

Classification of TV sports news by DCT features using multiple subspace method.

[BibT_eX]

[DOI]

Proceedings of the Fourteenth International Conference on Pattern Recognition, 1998

Unsupervised speaker normalization using canonical correlation analysis.

[BibT_eX]

[DOI]

Miharu Sakuragi

Proceedings of the 1998 IEEE International Conference on Acoustics, 1998

Face Indexing on Video Data - Extraction, Recognition, Tracking and Modeling.

[BibT_eX]

Noriyuki Ishikawa

Proceedings of the 3rd International Conference on Face & Gesture Recognition (FG '98), 1998

News Dictation and Article Classification Using Automatically Extracted Announcer Utterance.

[BibT_eX]

[DOI]

Proceedings of the Advanced Multimedia Content Processing, First International Conference, 1998

Human Information Retrieval by Face Extraction and Recognition on TV News Images Using Subspace Method.

[BibT_eX]

[DOI]

Noriyuki Ishikawa

Proceedings of the Computer Vision, 1998

1997

Indexing and Classification of TV News Articles Based on Telop Recognition.

[BibT_eX]

[DOI]

T. Teranishi

Proceedings of the 4th International Conference Document Analysis and Recognition (ICDAR '97), 1997

Effectiveness of speaker normalized HMM by projection to speaker subspace.

[BibT_eX]

[DOI]

Proceedings of the 1997 IEEE International Conference on Acoustics, 1997

A TV News Retrieval System with Interactive Query Function.

[BibT_eX]

[DOI]

Proceedings of the Second IFCIS International Conference on Cooperative Information Systems, 1997

1996

An enquiring system of unknown words in TV news by spontaneous repetition (application of speaker normalization by speaker subspace projection).

[BibT_eX]

[DOI]

Shigeaki Tagashira

Proceedings of the 4th International Conference on Spoken Language Processing, 1996

Integration of face and speaker recognition by subspace method.

[BibT_eX]

[DOI]

Noriyuki Ishikawa

Proceedings of the 13th International Conference on Pattern Recognition, 1996

Extraction of TV news articles based on scene cut detection using DCT clustering.

[BibT_eX]

[DOI]

Y. Saito

Proceedings of the Proceedings 1996 International Conference on Image Processing, 1996

Speaker recognition and speaker normalization by projection to speaker subspace.

[BibT_eX]

[DOI]

Shigeaki Tagashira

Masayuki Nishijima

Proceedings of the 1996 IEEE International Conference on Acoustics, 1996

Article Extraction and Classification of TV News Using Image and Speech Processing.

[BibT_eX]

M. Sakurai

Proceedings of the International Symposium on Cooperative Database Systems for Advanced Applications, 1996

1995

Segmentation and recognition of handwritten characters using subspace method.

[BibT_eX]

[DOI]

Y. Motegi

Proceedings of the Third International Conference on Document Analysis and Recognition, 1995

1994

Simultaneous spotting of phonemes and words in continuous speech.

[BibT_eX]

[DOI]

T. Kawamura

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Speaker recognition based on subspace methods.

[BibT_eX]

[DOI]

Keisuke Doi

Proceedings of the 3rd International Conference on Spoken Language Processing, 1994

Phoneme recognition improvement by restricting training section in concatenated HMM training.

[BibT_eX]

[DOI]

Keisuke Doi

Proceedings of ICASSP '94: IEEE International Conference on Acoustics, 1994

1990

Optimisation of English phoneme recognition based on HMM.

[BibT_eX]

[DOI]

Andrew M. Sutherland

Mervyn A. Jack

Proceedings of the First International Conference on Spoken Language Processing, 1990

Phoneme probability presentation of continuous speech.

[BibT_eX]

[DOI]

Mervyn A. Jack

Proceedings of the First International Conference on Spoken Language Processing, 1990

OSPREY: a transputer based continuous speech recognition system.

[BibT_eX]

[DOI]

Proceedings of the 1990 International Conference on Acoustics, 1990

1989

Word and monosyllable recognition using lifters on two-dimensional cepstrum.

[BibT_eX]

[DOI]

Masaaki Nagata

Syst. Comput. Jpn., 1989

Enhancement and optimisation of a speech recognition front end based on hidden Markov models.

[BibT_eX]

[DOI]

Fergus R. McInnes

Alan Wrench

Proceedings of the First European Conference on Speech Communication and Technology, 1989

Hierarchical phoneme discrimination by hidden Markov modelling using cepstrum and formant information.

[BibT_eX]

[DOI]

Fergus R. McInnes

Mervyn A. Jack

Proceedings of the IEEE International Conference on Acoustics, 1989

1987

High-speed transformation of drawing images based on structure description.

[BibT_eX]

[DOI]

Syst. Comput. Jpn., 1987

Continuous speech understanding by keyword extraction in a voice mail system.

[BibT_eX]

[DOI]

H. Ohkawa

Proceedings of the European Conference on Speech Technology, 1987

Spoken word recognition using statistic and dynamic information obtained by two-dimensional cepstrum analysis.

[BibT_eX]

[DOI]

Proceedings of the European Conference on Speech Technology, 1987

Uncertainty Reduction Paradigm Using Structural Knowledge in Line-Drawing Understanding.

[BibT_eX]

[DOI]

Masashi Morimoto

Proceedings of the 10th International Joint Conference on Artificial Intelligence. Milan, 1987

1986

Acoustic noise reduction by two dimensional spectral smoothing and spectral amplitude transformation.

[BibT_eX]

[DOI]

K. Kajimoto

Proceedings of the IEEE International Conference on Acoustics, 1986

1984

Speaker-independent word recognition in connected speech on the basis of phoneme recognition.

[BibT_eX]

[DOI]

Kiyoshi Maenobu