Jun Ogata

According to our database1, Jun Ogata authored at least 61 papers between 1992 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
FruitsMusic: A Real-World Corpus of Japanese Idol-Group Songs.
CoRR, 2024

Normal with Occasional Anomalies: Feature Extraction for Detecting Non-Stationary Abnormal Events in Wind Turbines.
Proceedings of the 32nd European Signal Processing Conference, 2024

2023
Learning Discriminative Feature Representations via Metric Learning for Early Operation of Wind Turbine Anomaly Detection Systems.
Proceedings of the International Conference on Machine Learning and Applications, 2023

Multi-Self-Supervised Learning Model-Based Throat Microphone Speech Recognition.
Proceedings of the Asia Pacific Signal and Information Processing Association Annual Summit and Conference, 2023

2022
Exploiting Fine-tuning of Self-supervised Learning Models for Improving Bi-modal Sentiment Analysis and Emotion Recognition.
Proceedings of the 23rd Annual Conference of the International Speech Communication Association, 2022

Applying Generative Adversarial Networks and Vision Transformers in Speech Emotion Recognition.
Proceedings of the HCI International 2022 - Late Breaking Papers. Multimodality in Advanced Interaction Environments, 2022

Throat microphone speech recognition using wav2vec 2.0 and feature mapping.
Proceedings of the 11th IEEE Global Conference on Consumer Electronics, 2022

2021
Stronger Baseline for Robust Results in Multimodal Sentiment Analysis.
Proceedings of the 35th Pacific Asia Conference on Language, Information and Computation, 2021

AI robustness analysis with consideration of corner cases.
Proceedings of the 2021 IEEE International Conference on Artificial Intelligence Testing, 2021

2020
Piecewise Linear Regression under Noise Level Variation via Convex Optimization.
Proceedings of the 28th European Signal Processing Conference, 2020

2019
Knowledge Distillation for Throat Microphone Speech Recognition.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Visual explanation of neural network based rotation machinery anomaly detection system.
Proceedings of the 2019 IEEE International Conference on Prognostics and Health Management, 2019

Effects of Mounting Position on Throat Microphone Speech Recognition.
Proceedings of the IEEE 8th Global Conference on Consumer Electronics, 2019

Our Neural Machine Translation Systems for WAT 2019.
Proceedings of the 6th Workshop on Asian Translation, 2019

2018
Fast Intra Mode Decision Method Based on Outliers of DCT Coefficients and Neighboring Block Information for H.265/HEVC.
Proceedings of the IEEE International Symposium on Circuits and Systems, 2018

Tandem Connectionist Anomaly Detection: Use of Faulty Vibration Signals in Feature Representation Learning.
Proceedings of the 2018 IEEE International Conference on Prognostics and Health Management, 2018

Bottleneck feature-mediated DNN-based feature mapping for throat microphone speech recognition.
Proceedings of the Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, 2018

2015
Non-iterative coding tree depth estimation for H.265/HEVC using neighboring block information.
Proceedings of the 10th International Conference on Information, 2015

2014
Two-level fast-forwarding using speech detection for rapidly perusing video.
Proceedings of the 5th Augmented Human International Conference, 2014

2012
PodCastle and Songle: Crowdsourcing-Based Web Services for Retrieval and Browsing of Speech and Music Content.
Proceedings of the First International Workshop on Crowdsourcing Web Search, 2012

PodCastle and songle: crowdsourcing-based web services for spoken content retrieval and active music listening.
Proceedings of the ACM multimedia 2012 workshop on Crowdsourcing for multimedia, 2012

PodCastle and songle: Crowdsourcing-based web services for spoken document retrieval and active music listening.
Proceedings of the 2012 Information Theory and Applications Workshop, 2012

PodCastle: Collaborative Training of Language Models on the Basis of Wisdom of Crowds.
Proceedings of the 13th Annual Conference of the International Speech Communication Association, 2012

2011
LyricSynchronizer: Automatic Synchronization System Between Musical Audio Signals and Lyrics.
IEEE J. Sel. Top. Signal Process., 2011

PodCastle: Recent Advances of a Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the 12th Annual Conference of the International Speech Communication Association, 2011

2010
PodCastle: A Spoken Document Retrieval Service Improved by Anonymous User Contributions.
Proceedings of the 24th Pacific Asia Conference on Language, Information and Computation, 2010

2009
PodCastle: a spoken document retrieval system for podcasts and its performance improvement by anonymous user contributions.
Proceedings of the third workshop on Searching spontaneous conversational speech, 2009

Acoustic event detection for spotting "hot spots" in podcasts.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

Podcastle: collaborative training of acoustic models on the basis of wisdom of crowds for podcast transcription.
Proceedings of the 10th Annual Conference of the International Speech Communication Association, 2009

The use of acoustically detected filled and silent pauses in spontaneous speech recognition.
Proceedings of the IEEE International Conference on Acoustics, 2009

2008
A similar content retrieval method for podcast episodes.
Proceedings of the 2008 IEEE Spoken Language Technology Workshop, 2008

Hyperlinking Lyrics: A Method for Creating Hyperlinks Between Phrases in Song Lyrics.
Proceedings of the ISMIR 2008, 2008

2007
Detection and Separation of Speech Events in Meeting Recordings Using a Microphone Array.
EURASIP J. Audio Speech Music. Process., 2007

Automatic transcription for a web 2.0 service to search podcasts.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Podcastle: a web 2.0 approach to speech recognition research.
Proceedings of the 8th Annual Conference of the International Speech Communication Association, 2007

Presentation sensei: a presentation training system using speech and image processing.
Proceedings of the 9th International Conference on Multimodal Interfaces, 2007

2006
Stream-Based Classification and Segmentation of Speech Events in Meeting Recordings.
Proceedings of the Multimedia Content Representation, 2006

Automatic Synchronization between Lyrics and Music CD Recordings Based on Viterbi Alignment of Segregated Vocal Signals.
Proceedings of the Eigth IEEE International Symposium on Multimedia (ISM 2006), 2006

Detection and separation of speech events in meeting recordings.
Proceedings of the Ninth International Conference on Spoken Language Processing, 2006

Speech pen: predictive handwriting based on ambient multimodal recognition.
Proceedings of the 2006 Conference on Human Factors in Computing Systems, 2006

2005
Recognition of speech from live sports coverage using acoustic and language model adaptation.
Syst. Comput. Jpn., 2005

Speech repair: quick error correction just by using selection operation for speech input interfaces.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

State estimation of meetings by information fusion using Bayesian network.
Proceedings of the 9th European Conference on Speech Communication and Technology, 2005

2004
Detection and Separation of Speech Event Using Audio and Video Information Fusion and Its Application to Robust Speech Interface.
EURASIP J. Adv. Signal Process., 2004

A Drum Pattern Retrieval Method by Voice Percussion.
Proceedings of the ISMIR 2004, 2004

Robust speech interface based on audio and video information fusion for humanoid HRP-2.
Proceedings of the 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, September 28, 2004

2003
Topic segmentation and retrieval system for lecture videos based on spontaneous speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Syllable-based acoustic modeling for Japanese spontaneous speech recognition.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

Live speech recognition in sports games by adaptation of acoustic model and language model.
Proceedings of the 8th European Conference on Speech Communication and Technology, EUROSPEECH 2003, 2003

2002
Unsupervised acoustic model adaptation based on phoneme error minimization.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

English call system with functions of speech segmentation and pronunciation evaluation using speech recognition technology.
Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP2002, 2002

2001
Improved speech recognition using iterative decoding based on confidence measures.
Proceedings of the EUROSPEECH 2001 Scandinavia, 2001

2000
Study on New Term Weighting Method and New Vector Space Model Based on Word Space in Spoken Document Retrieval.
Proceedings of the Computer-Assisted Information Retrieval (Recherche d'Information et ses Applications), 2000

Topic segmentation of news speech using word similarity.
Proceedings of the 8th ACM International Conference on Multimedia 2000, Los Angeles, CA, USA, October 30, 2000

Expanded vector space model based on word space in cross media retrieval of news speech data.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

An efficient lexical tree search for large vocabulary continuous speech recognition.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

Large vocabulary continuous speech recognition under real environments using adaptive sub-band spectral subtraction.
Proceedings of the Sixth International Conference on Spoken Language Processing, 2000

1998
Indexing and classification of TV news articles based on speech dictation using word bigram.
Proceedings of the 5th International Conference on Spoken Language Processing, Incorporating The 7th Australian International Speech Science and Technology Conference, Sydney Convention Centre, Sydney, Australia, 30th November, 1998

News Dictation and Article Classification Using Automatically Extracted Announcer Utterance.
Proceedings of the Advanced Multimedia Content Processing, First International Conference, 1998

1995
Allele-specific methylation and expression of an imprinted U2af1-rs1 (SP2) gene.
Nucleic Acids Res., 1995

1992
Neural Network Approaches for Attractive Area Extraction from Video Images.
Proceedings of IAPR Workshop on Machine Vision Applications, 1992


  Loading...