Hideki Nakayama

Orcid: 0000-0001-8726-2780

According to our database1, Hideki Nakayama authored at least 129 papers between 2003 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Enhanced Data Transfer Cooperating with Artificial Triplets for Scene Graph Generation.
IEICE Trans. Inf. Syst., 2024

BrainCodec: Neural fMRI codec for the decoding of cognitive brain states.
CoRR, 2024

HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis.
CoRR, 2024

Harnessing the Latent Diffusion Model for Training-Free Image Style Transfer.
CoRR, 2024

Cohesive Conversations: Enhancing Authenticity in Multi-Agent Simulated Dialogues.
CoRR, 2024

A Better LLM Evaluator for Text Generation: The Impact of Prompt Output Sequencing and Optimization.
CoRR, 2024

LLM as a Scorer: The Impact of Output Order on Dialogue Evaluation.
CoRR, 2024

MILE: Memory-Interactive Learning Engine for Neuro-Symbolic Solutions to Mathematical Problems.
IEEE Access, 2024

FedFit: Server Aggregation Through Linear Regression in Federated Learning.
IEEE Access, 2024

Label Augmentation as Inter-class Data Augmentation for Conditional Image Synthesis with Imbalanced Data.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Revisiting Latent Space of GAN Inversion for Robust Real Image Editing.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

Soft Curriculum for Learning Conditional GANs with Noisy-Labeled and Uncurated Unlabeled Data.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024

A Compact Dynamic 3D Gaussian Representation for Real-Time Dynamic View Synthesis.
Proceedings of the Computer Vision - ECCV 2024, 2024

LayoutFlow: Flow Matching for Layout Generation.
Proceedings of the Computer Vision - ECCV 2024, 2024

Evcap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Recent Trends in Personalized Dialogue Generation: A Review of Datasets, Methodologies, and Evaluations.
Proceedings of the 2024 Joint International Conference on Computational Linguistics, 2024

2023
Two-Path Object Knowledge Injection for Detecting Novel Objects With Single-Stage Dense Detector.
IEICE Trans. Inf. Syst., November, 2023

An Efficient 3D Gaussian Representation for Monocular/Multi-view Dynamic Scenes.
CoRR, 2023

Revisiting Latent Space of GAN Inversion for Real Image Editing.
CoRR, 2023

Balancing Reconstruction and Editing Quality of GAN Inversion for Real Image Editing with StyleGAN Prior Latent Space.
CoRR, 2023

Indirect Adversarial Losses via an Intermediate Distribution for Training GANs.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2023

Handwritten Text Generation with Character-Specific Encoding for Style Imitation.
Proceedings of the Document Analysis and Recognition - ICDAR 2023, 2023

Partition-and-Debias: Agnostic Biases Mitigation via A Mixture of Biases-Specific Experts.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

LED: A Dataset for Life Event Extraction from Dialogs.
Proceedings of the Findings of the Association for Computational Linguistics: EACL 2023, 2023

A-CAP: Anticipation Captioning with Commonsense Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Towards Parameter-Efficient Integration of Pre-Trained Language Models In Temporal Video Grounding.
Proceedings of the Findings of the Association for Computational Linguistics: ACL 2023, 2023

2022
Improving Noised Gradient Penalty with Synchronized Activation Function for Generative Adversarial Networks.
IEICE Trans. Inf. Syst., September, 2022

Stochastically Flipping Labels of Discriminator's Outputs for Training Generative Adversarial Networks.
IEEE Access, 2022

PPCD-GAN: Progressive Pruning and Class-Aware Distillation for Large-Scale Conditional GANs Compression.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Meta Approach to Data Augmentation Optimization.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

Neural Networks in a Product of Hyperbolic Spaces.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

How do people talk about images? A study on open-domain conversations with images.
Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop, 2022

DJMix: Unsupervised Task-agnostic Image Augmentation for Improving Robustness of Convolutional Neural Networks.
Proceedings of the International Joint Conference on Neural Networks, 2022

Pixel to Binary Embedding Towards Robustness for CNNs.
Proceedings of the 26th International Conference on Pattern Recognition, 2022

StoryER: Automatic Story Evaluation via Ranking, Rating and Reasoning.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

Character-centric Story Visualization via Visual Planning and Token Alignment.
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, 2022

NOC-REK: Novel Object Captioning with Retrieved Vocabulary from External Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

OSSGAN: Open-Set Semi-Supervised Image Generation.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

Weakly Supervised Formula Learner for Solving Mathematical Problems.
Proceedings of the 29th International Conference on Computational Linguistics, 2022

RNSum: A Large-Scale Dataset for Automatic Release Note Generation via Commit Logs Summarization.
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2022

2021
Graph Energy-based Model for Substructure Preserving Molecular Design.
CoRR, 2021

MADGAN: unsupervised medical anomaly detection GAN using multiple adjacent brain MRI slice reconstruction.
BMC Bioinform., 2021

Object Recognition with Continual Open Set Domain Adaptation for Home Robot.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2021

JokerGAN: Memory-Efficient Model for Handwritten Text Generation with Text Line Awareness.
Proceedings of the MM '21: ACM Multimedia Conference, Virtual Event, China, October 20, 2021

Stochastic Observation Prediction for Efficient Reinforcement Learning in Robotics.
Proceedings of the 4th IEEE International Conference on Multimedia Information Processing and Retrieval, 2021

GraphPlan: Story Generation by Planning with Event Graph.
Proceedings of the 14th International Conference on Natural Language Generation, 2021

DCT-based Fast Spectral Convolution for Deep Convolutional Neural Networks.
Proceedings of the International Joint Conference on Neural Networks, 2021

Open-Set Domain Generalization VIA Metric Learning.
Proceedings of the 2021 IEEE International Conference on Image Processing, 2021

Semantic Image Synthesis from Inaccurate and Coarse Masks.
Proceedings of the IEEE International Conference on Acoustics, 2021

Visualizing Association in Exemplar-Based Classification.
Proceedings of the IEEE International Conference on Acoustics, 2021

SciXGen: A Scientific Paper Dataset for Context-Aware Text Generation.
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2021, 2021


Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling.
Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence, 2021

2020
CNN-Based Prostate Zonal Segmentation on T2-Weighted MR Images: A Cross-Dataset Study.
Proceedings of the Neural Approaches to Dynamics of Signal Exchanges, 2020

Infinite Brain MR Images: PGGAN-Based Data Augmentation for Tumor Detection.
Proceedings of the Neural Approaches to Dynamics of Signal Exchanges, 2020

Unsupervised Discourse Constituency Parsing Using Viterbi EM.
Trans. Assoc. Comput. Linguistics, 2020

Erasing Scene Text with Weak Supervision.
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2020

Efficient Base Class Selection Algorithms for Few-Shot Classification.
Proceedings of the 2020 on International Conference on Multimedia Retrieval, 2020

A Visually-Grounded Parallel Corpus with Phrase-to-Region Linking.
Proceedings of The 12th Language Resources and Evaluation Conference, 2020

Bridging the Gap Between AI and Healthcare Sides: Towards Developing Clinically Relevant AI-Powered Diagnosis Systems.
Proceedings of the Artificial Intelligence Applications and Innovations, 2020

Unsupervised Visual Relationship Inference.
Proceedings of the IEEE International Conference on Image Processing, 2020

A Visually-grounded First-person Dialogue Dataset with Verbal and Non-verbal Responses.
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing, 2020

Faster AutoAugment: Learning Augmentation Strategies Using Backpropagation.
Proceedings of the Computer Vision - ECCV 2020, 2020

Supervised Visual Attention for Multimodal Neural Machine Translation.
Proceedings of the 28th International Conference on Computational Linguistics, 2020

Overview of the 7th Workshop on Asian Translation.
Proceedings of the 7th Workshop on Asian Translation, 2020

Single Model Ensemble using Pseudo-Tags and Distinct Vectors.
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, 2020

Graph-Based Heuristic Search for Module Selection Procedure in Neural Module Network.
Proceedings of the Computer Vision - ACCV 2020 - 15th Asian Conference on Computer Vision, Kyoto, Japan, November 30, 2020

Latent-Variable Non-Autoregressive Neural Machine Translation with Deterministic Inference Using a Delta Posterior.
Proceedings of the Thirty-Fourth AAAI Conference on Artificial Intelligence, 2020

2019
USE-Net: Incorporating Squeeze-and-Excitation blocks into U-Net for prostate zonal segmentation of multi-institutional MRI datasets.
Neurocomputing, 2019

GAN-based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis.
CoRR, 2019

Learning More with Less: GAN-based Medical Image Augmentation.
CoRR, 2019

CNN-based Prostate Zonal Segmentation on T2-weighted MR Images: A Cross-dataset Study.
CoRR, 2019

Infinite Brain MR Images: PGGAN-based Data Augmentation for Tumor Detection.
CoRR, 2019

Combining Noise-to-Image and Image-to-Image GANs: Brain MR Image Augmentation for Tumor Detection.
IEEE Access, 2019

Enabling Real-time Neural IME with Incremental Vocabulary Selection.
Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 2019

Shifted Spatial-Spectral Convolution for Deep Neural Networks.
Proceedings of the MMAsia '19: ACM Multimedia Asia, Beijing, China, December 16-18, 2019, 2019

Empirical Study of Easy and Hard Examples in CNN Training.
Proceedings of the Neural Information Processing - 26th International Conference, 2019

Bipolar Gan: Double Check the Solution Space and Lighten False Positive Errors in Generative Adversarial Nets.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

DCT Based Information-Preserving Pooling for Deep Neural Networks.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

LOL: Learning To Optimize Loss Switching Under Label Noise.
Proceedings of the 2019 IEEE International Conference on Image Processing, 2019

Learning More with Less: Conditional PGGAN-based Data Augmentation for Brain Metastases Detection Using Highly-Rough Annotation on MR Images.
Proceedings of the 28th ACM International Conference on Information and Knowledge Management, 2019

GAN-Based Multiple Adjacent Brain MRI Slice Reconstruction for Unsupervised Alzheimer's Disease Diagnosis.
Proceedings of the Computational Intelligence Methods for Bioinformatics and Biostatistics, 2019

Generating Diverse Translations with Sentence Codes.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

Synthesizing Diverse Lung Nodules Wherever Massively: 3D Multi-Conditional GAN-Based CT Image Augmentation for Object Detection.
Proceedings of the 2019 International Conference on 3D Vision, 2019

2018
Recurrent Visual Relationship Recognition with Triplet Unit for Diversity.
Int. J. Semantic Comput., 2018

Real-time Neural-based Input Method.
CoRR, 2018

Discrete Structural Planning for Neural Machine Translation.
CoRR, 2018

Coherence Modeling Improves Implicit Discourse Relation Recognition.
Proceedings of the 19th Annual SIGdial Meeting on Discourse and Dialogue, 2018

Deep Learning for Forecasting Stock Returns in the Cross-Section.
Proceedings of the Advances in Knowledge Discovery and Data Mining, 2018

PoB: Toward Reasoning Patterns of Beauty in Image Data.
Proceedings of the 2018 ACM Multimedia Conference on Multimedia Conference, 2018

Augmenting Image Question Answering Dataset by Exploiting Image Captions.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

Incorporating Semantic Attention in Video Description Generation.
Proceedings of the Eleventh International Conference on Language Resources and Evaluation, 2018

GAN-based synthetic brain MR image generation.
Proceedings of the 15th IEEE International Symposium on Biomedical Imaging, 2018

Compressing Word Embeddings via Deep Compositional Code Learning.
Proceedings of the 6th International Conference on Learning Representations, 2018

Improving Beam Search by Removing Monotonic Constraint for Neural Machine Translation.
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics, 2018

Semantic Aware Attention Based Deep Object Co-segmentation.
Proceedings of the Computer Vision - ACCV 2018, 2018

2017
Zero-resource machine translation by multimodal encoder-decoder network with multimedia pivot.
Mach. Transl., 2017

Parameter Reference Loss for Unsupervised Domain Adaptation.
CoRR, 2017

Single-Queue Decoding for Neural Machine Translation.
CoRR, 2017

Later-stage Minimum Bayes-Risk Decoding for Neural Machine Translation.
CoRR, 2017

Recurrent Visual Relationship Recognition with Triplet Unit.
Proceedings of the 19th IEEE International Symposium on Multimedia, 2017

Word Ordering as Unsupervised Learning Towards Syntactically Plausible Word Representations.
Proceedings of the Eighth International Joint Conference on Natural Language Processing, 2017

Bag of Local Convolutional Triplets for Script Identification in Scene Text.
Proceedings of the 14th IAPR International Conference on Document Analysis and Recognition, 2017

An Empirical Study of Adequate Vision Span for Attention-Based Neural Machine Translation.
Proceedings of the First Workshop on Neural Machine Translation, 2017

2016
Efficient Two-Step Middle-Level Part Feature Extraction for Fine-Grained Visual Categorization.
IEICE Trans. Inf. Syst., 2016

Reducing Redundant Computations with Flexible Attention.
CoRR, 2016

Multimodal Content-Aware Image Thumbnailing.
Proceedings of the 25th International Conference on World Wide Web, 2016

Annotation order matters: Recurrent Image Annotator for arbitrary length image tagging.
Proceedings of the 23rd International Conference on Pattern Recognition, 2016

Generating Video Description using Sequence-to-sequence Model with Temporal Attention.
Proceedings of the COLING 2016, 2016

2015
Multimodal Gesture Recognition Using Multi-stream Recurrent Neural Network.
Proceedings of the Image and Video Technology - 7th Pacific-Rim Symposium, 2015

Unsupervised Cosegmentation based on Global Graph Matching.
Proceedings of the 23rd Annual ACM Conference on Multimedia Conference, MM '15, Brisbane, Australia, October 26, 2015

Image-Mediated Learning for Zero-Shot Cross-Lingual Document Retrieval.
Proceedings of the 2015 Conference on Empirical Methods in Natural Language Processing, 2015

2014
Unsupervised Visual Domain Adaptation Using Auxiliary Information in Target Domain.
Proceedings of the 2014 IEEE International Symposium on Multimedia, 2014

2013
Augmenting descriptors for fine-grained visual categorization using polynomial embedding.
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013

NLab-UTokyo at ImageCLEF 2013 Plant Identification Task.
Proceedings of the Working Notes for CLEF 2013 Conference , 2013

Efficient Discriminative Convolution Using Fisher Weight Map.
Proceedings of the British Machine Vision Conference, 2013

2010
Image Annotation and Retrieval for Weakly Labeled Images Using Conceptual Learning.
New Gener. Comput., 2010

Dense Sampling Low-Level Statistics of Local Features.
IEICE Trans. Inf. Syst., 2010

High-speed 3D object recognition using additive features in a linear subspace.
Proceedings of the IEEE International Conference on Robotics and Automation, 2010

Improving Local Descriptors by Embedding Global and Local Spatial Information.
Proceedings of the Computer Vision, 2010

Global Gaussian approach for scene categorization using information geometry.
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010

Evaluation of dimensionality reduction methods for image auto-annotation.
Proceedings of the British Machine Vision Conference, 2010

2009
Scene Classification Using Generalized Local Correlation.
Proceedings of the IAPR Conference on Machine Vision Applications (IAPR MVA 2009), 2009

Canonical contextual distance for large-scale image annotation and retrieval.
Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining, 2009

Image annotation and retrieval based on efficient learning of contextual latent space.
Proceedings of the 2009 IEEE International Conference on Multimedia and Expo, 2009

AI Goggles: Real-time Description and Retrieval in the Real World with Online Learning.
Proceedings of the Sixth Canadian Conference on Computer and Robot Vision, 2009

2008
High-Performance Image Annotation and Retrieval for Weakly Labeled Images Using Latent Space Learning.
Proceedings of the Advances in Multimedia Information Processing, 2008

2007
Journalist robot: robot system making news articles from real world.
Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, October 29, 2007

2003
Towards Implicit Invocation of Web Services Functions.
Proceedings of the Information Modelling and Knowledge Bases XV, 2003


  Loading...