Jonathan Huang

According to our database1, Jonathan Huang authored at least 85 papers between 1992 and 2024.

Collaborative distances:

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think.
CoRR, 2024

Learning Hierarchical Semantic Classification by Grounding on Consistent Image Segmentations.
CoRR, 2024


Tree-D Fusion: Simulation-Ready Tree Dataset from Single Images with Diffusion Priors.
Proceedings of the Computer Vision - ECCV 2024, 2024

Optimizing Factorized Encoder Models: Time and Memory Reduction for Scalable and Efficient Action Recognition.
Proceedings of the Computer Vision - ECCV 2024, 2024

2023
How is Fatherhood Framed Online in Singapore?
CoRR, 2023

Optimizing ViViT Training: Time and Memory Reduction for Action Recognition.
CoRR, 2023

DaTaSeg: Taming a Universal Multi-Dataset Multi-Task Segmentation Model.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Partisan US News Media Representations of Syrian Refugees.
Proceedings of the Seventeenth International AAAI Conference on Web and Social Media, 2023

Learning to Detect Novel and Fine-Grained Acoustic Sequences Using Pretrained Audio Representations.
Proceedings of the IEEE International Conference on Acoustics, 2023

Text and Click inputs for unambiguous open vocabulary instance segmentation.
Proceedings of the 34th British Machine Vision Conference 2023, 2023

2022
Partisan US News Media Representations of Syrian Refugees.
CoRR, 2022

PERF-Net: Pose Empowered RGB-Flow Net.
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2022

The Auto Arborist Dataset: A Large-Scale Benchmark for Multiview Urban Forest Monitoring Under Domain Shift.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022

2021
Local Metrics for Multi-Object Tracking.
CoRR, 2021

The surprising impact of mask-head architecture on novel class segmentation.
Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision, 2021

2020
Investigating topics, audio representations and attention for multimodal scene-aware dialog.
Comput. Speech Lang., 2020

Utterance-level Intent Recognition from Keywords.
CoRR, 2020

Modern Architectures for Core Computer Vision on Videos.
Proceedings of the 2020 International Conference on Systems, Signals and Image Processing, 2020

Compact Speaker Embedding: lrx-Vector.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Length- and Noise-Aware Training Techniques for Short-Utterance Speaker Recognition.
Proceedings of the 21st Annual Conference of the International Speech Communication Association, 2020

Structural Sparsification for Far-Field Speaker Recognition with Intel® Gna.
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020

A Speaker Recognition Approach to Anomaly Detection.
Proceedings of 5th the Workshop on Detection and Classification of Acoustic Scenes and Events 2020 (DCASE 2020), 2020

RetinaTrack: Online Single Stage Joint Detection and Tracking.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

Context R-CNN: Long Term Temporal Context for Per-Camera Object Detection.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Exploring Context, Attention and Audio Features for Audio Visual Scene-Aware Dialog.
CoRR, 2019

Long Term Temporal Context for Per-Camera Object Detection.
CoRR, 2019

Structural sparsification for Far-field Speaker Recognition with GNA.
CoRR, 2019

Leveraging Topics and Audio Features with Multimodal Attention for Audio Visual Scene-Aware Dialog.
Proceedings of the Visually Grounded Interaction and Language (ViGIL), 2019

Intel Far-Field Speaker Recognition System for VOiCES Challenge 2019.
Proceedings of the 20th Annual Conference of the International Speech Communication Association, 2019

Acoustic Scene Classification Using Deep Learning-based Ensemble Averaging.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

GCC-PHAT Cross-Correlation Audio Features for Simultaneous Sound Event Localization and Detection (SELD) on Multiple Rooms.
Proceedings of the Workshop on Detection and Classification of Acoustic Scenes and Events 2019 (DCASE 2019), 2019

Diverse Generation for Multi-Agent Sports Games.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019

Uncertainty aware audiovisual activity recognition using deep Bayesian variational inference.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2019

2018
V-Speech: Noise-Robust Speech Capturing Glasses Using Vibration Sensors.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2018

Context, Attention and Audio Feature Explorations for Audio Visual Scene-Aware Dialog.
CoRR, 2018

Uncertainty aware multimodal activity recognition with Bayesian inference.
CoRR, 2018

Multimodal Relational Tensor Network for Sentiment and Emotion Classification.
CoRR, 2018

Generative Models of Visually Grounded Imagination.
Proceedings of the 6th International Conference on Learning Representations, 2018

Sufficiency Quantification for Seamless Text-Independent Speaker Enrollment.
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018

Rethinking Spatiotemporal Feature Learning: Speed-Accuracy Trade-offs in Video Classification.
Proceedings of the Computer Vision - ECCV 2018, 2018

Learning to Segment via Cut-and-Paste.
Proceedings of the Computer Vision - ECCV 2018, 2018

Progressive Neural Architecture Search.
Proceedings of the Computer Vision - ECCV 2018, 2018

2017
Introduction to the Issue on Signal Processing and Machine Learning.
IEEE J. Sel. Top. Signal Process., 2017

Mago: Mode of Transport Inference Using the Hall-Effect Magnetic Sensor and Accelerometer.
Proc. ACM Interact. Mob. Wearable Ubiquitous Technol., 2017

Rethinking Spatiotemporal Feature Learning For Video Understanding.
CoRR, 2017

Progressive Neural Architecture Search.
CoRR, 2017

Motion Prediction Under Multimodality with Conditional Stochastic Networks.
CoRR, 2017

Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

Spatially Adaptive Computation Time for Residual Networks.
Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition, 2017

2016
Detecting Events and Key Actors in Multi-person Videos.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

Generation and Comprehension of Unambiguous Object Descriptions.
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016

2015
Efficient inference in occlusion-aware generative models of images.
CoRR, 2015

Deep Knowledge Tracing.
Proceedings of the Advances in Neural Information Processing Systems 28: Annual Conference on Neural Information Processing Systems 2015, 2015

What's Cookin'? Interpreting Cooking Videos using Text, Speech and Vision.
Proceedings of the NAACL HLT 2015, The 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Denver, Colorado, USA, May 31, 2015

Autonomously Generating Hints by Inferring Problem Solving Policies.
Proceedings of the Second ACM Conference on Learning @ Scale, 2015

Multiple Orderings of Events in Disease Progression.
Proceedings of the Information Processing in Medical Imaging, 2015

Meeting assistant application.
Proceedings of the 16th Annual Conference of the International Speech Communication Association, 2015

Learning Program Embeddings to Propagate Feedback on Student Code.
Proceedings of the 32nd International Conference on Machine Learning, 2015

Im2Calories: Towards an Automated Mobile Vision Food Diary.
Proceedings of the 2015 IEEE International Conference on Computer Vision, 2015

2014
Codewebs: scalable homework search for massive open online programming courses.
Proceedings of the 23rd International World Wide Web Conference, 2014

Superposter behavior in MOOC forums.
Proceedings of the First (2014) ACM Conference on Learning @ Scale, 2014

Unobtrusive gait verification for mobile phones.
Proceedings of the ISWC'14, 2014

2013
Tuned Models of Peer Assessment in MOOCs.
Proceedings of the 6th International Conference on Educational Data Mining, 2013

Syntactic and Functional Variability of a Million Code Submissions in a Machine Learning MOOC.
Proceedings of the Workshops at the 16th International Conference on Artificial Intelligence in Education AIED 2013, 2013

2012
Riffled Independence for Efficient Inference with Partial Rankings.
J. Artif. Intell. Res., 2012

Probabilistic Event Cascades for Alzheimer's disease.
Proceedings of the Advances in Neural Information Processing Systems 25: 26th Annual Conference on Neural Information Processing Systems 2012. Proceedings of a meeting held December 3-6, 2012

2011
Efficient Probabilistic Inference with Partial Ranking Queries.
Proceedings of the UAI 2011, 2011

Fourier-Information Duality in the Identity Management Problem.
Proceedings of the Machine Learning and Knowledge Discovery in Databases, 2011

2010
Uncovering the Riffled Independence Structure of Rankings
CoRR, 2010

Learning Hierarchical Riffle Independent Groupings from Rankings.
Proceedings of the 27th International Conference on Machine Learning (ICML-10), 2010

An Extensible Sensor based Inferencing Framework for Context Aware Applications.
Proceedings of the 10th IEEE International Conference on Computer and Information Technology, 2010

2009
Exploiting Probabilistic Independence for Permutations.
Proceedings of the Twelfth International Conference on Artificial Intelligence and Statistics, 2009

Fourier Theoretic Probabilistic Inference over Permutations.
J. Mach. Learn. Res., 2009

Riffled Independence for Ranked Data.
Proceedings of the Advances in Neural Information Processing Systems 22: 23rd Annual Conference on Neural Information Processing Systems 2009. Proceedings of a meeting held 7-10 December 2009, 2009

Hilbert space embeddings of conditional distributions with applications to dynamical systems.
Proceedings of the 26th Annual International Conference on Machine Learning, 2009

2007
Edge Processing and Enterprise Integration: Closing the Gap on Deployable Industrial Sensor Networks.
Proceedings of the Fourth Annual IEEE Communications Society Conference on Sensor, 2007

Efficient Inference for Distributions on Permutations.
Proceedings of the Advances in Neural Information Processing Systems 20, 2007

2005
Intel mote 2: an advanced platform for demanding sensor network applications.
Proceedings of the 3rd International Conference on Embedded Networked Sensor Systems, 2005

The Intel Mote platform: a bluetooth-based sensor network for industrial monitoring.
Proceedings of the Fourth International Symposium on Information Processing in Sensor Networks, 2005

2004
Intel Mote: using bluetooth in sensor networks.
Proceedings of the 2nd International Conference on Embedded Networked Sensor Systems, 2004

2000
Subband-based adaptive decorrelation filtering for co-channel speech separation.
IEEE Trans. Speech Audio Process., 2000

Exploring property-based document organization in a collaborative note-sharing system.
Proceedings of the CHI '00 Extended Abstracts on Human Factors in Computing Systems, 2000

1999
NotePals: Light Weight Note Sharing by the Group, for the Group.
Proceedings of the Proceeding of the CHI '99 Conference on Human Factors in Computing Systems: The CHI is the Limit, 1999

1992
Inductive techniques for formal verification of systolic array designs in DSP applications.
Proceedings of the 1992 IEEE International Conference on Acoustics, 1992


  Loading...