We stand with Ukraine

We stand with Ukraine

Zhenfang Chen

Orcid: 0000-0002-8470-2709

According to our database¹, Zhenfang Chen authored at least 46 papers between 2017 and 2024.

Collaborative distances:

Dijkstra number² of four.
Erdős number³ of four.

Timeline

Legend:

Book

In proceedings

Article

PhD thesis

Dataset

Other

Links

On csauthors.net:

Bibliography

2024

Compositional Physical Reasoning of Objects and Events from Videos.

[BibT_eX]

[DOI]

,

,

,

,

,

Antonio Torralba

,

Joshua B. Tenenbaum

,

CoRR, 2024

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2024

Augmenting Embodied Learning in Welding Training: The Co-Design of an XR- and tinyML-Enabled Welding System for Creative Arts and Manufacturing Training.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Dina El-Zanfaly

Proceedings of the Eighteenth International Conference on Tangible, 2024

XRweld: An In-Situ Extended Reality Platform for Welding Education.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Dina El-Zanfaly

Proceedings of the ACM SIGGRAPH 2024 Immersive Pavilion, 2024

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos.

[BibT_eX]

[DOI]

,

,

,

,

Qin Zhi Eddie Lim

,

Joshua B. Tenenbaum

,

Proceedings of the Forty-first International Conference on Machine Learning, 2024

SALMON: Self-Alignment with Instructable Reward Models.

[BibT_eX]

[DOI]

,

,

,

,

,

David Daniel Cox

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Twelfth International Conference on Learning Representations, 2024

FlexAttention for Efficient High-Resolution Vision-Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Computer Vision - ECCV 2024, 2024

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023

Deep Face Video Inpainting via UV Mapping.

[BibT_eX]

[DOI]

,

,

,

,

Kwan-Yee K. Wong

IEEE Trans. Image Process., 2023

SALMON: Self-Alignment with Principle-Following Reward Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

CoRR, 2023

ModuleFormer: Learning Modular Large Language Models From Uncurated Data.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning.

[BibT_eX]

[DOI]

,

,

,

,

,

CoRR, 2023

Augmenting Welding Training: An XR Platform to Foster Muscle Memory and Mindfulness for Skills Development.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Dina El-Zanfaly

,

Proceedings of the Companion Proceedings of the 2023 Conference on Interactive Surfaces and Spaces, 2023

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision.

[BibT_eX]

[DOI]

,

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D-LLM: Injecting the 3D World into Large Language Models.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Planning with Large Language Models for Code Generation.

[BibT_eX]

[DOI]

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the Eleventh International Conference on Learning Representations, 2023

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sparse Universal Transformer.

[BibT_eX]

[DOI]

,

,

,

Aaron C. Courville

,

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

3D Concept Learning and Reasoning from Multi-View Images.

[BibT_eX]

[DOI]

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Joshua B. Tenenbaum

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners.

[BibT_eX]

[DOI]

,

,

,

,

Hengshuang Zhao

,

Erik G. Learned-Miller

,

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022

Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners.

[BibT_eX]

[DOI]

,

,

,

,

Hengshuang Zhao

,

Erik G. Learned-Miller

,

CoRR, 2022

S<sup>3</sup>-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint.

[BibT_eX]

[DOI]

,

,

,

,

Kwan-Yee K. Wong

Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos.

[BibT_eX]

[DOI]

,

,

,

,

Antonio Torralba

,

Joshua B. Tenenbaum

,

Proceedings of the Tenth International Conference on Learning Representations, 2022

A Unified Framework for Masked and Mask-Free Face Recognition Via Feature Rectification.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee K. Wong

Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo.

[BibT_eX]

[DOI]

,

,

,

,

Kwan-Yee K. Wong

Proceedings of the Computer Vision - ECCV 2022, 2022

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following.

[BibT_eX]

[DOI]

,

,

,

David Daniel Cox

,

,

Joshua B. Tenenbaum

,

Proceedings of the Conference on Robot Learning, 2022

Memory Portal: Investigating Data Meaning-making through Spatio-temporal Experiences.

[BibT_eX]

[DOI]

Dina EL-Zanfaly

,

,

,

Rachel Ann Arredondo

,

,

Christianne Francovich

Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Google Home, Listen: Building Helper Intelligences for Non-Verbal Sound.

[BibT_eX]

[DOI]

,

,

Dina EL-Zanfaly

Proceedings of the C&C '22: Creativity and Cognition, Venice, Italy, June 20 - 23, 2022, 2022

Spooky Technology: The ethereal and otherworldly as a resource for design.

[BibT_eX]

[DOI]

,

,

,

,

Anuprita Ranade

,

,

Katherine Giesa

,

,

Catherine Yochum

,

Gordon Robertson

,

Lisa (Yip Yan) Yeung

,

,

,

,

,

,

Alexander Heyison

,

Proceedings of the DIS '22: Designing Interactive Systems Conference, Virtual Event, Australia, June 13, 2022

2021

STAR: A Benchmark for Situated Reasoning in Real-World Videos.

[BibT_eX]

[DOI]

,

,

,

,

Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee Kenneth Wong

,

Joshua B. Tenenbaum

,

Proceedings of the 9th International Conference on Learning Representations, 2021

The Blessings of Unlabeled Background in Untrimmed Videos.

[BibT_eX]

[DOI]

,

,

,

,

Jianqiang Huang

,

Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SneezeLove: Embodying Cultural Superstitions in Connected Devices.

[BibT_eX]

[DOI]

,

,

,

Proceedings of the DIS '21: Designing Interactive Systems Conference 2021, 2021

2020

Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video.

[BibT_eX]

[DOI]

,

,

,

,

Kwan-Yee K. Wong

CoRR, 2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee K. Wong

,

Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019

Learning Local Similarity with Spatial Relations for Object Retrieval.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee K. Wong

Proceedings of the 27th ACM International Conference on Multimedia, 2019

Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video.

[BibT_eX]

[DOI]

,

,

,

Kwan-Yee Kenneth Wong

Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018

Design and Fabrication of a Piezoelectric Micromachined Ultrasonic Transducer Array Based on Ceramic PZT.

[BibT_eX]

[DOI]

,

,

,

,

,

Proceedings of the 2018 IEEE SENSORS, New Delhi, India, October 28-31, 2018, 2018

Boosting up Scene Text Detectors with Guided CNN.

[BibT_eX]

[DOI]

,

,

,

,

,

,

Proceedings of the British Machine Vision Conference 2018, 2018

2017

Aggregated Deep Feature from Activation Clusters for Particular Object Retrieval.

[BibT_eX]

[DOI]

,

,

Kwan-Yee K. Wong

,

Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017

Loading...