Zhenfang Chen

Orcid: 0009-0005-1619-932X

According to our database1, Zhenfang Chen authored at least 46 papers between 2017 and 2024.

Collaborative distances:
  • Dijkstra number2 of four.
  • Erdős number3 of four.

Timeline

Legend:

Book 
In proceedings 
Article 
PhD thesis 
Dataset
Other 

Links

On csauthors.net:

Bibliography

2024
Compositional Physical Reasoning of Objects and Events from Videos.
CoRR, 2024

Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble.
CoRR, 2024

Augmenting Embodied Learning in Welding Training: The Co-Design of an XR- and tinyML-Enabled Welding System for Creative Arts and Manufacturing Training.
Proceedings of the Eighteenth International Conference on Tangible, 2024

XRweld: An In-Situ Extended Reality Platform for Welding Education.
Proceedings of the ACM SIGGRAPH 2024 Immersive Pavilion, 2024

ContPhy: Continuum Physical Concept Learning and Reasoning from Videos.
Proceedings of the Forty-first International Conference on Machine Learning, 2024

SALMON: Self-Alignment with Instructable Reward Models.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules.
Proceedings of the Twelfth International Conference on Learning Representations, 2024

FlexAttention for Efficient High-Resolution Vision-Language Models.
Proceedings of the Computer Vision - ECCV 2024, 2024

SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024

Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning.
Proceedings of the Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024

2023
Deep Face Video Inpainting via UV Mapping.
IEEE Trans. Image Process., 2023

SALMON: Self-Alignment with Principle-Following Reward Models.
CoRR, 2023

ModuleFormer: Learning Modular Large Language Models From Uncurated Data.
CoRR, 2023

See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning.
CoRR, 2023

Augmenting Welding Training: An XR Platform to Foster Muscle Memory and Mindfulness for Skills Development.
Proceedings of the Companion Proceedings of the 2023 Conference on Interactive Surfaces and Spaces, 2023

Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

3D-LLM: Injecting the 3D World into Large Language Models.
Proceedings of the Advances in Neural Information Processing Systems 36: Annual Conference on Neural Information Processing Systems 2023, 2023

Planning with Large Language Models for Code Generation.
Proceedings of the Eleventh International Conference on Learning Representations, 2023

TextPSG: Panoptic Scene Graph Generation from Textual Descriptions.
Proceedings of the IEEE/CVF International Conference on Computer Vision, 2023

Sparse Universal Transformer.
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023

3D Concept Learning and Reasoning from Multi-View Images.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners.
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023

2022
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners.
CoRR, 2022

S<sup>3</sup>-NeRF: Neural Reflectance Field from Shading and Shadow under a Single Viewpoint.
Proceedings of the Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, 2022

ComPhy: Compositional Physical Reasoning of Objects and Events from Videos.
Proceedings of the Tenth International Conference on Learning Representations, 2022

A Unified Framework for Masked and Mask-Free Face Recognition Via Feature Rectification.
Proceedings of the 2022 IEEE International Conference on Image Processing, 2022

PS-NeRF: Neural Inverse Rendering for Multi-view Photometric Stereo.
Proceedings of the Computer Vision - ECCV 2022, 2022

Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following.
Proceedings of the Conference on Robot Learning, 2022

Memory Portal: Investigating Data Meaning-making through Spatio-temporal Experiences.
Proceedings of the CHI '22: CHI Conference on Human Factors in Computing Systems, New Orleans, LA, USA, 29 April 2022, 2022

Google Home, Listen: Building Helper Intelligences for Non-Verbal Sound.
Proceedings of the C&C '22: Creativity and Cognition, Venice, Italy, June 20 - 23, 2022, 2022

Spooky Technology: The ethereal and otherworldly as a resource for design.
Proceedings of the DIS '22: Designing Interactive Systems Conference, Virtual Event, Australia, June 13, 2022

2021
STAR: A Benchmark for Situated Reasoning in Real-World Videos.
Proceedings of the Neural Information Processing Systems Track on Datasets and Benchmarks 1, 2021

Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language.
Proceedings of the Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, 2021

Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning.
Proceedings of the 9th International Conference on Learning Representations, 2021

The Blessings of Unlabeled Background in Untrimmed Videos.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2021

SneezeLove: Embodying Cultural Superstitions in Connected Devices.
Proceedings of the DIS '21: Designing Interactive Systems Conference 2021, 2021

2020
Look Closer to Ground Better: Weakly-Supervised Temporal Grounding of Sentence in Video.
CoRR, 2020

Cops-Ref: A New Dataset and Task on Compositional Referring Expression Comprehension.
Proceedings of the 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2020

2019
Learning Local Similarity with Spatial Relations for Object Retrieval.
Proceedings of the 27th ACM International Conference on Multimedia, 2019

Weakly-Supervised Spatio-Temporally Grounding Natural Sentence in Video.
Proceedings of the 57th Conference of the Association for Computational Linguistics, 2019

2018
Design and Fabrication of a Piezoelectric Micromachined Ultrasonic Transducer Array Based on Ceramic PZT.
Proceedings of the 2018 IEEE SENSORS, New Delhi, India, October 28-31, 2018, 2018

Boosting up Scene Text Detectors with Guided CNN.
Proceedings of the British Machine Vision Conference 2018, 2018

2017
Aggregated Deep Feature from Activation Clusters for Particular Object Retrieval.
Proceedings of the on Thematic Workshops of ACM Multimedia 2017, Mountain View, CA, USA, October 23, 2017


  Loading...