Liangliang Cao
Orcid: 0000-0003-0900-1512
According to our database1,
Liangliang Cao
authored at least 156 papers
between 2005 and 2024.
Collaborative distances:
Collaborative distances:
Timeline
Legend:
Book In proceedings Article PhD thesis Dataset OtherLinks
On csauthors.net:
Bibliography
2024
Cavia: Camera-controllable Multi-view Video Diffusion with View-Integrated Attention.
CoRR, 2024
MMCOMPOSITION: Revisiting the Compositionality of Pre-trained Vision-Language Models.
CoRR, 2024
Proceedings of the Twelfth International Conference on Learning Representations, 2024
Efficient-3Dim: Learning a Generalizable Single-image Novel-view Synthesizer in One Day.
Proceedings of the Twelfth International Conference on Learning Representations, 2024
2023
Efficient-NeRF2NeRF: Streamlining Text-Driven 3D Editing with Multiview Correspondence-Enhanced Diffusion Models.
CoRR, 2023
RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture.
CoRR, 2023
Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness.
CoRR, 2023
RoomDreamer: Text-Driven 3D Indoor Scene Synthesis with Coherent Geometry and Texture.
Proceedings of the 31st ACM International Conference on Multimedia, 2023
Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, 2023
2022
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition.
IEEE J. Sel. Top. Signal Process., 2022
CoRR, 2022
Comput. Graph. Forum, 2022
Improving Confidence Estimation on Out-of-Domain Data for End-to-End Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2022
2021
CoRR, 2021
Input Length Matters: An Empirical Study Of RNN-T And MWER Training For Long-form Telephony Speech Recognition.
CoRR, 2021
Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models.
CoRR, 2021
Proceedings of the IEEE Spoken Language Technology Workshop, 2021
Multi-Task Learning for End-to-End ASR Word and Utterance Confidence with Deletion Prediction.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Bridging the Gap Between Streaming and Non-Streaming ASR Systems by Distilling Ensembles of CTC and RNN-T Models.
Proceedings of the 22nd Annual Conference of the International Speech Communication Association, Interspeech 2021, Brno, Czechia, August 30, 2021
Proceedings of the IEEE International Conference on Acoustics, 2021
Confidence Estimation for Attention-Based Sequence-to-Sequence Models for Speech Recognition.
Proceedings of the IEEE International Conference on Acoustics, 2021
Improving Streaming Automatic Speech Recognition with Non-Streaming Model Distillation on Unsupervised Data.
Proceedings of the IEEE International Conference on Acoustics, 2021
2020
Comput. Vis. Image Underst., 2020
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2020, 2020
Proceedings of The 12th Language Resources and Evaluation Conference, 2020
Proceedings of the 2020 IEEE International Conference on Acoustics, 2020
Proceedings of the Findings of the Association for Computational Linguistics: EMNLP 2020, 2020
Proceedings of the Computer Vision - ECCV 2020, 2020
2019
IEEE Trans. Pattern Anal. Mach. Intell., 2019
Accurate and Robust Pulmonary Nodule Detection by 3D Feature Pyramid Network with Self-supervised Feature Learning.
CoRR, 2019
3DFPN-HS<sup>2</sup>: 3D Feature Pyramid Network Based High Sensitivity and Specificity Pulmonary Nodule Detection.
CoRR, 2019
3DFPN-HS ^2 2 : 3D Feature Pyramid Network Based High Sensitivity and Specificity Pulmonary Nodule Detection.
Proceedings of the Medical Image Computing and Computer Assisted Intervention - MICCAI 2019, 2019
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2019
Proceedings of the 30th British Machine Vision Conference 2019, 2019
2018
Proceedings of the 47th International Conference on Parallel Processing, 2018
Proceedings of the 2018 IEEE International Conference on Acoustics, 2018
Proceedings of the IEEE Global Communications Conference, 2018
Proceedings of the 2018 IEEE Conference on Computer Vision and Pattern Recognition, 2018
2017
Context-Associative Hierarchical Memory Model for Human Activity Recognition and Prediction.
IEEE Trans. Multim., 2017
Mining Fashion Outfit Composition Using an End-to-End Deep Learning Approach on Set Data.
IEEE Trans. Multim., 2017
CoRR, 2017
Proceedings of the Tenth ACM International Conference on Web Search and Data Mining, 2017
Proceedings of the 2017 ACM on Multimedia Conference, 2017
Proceedings of the IEEE International Conference on Computer Vision, 2017
Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, 2017
2016
Proceedings of the Deep Learning and Convolutional Neural Networks for Medical Image Computing, 2016
Knowl. Based Syst., 2016
Robust Visual-Textual Sentiment Analysis: When Attention meets Tree-structured Recursive Neural Networks.
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM Conference on Multimedia Conference, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the 2016 ACM on International Conference on Multimedia Retrieval, 2016
Proceedings of the Twenty-Fifth International Joint Conference on Artificial Intelligence, 2016
Proceedings of the 25th ACM International Symposium on High-Performance Parallel and Distributed Computing, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition, 2016
Proceedings of the British Machine Vision Conference 2016, 2016
Poker-CNN: A Pattern Learning Strategy for Making Draws and Bets in Poker Games Using Convolutional Networks.
Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, 2016
2015
IEEE Trans. Multim., 2015
IEEE Trans. Image Process., 2015
CoRR, 2015
LSIF: A System for Large-Scale Information Flow Detection Based on Topic-Related Semantic Similarity Measurement.
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology, 2015
Proceedings of the 2015 IEEE Winter Conference on Applications of Computer Vision, 2015
Proceedings of the 15th Non-Volatile Memory Technology Symposium, 2015
Multi-facet Learning using Deep Convolutional Neural Network for Person-Related Categories in Photos.
Proceedings of the 5th ACM on International Conference on Multimedia Retrieval, 2015
Proceedings of the Twenty-Fourth International Joint Conference on Artificial Intelligence, 2015
You are what you tweet...pic! gender prediction based on semantic analysis of social media images.
Proceedings of the 2015 IEEE International Conference on Multimedia and Expo, 2015
2014
Comput. Vis. Image Underst., 2014
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014
Proceedings of the IEEE Winter Conference on Applications of Computer Vision, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
The Placing Task: A Large-Scale Geo-Estimation Challenge for Social-Media Videos and Images.
Proceedings of the 3rd ACM Multimedia Workshop on Geotagging and Its Applications in Multimedia, 2014
GeoMM 2014: the third ACM multimedia workshop ongeotagging and its applications in multimedia.
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
Proceedings of the ACM International Conference on Multimedia, MM '14, Orlando, FL, USA, November 03, 2014
2013
ACM Trans. Multim. Comput. Commun. Appl., 2013
IBM Research and Columbia University TRECVID-2013 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), Surveillance Event Detection (SED), and Semantic Indexing (SIN) Systems.
Proceedings of the 2013 TREC Video Retrieval Evaluation, 2013
Proceedings of the Advances in Multimedia Modeling, 19th International Conference, 2013
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the ACM Multimedia Conference, 2013
Second ACM multimedia workshop on geotagging and its applications in multimedia (GeoMM 2013).
Proceedings of the ACM Multimedia Conference, 2013
Proceedings of the 2013 IEEE International Conference on Multimedia and Expo, 2013
Large-scale video event classification using dynamic temporal pyramid matching of visual semantics.
Proceedings of the IEEE International Conference on Image Processing, 2013
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
Hierarchical Feature Pooling with Structure Learning: A New Method for Pedestrian Detection.
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2013
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition, 2013
Action Detection by Fusing Hierarchically Filtered Motion with Spatiotemporal Interest Point Features.
Proceedings of the Human Behavior Recognition Technologies, 2013
2012
IEEE Trans. Syst. Man Cybern. Part C, 2012
Latent Community Topic Analysis: Integration of Community Discovery with Topic Modeling.
ACM Trans. Intell. Syst. Technol., 2012
Neurocomputing, 2012
Proceedings of the 21st World Wide Web Conference, 2012
IBM Research and Columbia University TRECVID-2012 Multimedia Event Detection (MED), Multimedia Event Recounting (MER), and Semantic Indexing (SIN) Systems.
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Proceedings of the 2012 TREC Video Retrieval Evaluation, 2012
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
GeoMM'12: ACM international workshop on geotagging and its applications in multimedia.
Proceedings of the 20th ACM Multimedia Conference, MM '12, Nara, Japan, October 29, 2012
Proceedings of the 1st International Workshop on Big Data, 2012
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo Workshops, 2012
Video Event Detection Using Temporal Pyramids of Visual Semantics with Kernel Optimization and Model Subspace Boosting.
Proceedings of the 2012 IEEE International Conference on Multimedia and Expo, 2012
Proceedings of the Computer Vision - ECCV 2012, 2012
Beyond Mahalanobis distance: Learning second-order discriminant function for people verification.
Proceedings of the 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, 2012
IBM T.J. Watson Research Center, Multimedia Analytics: Modality Classification and Case-Based Retrieval Tasks of ImageCLEF2012.
Proceedings of the CLEF 2012 Evaluation Labs and Workshop, 2012
2011
A general framework for efficient clustering of large datasets based on activity detection.
Stat. Anal. Data Min., 2011
Proceedings of the 20th International Conference on World Wide Web, 2011
IBM Research and Columbia University TRECVID-2011 Multimedia Event Detection (MED) System.
Proceedings of the 2011 TREC Video Retrieval Evaluation, 2011
Proceedings of the Eleventh SIAM International Conference on Data Mining, 2011
Proceedings of the Advances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011. Proceedings of a meeting held 12-14 December 2011, 2011
Proceedings of the 19th International Conference on Multimedia 2011, Scottsdale, AZ, USA, November 28, 2011
Proceedings of the 11th IEEE International Conference on Data Mining, 2011
Proceedings of the 24th IEEE Conference on Computer Vision and Pattern Recognition, 2011
Proceedings of the Social Network Data Analytics, 2011
2010
Proceedings of the 19th International Conference on World Wide Web, 2010
Proceedings of the TRECVID 2010 workshop participants notebook papers, 2010
Proceedings of the Advances in Multimedia Modeling, 2010
Proceedings of the 18th International Conference on Multimedia 2010, 2010
Proceedings of the 2010 IEEE International Conference on Multimedia and Expo, 2010
Proceedings of the International Conference on Image Processing, 2010
Proceedings of the IEEE International Conference on Acoustics, 2010
Proceedings of the Twenty-Third IEEE Conference on Computer Vision and Pattern Recognition, 2010
Proceedings of the 19th ACM Conference on Information and Knowledge Management, 2010
2009
Image Annotation Within the Context of Personal Photo Collections Using Hierarchical Event and Scene Models.
IEEE Trans. Multim., 2009
Responses to the Comments on "Plane-Based Optimization for 3D Object Reconstruction from Single Line Drawings".
IEEE Trans. Pattern Anal. Mach. Intell., 2009
Responses to the Comments on "What the Back of the Object Looks Like: 3D Reconstruction from Line Drawings without Hidden Lines".
IEEE Trans. Pattern Anal. Mach. Intell., 2009
Proceedings of the SIAM International Conference on Data Mining, 2009
Enhancing semantic and geographic annotation of web images via logistic canonical correlation regression.
Proceedings of the 17th International Conference on Multimedia 2009, 2009
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009
Proceedings of the IEEE 12th International Conference on Computer Vision, ICCV 2009, Kyoto, Japan, September 27, 2009
2008
IEEE Trans. Pattern Anal. Mach. Intell., 2008
What the Back of the Object Looks Like: 3D Reconstruction from Line Drawings without Hidden Lines.
IEEE Trans. Pattern Anal. Mach. Intell., 2008
Proceedings of the TRECVID 2008 workshop participants notebook papers, 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Annotating photo collections by label propagation according to multiple similarity cues.
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 16th International Conference on Multimedia 2008, 2008
Proceedings of the 2008 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2008), 2008
Proceedings of the 7th ACM International Conference on Image and Video Retrieval, 2008
2007
Spatially Coherent Latent Topic Model for Concurrent Segmentation and Classification of Objects and Scenes.
Proceedings of the IEEE 11th International Conference on Computer Vision, 2007
Proceedings of the 2007 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR 2007), 2007
2006
Proceedings of the 14th ACM International Conference on Multimedia, 2006
Automatic Segmentation of Lung Fields from Radiographic Images of SARS Patients Using a New Graph Cuts Algorithm.
Proceedings of the 18th International Conference on Pattern Recognition (ICPR 2006), 2006
Proceedings of the Computer Vision, 2006
2005
Proceedings of the 10th IEEE International Conference on Computer Vision (ICCV 2005), 2005