Hi, I am Caiming Xiong, VP of AI Research and Applied AI at Salesforce. I got my Ph.D. in the department of Computer Science and Engineering, University at Buffalo, SUNY and worked as a Postdoctoral Researcher Scholar at the University of California, Los Angeles (UCLA).

Research Interests

ML/DL, NLP, Computer Vision, Multimedia, Recommendation and AI for Good.

Institutions

         

Links

Google Scholar

Contact

cxiong [at] salesforce.com

Publications

See Google scholar for up-to-date list of papers

2021

Robustness Gym: Unifying the NLP Evaluation Landscape, Karan Goel, Nazneen Rajani, Jesse Vig, Samson Tan, Jason Wu, Stephan Zheng, Caiming Xiong, Mohit Bansal, Christopher Ré.[ arxiv link, code, website ]

SCRIPT: Self-Critic Pre-Training of Transformers, Erik Nijkamp, Bo Pang, Ying Nian Wu and Caiming Xiong.
The 2021 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2021). [ pdf ]

Learning to Synthesize Data for Semantic Parsing, Bailin Wang, Wenpeng Yin, Xi Victoria Lin and Caiming Xiong.
The 2021 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2021). [ pdf ]

DART: Open-Domain Structured Data Record to Text Generation, Linyong Nan, Dragomir Radev, Rui Zhang, Amrit Rau, Abhinand Sivaprasad, Chiachun Hsieh, Xiangru Tang, Aadit Vyas, Neha Verma, Pranav Krishna, YANGXIAOKANG LIU, Nadia Irwanto, Jessica Pan, Faiaz Rahman, Ahmad Zaidi, Mutethia Mutuma, Yasin Tarabar, Ankit Gupta, Tao Yu, Yi Chern Tan, Xi Victoria Lin, Caiming Xiong, Richard Socher and Nazneen Fatema Rajani.
The 2021 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL-HLT 2021). [ pdf ]

Structured Scene Memory for Vision-Language Navigation. Hanqing Wang, Wenguan Wang, Wei Liang, Caiming Xiong, Jianbing Shen.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021). [ pdf]

WOAD: Weakly Supervised Online Action Detection in Untrimmed Videos. Mingfei Gao, Yingbo Zhou, Ran Xu, Richard Socher, Caiming Xiong.
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2021). [ pdf]

MoPro: Webly Supervised Learning with Momentum Prototypes. Junnan Li, Caiming Xiong, Steven Hoi.
International Conference on Learning Representations (ICLR 2021). [ pdf]

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components. Junwen Bai, Weiran Wang, Yingbo Zhou, Caiming Xiong.
International Conference on Learning Representations (ICLR 2021) [ pdf]

BERTology Meets Biology: Interpreting Attention in Protein Language Models. Jesse Vig, Ali Madani, Lav R. Varshney, Caiming Xiong, richard socher, Nazneen Rajani.
International Conference on Learning Representations (ICLR 2021). [ pdf]

Prototypical Contrastive Learning of Unsupervised Representations. Junnan Li, Pan Zhou, Caiming Xiong, Steven Hoi.
International Conference on Learning Representations (ICLR 2021). [ pdf]

CoCo: Controllable Counterfactuals for Evaluating Dialogue State Trackers. Shiyang Li, Semih Yavuz, Kazuma Hashimoto, Jia Li, Tong Niu, Nazneen Rajani, Xifeng Yan, Yingbo Zhou, Caiming Xiong.
International Conference on Learning Representations (ICLR 2021). [ pdf]

GraPPa: Grammar-Augmented Pre-Training for Table Semantic Parsing. Tao Yu, Chien-Sheng Wu, Xi Victoria Lin, Bailin Wang, Yi Chern Tan, Xinyi Yang, Dragomir Radev, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2021). [ pdf]

Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models. Tong Che, Xiaofeng Liu, Site Li, Yubin Ge, Ruixiang Zhang, Caiming Xiong and Yoshua Bengio
.
Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021). [ pdf]

Proposal Learning for Semi-Supervised Object Detection. Peng Tang, Chetan Ramaiah, Yan Wang, Ran Xu and Caiming Xiong
.
Winter Conference on Applications of Computer Vision 2021 (WACV 2021). [ pdf]

SummEval: Re-evaluating Summarization Evaluation, Alexander R. Fabbri, Wojciech Kryściński, Bryan McCann, Caiming Xiong, Richard Socher, Dragomir Radev.
Transactions of the Association for Computational Linguistics (TACL). [ pdf ]

A Dynamic Frame Selection Framework for Fast Video Recognition, Zuxuan Wu, Hengduo Li, Caiming Xiong, Yu-Gang Jiang, Larry Steven Davis.
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI). [ pdf ]

2020

Online Structured Meta-learning
. Huaxiu Yao, Yingbo Zhou, Mehrdad Mahdavi, Zhenhui (Jessie) Li, Richard Socher and Caiming Xiong
.
The 2020 Conference on Neural Information Processing Systems (NeurIPS 2020). [ pdf]

Towards Understanding Hierarchical Learning: Benefits of Neural Representations
. Minshuo Chen, Yu Bai, Jason Lee, Tuo Zhao, Huan Wang, Caiming Xiong and Richard Socher.
The 2020 Conference on Neural Information Processing Systems (NeurIPS 2020). [ pdf]

Theory-Inspired Path-Regularized Differential Network Architecture Search. Pan Zhou, Caiming Xiong, Richard Socher and Steven Hoi
.
The 2020 Conference on Neural Information Processing Systems (NeurIPS 2020). [ pdf] (Oral)

Towards Theoretically Understanding Why Sgd Generalizes Better Than Adam in Deep Learning. Pan Zhou, Jiashi Feng, Chao Ma, Caiming Xiong, Steven Hoi and Weinan E
.
The 2020 Conference on Neural Information Processing Systems (NeurIPS 2020). [ pdf]

Universal Natural Language Processing with Limited Annotations: Try Few-shot Textual Entailment as a Start
. Wenpeng Yin, Nazneen Fatema Rajani, Dragomir Radev, Richard Socher and Caiming Xiong
.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

TOD-BERT: Pre-trained Natural Language Understanding for Task-Oriented Dialogue. Chien-Sheng Wu, Steven C.H. Hoi, Richard Socher and Caiming Xiong
.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Probing Task-Oriented Dialogue Representation from Language Models. Chien-Sheng Wu and Caiming Xiong
.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

VD-BERT: A Unified Vision and Dialog Transformer with BERT. Yue Wang, Shafiq Joty, Michael R., Irwin King, Caiming Xiong and Steven C.H. Hoi.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Discern: Discourse-Aware Entailment Reasoning Network for Conversational Machine Reading
. Yifan Gao, Chien-Sheng Wu, Jingjing Li, Shafiq Joty, Steven C.H. Hoi, Caiming Xiong, Irwin King, and Michael Lyu.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference
. Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu, Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher and Caiming Xiong.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Evaluating the Factual Consistency of Abstractive Text Summarization. Wojciech Kryscinski, Bryan McCann, Caiming Xiong, Richard Socher
.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

The Thieves on Sesame Street are Polyglots: Extracting Multilingual Models from Monolingual APIs
. Nitish Shirish Keskar, Bryan McCann, Caiming Xiong and Richard Socher
.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging. Semih Yavuz, Kazuma Hashimoto, Wenhao Liu, Nitish Shirish Keskar, Richard Socher, Caiming Xiong.
The 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020). [ pdf]

Bridging Textual and Tabular Data for Cross-Domain Text-to-SQL Semantic Parsing. Victoria Lin, Richard Socher, Caiming Xiong.
(EMNLP-Findings 2020). [ pdf]

Improving Limited Labeled Dialogue State Tracking with Self-Supervision
. Chien-Sheng Wu, Steven C.H. Hoi,andCaiming Xiong
.
(EMNLP-Findings 2020). [ pdf]

Composed Variational Natural Language Generation for Few-shot Intents. Congying Xia,Caiming Xiong, Philip Yu and Richard Socher.
(EMNLP-Findings 2020). [ pdf]

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation. T Wang, X. Lin, N. F. Rajani, B. McCann, V. Ordonez and C. Xiong.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). [ pdf]

Explicit Memory Tracker with Coarse-to-Fine Reasoning for Conversational Machine Reading. Y. Gao, C. Wu, S. Joty, C. Xiong, R. Socher, I. King, M. Lyu and S. Hoi.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). [ pdf]

ERASER: A Benchmark to Evaluate Rationalized NLP Models. J. DeYoung, S. Jain, N. F. Rajani, E. Lehman, C. Xiong, R. Socher and B. C. Wallace.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). [ pdf]

ESPRIT: Explaining Solutions to Physical ReasonIng Tasks. N. F. Rajani, R. Zhang, Y. Tan, S. Zheng, J. Weiss, A. Vyas, A. Gupta, C. Xiong, R. Socher and D. Radev.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). [ pdf]

Photon: A Robust Cross-Domain Text-to-SQL System. J. Zeng, X. Lin, S. Hoi, C. Xiong, R. Socher, M. Lyu, I. King.
The 58th Annual Meeting of the Association for Computational Linguistics (ACL 2020). [ pdf]

Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering , Akari Asai, Kazuma Hashimoto, Hannaneh Hajishirzi, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2020). [ pdf]

Assessing Local Generalization Capability in Deep Models, Huan Wang, Nitish Shirish Keskar, Caiming Xiong, Richard Socher.
The 23rd International Conference on Artificial Intelligence and Statistics (AISTATS 2020). [ pdf]

Learning from Noisy Anchors for One-stage Object Detection, Hengduo Li, Zuxuan Wu, Chen Zhu, Caiming Xiong, Richard Socher, Larry S. Davis.
Conference on Computer Vision and Pattern Recognition (CVPR 2020). [ pdf]

2019

CTRL: A Conditional Transformer Language Model for Controllable Generation, Nitish Shirish Keskar, Bryan McCann, Lav R. Varshney, Caiming Xiong, Richard Socher.
[ arxiv link, code (pre-trained and fine-tuning), blog ]

Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards, Alex Trott, Stephan Zheng, Caiming Xiong and Richard Socher.
Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019). [ pdf]

LiteEval: A Coarse-to-Fine Framework for Resource Efficient Video Recognition, Zuxuan Wu, Caiming Xiong, Yu-Gang Jiang and Larry Davis.
Thirty-third Conference on Neural Information Processing Systems (NeurIPS 2019). [ pdf]

The State of Text Summarization: A Critical Evaluation, Wojciech Kryscinski, Caiming Xiong and Richard Socher.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ pdf]

WSLLN: Weakly Supervised Natural Language Localization Networks, Mingfei Gao, Larry Davis, Richard Socher and Caiming Xiong.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ pdf]

Editing-based SQL Query Generation for Cross-Domain Context-Dependent Questions, Rui Zhang, Tao Yu, Heyang Er, Sungrok Shim, Eric Xue, Xi Victoria Lin, Tianze Shi, Caiming Xiong, Richard Socher and Dragomir Radev .
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ pdf]

CoSQL: A Conversational Text-to-SQL Challenge Towards Cross-Domain Natural Language Interfaces to Databases, Tao Yu, Rui Zhang, Heyang Er, Suyi Li, Eric Xue, Bo Pang, Xi Victoria Lin, Yi Chern Tan, Tianze Shi, Zihan Li, Youxuan Jiang, Michihiro Yasunaga, Sungrok Shim, Tao Chen, Alexander Fabbri, Zifan Li, Luyao Chen, Yuwen Zhang, Shreya Dixit, Vincent Zhang, Caiming Xiong, Richard Socher, Walter Lasecki and Dragomir Radev.
2019 Conference on Empirical Methods in Natural Language Processing and 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP 2019). [ pdf]

Explain Yourself! Leveraging Language Models for Commonsense Reasoning, Nazneen Fatema Rajani, Bryan McCann, Caiming Xiong and Richard Socher.
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ arxiv pdf, Blog Post, Github, Press: VentureBeat, Silicon Angle, ZDNet ]

SParC: Cross-Domain Semantic Parsing in Context, Tao Yu, Rui Zhang, Michihiro Yasunaga, Yi Chern Tan, Xi Victoria Lin, Suyi Li, Heyang Er, Irene Li, Bo Pang, Tao Chen, Emily Ji, Shreya Dixit, David Proctor, Sungrok Shim, Jonathan Kraft, Vincent Zhang, Caiming Xiong, Richard Socher and Dragomir Radev.
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ pdf]

Transferable Multi-Domain State Generator for Task-Oriented Dialogue Systems, Chien-Sheng Wu, Andrea Madotto, Ehsan Hosseini-Asl, Caiming Xiong, Richard Socher and Pascale Fung.
The 57th Annual Meeting of the Association for Computational Linguistics (ACL 2019). [ pdf] (Outstanding Paper Award)

Learn to Grow: A Continual Structure Learning Framework for Catastrophic Forgetting, Xilai Li, Yingbo Zhou, Caiming Xiong, Richard Socher.
The 36th International Conference on Machine Learning (ICML 2019). [ pdf]

Taming MAML: Control variates for unbiased meta-reinforcement learning gradient estimation, Hao Liu, Richard Socher, Caiming Xiong.
The 36th International Conference on Machine Learning (ICML 2019). [ pdf]

On the Generalization Gap in Reparameterizable Reinforcement Learning, Huan Wang, Stephan Zheng, Caiming Xiong, Richard Socher.
The 36th International Conference on Machine Learning (ICML 2019). [ pdf]

The Regretful Agent: Heuristic-Aided Navigation through Progress Estimation, Chih-Yao Ma†, Zuxuan Wu, Ghassan AlRegib, Caiming Xiong, Zsolt Kira.
Conference on Computer Vision and Pattern Recognition (CVPR 2019). [ pdf]

AdaFrame: Adaptive Frame Selection for Fast Video Recognition, Zuxuan Wu, Caiming Xiong, Chih-Yao Ma, Richard Socher, Larry S Davis.
Conference on Computer Vision and Pattern Recognition (CVPR 2019). [ pdf]

Global-to -local Memory Pointer Networks for Task-Oriented Dialogue, Chien-Sheng Wu, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2019). [ pdf]

Self-Monitoring Navigation Agent via Auxiliary Progress Estimation, Chih-Yao Ma, Jiasen Lu, Zuxuan Wu, Ghassan AlRegib, Zsolt Kira, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2019). [ pdf]

Coarse-grain Fine-grain Coattention Network for Multi-evidence Question Answering, Victor Zhong, Caiming Xiong, Nitish Shirish Keskar, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ pdf]

Competitive experience replay, Hao Liu, Alexander Trott, Richard Socher, Caiming Xiong.
International Conference on Learning Representations (ICLR 2019). [ pdf]

Augmented Cyclic Adversarial Learning for Low Resource Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ pdf]

A Closer Look at Deep Learning Heuristics: Learning rate restarts, Warmup and Distillation, Akhilesh Gotmare, Nitish Shirish Keskar, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2019). [ pdf]

Neural Abstract Style Transfer for Chinese Traditional Painting, Bo Li, Caiming Xiong, Tianfu Wu, Yu Zhou, Lun Zhang, Rufeng Chu.
Asian Conference on Computer Vision (ACCV 2019). [ pdf]

2018

The Natural Language Decathlon: Multitask Learning as Question Answering, Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher.
[ arxiv pdf, code and leaderboard, blog post, Press: zdnet, Venturebeat, SiliconAngle ]

Multi-Hop Knowledge Graph Reasoning with Reward Shaping, Xi Victoria Lin, Richard Socher, Caiming Xiong.
The 2018 Conference on Empirical Methods on Natural Language Processing (EMNLP 2018). [ pdf]

Improving Abstraction in Text Summarization, Wojciech Kryściński, Romain Paulus, Caiming Xiong, Richard Socher.
The 2018 Conference on Empirical Methods on Natural Language Processing (EMNLP 2018). [ pdf]

Global-Locally Self-Attentive Encoder for Dialogue State Tracking, Victor Zhong, Caiming Xiong and Richard Socher.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ pdf]

Efficient and Robust Question Answering from Minimal Context over Documents, Sewon Min, Victor Zhong, Richard Socher, Caiming Xiong.
Association for Computational Linguistics 2018 Conference (ACL 2018). [ pdf]

End-to-End Dense Video Captioning with Masked Transformer, Luowei Zhou, Yingbo Zhou, Jason J. Corso, Richard Socher, Caiming Xiong.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2018). [ pdf ] (Spotlight)

Interpretable Counting for Visual Question Answering, Alexander Trott, Caiming Xiong* and Richard Socher.
International Conference on Learning Representations (ICLR 2018). [ pdf, blog post]

Hierarchical and Interpretable Skill Acquisition in Multi-task Reinforcement Learning, Tianmin Shu, Caiming Xiong*, Richard Socher.
International Conference on Learning Representations (ICLR 2018).[ pdf, blog post]

Non-Autoregressive Neural Machine Translation, Jiatao Gu, James Bradbury, Caiming Xiong, Victor O.K. Li, Richard Socher.
International Conference on Learning Representations (ICLR 2018). [ pdf, blog post, dataset, Press: CNBC, Venturebeat, Slator ]

DCN+: Mixed Objective and Deep Residual Coattention for Question Answering, Caiming Xiong, Victor Zhong and Richard Socher.
International Conference on Learning Representations (ICLR 2018). [ pdf]

A Deep Reinforced Model for Abstractive Summarization, Romain Paulus, Caiming Xiong*, Richard Socher.
International Conference on Learning Representations (ICLR 2018).[ pdf, blog post, Press: Forbes, MIT Tech Review, TechCrunch]

A Multi-Discriminator CycleGAN for Unsupervised Non-Parallel Speech Domain Adaptation, Ehsan Hosseini-Asl, Yingbo Zhou, Caiming Xiong, Richard Socher.
Interspeech 2018.[ pdf, blog post]

Improving End-to-End Speech Recognition with Policy Learning, Yingbo Zhou, Caiming Xiong, Richard Socher.
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2018).[ pdf, blog post]

2017

Improved Regularization Techniques for End-to-End Speech Recognition, Yingbo Zhou, Caiming Xiong, and Richard Socher.
[ pdf, blog post]

Seq2SQL: Generating Structured Queries from Natural Language using Reinforcement Learning, Victor Zhong, Caiming Xiong, and Richard Socher.
[ pdf, blog post, dataset, Press: TechCrunch, Venturebeat ]

Learned in Translation: Contextualized Word Vectors, Bryan McCann, James Bradbury, Caiming Xiong, Richard Socher.
Advances in Neural Information Processing Systems (NIPS 2017). [ pdf, blog post, code, Press: MIT Tech Review ]

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks, Kazuma Hashimoto, Caiming Xiong*, Yoshimasa Tsuruoka, Richard Socher.
The 2017 Conference on Empirical Methods on Natural Language Processing (EMNLP 2017). [ pdf, blog post]

Knowing When to Look: Adaptive Attention via A Visual Sentinel for Image Captioning, Jiasen Lu*, Caiming Xiong*, Devi Parikh, Richard Socher.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2017). (* equal contribution). [ pdf] (Spotlight)

Dynamic Coattention Networks For Question Answering, Caiming Xiong, Victor Zhong, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, blog post]

Quasi-Recurrent Neural Networks, James Bradbury, Stephen Merity, Caiming Xiong, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, blog post]

Pointer Sentinel Mixture Models, Stephen Merity, Caiming Xiong, James Bradbury, Richard Socher.
International Conference on Learning Representations (ICLR 2017).[ pdf, new dataset]

2016

Dynamic Memory Networks for Visual and Textual Question Answering, Caiming Xiong, Stephen Merity, Richard Socher.
The 33rd International Conference on Machine Learning (ICML 2016). [ pdf, New York Times]

Active Clustering with Model-Based Uncertainty Reduction, Caiming Xiong, David M. Johnson, Jason Corso.
IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI). [ pdf]

Recognizing Car Fluents from Videos, Bo Li, Tianfu Wu, Caiming Xiong, Song-Chun Zhu.
IEEE Computer Vision and Pattern Recognition (CVPR 2016). [ pdf ] (Oral)

Grounded Semantic Role Labeling, Shaohua Yang, Qiaozi Gao, Changsong Liu, Caiming Xiong, Joyce Y. Chai, Song-Chun Zhu.
The 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2016). [ pdf ]

Robot Learning with a Spatial, Temporal, and Causal And-Or Graph, Caiming Xiong, Nishant Shukla, Wenlong Xiong, Song-Chun Zhu.
IEEE International Conference on Robotics and Automation (ICRA 2016).[ pdf ]

Maximum Margin Dirichlet Process Mixtures for Clustering, Gang Chen, Haiying Zhang, Caiming Xiong.
AAAI Conference on Artificial Intelligence (AAAI 2016). [ pdf ]

Semi-Supervised Nonlinear Distance Metric Learning via Forests of Max-Margin Cluster Hierarchies, David M. Johnson, Caiming Xiong, Jason Corso.
IEEE Transactions on Knowledge and Data Engineering (TKDE). [ pdf ]

A Way out of the Odyssey: Analyzing and Combining Recent Insights for LSTMs, Shayne Longpre, Sabeek Pradhan, Caiming Xiong, Richard Socher.
[ pdf]

2015

A Unified Framework for Human-Robot Knowledge Transfer, Nishant Shukla, Caiming Xiong, Song-Chun Zhu.
AAAI Fall Symposium on AI for Human-Robot Interaction (AI-HRI 2015).[ pdf ]

Joint Action Recognition and Pose Estimation From Video, Xiaohan Nie, Caiming Xiong, Song-Chun Zhu.
IEEE Computer Vision and Pattern Recognition (CVPR 2015).[ pdf ]

Can humans fly? Action understanding with multiple classes of actors, Chenliang Xu, Shao-Hang Hsieh, Caiming Xiong, Jason Corso.
IEEE Computer Vision and Pattern Recognition (CVPR 2015).[ pdf ]

Jointly modeling deep video and compositional text to bridge vision and language in a unified framework, Ran Xu, Caiming Xiong, Wei Chen, Jason Corso.
AAAI Conference on Artificial Intelligence (AAAI 2015).[ pdf ]

2014

Seeing is worse than believing: Reading people's minds better than computer vision methods recognize actions, Andrei Barbu, Daniel P. Barrett, Wei Chen, N. Siddharth, Caiming Xiong , Jason Corso, Christiane D. Fellbaum, Catherine Hanson, Stephen Jos´e Hanson, S´ebastien H´elie, Evguenia Malaia, Barak A. Pearlmutter, Jeffrey Mark Siskind, Thomas Michael Talavage, Ronnie B. Wilbur.
European Conference on Computer Vision (ECCV 2014).[ pdf ]

Latent Domains Modeling for Visual Domain Adaptation, Caiming Xiong, Scott McCloskey, Shao-Hang Hsieh, Jason Corso.
AAAI Conference on Artificial Intelligence (AAAI 2014).[ pdf ]

Actionness Ranking with Lattice Conditional Ordinal Random Fields, Wei Chen, Caiming Xiong, Jason Corso
IEEE Computer Vision and Pattern Recognition (CVPR 2014).[ pdf , code. ]

Adaptive Quantization for Hashing: An Information-Based Approach to Learning Binary Codes, Caiming Xiong, Wei Chen, Gang Chen, David M. Johnson, Jason Corso
SIAM International Conference on Data Mining (SDM 2014).[ pdf, code. ]

Spectral Active Clustering of Remote Sensing Images, Zifeng Wang, Gui-Song Xia, Caiming Xiong, Liangpei Zhang.
IEEE International Geoscience and Remote Sensing Symposium (IGARSS 2014).[pdf]

2013

Uncertainty reduction for active image clustering via a hybrid global-local uncertainty model, Caiming Xiong, David M. Johnson, and Jason Corso.
AAAI Conference on Artificial Intelligence (Late-Breaking Papers Track) (AAAI 2013). [ pdf, code. ]

Comprehensive cross-hierarchy cluster agreement evaluation, David M. Johnson, Caiming Xiong, Jason Corso.
AAAI Conference on Artificial Intelligence (Late-Breaking Papers Track) (AAAI 2013). [ pdf, code. ]

2012

Streaming hierarchical video segmentation, Chenliang Xu*, Caiming Xiong*, Jason Corso.
European Conference on Computer Vision (ECCV 2012). [ pdf, code ] (Oral)(* equal contribution).

Random forests for metric learning with implicit pairwise position dependence, Caiming Xiong, David M. Johnson, R. Xu, Jason Corso.
ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2012). [ pdf, code ](Oral)

Coaction discovery: Segmentation of common actions across multiple videos, Caiming Xiong, David M. Johnson, R. Xu, Jason Corso.
Multimedia Data Mining Workshop in Conjunction with the ACM SIGKDD Conference on Knowledge Discovery and Data Mining (MDMKDD 2012). [ pdf, code ]

Dictionary transfer for image denoising via domain adaptation, Gang Chen, Caiming Xiong, Jason Corso.
IEEE International Conference on Image Processing (ICIP 2012). [ pdf ]

Efficient max-margin metric learning, Caiming Xiong, David M. Johnson, Jason Corso.
European Conference on Data Mining (ECDM 2012). [ pdf, code ](Best Paper Award)

Spectral active clustering via purification of the k-nearest neighbor graph, Caiming Xiong, David M. Johnson, Jason Corso.
European Conference on Data Mining (ECDM 2012). [ pdf, code ]

Online Active Constraint Selection For Semi-Supervised Clustering, Caiming Xiong, David M. Johnson, Jason Corso.
ECAI Active and Incremental Workshop, 2012. [ pdf, code ]

2011

AirTouch: Interacting With Computer Systems At A Distance., Daniel R. Schlegel, Albert Y. C. Chen, Caiming Xiong, Jeffery A. Delmerico and Jason Corso.
IEEE Winter Vision Meetings: Workshop on Applications of Computer Vision (WACV 2011). [ pdf]

Towards a parts-based approach to sub-cortical brain structure parsing, Digvijay Gagneja, Caiming Xiong, Jason Corso.
SPIE Conference on Medical Imaging, 2011. [ pdf ]

2009

From image parsing to painterly rendering, Kun Zeng, Mingtian Zhao, Caiming Xiong, Song-Chun Zhu.
ACM Transaction on Graphics, 2009 (TOG). [ pdf ]

Marker-less registration based on template tracking for augmented reality, Liang Lin, Yongtian Wang, Yue Liu, Caiming Xiong, Kun Zeng.
Multimedia Tools Applications, 2009 (MTA). [ pdf ]

Professional Services

Conference and Workshop Organization

  • Organizing Committee, Workshop on Interactive Executable Semantic Parsing (at EMNLP 2020).
  • Organizing Committee, Workshop on language and vision (at CVPR 2015).
Conference Area Chair
  • NLPCC 2019-2020, EMNLP 2019-2021, ACL 2020
Journal and Conference Reviewer
  • TPAMI, TACL, IJCV, TIP, PR, NIPS, ICML, ICLR, EMNLP, ACL, CVPR, ICCV, etc.