2024
- Skin-in-the-Game: Decision Making via Multi-Stakeholder Alignment in LLMs [arXiv]
Bilgehan Sel, Priya Shanmugasundaram, Mohammad Kachuee, Kun Zhou, Ruoxi Jia, Ming Jin
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
- How Johnny Can Persuade LLMs to Jailbreak Them: Rethinking Persuasion to Challenge AI Safety by Humanizing LLMs [arXiv]
Yi Zeng, Hongpeng Lin, Jingwen Zhang, Diyi Yang, Ruoxi Jia, Weiyan Shi
Annual Meeting of the Association for Computational Linguistics (ACL), 2024
- Rethinking Data Shapley for Data Selection Tasks: Misleads and Merits [arXiv]
Jiachen T. Wang, Tianji Yang, James Zou, Yongchan Kwon, Ruoxi Jia
International Conference on Machine Learning (ICML), 2024
Oral presentation
- RigorLLM: Resilient Guardrails for Large Language Models against Undesired Content [arXiv]
Zhuowen Yuan, Zidi Xiong, Yi Zeng, Ning Yu, Ruoxi Jia, Dawn Song, Bo Li
International Conference on Machine Learning (ICML), 2024
- A Safe Harbor for AI Evaluation and Red Teaming [arXiv]
Shayne Longpre, Sayash Kapoor, Kevin Klyman, Ashwin Ramaswami, Rishi Bommasani, Borhane Blili-Hamelin, Yangsibo Huang, Aviya Skowron, Zheng-Xin Yong, Suhas Kotha, Yi Zeng, Weiyan Shi, Xianjun Yang, Reid Southen, Alexander Robey, Patrick Chao, Diyi Yang, Ruoxi Jia, Daniel Kang, Sandy Pentland, Arvind Narayanan, Percy Liang, Peter Henderson
International Conference on Machine Learning (ICML), 2024
Oral presentation
- Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models [arXiv]
Bilgehan Sel, Ahmad Al-Tawaha, Vanshaj Khattar, Ruoxi Jia, Ming Jin
International Conference on Machine Learning (ICML), 2024
- Learning to Rank for Active Learning via Multi-Task Bilevel Optimization [arXiv]
Zixin Ding, Si Chen, Ruoxi Jia, Yuxin Chen
Conference on Uncertainty in Artificial Intelligence (UAI), 2024
- The Mirrored Influence Hypothesis: Efficient Data Influence Estimation by Harnessing Forward Passes [arXiv]
Myeongseob Ko, Feiyang Kang, Weiyan Shi, Ming Jin, Zhou Yu, Ruoxi Jia
Conference on Computer Vision and Pattern Recognition (CVPR), 2024
- Efficient Data Valuation for Weighted Nearest Neighbor Algorithms [arXiv]
Jiachen Wang, Prateek Mittal, Ruoxi Jia
International Conference on Artificial Intelligence and Statistics (AISTATS), 2024
Oral presentation
- Get more for less: Principled Data Selection for Warming Up Fine-Tuning in LLMs [openreivew]
Feiyang Kang, Hoang Anh Just*, Yifan Sun*, Himanshu Jahagirdar*, Yuanzhi Zhang, Rongxing Du, Anit Kumar Sahu, Ruoxi Jia
International Conference on Learning Representations (ICLR), 2024
- Fine-tuning Aligned Language Models Compromises Safety, Even When Users Do Not Intend To! [openreivew]
Xiangyu Qi*, Yi Zeng*, Tinghao Xie*, Pin-Yu Chen, Ruoxi Jia, Prateek Mittal, Peter Henderson
International Conference on Learning Representations (ICLR), 2024
Oral presentation
Featured in the New York Times
2023
- Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources [arXiv]
Feiyang Kang*, Hoang Anh Just*, Anit Kumar Sahu, Ruoxi Jia
Conference on Neural Information Processing Systems (NeurIPS), 2023
Adopted by Amazon for data selection
- Threshold KNN-Shapley: A Linear-Time and Privacy-Friendly Approach to Data Valuation [arXiv]
Jiachen T. Wang, Yuqing Zhu, Yu-Xiang Wang, Ruoxi Jia, Prateek Mittal
Conference on Neural Information Processing Systems (NeurIPS), 2023
Spotlight presentation
- A Randomized Approach for Tight Privacy Accounting [arXiv]
Jiachen T. Wang, Saeed Mahloujifar, Tong Wu, Ruoxi Jia, Prateek Mittal
Conference on Neural Information Processing Systems (NeurIPS), 2023
- Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study
Myeongseob Ko, Ming Jin, Chenguang Wang, Ruoxi Jia
International Conference on Computer Vision (ICCV), 2023
- One-Round Active Learning through Data Utility Learning and Proxy Models [openreview]
Jiachen T. Wang, Si Chen, Ruoxi Jia
Transactions on Machine Learning Research, 2023
- Turning a Curse into a Blessing: Enabling In-Distribution-Data-Free Backdoor Removal via Stabilized Model Inversion [openreview]
Si Chen, Yi Zeng, Won Park, Jiachen T. Wang, Xun Chen, Lingjuan Lyu, Zhuoqing Mao, Ruoxi Jia
Transactions on Machine Learning Research, 2023
- PrivMon: A Stream-Based System for Real-Time Privacy Attack Detection for Machine Learning Models
Myeongseob Ko*, Xinyu Yang*, Zhengjie Ji, Hoang Anh Just, Peng Gao, Anoop Kumar, Ruoxi Jia
International Symposium on Research in Attacks, Intrusions, and Defenses (RAID), 2023
- ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms [arXiv]
Minzhou Pan*, Yi Zeng*, Lingjuan Lyu, Xue Lin, Ruoxi Jia
USENIX Security, 2023
- Narcissus: A Practical Clean-Label Backdoor Attack with Limited Information [arXiv]
Yi Zeng*, Minzhou Pan*, Hoang Anh Just, Lingjuan Lyu, Meikang Qiu, Ruoxi Jia
ACM Conference on Computer and Communications Security (CCS), 2023
- 2D-Shapley: A Framework for Fragmented Data Valuation [arXiv]
Liu Zhihong*, Hoang Anh Just*, Xiangyu Chang, Xi Chen, Ruoxi Jia
International Conference on Machine Learning (ICML), 2023
- Revisiting Data-Free Knowledge Distillation with Poisoned Teachers [arXiv]
Junyuan Hong*, Yi Zeng*, Shuyang Yu, Lingjuan Lyu, Ruoxi Jia, Jiayu Zhou
International Conference on Machine Learning (ICML), 2023
- How to Sift Out a Clean Data Subset in the Presence of Data Poisoning? [pdf]
Yi Zeng*, Minzhou Pan*, Himanshu Jahagirdar, Ming Jin, Lingjuan Lyu, Ruoxi Jia
USENIX Security, 2023
- Learning-to-Learn to Guide Random Search: Derivative-Free Meta Blackbox Optimization on Manifold
Bilgehan Sel, Ahmad Al-Tawaha, Yuhao Ding, Ruoxi Jia, Bo Ji, Javad Lavaei, Ming Jin
Learning for Dynamics and Control Conference (L4DC), 2023
Oral presentation
- LAVA: Data Valuation without Pre-Specified Learning Algorithms [openreview]
Hoang Anh Just*, Feiyang Kang*, Jiachen T. Wang, Yi Zeng, Myeongseob Ko, Ming Jin, Ruoxi Jia
International Conference on Learning Representations (ICLR), 2023
Spotlight presentation
- Towards Robustness Certification Against Universal Perturbations [openreview]
Yi Zeng*, Zhouxing Shi*, Ming Jin, Feiyang Kang, Lingjuan Lyu, Cho-Jui Hsieh, Ruoxi Jia
International Conference on Learning Representations (ICLR), 2023
- Data Banzhaf: A Robust Data Valuation Framework for Machine Learning [arXiv]
Jiachen T. Wang, Ruoxi Jia
International Conference on Artificial Intelligence and Statistics (AISTATS), 2023
Oral presentation
- On Solution Functions of Optimization: Universal Approximation and Covering Number Bounds [arXiv]
Ming Jin, Vanshaj Khattar, Bilgehan Sel, Harshal Kaushik, Ruoxi Jia
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2023
Oral presentation
- Certifiably Robust Neural ODE with Learning-based Barrier Function [link]
Runing Yang, Ruoxi Jia, Xiangyu Zhang, Ming Jin
IEEE Control Systems Letters, 2023
- ModelPred: A Framework for Predicting Trained Model from Training Data [arXiv]
Yingyan Zeng, Jiachen T. Wang, Si Chen, Hoang Anh Just, Ran Jin, Ruoxi Jia
IEEE Conference on Secure and Trustworthy Machine Learning (SaTML), 2023
- Variance Reduced Shapley Value Estimation for Trustworthy Data Valuation [arXiv]
Mengmeng Wu, Ruoxi Jia, Changle Lin, Wei Huang, Xiangyu Chang
Computers and Operations Research, 2023
- Decision-Focused Learning for Inverse Noncooperative Games: Generalization Bounds and Convergence Analysis.
Ahmad Al-Tawaha, Harshal Kaushik, Bilgehan Sel, Ruoxi Jia, Ming Jin
IFAC World Congress, 2023
- A Theoretical Analysis of Using Gradient Data for Sobolev Training in RKHS
Zain ul Abdeen, Ruoxi Jia, Vassilis Kekatos, Ming Jin
IFAC World Congress, 2023
2022
- Renyi Differential Privacy of Propose-Test-Release and Applications to Private and Robust Machine Learning [arXiv]
Jiachen T. Wang, Saeed Mahloujifar, Shouda Wang, Ruoxi Jia, Prateek Mittal
Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS), 2022
- CATER: Intellectual Property Protection on Text Generation APIs via Conditional Watermarks [arXiv]
Xuanli He, Qiongkai Xu, Yi Zeng, Lingjuan Lyu, Fangzhao Wu, Jiwei Li, Ruoxi Jia
Thirty-Sixth Conference on Neural Information Processing Systems (NeurIPS), 2022
- Just Fine-tune Twice: Selective Differential Privacy for Large Language Models [arXiv]
Weiyan Shi, Si Chen, Chiyuan Zhang, Ruoxi Jia, Zhou Yu
Conference on Empirical Methods in Natural Language Processing (EMNLP), 2022
- Selective Differential Privacy for Language Modeling [arXiv]
Weiyan Shi, Aiqi Cui, Evan Li, Ruoxi Jia, Zhou Yu
North American Chapter of the Association for Computational Linguistics (NAACL), 2022
Oral presentation
- Label-Only Model Inversion Attacks via Boundary Repulsion [arXiv]
Mostafa Kahla, Si Chen, Hoang Anh Just, Ruoxi Jia
Conference on Computer Vision and Pattern Recognition (CVPR), 2022
- Adversarial Unlearning of Backdoors via Implicit Hypergradient [arXiv]
Yi Zeng, Si Chen, Won Park, Z. Morley Mao, Ming Jin, Ruoxi Jia
International Conference on Learning Representations (ICLR), 2022
2021
- Knowledge-Enriched Distributional Model Inversion Attacks [arXiv]
Si Chen, Moustafa Kahla, Ruoxi Jia, Guo-Jun Qi
International Conference on Computer Vision (ICCV), 2021
- Rethinking the Backdoor Attacks’ Triggers: A Frequency Perspective [arXiv]
Yi Zeng*, Won Park*, Z. Morley Mao, Ruoxi Jia
International Conference on Computer Vision (ICCV), 2021
- DPlis: Boosting Utility of Differentially Private Deep Learning via Randomized Smoothing [arXiv]
Wenxiao Wang, Tianhao Wang, Lun Wang, Nanqing Luo, Pan Zhou, Dawn Song, and Ruoxi Jia
The 21st Privacy Enhancing Technologies Symposium (PETS), 2021
- Scalability vs. Utility: Do We Have to Sacrifice One for the Other in Data Importance Quantification? [arXiv]
Ruoxi Jia, Fan Wu, Xuehui Sun, Jiacen Xu, David Dao, Bhavya Kailkhura, Ce Zhang, Bo Li, Dawn Song
Conference on Computer Vision and Pattern Recognition (CVPR), 2021
- InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective [arXiv]
Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu
International Conference on Learning Representations (ICLR), 2021
- Improving Robustness to Model Inversion Attacks via Mutual Information Regularization [arXiv]
Tianhao Wang, Yuheng Zhang, Ruoxi Jia
Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI), 2021
- REFIT: a Unified Watermark Removal Framework for Deep Learning Systems with Limited Data [arXiv]
Xinyun Chen, Wenxiao Wang, Chris Bender, Yiming Ding, Ruoxi Jia, Bo Li, Dawn Song
ASIACCS, 2021
- Stability-Based Analysis and Defense against Backdoor Attacks on Edge Computing Services [paper]
Yi Zhao, Ke Xu, Haiyang Wang, Bo Li, Ruoxi Jia
IEEE Network Magazine, 2021
- Adaptive Backdoor Trigger Detection in Edge-Deployed DNNs in 5G-Enabled IIoT Systems
Yi Zeng, Ruoxi Jia, Meikang Qiu
IEEE Transactions on Industrial Informatics, 2021
2020
- A Principled Approach to Data Valuation for Federated Learning [arXiv]
Tianhao Wang, Johannes Rausch, Ce Zhang, Ruoxi Jia, Dawn Song
Book Chapter in Federated Learning: Privacy and Incentive, 2020
- The Secret Revealer: Generative Model-Inversion Attacks Against Deep Neural Networks [arXiv]
Yuheng Zhang*, Ruoxi Jia*, Hengzhi Pei, Wenxiao Wang, Bo Li, Dawn Song
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Oral presentation
- Robust anomaly detection and backdoor attack detection via differential privacy [arXiv]
Min Du, Ruoxi Jia, Dawn Song
International Conference on Learning Representations (ICLR), 2020
- On the Impact of Perceptual Compression on Deep Learning [paper]
Gerald Friedland, Ruoxi Jia, Jingkang Wang, Bo Li, Nathan Mundhenk
IEEE 3rd International Conference on Multimedia Information Processing and Retrieval, 2020
Before 2020
- Efficient Task-Specific Data Valuation for Nearest Neighbor Algorithms [arXiv]
Ruoxi Jia, David Dao, Boxin Wang, Frances Ann Hubis, Nezihe Merve Gurel, Bo Li, Ce Zhang, Costas J. Spanos, Dawn Song
International Conference on Very Large Data Bases (VLDB), 2019
- Towards Efficient Data Valuation Based on the Shapley Value [arXiv]
Ruoxi Jia*, David Dao*, Boxin Wang, Frances Ann Hubis, Nick Hynes, Nezihe Merve Gurel, Bo Li, Ce Zhang, Dawn Song, Costas Spanos
International Conference on Artificial Intelligence and Statistics (AISTATS), 2019
- Delving into Bootstrapping for Differential Privacy [paper]
Ruoxi Jia, Bo Li, Chaowei Xiao, Dawn Song
ICML Workshop on Security and Privacy of Machine Learning, 2019
- Leveraging Unlabeled Data for Watermark Removal of Deep Neural Networks [paper]
Xinyun Chen, Wenxiao Wang, Yiming Ding, Chris Bender, Ruoxi Jia, Bo Li and Dawn Song
ICML Workshop on Security and Privacy of Machine Learning, 2019
- On the Weak Neural Dependence Phenomenon in Deep Learning [paper]
Jiayao Zhang, Ruoxi Jia, Bo Li, Dawn Song.
NeurIPS Workshop on Deep Learning Theory, 2018
- Poisoning Attacks on Data-Driven Utility Learning in Games [paper]
Ruoxi Jia*, Ioannis Konstantakopoulos*, Bo Li, Costas Spanos
American Control Conference, 2018
- A Framework for Privacy-Preserving Data Publishing with Enhanced Utility for Cyber-Physical Systems [paper]
Fisayo Caleb Sangogboye*, Ruoxi Jia*, Tianzhen Hong, Costas Spanos, Mikkel Baun Kjaergaard
ACM Transactions on Sensor Networks, 2018
- Design Automation for Smart Building Systems [paper]
Ruoxi Jia*, Baihong Jin*, Ming Jin, Yuxun Zhou, Ioannis Konstantakopoulos, Han Zou, Joyce Kim, Dan Li, Weixi Gu, Reza Arghandeh, Pierluigi Nuzzo, Stefano Schiavon, Alberto L. Sangiovanni-Vincentelli, Costas Spanos.
Proceedings of the IEEE, 2018
- Advanced Building Control via Deep Reinforcement Learning [paper]
Ruoxi Jia, Ming Jin, Kaiyu Sun, Tianzhen Hong, Costas Spanos
International Conference on Applied Energy, 2018
- Buildings.Occupants: a Modelica package for modeling occupant behaviour in buildings [paper]
Zhe Wang, Tianzhen Hong, Ruoxi Jia
Journal of Building Performance Simulation, 2018
- BISCUIT: Building Intelligent System Customer Investment Tools [paper]
Ming Jin, Ruoxi Jia, Hari Prasanna Das, Wei Feng, Costas Spanos
International Conference on Applied Energy, 2018
- Virtual Occupancy Sensing: Your Energy Can Tell Whether You Are Present [paper]
Ming Jin, Ruoxi Jia, Costas Spanos
IEEE Transactions on Mobile Computing, 2017
- Data Analytics and Optimization of an Ice-Based Energy Storage System for Commercial Buildings [paper]
Na Luo, Tianzhen Hong, Hui Li, Ruoxi Jia, Wenguo Weng
Applied Energy, 2017
- Towards a Theory of Free-Lunch Privacy in Cyber-Physical Systems [paper]
Ruoxi Jia, Roy Dong, Prashanth Ganesh, Shankar Sastry, Costas Spanos
Annual Allerton Conference on Communication, Control, and Computing, 2017
- PAD: Protecting Anonymity in Publishing Building Related Datasets [paper]
Ruoxi Jia*, Fisayo Caleb Sangogboye*, Tianzhen Hong, Costas Spanos, Mikkel Baun Kjaergaard
ACM International Conference on Systems for Energy-Efficient Built Environments, 2017
- Optimal Sensor-Controller Codesign for Privacy in Dynamical Systems [paper]
Ruoxi Jia, Roy Dong, Shankar Sastry, Costas Spanos
IEEE Conference on Decision and Control, 2017
- Privacy-Enhanced Architecture for Occupancy-based HVAC Control [paper]
Ruoxi Jia, Roy Dong, Shankar Sastry, Costas Spanos
ACM/IEEE International Conference on Cyber-Physical Systems, 2017
- Occupancy Modeling in Shared Spaces of Buildings: A Queueing Approach [paper]
Ruoxi Jia, Costas Spanos
Journal of Building Performance Simulation, 2016
- MapSentinel: Can the Knowledge of Space Use Improve Indoor Tracking Further? [paper]
Ruoxi Jia, Ming Jin, Han Zou, Yigitcan Yesilata, Lihua Xie, Costas Spanos
Sensors, 2016
- A Fully Unsupervised Nonintrusive Load Monitoring Framework [paper]
Ruoxi Jia, Yang Gao, Costas Spanos
IEEE International Conference on Smart Grid Communications, 2015
- APEC: Auto Planner for Efficient Configuration of Indoor Positioning System [paper]
Ming Jin, Ruoxi Jia, Costas Spanos
International Conference on Mobile Ubiquitous Computing, Systems, Services and Technologies, 2015
- SoundLoc: Accurate Room-level Indoor Localization Using Acoustic Signatures [paper]
Ruoxi Jia, Ming Jin, Zilong Chen, Costas Spanos
IEEE International Conference on Automation Science and Engineering, 2015
- An iBeacon Assisted Indoor Localization and Tracking System [paper]
Zhenghua Chen, Qingchang Zhu, Hao Jiang, Han Zou, Yeng Chai Soh, Lihua Xie, Ruoxi Jia, Costas Spanos
International Conference on Information Processing in Sensor Networks, 2015
- Presencesense: Zero-training Algorithm for Individual Presence Detection Based on Power Monitoring [paper]

Ming Jin, Ruoxi Jia, Zhaoyi Kang, Ioannis Konstantakopoulos, Costas Spanos
ACM International Conference on Systems for Energy-Efficient Built Environments, 2014
- Environmental Sensing by Wearable Device for Indoor Activity and Location Estimation [paper]
Ming Jin, Han Zou, Kevin Weekly, Ruoxi Jia, Alexandre M Bayen, Costas Spanos
Annual Conference of the IEEE Industrial Electronics Society, 2014