Trustworthy Machine Learning

We study how machine learning systems can be made robust, reliable, and secure. Our work covers adversarial examples, data poisoning attacks and defenses, backdoor attacks, and hardware fault attacks on neural networks.

Privacy in Machine Learning

We investigate privacy risks in foundation models and machine learning pipelines, including membership inference attacks, data extraction from LLMs, and privacy-preserving training methods.

LLM and Agentic AI Security

As large language models and AI agents become more capable and widely deployed, we study how they can be attacked and how to build defenses. Topics include jailbreaking, indirect prompt injection, and TOCTOU vulnerabilities in agents.

Efficient and Reliable AI

We explore the intersection of efficiency and reliability in AI systems, studying how model compression (quantization, early-exit, pruning) affects robustness, and how AI systems behave under hardware faults.