zhang

Machine Learning and Data Privacy

Our group conducts research in the intersection of machine learning and data privacy. On the one hand, we use machine learning models to assess and mitigate the privacy risks stemming from various kinds of data, such as social network data and biomedical data. On the other hand, we investigate the privacy risks of machine learning models.

Members

Most Recent Publications

Year 2026

2026-03-24

Defeating Cerberus:
Privacy-Leakage Mitigation in Vision Language Models

Conference / Medium

European Association for Computational Linguistics (EACL)
Defeating Cerberus: Privacy-Leakage Mitigation in Vision Language Models

Tags

Trustworthy Information Processing

Authors

Boyang Zhang
Istemi Ekin Akkus
Ruichuan Chen
Alice Dethise
Klaus Satzke
Ivica Rimac
Yang Zhang

Full Paper Visit Detail Page

2026-01-28

Backdoor Complications:
A Comprehensive Analysis and Mitigation of the Unforeseen Consequences of Backdoor Attacks

Article

IEEE Transactions on Dependable and Secure Computing Backdoor Complications: A Comprehensive Analysis and Mitigation of the Unforeseen Consequences of Backdoor Attacks

Tags

Trustworthy Information Processing

Authors

Rui Zhang
Yun Shen
Hongwei Li
Wenbo Jiang
Hanxiao Chen
Yuan Zhang
Guowen Xu
Yang Zhang

Full Paper Visit Detail Page

2026-01-21

SL-CBM:
Enhancing Concept Bottleneck Models with Semantic Locality for Better Interpretability

Conference / Medium

National Conference of the American Association for Artificial Intelligence (AAAI)
SL-CBM: Enhancing Concept Bottleneck Models with Semantic Locality for Better Interpretability

Tags

Trustworthy Information Processing

Authors

Hanwei Zhang
Luo Chen
Rui Wen
Yang Zhang
Lijun Zhang
Holger Hermanns

Full Paper Visit Detail Page

Year 2025

2025-12-05

Adjacent Words, Divergent Intents:
Jailbreaking Large Language Models via Task Concurrency

Conference / Medium

Conference on Neural Information Processing Systems (NeurIPS)
Adjacent Words, Divergent Intents: Jailbreaking Large Language Models via Task Concurrency

Tags

Trustworthy Information Processing

Authors

Full Paper Visit Detail Page

2025-12-03

Finding and Reactivating Post-Trained LLMs’ Hidden Safety Mechanisms

Conference / Medium

Conference on Neural Information Processing Systems (NeurIPS)
Finding and Reactivating Post-Trained LLMs’ Hidden Safety Mechanisms

Tags

Trustworthy Information Processing

Authors

Full Paper Visit Detail Page

Yang Zhang

Machine Learning and Data Privacy

Head of Group

Email

Address

Members

Boyang Zhang

Yicong Tan

Chi Cui

Tianze Chang

Zeyuan Chen

Ye Leng

Mengfei Liang

Bo Shao

Xinyu Zhang

Zihan Wang

Shengyun Si

Most Recent Publications

Year 2026

Year 2025