ICLR2025

CipherPrune: Efficient and Scalable Private Transformer Inference

Yancheng Zhang, Jiaqi Xue, Mengxin Zheng, Mimi Xie, Mingzhe Zhang, Lei Jiang, Qian Lou

Abstract

Transformer-based models are widely used to process highly confidential information. Query Untrusted Server Inference result Hi, I'm Jamie Thompson, a 42-year-old software developer living at 835 Oakwood Drive, Portland, Oregon 97214. You can reach me at (503) 555-2187 if needed. I've been having trouble with my email account jamie.thompson98@email.com and would appreciate if you could help me troubleshoot the issue.* Direct data sharing leads to privacy breaches *fake content generated by ChatGPT for presentation purpose only User queries can often contain private information like name, age, address, phone number, email…….