人工智能：数字隐私守护者的技术解决方案

PDF文件

下载需积分: 26 | 386KB | 更新于2024-07-09 | 7 浏览量 | 举报收藏

立即下载

本文探讨了人工智能在数字隐私保护领域的角色，强调了AI技术如何在平衡数据收集与隐私保护之间发挥积极作用。首先，文章介绍了两种关键的隐私增强技术：差分隐私（Differential Privacy）和联邦学习（Federated Learning）。差分隐私通过向数据添加噪声来保护个体隐私，确保即使在大数据集上进行分析，也不会泄露个人的特定信息。联邦学习则允许多方在不共享原始数据的情况下进行模型训练，降低了数据集中个人敏感信息的暴露风险。接着，文章提出了AI审计员和个人守护者的概念。这些智能系统通过集体行动和激励机制，代表用户监控数据处理过程，防止重新识别或歧视性结果的发生，从而维护消费者的权益。此外，AI还可以帮助定义和理解隐私这一抽象概念，通过机器学习和数据分析，提供更准确、实用的隐私界限，为制定更具针对性的隐私政策提供依据。文章进一步讨论了如何通过法律手段强化AI的隐私保护应用。一方面，"Private Reinforcement"提倡逐步推动创新，通过监管和标准化确保AI技术在尊重隐私的前提下发展。另一方面，“Legal Reinforcement”呼吁法律框架的变革，可能包括对隐私侵犯行为的严惩和对隐私友好设计的鼓励，促使企业和组织在设计AI系统时优先考虑用户隐私。尽管AI带来的潜在风险引发担忧，但通过恰当的技术和法规引导，人工智能可以成为数字隐私保护的强大工具。然而，实现这一目标需要跨学科合作，包括科技、法律、政策制定者和公众的共同努力。只有这样，AI才能真正成为改善人们生活的同时，有效保护个人隐私的守护者。

220 Harvard Journal of Law & Technology [Vol. 31

A. Differential Privacy

Differential privacy is a field pioneered by researchers at Microsoft

and Apple, alongside a handful of academics. The animating principle

behind differential privacy, as articulated by its original proponent Cyn-

thia Dwork, is that responses to dataset queries should not provide

enough information to identify any individual included in the dataset.

Differential privacy is ultimately a mathematical definition of privacy

that considers whether a particular person’s data has a significant impact

on the answer to a dataset query; if it does not, then the data will not

identify the person it describes.

The identifiability of information is (as

we have undoubtedly discovered)

not a binary question, but a proba-

bilistic one. How much of an impact the data must have on the query to

be excluded — and by extension how likely it is that a query would lead

to personal identification — depends on a “privacy budget” set by the

holder of the data, which defines how much information leakage is con-

sidered acceptable.

Setting an appropriate privacy budget is therefore crucial to the

proper use of differential privacy techniques. And because of the way

that differential privacy works, there is an inherent tradeoff between the

level of privacy afforded to data subjects and the accuracy of the query

results. This is because differential privacy is performed primarily by

injecting noise (randomness) into a dataset in such a way that the outputs

or conclusions generated by the data are minimally impacted while pri-

vacy protection is enhanced.

The amount of noise introduced will de-

pend on the specified amount of acceptable data leakage and the way the

data will be used. Just as data leakage will never reach zero, neither will

the amount of error introduced by the noise.

Apple has developed more sophisticated differential privacy tech-

niques that incorporate hashing and subsampling into its methodology as

12. See Cynthia Dwork, Differential Privacy, 33 INT’L COLLOQUIUM ON AUTOMATA,

LANGUAGES AND PROGRAMMING 1 (2006).

13. Matthew Green, What Is Differential Privacy?, A

FEW THOUGHTS ON CRYPTOGRAPHIC

ENGINEERING (June 15, 2016), https://blog.cryptographyengineering.com

/2016/06/15/what-is-differential-privacy/ [https://perma.cc/73YU-RKJZ].

14. For example, Netflix’s publicly released viewing dataset for an algorithmic design con-

test turned out to be insufficiently anonymized because researchers discovered that the dataset

could be used to re-identify certain viewers when combined with publicly-available data. This

led to inquiries by the FTC and a California class-action lawsuit against Netflix. See Andrew

Chin & Anne Klinefelter, Differential Privacy as a Response to the Reidentification Threat: The

Facebook Advertiser Case Study, 90 N.C. L. R

EV. 1417, 1424 (2012). In another case, Latanya

Sweeney published a study in which she merged supposedly anonymized Massachusetts worker

hospital records with easily acquired voter registration records, and found she was able to iden-

tify the health records of then-Governor William Weld; she later published “a broader study

finding that 87% of the 1990 U.S. Census population could be identified using only gender, zip

code, and full date of birth.” Id. at 1425.

15. Green, supra note 13.

16. Id.

剩余18页未读，继续阅读

weixin_38689477

粉丝: 2

人工智能：数字隐私守护者的技术解决方案

2021年《计算机研究与发展》专题(正刊)征文通知——人工智能安全与隐私保护技术.pdf

隐私保护方案-基于区块链+同态加密+小程序（含源码+项目说明+论文）.zip

司法机构自动化决策的隐私和数据保护限制-研究论文

企业数据道德：高级分析和人工智能时代的数据治理转型-研究论文

Buddy Bots：图灵的快友如何挖掘消费者隐私-研究论文

人工智能技术和智能电网：在维护隐私的同时实现智能电表的便利-研究论文

PIPEDA 改革以解决人工智能问题的政策建议-研究论文

商业数据伦理：高级分析和AI治理中的新兴趋势-研究论文

信息技术中的人工智能-研究论文

对欧盟委员会人工智能白皮书的回应-研究论文

算法操作。 探索不公平商业行为、数据保护和隐私法的三角关系-研究论文

数字经济的进化：进化经济学研究计划-研究论文

作为大型技术系统的智慧城市：集体行动问题成为城市数字化转型的组织壁垒-研究论文

技术规则——技术如何被用来扰乱基本的劳动法保护-研究论文

大众传媒研究-研究论文

了解数字经济：新业务模式的挑战-研究论文

面向机器人的透明、可解释和负责的 AI-研究论文

在线算法执行-研究论文

高应变桩基承载力检测.ppt

最新资源

算法操作。探索不公平商业行为、数据保护和隐私法的三角关系-研究论文