<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments on: An open letter to Netflix from the authors of the de-anonymization paper</title>
	<atom:link href="http://33bits.org/2010/03/15/open-letter-to-netflix/feed/" rel="self" type="application/rss+xml" />
	<link>http://33bits.org/2010/03/15/open-letter-to-netflix/</link>
	<description>The End of Anonymized Data and What to Do About It</description>
	<lastBuildDate>Tue, 06 Dec 2011 23:45:54 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
	<item>
		<title>By: Arvind</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1437</link>
		<dc:creator><![CDATA[Arvind]]></dc:creator>
		<pubDate>Thu, 27 May 2010 17:59:42 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1437</guid>
		<description><![CDATA[Yes, there is no agreed-upon value of epsilon. That is indeed one stumbling block when implementing differential privacy.

However that&#039;s a different problem than the difficulty with other definitions. Not having a magic value of epsilon does not make differential privacy any difficult to enforce once you do pick epsilon.

It would seem that a discussion group for everyone implementing or thinking of implementing differential privacy would be valuable.]]></description>
		<content:encoded><![CDATA[<p>Yes, there is no agreed-upon value of epsilon. That is indeed one stumbling block when implementing differential privacy.</p>
<p>However that&#8217;s a different problem than the difficulty with other definitions. Not having a magic value of epsilon does not make differential privacy any difficult to enforce once you do pick epsilon.</p>
<p>It would seem that a discussion group for everyone implementing or thinking of implementing differential privacy would be valuable.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mimi Yin</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1435</link>
		<dc:creator><![CDATA[Mimi Yin]]></dc:creator>
		<pubDate>Wed, 26 May 2010 16:57:06 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1435</guid>
		<description><![CDATA[Theoretical and practical evaluations of differential privacy aside, has anyone tried to put a number to the differential privacy guarantee? Meaning, has anyone tried to quantify your record will be almost indiscernible. Or the presence of your record will have negligible impact on answers given.

The organization I’m working with (The Common Data Project) is experimenting with using differential privacy to “automate” the release of sensitive data to the public.

One hurdle we’ve run up against is that there doesn’t seem to be an agreed upon “magic number” to quantify the “negligable impact” or “almost indiscernible” language in the guarantee?

Without such a quantity, we’re concerned that the differential privacy guarantee runs into some of the same problems that plague current definitions? (ie. vagueness which translates into difficult to enforce)

Does anyone here happen to know of any papers or discussions on this particular aspect of differential privacy? Any leads would be greatly appreciated!

We’ve documented our quest for the quantifiable guarantee in excruciating detail here: http://bit.ly/bj8gWF]]></description>
		<content:encoded><![CDATA[<p>Theoretical and practical evaluations of differential privacy aside, has anyone tried to put a number to the differential privacy guarantee? Meaning, has anyone tried to quantify your record will be almost indiscernible. Or the presence of your record will have negligible impact on answers given.</p>
<p>The organization I’m working with (The Common Data Project) is experimenting with using differential privacy to “automate” the release of sensitive data to the public.</p>
<p>One hurdle we’ve run up against is that there doesn’t seem to be an agreed upon “magic number” to quantify the “negligable impact” or “almost indiscernible” language in the guarantee?</p>
<p>Without such a quantity, we’re concerned that the differential privacy guarantee runs into some of the same problems that plague current definitions? (ie. vagueness which translates into difficult to enforce)</p>
<p>Does anyone here happen to know of any papers or discussions on this particular aspect of differential privacy? Any leads would be greatly appreciated!</p>
<p>We’ve documented our quest for the quantifiable guarantee in excruciating detail here: <a href="http://bit.ly/bj8gWF" rel="nofollow">http://bit.ly/bj8gWF</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Is Making Public Data &#8220;More Public&#8221; a Privacy Violation? &#171; 33 Bits of Entropy</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1321</link>
		<dc:creator><![CDATA[Is Making Public Data &#8220;More Public&#8221; a Privacy Violation? &#171; 33 Bits of Entropy]]></dc:creator>
		<pubDate>Mon, 05 Apr 2010 18:12:28 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1321</guid>
		<description><![CDATA[[...] third reason is the &#8220;greater good.&#8221; I&#8217;ve opposed that line of reasoning when used to justify reneging on an explicit privacy promise. But when it [...]]]></description>
		<content:encoded><![CDATA[<p>[...] third reason is the &#8220;greater good.&#8221; I&#8217;ve opposed that line of reasoning when used to justify reneging on an explicit privacy promise. But when it [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: AnonCryptographer</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1316</link>
		<dc:creator><![CDATA[AnonCryptographer]]></dc:creator>
		<pubDate>Sun, 04 Apr 2010 02:16:41 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1316</guid>
		<description><![CDATA[I&#039;m with Dan Kaminsky.  I don&#039;t have the guts to use my real name, but I&#039;ve read about differential privacy, and I&#039;m quite skeptical that it&#039;ll work for this.  I don&#039;t think the problem is that I fail to understand differential privacy; I think the problem is that anonymization is darn hard.

But whatever.  The burden of proof is not on me; the burden of proof is on you, to prove convincingly that it is possible to anonymize/privatize this kind of data, without loss of utility.  So far, you haven&#039;t done so. The ball&#039;s in your court.  Let&#039;s see the proof.

For instance, how about demonstrating how you would have anonymized the first Netflix data set.  Prove that your method preserves anonymity, and doesn&#039;t eliminate utility.  Then we can talk.]]></description>
		<content:encoded><![CDATA[<p>I&#8217;m with Dan Kaminsky.  I don&#8217;t have the guts to use my real name, but I&#8217;ve read about differential privacy, and I&#8217;m quite skeptical that it&#8217;ll work for this.  I don&#8217;t think the problem is that I fail to understand differential privacy; I think the problem is that anonymization is darn hard.</p>
<p>But whatever.  The burden of proof is not on me; the burden of proof is on you, to prove convincingly that it is possible to anonymize/privatize this kind of data, without loss of utility.  So far, you haven&#8217;t done so. The ball&#8217;s in your court.  Let&#8217;s see the proof.</p>
<p>For instance, how about demonstrating how you would have anonymized the first Netflix data set.  Prove that your method preserves anonymity, and doesn&#8217;t eliminate utility.  Then we can talk.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thomas</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1306</link>
		<dc:creator><![CDATA[Thomas]]></dc:creator>
		<pubDate>Fri, 02 Apr 2010 04:04:52 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1306</guid>
		<description><![CDATA[Hi Dan,

You seem to misunderstand differential privacy -- I recommend sitting down and reading one of these papers. I don&#039;t know what you mean when you say it &quot;won&#039;t actually work&quot;. If you read the mathematical definition, you will see that it offers quite a strong privacy guarantee. And there are many papers showing that for a surprisingly large number of tasks, it does work -- you get useful algorithms (including for netflix recommender systems) that -provably- preserve differential privacy.]]></description>
		<content:encoded><![CDATA[<p>Hi Dan,</p>
<p>You seem to misunderstand differential privacy &#8212; I recommend sitting down and reading one of these papers. I don&#8217;t know what you mean when you say it &#8220;won&#8217;t actually work&#8221;. If you read the mathematical definition, you will see that it offers quite a strong privacy guarantee. And there are many papers showing that for a surprisingly large number of tasks, it does work &#8212; you get useful algorithms (including for netflix recommender systems) that -provably- preserve differential privacy.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arvind</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1289</link>
		<dc:creator><![CDATA[Arvind]]></dc:creator>
		<pubDate>Wed, 24 Mar 2010 16:16:44 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1289</guid>
		<description><![CDATA[That is very interesting. Thanks for letting us know!]]></description>
		<content:encoded><![CDATA[<p>That is very interesting. Thanks for letting us know!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Marcin Wojnarski &#124; TunedIT</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1288</link>
		<dc:creator><![CDATA[Marcin Wojnarski &#124; TunedIT]]></dc:creator>
		<pubDate>Wed, 24 Mar 2010 10:03:23 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1288</guid>
		<description><![CDATA[... protecting privacy will likely require setting up an online system for data analysis rather than an “anonymize and release” approach ...

TunedIT (http://tunedit.org) is a challenge platform where online data analysis is possible, without releasing the data. Participants may submit executable code of their learning algorithm, which is then trained and tested on a dataset that&#039;s kept secret on the server.

This approach was used recently in Advanced Track of RSCTC Discovery Challenge that concerned DNA microarray data analysis (http://tunedit.org/challenge/RSCTC-2010-A) - online scoring of algorithms allowed for more precise evaluation, because every algorithm could have been trained multiple times on the same dataset, using different splits - something like cross-validation or leave-one-out approach. This was important because microarray data are very sparse by nature and single train+test evaluation doesn&#039;t work pretty well.]]></description>
		<content:encoded><![CDATA[<p>&#8230; protecting privacy will likely require setting up an online system for data analysis rather than an “anonymize and release” approach &#8230;</p>
<p>TunedIT (<a href="http://tunedit.org" rel="nofollow">http://tunedit.org</a>) is a challenge platform where online data analysis is possible, without releasing the data. Participants may submit executable code of their learning algorithm, which is then trained and tested on a dataset that&#8217;s kept secret on the server.</p>
<p>This approach was used recently in Advanced Track of RSCTC Discovery Challenge that concerned DNA microarray data analysis (<a href="http://tunedit.org/challenge/RSCTC-2010-A" rel="nofollow">http://tunedit.org/challenge/RSCTC-2010-A</a>) &#8211; online scoring of algorithms allowed for more precise evaluation, because every algorithm could have been trained multiple times on the same dataset, using different splits &#8211; something like cross-validation or leave-one-out approach. This was important because microarray data are very sparse by nature and single train+test evaluation doesn&#8217;t work pretty well.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Six Lines &#124; Netflix Privacy Research</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1285</link>
		<dc:creator><![CDATA[Six Lines &#124; Netflix Privacy Research]]></dc:creator>
		<pubDate>Tue, 23 Mar 2010 15:39:13 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1285</guid>
		<description><![CDATA[[...] Narayanan and Vitaly Shmatikov, the authors of the Netflix dataset de-anonymization paper, wrote an open letter to Netflix about their recent cancellation of their second contest due to privacy concerns. It&#8217;s worth [...]]]></description>
		<content:encoded><![CDATA[<p>[...] Narayanan and Vitaly Shmatikov, the authors of the Netflix dataset de-anonymization paper, wrote an open letter to Netflix about their recent cancellation of their second contest due to privacy concerns. It&#8217;s worth [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dan Kaminsky</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1255</link>
		<dc:creator><![CDATA[Dan Kaminsky]]></dc:creator>
		<pubDate>Thu, 18 Mar 2010 09:50:37 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1255</guid>
		<description><![CDATA[Arvind,

   Oh come on.  The first thing I saw when I clicked that link was:

===
Unlike prior privacy work concerned with cryptographically securing the computation of recommendations, differential privacy constrains a computation in a way that precludes any inference about the underlying records from its output.
===

   In other words, a better way of anonymizing data, such that its release does not impact privacy -- something we both know won&#039;t actually work.

   Look.  I&#039;m totally sympathetic to your position.  I&#039;ll even grant that, if you hadn&#039;t done all this, someone else would have, and people probably shouldn&#039;t be angry at you for this inevitable happening.  But, it&#039;s where we are.]]></description>
		<content:encoded><![CDATA[<p>Arvind,</p>
<p>   Oh come on.  The first thing I saw when I clicked that link was:</p>
<p>===<br />
Unlike prior privacy work concerned with cryptographically securing the computation of recommendations, differential privacy constrains a computation in a way that precludes any inference about the underlying records from its output.<br />
===</p>
<p>   In other words, a better way of anonymizing data, such that its release does not impact privacy &#8212; something we both know won&#8217;t actually work.</p>
<p>   Look.  I&#8217;m totally sympathetic to your position.  I&#8217;ll even grant that, if you hadn&#8217;t done all this, someone else would have, and people probably shouldn&#8217;t be angry at you for this inevitable happening.  But, it&#8217;s where we are.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Arvind</title>
		<link>http://33bits.org/2010/03/15/open-letter-to-netflix/#comment-1254</link>
		<dc:creator><![CDATA[Arvind]]></dc:creator>
		<pubDate>Thu, 18 Mar 2010 09:11:24 +0000</pubDate>
		<guid isPermaLink="false">http://33bits.org/?p=452#comment-1254</guid>
		<description><![CDATA[Privacy is not the same as anonymization, and differential privacy is not a better way of anonymizing data. Rather, I see it as a way of bypassing the need for anonymization. For example, see &lt;a href=&quot;http://research.microsoft.com/apps/pubs/default.aspx?id=80511&quot; rel=&quot;nofollow&quot;&gt;this paper.&lt;/a&gt;]]></description>
		<content:encoded><![CDATA[<p>Privacy is not the same as anonymization, and differential privacy is not a better way of anonymizing data. Rather, I see it as a way of bypassing the need for anonymization. For example, see <a href="http://research.microsoft.com/apps/pubs/default.aspx?id=80511" rel="nofollow">this paper.</a></p>
]]></content:encoded>
	</item>
</channel>
</rss>

