<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Where Should The Data Reside?</title>
	<atom:link href="http://www.centernetworks.com/where-should-the-data-reside/feed" rel="self" type="application/rss+xml" />
	<link>http://www.centernetworks.com/where-should-the-data-reside</link>
	<description>Web 2 and Social Media News and Reviews</description>
	<lastBuildDate>Sat, 21 Nov 2009 01:50:56 -0600</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: anon</title>
		<link>http://www.centernetworks.com/where-should-the-data-reside/comment-page-1#comment-30923</link>
		<dc:creator>anon</dc:creator>
		<pubDate>Sat, 30 May 2009 07:51:50 +0000</pubDate>
		<guid isPermaLink="false">http://www.centernetworks.com/?p=15788#comment-30923</guid>
		<description>can you kill the gravatar on that last post? i wrote it on a cookie-less comp, but appreantly it follows ip&#039;s or something...it&#039;s not good writing

...or kill the whole post?

thanks.</description>
		<content:encoded><![CDATA[<p>can you kill the gravatar on that last post? i wrote it on a cookie-less comp, but appreantly it follows ip&#8217;s or something&#8230;it&#8217;s not good writing</p>
<p>&#8230;or kill the whole post?</p>
<p>thanks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: anon</title>
		<link>http://www.centernetworks.com/where-should-the-data-reside/comment-page-1#comment-30920</link>
		<dc:creator>anon</dc:creator>
		<pubDate>Sat, 30 May 2009 07:48:52 +0000</pubDate>
		<guid isPermaLink="false">http://www.centernetworks.com/?p=15788#comment-30920</guid>
		<description>Couple disparate thoughts (it&#039;s late so I&#039;m a bit loopy):
--Google&#039;s Wave will make this...is it worse? or better? However you look at it...I THINK they&#039;ve got tools to embed Waves into your blog, his blog, her blog, everyone&#039;s blog can have Allens Wave...
--the general motive of these re-re-et-al-re-publishers is to get google ad cash. They dont know google&#039;s already filtering them out and/or they are a dime-a-dozen (search for some freeware and you&#039;ll find hundreds of sites with same content - its the same idea with these blog/tweat re-publishers)..so they get lost in the crowd. Sure Mahalo (etc) has a bigger footprint, but it doesn&#039;t matter...only sources matter (SEO) long term.
--@epc - good post...also, no way to FORCE an update on a remote site. And, no way to FORCE proper implementation of a protocol (say, if the protocol includes methods to force updates). eg. I did an email gateway years ago following the exact protocol. In the wild, it needed massive modding to accommodate everyone&#039;s goofy/broken protocol implementations (even big names like IBM/Novell/MS/etc)
--Twitter to FF to FB to X then Y then Z and back to Twitter...infinite loop? Sweeeet...
--We got one post on another blog and saw it copied - exact copies - on hundreds of sites. Didn&#039;t do anything - 99.99% of traffic came from original source (see above: re-publishers get few hits - I know someone&#039;s going to argue this point...&quot;I made $10 in one month...!!!&quot;, big deal.).
--once its on the inets and google has it, you&#039;re hosed. cat&#039;s out of bag
--this reflects in the so called &quot;Real Time&quot; web (a misnomer). It&#039;s only real-time as fast as the consumers are designed to re-consume...and as fast as the publishers can push. And as correct as consumers care to make it correct. Only the source is &quot;real time&quot; (even then, not really).
--&quot;My take is that it’s fine to display content from other social services but it should be a display only — not/never a store and retain.&quot; In theory I agree that&#039;s how it should work. But it goes back to caring - by BOTH parties. If the publisher doesn&#039;t care to display the data in a format people want (see: Twitter), a second party can come and suck it all into their system to format and re-display how users want to see it (see: Summize). And, if the &quot;real time&quot; publisher crashes, they don&#039;t want to be pulled down too if their data is a mixture.
--you mention &quot;data portability&quot;. Is this about &quot;data portability&quot;? Isn&#039;t &quot;data portability&quot; moving your private/semi-private account data from one site to another? Off topic, yeah, &quot;data portability&quot; is scary. Ever delve into FB Connect? Wow. Watch out what you wish for.</description>
		<content:encoded><![CDATA[<p>Couple disparate thoughts (it&#8217;s late so I&#8217;m a bit loopy):<br />
&#8211;Google&#8217;s Wave will make this&#8230;is it worse? or better? However you look at it&#8230;I THINK they&#8217;ve got tools to embed Waves into your blog, his blog, her blog, everyone&#8217;s blog can have Allens Wave&#8230;<br />
&#8211;the general motive of these re-re-et-al-re-publishers is to get google ad cash. They dont know google&#8217;s already filtering them out and/or they are a dime-a-dozen (search for some freeware and you&#8217;ll find hundreds of sites with same content &#8211; its the same idea with these blog/tweat re-publishers)..so they get lost in the crowd. Sure Mahalo (etc) has a bigger footprint, but it doesn&#8217;t matter&#8230;only sources matter (SEO) long term.<br />
&#8211;@epc &#8211; good post&#8230;also, no way to FORCE an update on a remote site. And, no way to FORCE proper implementation of a protocol (say, if the protocol includes methods to force updates). eg. I did an email gateway years ago following the exact protocol. In the wild, it needed massive modding to accommodate everyone&#8217;s goofy/broken protocol implementations (even big names like IBM/Novell/MS/etc)<br />
&#8211;Twitter to FF to FB to X then Y then Z and back to Twitter&#8230;infinite loop? Sweeeet&#8230;<br />
&#8211;We got one post on another blog and saw it copied &#8211; exact copies &#8211; on hundreds of sites. Didn&#8217;t do anything &#8211; 99.99% of traffic came from original source (see above: re-publishers get few hits &#8211; I know someone&#8217;s going to argue this point&#8230;&#8221;I made $10 in one month&#8230;!!!&#8221;, big deal.).<br />
&#8211;once its on the inets and google has it, you&#8217;re hosed. cat&#8217;s out of bag<br />
&#8211;this reflects in the so called &#8220;Real Time&#8221; web (a misnomer). It&#8217;s only real-time as fast as the consumers are designed to re-consume&#8230;and as fast as the publishers can push. And as correct as consumers care to make it correct. Only the source is &#8220;real time&#8221; (even then, not really).<br />
&#8211;&#8221;My take is that it’s fine to display content from other social services but it should be a display only — not/never a store and retain.&#8221; In theory I agree that&#8217;s how it should work. But it goes back to caring &#8211; by BOTH parties. If the publisher doesn&#8217;t care to display the data in a format people want (see: Twitter), a second party can come and suck it all into their system to format and re-display how users want to see it (see: Summize). And, if the &#8220;real time&#8221; publisher crashes, they don&#8217;t want to be pulled down too if their data is a mixture.<br />
&#8211;you mention &#8220;data portability&#8221;. Is this about &#8220;data portability&#8221;? Isn&#8217;t &#8220;data portability&#8221; moving your private/semi-private account data from one site to another? Off topic, yeah, &#8220;data portability&#8221; is scary. Ever delve into FB Connect? Wow. Watch out what you wish for.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Allen Stern</title>
		<link>http://www.centernetworks.com/where-should-the-data-reside/comment-page-1#comment-30808</link>
		<dc:creator>Allen Stern</dc:creator>
		<pubDate>Sat, 30 May 2009 03:18:29 +0000</pubDate>
		<guid isPermaLink="false">http://www.centernetworks.com/?p=15788#comment-30808</guid>
		<description>thanks for the great reply epc!</description>
		<content:encoded><![CDATA[<p>thanks for the great reply epc!</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: epc</title>
		<link>http://www.centernetworks.com/where-should-the-data-reside/comment-page-1#comment-30793</link>
		<dc:creator>epc</dc:creator>
		<pubDate>Sat, 30 May 2009 02:43:54 +0000</pubDate>
		<guid isPermaLink="false">http://www.centernetworks.com/?p=15788#comment-30793</guid>
		<description>There was some great discussion about this at the recent “Glue” conference but no clearcut answers.

At a technical lever there&#039;s a couple of problems: it&#039;s trivial to syndicate the data, but non-trivial to synchronize actions on the data.  If the feed is an Atom feed there&#039;s a notion of stubs to reflect that a given bit of content has been unpublished, but this concept doesn&#039;t exist in RSS which is what most sites use for data exchange.

There&#039;s also no notion of what I&#039;m going to call &quot;contractual use of data&quot;.  There&#039;s no way to obligate a subscribing party to either update a given element of data (maybe I published something in error and I want to push out the correction) or remove it (for whatever reason).

An author/publisher I know had a hell of a time getting bad data out of &quot;the system&quot; for a book he wrote.  Initially (years ago) he&#039;d talked to O&#039;Reilly about getting it published.  For whatever reason that didn&#039;t go through.  For reasons even O&#039;Reilly admits were in error, the book appeared in a database update of upcoming titles.  For the next several years the title showed up as an O&#039;Reilly title complete with erroneous ISBN even though the author and O&#039;Reilly quickly cleaned up the original bad data source.  It flowed out to Amazon, then other sites and even to this day resurfaces years later.

The problem with establishing some sort of contractual obligation on data flow is …isn&#039;t that DRM?  And it is in a way I guess, but not in the sense of preventing copies or use but in requiring some sense of fidelity to the original data.

Atom tried to achieve a first cut at this both with the stub idea for deletions as well as the requirement of a unique identifier for each chunk of content — the idea being that even if you republish my blog post from my personal site over here on CN, the original id is maintained, but in practice no one does this and the tools don&#039;t really support or enforce it.</description>
		<content:encoded><![CDATA[<p>There was some great discussion about this at the recent “Glue” conference but no clearcut answers.</p>
<p>At a technical lever there&#8217;s a couple of problems: it&#8217;s trivial to syndicate the data, but non-trivial to synchronize actions on the data.  If the feed is an Atom feed there&#8217;s a notion of stubs to reflect that a given bit of content has been unpublished, but this concept doesn&#8217;t exist in RSS which is what most sites use for data exchange.</p>
<p>There&#8217;s also no notion of what I&#8217;m going to call &#8220;contractual use of data&#8221;.  There&#8217;s no way to obligate a subscribing party to either update a given element of data (maybe I published something in error and I want to push out the correction) or remove it (for whatever reason).</p>
<p>An author/publisher I know had a hell of a time getting bad data out of &#8220;the system&#8221; for a book he wrote.  Initially (years ago) he&#8217;d talked to O&#8217;Reilly about getting it published.  For whatever reason that didn&#8217;t go through.  For reasons even O&#8217;Reilly admits were in error, the book appeared in a database update of upcoming titles.  For the next several years the title showed up as an O&#8217;Reilly title complete with erroneous ISBN even though the author and O&#8217;Reilly quickly cleaned up the original bad data source.  It flowed out to Amazon, then other sites and even to this day resurfaces years later.</p>
<p>The problem with establishing some sort of contractual obligation on data flow is …isn&#8217;t that DRM?  And it is in a way I guess, but not in the sense of preventing copies or use but in requiring some sense of fidelity to the original data.</p>
<p>Atom tried to achieve a first cut at this both with the stub idea for deletions as well as the requirement of a unique identifier for each chunk of content — the idea being that even if you republish my blog post from my personal site over here on CN, the original id is maintained, but in practice no one does this and the tools don&#8217;t really support or enforce it.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
