Slashdot Log In
RSS & BT Together?
Posted by
CmdrTaco
on Tue Dec 16, 2003 10:07 AM
from the duct-taping-together-the-next-generation-of-p2p dept.
from the duct-taping-together-the-next-generation-of-p2p dept.
AntiPasto writes "According to this Yahoo! News article, RSS and BitTorrent could be set to join in a best-of-both-worlds content management system for the net. Possible?" Update: 03/17 21:39 GMT by T : Thanks to Steve Gillmor, here's the original story on eWeek to replace the now-dead Yahoo! link.
This discussion has been archived.
No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
RSS polling intervals (Score:5, Interesting)
Hm. That's interesting. The RubyForge [rubyforge.org] RSS feeds get polled every
half hour by a couple folks, i.e.:Hasn't caused problems yet, but maybe that's because RubyForge only gets about
30K-40K hits per day, and the feeds get just a fraction of that.
Re:RSS polling intervals (Score:5, Insightful)
Still, I'm not seeing anything beyond the "duh" factor here. All that needs to happen is for browsers to handle torrent links. Not some souped up napster app, a browser, so that I can type in a torrent link and get any web page (or other mime doc) for the browser to handle. Change the RSS to use the new URL scheme, and there you go. You could also do it as a proxy, but you run into worse cache coherency issues than with direct support of the protocol; who's to say who has the correct mapping of the content url to the torrent url?
Good luck, mind you, on getting anything but blogs, download sites, and perhaps hobby news sites to jump on board. This issue has been beaten to death in the IETF and many other circles, and it all boils down to content control -- the NY Times simply doesn't want its content mirrored like that.
Parent
Re:RSS polling intervals (Score:5, Informative)
My newsbot (in my
Parent
Re:RSS polling intervals (Score:2, Interesting)
Re:RSS polling intervals (Score:2)
That's a good point, but it's trivial to serve up "broken" or empty RSS if the requests are coming too often.. limit by IP. Would cost SLIGHTLY more in processing, but would save much in bandwidth, especially if the feed is large.
S
Re:RSS polling intervals (Score:3, Informative)
Re:RSS polling intervals (Score:5, Informative)
Parent
Re:RSS polling intervals (Score:3, Insightful)
A well behaved program won't go GETs on every RSS page, but will do HEADS, compare them to what it already has, and decide from there to get or not get the new page.
A HEAD request is very small, and unless you're doing millions of them, this shouldn't be an issue.
- Serge
Re:RSS polling intervals (Score:2)
Sure, well-behaved clients would do "HEAD" at a moderate interval. But, clients just can't be trusted; most users are "gimme gimme gimme"..
S
Re:RSS polling intervals (Score:4, Insightful)
> compare them to what it already has, and decide from there
> to get or not get the new page.
An even more behaved program will issue a GET with the "If-Modified-Since: " header, which will mean the server will return a "304 - not modified" if the file hasn't changed, or the actual file if it has.. Thus doing in one operation what a combined HEAD and followup GET would take 2 to do.
Parent
Use conditional GET, not HEAD (Score:5, Informative)
Parent
Re:RSS polling intervals (Score:2, Informative)
Neat idea. (Score:5, Interesting)
This could be carried further into a whole indymedia [indymedia.org] via BT. It would be even harder for governments and industry to silent dissident voices.
Re:Neat idea. (Score:4, Funny)
A couple weeks back, Indymedia had an article saying that the Protocols of Zion were created by the Illuminati to throw blame on the Jews while they take over the world.
There's a fine line between being a dissident and wearing a tin-foil hat, and many of the guys at Indymedia are squarely on the wrong side.
Parent
Re:Neat idea. (Score:3, Funny)
Awww....shit......
[BY THE POWER OF THE ILLUMINATED LIGHT: IMPLEMENT PLAN BETA. PLAN APLHA HAS BEEN SPOTTED BY THE MASSES]
Good concept (Score:2, Interesting)
I highly doubt it. (Score:3, Insightful)
I'd rather see BitTorrent improved in more... (Score:4, Insightful)
And setting up a server isn't quite easy.
It really could be a lot better with some work.
Re:I'd rather see BitTorrent improved in more... (Score:2, Informative)
create
put
run bttracker.py telling it which torrents are allowed (the location of that folder) and a folder to use for tmp files.
run btdownload.py like you normally would when resuming a download.
send the
(for more details RTFM! or STFW!)
Re:I'd rather see BitTorrent improved in more... (Score:4, Insightful)
The other problem being the relative difficulty of actually finding those 'random' websites that contain links to the things you'd actually want to download.
Parent
Re:I'd rather see BitTorrent improved in more... (Score:3, Informative)
http://pdtp.org/ [pdtp.org]
Ummm... (Score:5, Funny)
Re:Ummm... (Score:2)
BitTorrent is no-go for small files.. (Score:5, Interesting)
The tracker keeps, well, uhm, track, of the available pieces of the file, and every client reports in every time has got, or failed to get, a piece. So, using BitTorrent to distribute RSS feeds won't work, because the tracker will take up as much bandwidth, if not more, as a HTTP request, resulting in the "Not changed since your version" request.
Apart from that, well, yes, BitTorrent is great to distribute large files
Re:BitTorrent is no-go for small files.. (Score:2, Informative)
So you send out a new torrent through RSS referencing your new page instead of the regular RSS content, and your viewers use BitTorrent to work together to get the content from you without putting all the strain on your server. A .torrent file would be a lot smaller than a full RSS feed with images like he was using in the example.
Makes more s
Re:BitTorrent is no-go for small files.. (Score:2)
If he wants to save bandwidth. (Score:2, Interesting)
but i guess bumming off bittorrent/p2p bandwidth is not a bad idea either.
Re:If he wants to save bandwidth. (Score:4, Informative)
Akamai is great for offloading bandwidth and speeding up customer's page load times, but you're paying for the bandwidth one way or another.
Parent
Re:If he wants to save bandwidth. (Score:2)
It's not cheap, but for us it was cheaper than addin
Re:If he wants to save bandwidth. (Score:2)
Fun stuff. Good riddance, I say.
S
Konspire2b (Score:5, Informative)
why not nntp for syndication? (Score:5, Interesting)
Re:why not nntp for syndication? (Score:2)
Re:why not nntp for syndication? (Score:2)
Pushing checksums via the web does indeed reduce bandwidth in a best-case scenario, but if someone floods the newsgroup with fake updates all the aggregators will slam the website like mad trying (and failing) to verify the MD5 hash.
IRC (Score:4, Interesting)
fidonet (Score:5, Interesting)
The key to usefulness however, is enabling technology to prioritize and authenticate the RSS feeds in some way.
Re:fidonet (Score:3, Interesting)
The idea goes like this:
If you want to host a RSS feed, you run a program that is basically a peer cache. People hit your IP and "subscribe" to the feed. You give them a list of other subscribers' IPs and the public key for the feed. The client then hits these peers and checks to see who has faster bandwidth. If the pe
Whoah. </keanu> (Score:5, Informative)
Anyway, FidoNet was not without its share of problems. The killing bullet, I'd say today, was the social factor - there were too conservative forces clinging to backwards compatibility at the cost of anything. Anything had to work with the most basic piece of software; this effectively shot progress and evolution dead.
Not that there weren't attempts. There were. They just weren't successful.
Anyway, setting up echoes would have the same problems as FidoNet echoes. The number one problem was typical for Slashdot: DUPES!
Echoes were set up so that one node relayed a message in an echomail forum to its other connected nodes for a particular echo, effectively creating a star topology, different for each forum. However, since each sysop just wanted the echo linked, he would just hook up to somewhere, and forget about it. Then, others would hook up from him, and all of a sudden somebody had hooked up to two different valid uplinks.
The result? The star topology all of a sudden had a loop in it. Messages would keep circling (since FidoNet used dedicated dialup lines, latency between nodes was typically in the hours range) and dupe filters were created.
All of those filters and filter-enabling tags were optional, of course. After all, you couldn't mandate an operational node to change its behavior, you could just ask nicely.
Political play to no ends.
Anyway, there were many other funny effects with EchoMail. Crosslinking was another - when one echo got linked to another at a node, so that all messages in echo X would enter echo Y at that node and vice versa. The most exotic of these was when a religious echo got crosslinked with a fantasy humor one -- through crosslinked physical directories at a node (the FAT pointers for the different directories hosting the two echoes pointed to the same location on the disk). Anyway, much hilarious discussion ensued, and not many understood much what people were trying to say in the crosslinked echo.
/ former sysop and NEC in FidoNet
Parent
Content management system ? (Score:5, Interesting)
Sounds more like a (possibly improved) content delivery system.
Too bad the article didn't indicate anything about content management.
WebTorrent (Score:4, Insightful)
The drawback of the WebTorrent idea is that you need some way to group all the images, text and stylesheets together, otherwise you have to make a n inefficient P2P request for each one. RSS is a great way of doing that.
There aren't many details online at the moment of the work we did on the WebTorrent idea; it was mainly an e-mail thread -- get in touch if you'd like details. The project page [seldo.com] is available, but I stopped updating it so it doesn't have all the work that was eventually done.
Re:WebTorrent (Score:3, Insightful)
Next logical step... (Score:3, Interesting)
These problems are not insurmountable, but they are not insignificant, either. Thus, I don't think that RSS+BT is the instant-gratification, no-risk paradise that the Yahoo article makes it out to be.
A notification server protocol? cf. NTP/NNTP/SIP (Score:3, Interesting)
A) Sounds nice, but even without a torrent, using an open source hash algorithm (client and server agree on how to calculate the hash) would provide a way for the client to only download the hash value itself to check for freshness.
This way,
1 the author knows how many people have consumed the data and their general geographic distribution.
2 the author can make a decision to stop publication, which problematic but at any rate easier to enforce than if he or she starts out authorizing a torrent.
3 the author is free to pay for bandwidth if he or she will happily serve one per user just not a zillion per poller.
B) To be sure, it is easy to see who publishes an RSS feed / incites a Torrent download over somebody's infrastructure, whereas it is not so easy to discover the identity of an anonymous coward. You could also publish a pseudo-RSS feed itself exclusively over the torrent network using sequential filenames for more anonymity maybe..um.
C) Personally I have a current need for frequently updated RSS for a certain application and I'd set up a server that my internal network clients would poll frequently. But I'd still need for one machine to know the instant things change on the web too.
D) I'm wondering if a hierarchical network of servers might be useful here to publish event notifications. UDP is lossy, and we don't want to lose any events so that's out I guess. In NTP, various strata of time servers are used and clients try to sync to Greenwich time (light data) by the best route available. In NNTP, a client usually uses only one news server to get a fat feed, and different server owners often choose to handle only a subset of what's available in the whole world, which might also be the case (try serving every event of importance to someone in the world.. what is the bandwidth needed for that? How many bits to describe it in ip-like dot format?)
Probably there is another service that does what I'd like and it just flew out my left ear, but it just seemed to me that the best thing would be to combine the lightweight NTP network which lets clients synchronize their understanding of time despite general flakiness, and the NNTP network which allows different servers to decide to serve only certain segments of the worldwide aggregated feed.
And SIP does a lot of things that might be useful. And there is MDS (metacomputing directory service for the "semantic grid" - pdf [ic.ac.uk] / google's html [216.239.57.104]). And there's Jini ..
Anyway we do want to know some things with at least one minute resolution. (A storm watch? A news headline so we can turn on the TV or video stream?) At TV stations I know people constantly are watching the TV out of the corner of their eye to see if something earth-shattering comes up. How about a chime to tell you to look instead? How else to people get First Post? ;) I'd just like to beat normal notification systems for current events and website updates, for starters, based on a relatively robust and timely mechanism.
Maybe a low bandwidth network with some of these characteristics would be useful to distribute update event notifications that filter down to everyone's devces. A big company could have one or two machines consuming a global event notification thread, add events only it knows about, and serve this information on a push or pull basis to all its employees. Hmmm, tasty. Come to think of it I want something like that for another project too, Does anything already do this?
One interesting paper (2001) I found is on an emergency notification network based on subscribe/notify messages over SIP, a widespread voice over ip prot
A better solution is IP multicasting (Score:3, Insightful)
A better solution would be eliminating the need for syndicators to constnantly poll waiting for RSS updates by using IP multicasting to notify syndicators of when the content of a particular RSS feed has changed. Multicast protocols which provide such announcements already exist, such as the Session Announcement Protocol [faqs.org] which would notify those curious of updated RSS feeds. A URL to the updated feed would be provided, and afterwards whatever file transfer protocol you wish could be used to fetch the RSS feed itself, even BitTorrent.
Re:genius (Score:2)
Re:To me, (Score:3, Funny)
Re:RSS + BT = USENET + NNTP (Score:3, Insightful)
Does your usenet reader serve news articles to other users?
No, you need a costly usenet servers architecture. Not only machines, but also huuuge bandwith. Today's usenet servers that want to serve large portion of world hierarchies can only get it via dedicated satellite usenet-only feeds.
RSS+BT on the other hand is poor server and rich clients that exchange articles between themselves via p2p network only supervised by a BT tracker.
Robert
Re:RSS + BT = USENET + NNTP (Score:2)
Things RSS has been struggling with like character encoding, attachments (enclosures), scaling, and other issues are things that NNTP solved long ago.
RSS + Torrent would be an excellent replacement for binaries newsgroups though.
Re:RSS + BT = USENET + NNTP (Score:4, Insightful)
Yes: the way people traditionally read USENET news is by becoming a USENET node, downloading articles to the directory hierarchy of the local machine, and then redistributing them to neighboring sites. Reading news by connecting to centralized news servers via a network client happened many years later.
No, you need a costly usenet servers architecture.
There is nothing intrinsically "costly" about it: it's something a PDP-11 used to handle and that regularly ran over dial-up.
Not only machines, but also huuuge bandwith. Today's usenet servers that want to serve large portion of world hierarchies can only get it via dedicated satellite usenet-only feeds.
Just like a BT solution, you only redistribute those articles that you yourself are interested in.
The reason why we got a USENET infrastructure with a small number of backbone sites (compared to the readership) that carried everything is simply because a bunch of sites took on that role and carry everything. There is nothing in the protocol or design of USENET that requires it.
RSS+BT on the other hand is poor server and rich clients that exchange articles between themselves via p2p network only supervised by a BT tracker.
And you believe that BT and the BT tracker scales up to many billions of files on millions of nodes by sheer magic? BT would probably need a lot of work to scale up. And at least USENET doesn't need any supervision by anything--it's completely asynchronous and unsupervised.
Note that I did not claim that USENET would work any better than RSS+BT--I have no idea whether it would--simply that people are basically reinventing USENET when they combine RSS and BT.
I actually suspect that there are intrinsic properties of large peer-to-peer news networks that people don't like because that's why USENET became more and more centralized over the years.
What morron modded parent as insightful?
That's what I would ask about your posting. In fact, I would ask what moron wrote it.
Parent
Re:Stupid article (Score:2, Interesting)