RSS & BT Together? 161
AntiPasto writes "According to this Yahoo! News article, RSS and BitTorrent could be set to join in a best-of-both-worlds content management system for the net. Possible?" Update: 03/17 21:39 GMT by T : Thanks to Steve Gillmor, here's the original story on eWeek to replace the now-dead Yahoo! link.
RSS polling intervals (Score:5, Interesting)
Hm. That's interesting. The RubyForge [rubyforge.org] RSS feeds get polled every
half hour by a couple folks, i.e.: Hasn't caused problems yet, but maybe that's because RubyForge only gets about
30K-40K hits per day, and the feeds get just a fraction of that.
Re:RSS polling intervals (Score:1, Insightful)
Re:RSS polling intervals (Score:1)
S
Re:RSS polling intervals (Score:2, Interesting)
Re:RSS polling intervals (Score:2)
That's a good point, but it's trivial to serve up "broken" or empty RSS if the requests are coming too often.. limit by IP. Would cost SLIGHTLY more in processing, but would save much in bandwidth, especially if the feed is large.
S
Re:RSS polling intervals (Score:3, Informative)
Re:RSS polling intervals (Score:5, Informative)
Re:RSS polling intervals (Score:1)
So explain how I get 230K/s download with only 10K/s upload using the ABC client?
graspee
Re:RSS polling intervals (Score:1)
Re:RSS polling intervals (Score:2, Informative)
That isn't unusual. If there are plenty of uploaders with plenty of upstream capacity, you can expect fast downloads pretty much without uploading anything. The BitTorrent idea really comes into play when everyone is trying to download at the same time. It guarantees some level of fairness since other clients will give you faster downloads if you are being generous in uploading to them.
Strange things do still happen though. S
Re:RSS polling intervals (Score:1)
Re:RSS polling intervals (Score:3, Insightful)
A well behaved program won't go GETs on every RSS page, but will do HEADS, compare them to what it already has, and decide from there to get or not get the new page.
A HEAD request is very small, and unless you're doing millions of them, this shouldn't be an issue.
- Serge
Re:RSS polling intervals (Score:2)
Sure, well-behaved clients would do "HEAD" at a moderate interval. But, clients just can't be trusted; most users are "gimme gimme gimme"..
S
Re:RSS polling intervals (Score:4, Insightful)
> compare them to what it already has, and decide from there
> to get or not get the new page.
An even more behaved program will issue a GET with the "If-Modified-Since: " header, which will mean the server will return a "304 - not modified" if the file hasn't changed, or the actual file if it has.. Thus doing in one operation what a combined HEAD and followup GET would take 2 to do.
Use conditional GET, not HEAD (Score:5, Informative)
Re:RSS polling intervals (Score:1)
Re:RSS polling intervals (Score:2, Informative)
Re:RSS polling intervals (Score:1)
If slashdot cared so much about their bandwidth they would fix the html so that it was html 4.0 strict and using css for all of the layout. The bandwidth savings would be average 14Gb a day.
In fact there is an interesting article on A List Apart [alistapart.com] citing all of the changes.
Re:RSS polling intervals (Score:2)
Duh!
Re:RSS polling intervals (Score:5, Insightful)
Still, I'm not seeing anything beyond the "duh" factor here. All that needs to happen is for browsers to handle torrent links. Not some souped up napster app, a browser, so that I can type in a torrent link and get any web page (or other mime doc) for the browser to handle. Change the RSS to use the new URL scheme, and there you go. You could also do it as a proxy, but you run into worse cache coherency issues than with direct support of the protocol; who's to say who has the correct mapping of the content url to the torrent url?
Good luck, mind you, on getting anything but blogs, download sites, and perhaps hobby news sites to jump on board. This issue has been beaten to death in the IETF and many other circles, and it all boils down to content control -- the NY Times simply doesn't want its content mirrored like that.
Re:RSS polling intervals (Score:2)
I don't see how this is supposed to help. The problem that BitTorrent addresses is different from that faced by a popular RSS service.
BitTorrent has proven useful so far to preserve bandwidth. It's handy when distributing files of greater than 1 megabyte in size- usually much larger, such as 650 meg ISO images. It's effectiveness comes from the fact that indivudual download
Re:RSS polling intervals (Score:5, Informative)
My newsbot (in my
Re:RSS polling intervals (Score:2)
Re:RSS polling intervals (Score:2)
Solution (Score:2)
A simple RSS feed to the tune of, "Go bugger yourself, and either don't hit my site so often or use an RSS spider that respects 304s," would probably work wonders.
b.c
Re:RSS polling intervals (Score:2)
Re:RSS polling intervals (Score:1)
Re:Ok.. (Score:1)
It's a free hosting site for open source Ruby projects.
genius (Score:1, Funny)
Re:genius (Score:2)
Re:genius (Score:1)
Neat idea. (Score:5, Interesting)
This could be carried further into a whole indymedia [indymedia.org] via BT. It would be even harder for governments and industry to silent dissident voices.
Re:Neat idea. (Score:1)
Re:Neat idea. (Score:4, Funny)
A couple weeks back, Indymedia had an article saying that the Protocols of Zion were created by the Illuminati to throw blame on the Jews while they take over the world.
There's a fine line between being a dissident and wearing a tin-foil hat, and many of the guys at Indymedia are squarely on the wrong side.
Re:Neat idea. (Score:3, Funny)
Awww....shit......
[BY THE POWER OF THE ILLUMINATED LIGHT: IMPLEMENT PLAN BETA. PLAN APLHA HAS BEEN SPOTTED BY THE MASSES]
Re:Neat idea. (Score:2)
Good concept (Score:2, Interesting)
I highly doubt it. (Score:3, Insightful)
I'd rather see BitTorrent improved in more... (Score:4, Insightful)
And setting up a server isn't quite easy.
It really could be a lot better with some work.
Re:I'd rather see BitTorrent improved in more... (Score:2, Informative)
create
put
run bttracker.py telling it which torrents are allowed (the location of that folder) and a folder to use for tmp files.
run btdownload.py like you normally would when resuming a download.
send the
(for more details RTFM! or STFW!)
Re:I'd rather see BitTorrent improved in more... (Score:4, Insightful)
The other problem being the relative difficulty of actually finding those 'random' websites that contain links to the things you'd actually want to download.
Re:I'd rather see BitTorrent improved in more... (Score:1)
Setting up sharing for Bittorrent (Score:1, Interesting)
It has a simple option --share that automatically shares a file or directory through bittorrent by creating the metainfo file on the fly, launching a mini webserver to serve the .torrent file and acts as the bittorent clients that acts as the initial seed.
And it is a nice and simple commandline tool.
Re:I'd rather see BitTorrent improved in more... (Score:3, Informative)
http://pdtp.org/ [pdtp.org]
Ummm... (Score:5, Funny)
Re:Ummm... (Score:1)
No, I don't think blogs themselves (with an average of 12 readers each) are very powerful. But combined with massive syndication and micro content searchable with keywords, a needle in the haystack could be felt.
You're right though, most bloggers (er... perhaps myself included) have a very high opinion of their content.
Re:Ummm... (Score:2)
BitTorrent is no-go for small files.. (Score:5, Interesting)
The tracker keeps, well, uhm, track, of the available pieces of the file, and every client reports in every time has got, or failed to get, a piece. So, using BitTorrent to distribute RSS feeds won't work, because the tracker will take up as much bandwidth, if not more, as a HTTP request, resulting in the "Not changed since your version" request.
Apart from that, well, yes, BitTorrent is great to distribute large files
Re:BitTorrent is no-go for small files.. (Score:2, Informative)
So you send out a new torrent through RSS referencing your new page instead of the regular RSS content, and your viewers use BitTorrent to work together to get the content from you without putting all the strain on your server. A .torrent file would be a lot smaller than a full RSS feed with images like he was using in the example.
Makes more s
Re:BitTorrent is no-go for small files.. (Score:2)
Re:BitTorrent is no-go for small files.. (Score:1)
Re:BitTorrent is no-go for small files.. (Score:2)
The difference between "reports in when it gets, or fails to get, a piece" (what he said) and "percentage complete that the client reports" (what you said) is insigificant, and his conclusion is perfectly valid.
A normal RSS download is not really more work to serve than to maintain a tracker. (Less, actually, because the HTTP server code is probably more optimized than the BT tracker)
Merely openin
If he wants to save bandwidth. (Score:2, Interesting)
but i guess bumming off bittorrent/p2p bandwidth is not a bad idea either.
Re:If he wants to save bandwidth. (Score:4, Informative)
Akamai is great for offloading bandwidth and speeding up customer's page load times, but you're paying for the bandwidth one way or another.
Re:If he wants to save bandwidth. (Score:1)
(yes, I placed that decimal correctly)
S
Re:If he wants to save bandwidth. (Score:2)
It's not cheap, but for us it was cheaper than addin
Re:If he wants to save bandwidth. (Score:2)
Fun stuff. Good riddance, I say.
S
Konspire2b (Score:5, Informative)
RSS + BT = USENET + NNTP (Score:1, Insightful)
Re:RSS + BT = USENET + NNTP (Score:3, Insightful)
Does your usenet reader serve news articles to other users?
No, you need a costly usenet servers architecture. Not only machines, but also huuuge bandwith. Today's usenet servers that want to serve large portion of world hierarchies can only get it via dedicated satellite usenet-only feeds.
RSS+BT on the other hand is poor server and rich clients that exchange articles between themselves via p2p network only supervised by a BT tracker.
Robert
Re:RSS + BT = USENET + NNTP (Score:2)
Things RSS has been struggling with like character encoding, attachments (enclosures), scaling, and other issues are things that NNTP solved long ago.
RSS + Torrent would be an excellent replacement for binaries newsgroups though.
Re:RSS + BT = USENET + NNTP (Score:4, Insightful)
Yes: the way people traditionally read USENET news is by becoming a USENET node, downloading articles to the directory hierarchy of the local machine, and then redistributing them to neighboring sites. Reading news by connecting to centralized news servers via a network client happened many years later.
No, you need a costly usenet servers architecture.
There is nothing intrinsically "costly" about it: it's something a PDP-11 used to handle and that regularly ran over dial-up.
Not only machines, but also huuuge bandwith. Today's usenet servers that want to serve large portion of world hierarchies can only get it via dedicated satellite usenet-only feeds.
Just like a BT solution, you only redistribute those articles that you yourself are interested in.
The reason why we got a USENET infrastructure with a small number of backbone sites (compared to the readership) that carried everything is simply because a bunch of sites took on that role and carry everything. There is nothing in the protocol or design of USENET that requires it.
RSS+BT on the other hand is poor server and rich clients that exchange articles between themselves via p2p network only supervised by a BT tracker.
And you believe that BT and the BT tracker scales up to many billions of files on millions of nodes by sheer magic? BT would probably need a lot of work to scale up. And at least USENET doesn't need any supervision by anything--it's completely asynchronous and unsupervised.
Note that I did not claim that USENET would work any better than RSS+BT--I have no idea whether it would--simply that people are basically reinventing USENET when they combine RSS and BT.
I actually suspect that there are intrinsic properties of large peer-to-peer news networks that people don't like because that's why USENET became more and more centralized over the years.
What morron modded parent as insightful?
That's what I would ask about your posting. In fact, I would ask what moron wrote it.
Re:RSS + BT = USENET + NNTP (Score:1, Funny)
bttracker: default port 80 (though its often on a higher port)
rss: just an xml document on a webserver, default port 80
there probably is a way to proxy the peer to peer connection to port 80 as well.
in the future everything will be on port 80, and the OS will have everything built into it. and The Windows OS will drop the name 'Windows', much like people seem to forget The MSSQL Server isn't the same The SQL Server. the file system will be just a database, the os
Re:RSS + BT = USENET + NNTP (Score:2)
why not nntp for syndication? (Score:5, Interesting)
Re:why not nntp for syndication? (Score:1)
Re:why not nntp for syndication? (Score:2)
Re:why not nntp for syndication? (Score:1)
Sounds like a perfect application for PGP/GPG. It'll guarantee that the person you think wrote it did, or not, and whether that content has been modified at all.
Re:why not nntp for syndication? (Score:2)
Pushing checksums via the web does indeed reduce bandwidth in a best-case scenario, but if someone floods the newsgroup with fake updates all the aggregators will slam the website like mad trying (and failing) to verify the MD5 hash.
Re:why not nntp for syndication? (Score:1)
IRC (Score:4, Interesting)
fidonet (Score:5, Interesting)
The key to usefulness however, is enabling technology to prioritize and authenticate the RSS feeds in some way.
Re:fidonet (Score:3, Interesting)
The idea goes like this:
If you want to host a RSS feed, you run a program that is basically a peer cache. People hit your IP and "subscribe" to the feed. You give them a list of other subscribers' IPs and the public key for the feed. The client then hits these peers and checks to see who has faster bandwidth. If the pe
Whoah. </keanu> (Score:5, Informative)
Anyway, FidoNet was not without its share of problems. The killing bullet, I'd say today, was the social factor - there were too conservative forces clinging to backwards compatibility at the cost of anything. Anything had to work with the most basic piece of software; this effectively shot progress and evolution dead.
Not that there weren't attempts. There were. They just weren't successful.
Anyway, setting up echoes would have the same problems as FidoNet echoes. The number one problem was typical for Slashdot: DUPES!
Echoes were set up so that one node relayed a message in an echomail forum to its other connected nodes for a particular echo, effectively creating a star topology, different for each forum. However, since each sysop just wanted the echo linked, he would just hook up to somewhere, and forget about it. Then, others would hook up from him, and all of a sudden somebody had hooked up to two different valid uplinks.
The result? The star topology all of a sudden had a loop in it. Messages would keep circling (since FidoNet used dedicated dialup lines, latency between nodes was typically in the hours range) and dupe filters were created.
All of those filters and filter-enabling tags were optional, of course. After all, you couldn't mandate an operational node to change its behavior, you could just ask nicely.
Political play to no ends.
Anyway, there were many other funny effects with EchoMail. Crosslinking was another - when one echo got linked to another at a node, so that all messages in echo X would enter echo Y at that node and vice versa. The most exotic of these was when a religious echo got crosslinked with a fantasy humor one -- through crosslinked physical directories at a node (the FAT pointers for the different directories hosting the two echoes pointed to the same location on the disk). Anyway, much hilarious discussion ensued, and not many understood much what people were trying to say in the crosslinked echo.
/ former sysop and NEC in FidoNet
Re:Whoah. (Score:1)
Re:Whoah. (Score:2)
Content management system ? (Score:5, Interesting)
Sounds more like a (possibly improved) content delivery system.
Too bad the article didn't indicate anything about content management.
That was weird. (Score:1, Funny)
Like I said, weird.
WebTorrent (Score:4, Insightful)
The drawback of the WebTorrent idea is that you need some way to group all the images, text and stylesheets together, otherwise you have to make a n inefficient P2P request for each one. RSS is a great way of doing that.
There aren't many details online at the moment of the work we did on the WebTorrent idea; it was mainly an e-mail thread -- get in touch if you'd like details. The project page [seldo.com] is available, but I stopped updating it so it doesn't have all the work that was eventually done.
Re:WebTorrent (Score:2)
Re:WebTorrent (Score:2)
Re:WebTorrent (Score:1)
Which could be simpler than one might think:
Before satisfying a request for the URL "http://www.mysite.com/a/b/foo.jpg" the "hard wa
Re:WebTorrent (Score:3, Insightful)
Re:WebTorrent (Score:2)
A lot of your summary seems already available in HTTP, if crudely. HTTP can:
Re:WebTorrent (Score:1)
Next logical step... (Score:3, Interesting)
These problems are not insurmountable, but they are not insignificant, either. Thus, I don't think that RSS+BT is the instant-gratification, no-risk paradise that the Yahoo article makes it out to be.
Re:Next logical step... (Score:2)
You don't need to crack anything, just download the source and re-write it best you want... bittorrent is kinda open source.
Furthermore, corrupted files being sent out is already happening today, which is why some clients (I use the shadow bittorrent client) have an option of banning any user that sends corrupted data. Every now and then, my client will have banned a user or two, so this is already happenin
Re:Next logical step... (Score:2)
A notification server protocol? cf. NTP/NNTP/SIP (Score:3, Interesting)
A) Sounds nice, but even without a torrent, using an open source hash algorithm (client and server agree on how to calculate the hash) would provide a way for the client to only download the hash value itself to check for freshness.
This way,
1 the author knows how many people have consumed the data and their general geographic distribution.
2 the author can make a decision to stop publication, which problematic but at any rate easier to enforce than if he or she starts out authorizing a torrent.
3 the author is free to pay for bandwidth if he or she will happily serve one per user just not a zillion per poller.
B) To be sure, it is easy to see who publishes an RSS feed / incites a Torrent download over somebody's infrastructure, whereas it is not so easy to discover the identity of an anonymous coward. You could also publish a pseudo-RSS feed itself exclusively over the torrent network using sequential filenames for more anonymity maybe..um.
C) Personally I have a current need for frequently updated RSS for a certain application and I'd set up a server that my internal network clients would poll frequently. But I'd still need for one machine to know the instant things change on the web too.
D) I'm wondering if a hierarchical network of servers might be useful here to publish event notifications. UDP is lossy, and we don't want to lose any events so that's out I guess. In NTP, various strata of time servers are used and clients try to sync to Greenwich time (light data) by the best route available. In NNTP, a client usually uses only one news server to get a fat feed, and different server owners often choose to handle only a subset of what's available in the whole world, which might also be the case (try serving every event of importance to someone in the world.. what is the bandwidth needed for that? How many bits to describe it in ip-like dot format?)
Probably there is another service that does what I'd like and it just flew out my left ear, but it just seemed to me that the best thing would be to combine the lightweight NTP network which lets clients synchronize their understanding of time despite general flakiness, and the NNTP network which allows different servers to decide to serve only certain segments of the worldwide aggregated feed.
And SIP does a lot of things that might be useful. And there is MDS (metacomputing directory service for the "semantic grid" - pdf [ic.ac.uk] / google's html [216.239.57.104]). And there's Jini ..
Anyway we do want to know some things with at least one minute resolution. (A storm watch? A news headline so we can turn on the TV or video stream?) At TV stations I know people constantly are watching the TV out of the corner of their eye to see if something earth-shattering comes up. How about a chime to tell you to look instead? How else to people get First Post? ;) I'd just like to beat normal notification systems for current events and website updates, for starters, based on a relatively robust and timely mechanism.
Maybe a low bandwidth network with some of these characteristics would be useful to distribute update event notifications that filter down to everyone's devces. A big company could have one or two machines consuming a global event notification thread, add events only it knows about, and serve this information on a push or pull basis to all its employees. Hmmm, tasty. Come to think of it I want something like that for another project too, Does anything already do this?
One interesting paper (2001) I found is on an emergency notification network based on subscribe/notify messages over SIP, a widespread voice over ip prot
Why not use pgp-style digital signatures? (Score:2, Interesting)
(Note I don't know details of how BT works, just general idea - fell free to take this idea and run with it however it makes more sense.)
I like the notion of this happening at the web-server level, which allows it to be generalized to other forms of content distrib
Whatever happened to usenet? (Score:2)
Aggregator = news reader
Bittorrent = RAR+PAR binaries
And best of all.. no polling! Well, between usenet servers it's mostly a broadcast kind of affair these days..
Has anyone made a rss2nntp bot yet?
Of course, IRC is also a remarkably cool medium for timely distribution of small ASCII messages.. The nick/channel bullshit sucks (though usenet "channel"/group takeovers/spams suck even more), but surely it's not beyond the realm of possibilities to build an IRC server that requires people
Re:Whatever happened to usenet? (Score:1)
modtorrent (Score:2, Insightful)
Re:modtorrent (Score:2, Interesting)
Hi,
They've already started a project for this at http://mod-torrent.sourceforge.net/ [sourceforge.net]
RegardselFarto
jabber with pub-sub (Score:1)
A better solution is IP multicasting (Score:3, Insightful)
A better solution would be eliminating the need for syndicators to constnantly poll waiting for RSS updates by using IP multicasting to notify syndicators of when the content of a particular RSS feed has changed. Multicast protocols which provide such announcements already exist, such as the Session Announcement Protocol [faqs.org] which would notify those curious of updated RSS feeds. A URL to the updated feed would be provided, and afterwards whatever file transfer protocol you wish could be used to fetch the RSS feed itself, even BitTorrent.
Gillmor has it backwards (Score:1)
The real potential of combining BT & RSS is in the reverse: use RSS to distribute
Re:Gillmor has it backwards (Score:2)
Re:To me, (Score:3, Funny)
Re:Stupid article (Score:2, Interesting)