Slashdot is powered by your submissions, so send in your scoop

 



Forgot your password?
typodupeerror
×
User Journal

Journal goldfndr's Journal: Google filetype:txt has <HTML> entities?!?

Lately, I've noticed that files of type txt frequently have what looks like raw html in the Page Title and excerpt. The raw html is also expressed in the Cached link, even though it arguably shouldn't be there.

For example: google filetype:txt brings up several entries.

Looking at site:interconnected.org google pyra filetype:txt in particular has two text files around the same time but with different formatting.

I hate to compare Google with Microsoft, but it's somewhat reminiscent of Microsoft Internet Explorer claiming that a zero-length (empty) page has html content when you View Source.

Any idea what's causing this? Perhaps they're still debugging it and that's why they don't list it on their File Format pulldown in the Advanced Search?

This discussion has been archived. No new comments can be posted.

Google filetype:txt has <HTML> entities?!?

Comments Filter:

"Given the choice between accomplishing something and just lying around, I'd rather lie around. No contest." -- Eric Clapton

Working...