Follow Slashdot blog updates by subscribing to our blog RSS feed

U.S. House of Representatives Makes Resolutions in XML 164

Posted by timothy on Thursday July 04, 2002 @02:28PM from the about-time dept.

RennieScum writes: "The House of Representatives is turning to technology with their test of XML for use with resolutions according to this article. It reports that the HR has made 100 DTDs and uses Microsoft Word and a special converter to do the job. Testing has begun and their goal is to start using it in January of next year. See also http://xml.house.gov/ And it looks like the DTDs will be free to use and distribute!"

This discussion has been archived. No new comments can be posted.

U.S. House of Representatives Makes Resolutions in XML

Load All Comments

Search 164 Comments Log In/Create an Account

Comments Filter:

Yee haw! Crappy laws in better format! (Score:2)

by shodson ( 179450 ) writes:

Now we can all make our own crappy laws using XML! More downloads for Xerces.
Was there any doubt they wouldn't be free? (Score:1)

by smcavoy ( 114157 ) writes:

If the government creates something original for it's use how can there be any arguement as to if it should be availible to the people..? (top secret, national security stuff aside)??
- Re:Was there any doubt they wouldn't be free? (Score:2)
  
  by Ivan Raikov ( 521143 ) writes:
  
  If the government creates something original for it's use how can there be any arguement as to if it should be availible to the people..?
  
  Considering the current government's flirtations with Big Business (not to be confused with Big Brother), I'm actually surprised that they didn't just publish their bills as Word documents.
  
  And looking at the XML documents, it does appear that they're using some non-W3C, Microsoft-like XML stylesheet format. I'd argue that this is favoring one commercial product (Internet Exploder) at the expense of all others.
  - Re:Was there any doubt they wouldn't be free? (Score:2, Informative)
    
    by feronti ( 413011 ) writes:
    
    Um, did you read the source? Or did you just open it up in IE? Because the source is clean (though not prettily formatted:), pure, 100% XML. In fact, there's only one namespace declaration in the entire thing (XLink, which they use to embed hyperlinks between various parts of the documents). All in all, this is some of the cleanest XML I've ever seen (including XML I've written myself by hand:)
    
    But if you opened it up in IE, IE applies a stylesheet to all xml documents which gives you a nice collapsible view of the document tree (which is often easier to read than the source:)
Ugh. DTDs?!? (Score:2, Insightful)

by Aquaman616 ( 131268 ) writes:

I guess that's the government for ya... why in the *hell* would you use DTDs when XML Schemas are so much better???

Oh well... at least it's a step forward - I'll applaud them for that.
- They DO use schemas... (Score:2)
  
  by jaaron ( 551839 ) writes:
  
  Check out the source for http://xml.house.gov/Members/mbr107.xml [house.gov] and then the corresponding schema: http://xml.house.gov/Members/member-schema.xml [house.gov]
  - Re:They DO use schemas... (Score:2)
    
    by jaaron ( 551839 ) writes:
    
    Good point, I didn't notice that when I first posted. Still though, they're using namespaces which isn't part of the DTD definition. So the issue isn't that they're using outdated technology, it's that they're using proprietary extentions.
- Re:Ugh. DTDs?!? (Score:1, Insightful)
  
  by Anonymous Coward writes:
  
  Well, DTDs are just a less-expressive form of Schemas, correct?
  
  Why couldn't you just take all of their DTDs and rewrite them as schemas? You could then donate that back to them, and i'm sure they'd be happy to offer it as a download option.
  
  Hell, maybe someone could make an XSL stylesheet to turn DTDs into schemas :)
  
  -- super ugly ultraman
- Schema war is not over...W3C XML-Schema is bloated (Score:3, Insightful)
  
  by ClarkEvans ( 102211 ) writes:
  
  Why use DTDs?
  
  Have you ever tried to use XML Schema? It's a bloated peice of ****. Relax is tons better. And for the government's purposes, DTDs work much better and are an ISO standard.
  - Re:Schema war is not over...W3C XML-Schema is bloa (Score:1)
    
    by malakai ( 136531 ) writes:
    
    Using XML to describe XML simply makes sense. DTD's are antiquated, and I can't even transform against them for meta-meta-data tasks.
    - Re:Schema war is not over...W3C XML-Schema is bloa (Score:2)
      
      by ClarkEvans ( 102211 ) writes:
      
      Using XML to describe XML simply makes sense.
      
      In this case RELAX [oasis-open.org] is far superior, it has both an XML and a non-XML represenatation and is build on top of a clean model by some brilliant fellas.
      
      XML Schema, OTOH, is just a bloated mess.
      
      DTD's are antiquated
      
      Perhaps, but they are readable. XML Schema is anything but readable.
      
      and I can't even transform against them for meta-meta-data tasks
      
      Oh, now that's something you do every day. Using XML syntax for everything is just plain stupid. IF you have to do transforms, use RELAX, it has a cleaner model anyway... doing transforms on XML Schema is like pulling teeth.
DTD is sooo 1999. (Score:3, Insightful)

by km790816 ( 78280 ) writes: <wqhq3gx02NO@SPAMsneakemail.com> on Thursday July 04, 2002 @02:32PM (#3823148)

This is the government for you.

When every tool under the sun is using XML schemas, the House is announcing their support for DTDs.

I guess it's still a step forward.

Share
twitter facebook
- Re:DTD is sooo 1999. (Score:2)
  
  by ftobin ( 48814 ) writes:
  
  When every tool under the sun is using XML schemas, the House is announcing their support for DTDs.
  
  Jeezus, why would you even consider using Schemas when there is there is Relax-NG [thaiopensource.com], a much better, simply, and based on theory system. Note the author of that document I gave; it's James Clark; if you are using an XML parser, chances are good it was written by him (expat). Heck, there is not even any normative spec for XML-Scheme!
- DTD may be old (Score:1)
  
  by bsDaemon ( 87307 ) writes:
  
  But so is the constitution and noone much complains about upgrading that to version 2.0
  - Re:DTD may be old (Score:2)
    
    by foniksonik ( 573572 ) writes:
    
    Well that is if you don't count the Bill of Rights and the rest of the AMENDMENTS to the Constitution.
    
    Seems to me like it's been at 2.0 RC X.x for quite some time.
- Re:DTD is sooo 1999. (Score:5, Insightful)
  
  by SirSlud ( 67381 ) writes: on Thursday July 04, 2002 @03:41PM (#3823438) Homepage
  
  Your government must make an attempt to stick to standards when they are dealing with accessibility. They have to use technologies that have had some time to settle. By virtue of you pointing out that DTDs are 3 years old and you consider them obsolete, you reinforce the point that by selecting bleeding-edge formats/technologies/etc, they might be investing time and some of your money into something that wont be around in a year or two.
  
  And then in a year or two, you'd just complain how the government cant choose their technologies right.
  
  Start thinking about where you're getting this 'government is stupid/terrible/lazy/blah/blah' message from - alot of it is from private interests that enjoy the freedom and lack of public accountability to select their technological infrastructure based on higher demoninators than your government should. While the 'saavy' factor will always be higher in the private sector, dont *always* take this as an indication that government must be technologically inept (although, like anybody who's core competancy isn't technology, they frequently are) ... often they are doing something much smarter than private interests give them credit for. All of this is moot, of course, when discussing moves the government makes on _behalf_ of powerful private interests, but thats another argument and does not apply in this situation.
  
  It's like being a private teacher vs public. Private teachers can probably be more 'progressive', but at the cost of maybe teaching in ways that might soon be proven to be ineffectual or bad, while public systems generally must move slower in order to ensure that the ideas have been vetted and that everyone has a moderately equal opportunity to access the fruits of the system.
  
  Like parents, sysadmins, anybody who has an onus to cater to the greater good rather than the richer good, sometimes you have to make decisions that are going to be publicly derided even if its for the common good. Sometimes you have to just give the benifit of the doubt, though I realize this kind of attitude is in short supply these days.
  
  Ok, rant off.
  
  Parent Share
  twitter facebook
  - MOD PARENT UP (Score:1)
    
    by GeckoX ( 259575 ) writes:
    
    A rare piece of insight indeed.
    Listen up kiddies.
  - I second that. (Score:1, Redundant)
    
    by Futurepower(R) ( 558542 ) writes:
    
    I agree. Mod parent up.
  - Not the real issue (Score:1)
    
    by Jedi Creed ( 590140 ) writes:
    
    The real problem is that XML itself is too new. DTDs turned out to be too clumsy and limited, so schemas replaced them. What Congress really needs to do is wait 2-5 years for XML to settle down. By jumping in prematurely, Congress is running into pitfalls like the use of DTDs.
    - Re:Not the real issue (Score:1)
      
      by Prof.Phreak ( 584152 ) writes:
      
      XML is unlikely to go away (you'll still be able to read XML docs 50 years from now, even if basic formats like JPEG, etc., are totally replaced).
      
      Not to mention in case of any major changes, it doesn't take long to create an XSLT script to convert your XML into anything.
- - Even HTML would be a HUGE improvemt (Score:2)
    
    by ahfoo ( 223186 ) writes:
    
    --aything with links is essential to reforming legal texts into something useful. In the US, the laws are written in English. It should be the case that anybody with a high school education could read them and understand them with ease. The main reason lawyers get so involved in anything that has the slightest concern with the law is the twisted textual markup that is currently used makes the documents incomprehensible and extremely difficult to understand in full because of the need to obtain the hundreds of essential external references. This is wonderful news.
    Even the stilted style of language referred to as legalese is partly a product of the need for a meta context within legal writing. This is long overdo, but awesome nonetheless.
Uhhh.... (Score:4, Interesting)

by Verizon Guy ( 585358 ) writes: on Thursday July 04, 2002 @02:37PM (#3823162) Homepage

Going to http://xml.house.gov/Members/mbr107.xml [house.gov] renders a perfectly viewable directory of representatives in Internet Explorer, but Mozilla dumps it all as raw text in one giant paragraph. What gives?!?

Share
twitter facebook
- Re:Uhhh.... (Score:2)
  
  by josh crawley ( 537561 ) writes:
  
  Maybe because IE supports the xml STANDARD more than mozilla.
  - Re:Uhhh.... (Score:2)
    
    by jaaron ( 551839 ) writes:
    
    No, it's because of the way they use the XSL stylesheet. IE does not support the XML "standard" any more than Mozilla. Quit posting FUD.
- Re:Uhhh.... (Score:2)
  
  by llamalicious ( 448215 ) writes:
  
  <?xml version="1.0" encoding="utf-8"?>
  <?xml:stylesheet type="text/xsl" href="member-sorter-vb.xsl"?>
  <?xm-well_formed path="m:\xmltech\billres1\00-11-01\Members\mbr107. dtd"?>
  <ushousemembers xmlns="x-schema:member-schema.xml">
- Stylesheet issues... (Score:5, Informative)
  
  by jaaron ( 551839 ) writes: on Thursday July 04, 2002 @02:44PM (#3823198) Homepage
  
  It's because of the XSL style sheet they use. You can find it at http://xml.house.gov/Members/member-sorter-vb.xsl [house.gov]. (Use view source to see the actual XSLT). Notice that they use VBScript!
  
  Parent Share
  twitter facebook
- Re:Uhhh.... (Score:1)
  
  by evalhalla ( 581819 ) writes:
  
  I think that's because IE uses a default stylesheet for xml documents, while Mozilla strictly complies to the standard and just shows the contents of the tags, without any style.
- Re:Uhhh.... (Score:5, Informative)
  
  by MiTEG ( 234467 ) writes: on Thursday July 04, 2002 @02:51PM (#3823232) Homepage Journal
  
  It's all screwed up with Opera 6.01 also.
  
  Parent Share
  twitter facebook
- It's the XSLT (Score:1, Informative)
  
  by Anonymous Coward writes:
  
  in the second line of the xml:
  <?xml:stylesheet type="text/xsl" href="member-sorter-vb.xsl"?>
  in the 6th line of the above-referenced xsl document being used to transform the xml:
  <xsl:stylesheet xmlns:xsl="http://www.w3.org/TR/WD-xsl" language="VBScript">
  basically, they're using the MSXML parser to do their XSLT on the client-side. I've been working with this stuff for a while, and there are a lot of advantages to doing this. The MSXML parser is a lot more mature & well documented than whatever comes built into NS6 & Mozilla(if you know better, please point me to some good resources for working with client-side XSLT on these browsers-- i've looked everywhere).
  But it seems to me that public accessibility to to these documents should preclude this, and demand that the parsing be done on the server-side.
  Beyond that, the fact that they're using VBScript instead of JavaScript for their scripting is indicative of the fact that the people in charge of this initiative are hardcore MS-Heads -- ther's no reason for it, you can do some extremely complex stuff with the MSXML parser and JavaScript.
  I know this is paranoid, but my past experience has been that even people inside MS use JScript if they can avoid VBScript... unless they're forced to use it for marketing reasons. Wonder who's in charge of this initiative.
  - Re:It's the XSLT (Score:1)
    
    by Verizon Guy ( 585358 ) writes:
    
    I know this is paranoid, but my past experience has been that even people inside MS use JScript if they can avoid VBScript... unless they're forced to use it for marketing reasons. Wonder who's in charge of this initiative.
    
    IIRC, the ASP pages on microsoft.com use JScript; VBScript is great because if you know VB, you can learn VBScript in an hour.
  - Re:It's the XSLT (Score:2)
    
    by Abcd1234 ( 188840 ) writes:
    
    Ummm... what about Transformiix? That would be the Mozilla XSLT engine, which is built right into Moz 1.0. Check out the project website here [mozilla.org].
- Re:Uhhh.... (Score:2, Informative)
  
  by perlfool ( 102637 ) writes:
  
  The main reason it doesn't render in Mozilla is they used an old XSLT Working draft namespace "http://www.w3.org/TR/WD-xsl". The XLST 1.0 namespace should be: "http://www.w3.org/1999/XSL/Transform"
  See Unofficial MSXML XSLT FAQ" [netcrucible.com] for some info about the old Working Draft, XSLT 1.0 and Internet Explorer.
- I get this in Netscape 7 Preview: (Score:2)
  
  by ImaLamer ( 260199 ) writes:
  
  I get seperate paragraphs (yet mashed together), yet I can paste the data to notepad or this text box and it looks even worse.
  
  I can't post it because of this error:
  
  Your comment has too few characters per line (currently 6.2)
  - Check this with IE though: (Score:2)
    
    by ImaLamer ( 260199 ) writes:
    
    http://xml.house.gov/hr100_eh.xml
    http://xml.hous e.gov/hr6_ath.xml
    http://xml.house.gov/hr10.xml
    
    all just code
    - Re:Check this with IE though: (Score:1)
      
      by Verizon Guy ( 585358 ) writes:
      
      At least IE has the decency to delimit and color code it in a collapsible tree, unlike moz which mashes it all together.
      - Re:Check this with IE though: (Score:2)
        
        by ImaLamer ( 260199 ) writes:
        
        No, IE shows code which is just ghey.... who wants to go surfing the net reading HTML the whole time?
- - Re:Just use IE6 (Score:2)
    
    by DunbarTheInept ( 764 ) writes:
    
    Why not just use IE? Because it only works if you are using a shitty Operating System underneath it, and the OS you use affects a lot more stuff than just your web browser. There are reasons completely unrelated to web browsing that make me want to be running Linux most of the time except for the occasional game. I think that this is the primary reason for the IE hostility a lot of geeks have. To use it we have to dumb-down *everything* we use (which is what happens it feels like to use Windows after being used to using Unix), just to get a particular web browser. If I.E. was produced by a company other than the one that has a vested interest in keeping the Windows monopoly in place, it wouldn't be a problem because they would make a Linux version.
    - Re:Just use IE6 (Score:1)
      
      by Verizon Guy ( 585358 ) writes:
      
      Please go suck a putrid, herpes-infected dick you zealot cocksucker.
      
      IE runs on real Unixes, like Solaris and HP-UX. Grow some pubes, take a shower, and get a life.
      - Re:Just use IE6 (Score:2)
        
        by DunbarTheInept ( 764 ) writes:
        
        1. The only poeple who give a flying fuck about the fact that linux isn't technically legallt allowed to be called unix are lawyers and trolls like you and that "Rev Don Cool" idiot on usenet.
        
        2. IE support on the few unixen where it does run is awful and the thing is too bloated to be practical (since instead of porting IE to unix APIs they ported parts of the Windows API and put IE on top of that, the executable is gigantic on unix.)
        
        3. You did say "IE 6", which even on the few unixes where IE 6 exists, it doesn't go up to that version number, so clearly you are lying.
        
        Re:Just use IE6 (Score:2)
        
        by DunbarTheInept ( 764 ) writes:
        
        Err, delete that "6" from the second "IE 6". The dangers of cutting and pasting.
  - Re:Just use IE6 (Score:1)
    
    by Verizon Guy ( 585358 ) writes:
    
    I use IE6... =) I was just curious what it would look like in moz... I only use moz for browsing for pr0n... all those tabs!
How Slashdot-like (Score:5, Funny)

by jaaron ( 551839 ) writes: on Thursday July 04, 2002 @02:39PM (#3823178) Homepage

So the government tries to update their use of technology to use an open format like XML and publish the DTD's and inevitably the first 10 slashdot posts complain that the government is too behind the times because that don't use new (and better) XML schemas! Talk about geeks! :)

Share
twitter facebook
- Re:How Slashdot-like (Score:1)
  
  by idletask ( 588926 ) writes:
  
  > the government is too behind the times because that don't use new (and better) XML schemas!
  Well, this is an administration, you know... So actually they can be credited for having been aware of XML at least a year ago. Had they been aware of XML schemas that it'd have taken another 6 months before the site got up, don't you think?
  I'm quite confident that nowadays the average PHB doesn't even know what XML stands for and is used for...
DTDs (Score:2)

by Citizen of Earth ( 569446 ) writes:

It reports that the HR has made 100 DTDs and uses Microsoft Word and a special converter to do the job.

But if they really want an intractible problem, they should use XML/Schema!
Oh Boy! (Score:1, Offtopic)

by rbeattie ( 43187 ) writes:

Free DTDs!!! I LOVE DTDs! Wooohoo! We definitely don't have enough of those already!

And who says a Republican government is only out to help the big guys. Free DTDs for all!

Happy 4th everyone! Damn I'm proud to be an American today. Free DTDs!!

-Russ
- Re:Oh Boy! (Score:1)
  
  by p3d0 ( 42270 ) writes:
  
  Oh man, this is the funniest thing I have read in a while. You almost made me burst out laughing out loud here at work, which would have been very embarrassing...
Lawmakers who don't understand the law (Score:4, Interesting)

by kuroth ( 11147 ) writes: on Thursday July 04, 2002 @02:49PM (#3823224)

From the cited page [house.gov]...

Pursuant to Title 17 Section 105 of the United States Code, these DTDs are not subject to copyright protection and are in the public domain.
...
These DTDs can be redistributed and/or modified freely provided that any derivative works bear some notice that they are derived from it, and any modified versions bear some notice that they have been modified.

Sorry, cupcakes, that's not how the public domain works. If you release it into the public domain, you no longer have *any* control whatsoever upon the modification, reuse, or redistribution of the work. The required notice clause listed above in invalid.

Cite [stanford.edu], cite (#3) [templetons.com], cite [ufl.edu].

Kuroth

Share
twitter facebook
- Re:Lawmakers who don't understand the law (Score:1)
  
  by user32.ExitWindowsEx ( 250475 ) writes:
  
  Well, hearing about the above clause makes me think of the BSD license. Same principle.
  - Re:Lawmakers who don't understand the law (Score:1)
    
    by jordan_a ( 139457 ) writes:
    
    No since the BSD license doesn't say "This is public domain" anywheres in it. Very diffrent principles
  - Re:Lawmakers who don't understand the law (Score:2)
    
    by foniksonik ( 573572 ) writes:
    
    I was thinking GPL myself... public domain with copyright. Wouldn't that be interesting if the US Gov starting using GPL for all documents? Just put it in the metadata and a quick notice at bottom.
    
    hmmm makes me think I want to do that with all my documents. Is there a license attribute for meta-data tags in html... if not I'll make one.
I say... (Score:2)

by numbuscus ( 466708 ) writes:

...even if they are using a what some on this site would consider 'suboptimal' technology, the government's incorporation of ANY technology is better than none at all. Hell, the Senate doesn't allow laptops on the Senate floor! Hopefully, as the 'mainstream' government begins to use more open-standards technology and technology in general, they will be more willing to defend it against M$ and any other company that tries to 'embrace and extend' it.

My $0.02
Example of the new markup (Score:5, Funny)

by crucini ( 98210 ) writes: on Thursday July 04, 2002 @03:00PM (#3823269)

<bill status="proposed" name="CBDTPA">
<sponsor name="Fritz Hollings" constituency="Disney">
<violatesAmendment number="1">
<violatesAmendment number="4">
<contribution donor="Disney" amount="24500.00">
<contribution donor="AOL" amount="33000.00">
<contribution donor="National Association of Broadcasters" amount="25000.00">
<excuse>Promote broadband adoption</excuse>
<excuse>Save the arts from extinction</excuse>
</bill>

Share
twitter facebook
- Re:Example of the new markup (Score:2)
  
  by SirSlud ( 67381 ) writes:
  
  > Save the arts from extinction
  
  Thats the best part! I always hated that excuse, especially considering how insulting it should be to artists.
  
  Stop and think about this - claiming the arts will die if hollywood dies is like saying the habit of breathing oxygen will die if the SCUBA industry goes belly up.
- Indeed, it's not free (Score:3, Informative)
  
  by twitter ( 104583 ) writes:
  
  The mention of M$ Word put me on alert, as have previous stories here which have demostrated that XML will simply be a container for propriatory data formats like M$ Word. Closer examination, however, reveals a much more horrible arangement.
  XML is dependent on unicode, as the US Government site's reference states. Follow the W3C [w3.org] to unicode ,
  Unicode is required by modern standards such as XML, Java, ECMAScript (JavaScript), LDAP, CORBA 3.0, WML, etc., and is the official way to implement ISO/IEC 10646.
  Unicode is owned by Unicode Incorporated [unicode.org] and all of it's documents and standarts are issued under a restrictive license [unicode.org] with a unilaeral change clause:
  Modification by Unicode Unicode shall have the right to modify this Agreement at any time by posting it to this site. The user may not assign any part of this Agreement without Unicodes prior written consent.
  Dare I compare this evil arangement to ASCII and other predecesors? To have IBM, M$, Sun and other OWN the very format your data takes and to be able to change it and break previous implimentations at whim, and YOU may not? Who wants to be a plump nickle that any thing vaugly resembling unicode in the future will be called a "derivative" and it's distribution halted? Is this not a collusion of comercial software vendors to control information at it's most basic representation? Does anyone else here see this as the ultimate extention of copyright? Evil, Evil, Evil.
  I'd rather see the US government continue to publish in the American Standard for Information Interchange. This extensible standard is no standard at all.
  - Why didn't they just use standard HTML? (Score:2)
    
    by moncyb ( 456490 ) writes:
    
    Standard HTML is just as searchable as long as you use the tags properly. One does have to wonder if M$ "encouraged" them to use this format.
    - Re:Why didn't they just use standard HTML? (Score:2)
      
      by Ravagin ( 100668 ) writes:
      
      Why not html? Because they're not just describing text here. There're all sorts of data associated with a piece of legislation, and an extensible - not a hyptertext - markup language is the best way to do it.
      - Re:Why didn't they just use standard HTML? (Score:2)
        
        by moncyb ( 456490 ) writes:
        
        What is this mysterious data that can't be expressed in HTML???? Blipverts [techtv.com]!!!??!!?? Maybe they'll put [w3.org] cartoons [w3.org] into the bill--to help explain why they passed it. Oooo...maybe they can put in complex equations [w3.org] so everyone will think they are smart [imdb.com].
        I think some people just believe XML is some sort of magical file format that should be used no matter what. I expect MPEG 5 will be in XML, then they'll wonder why the files are so much larger and takes 10x the processing time and memory to decode.
        XML may be useful in some places, but not everywhere. Replacing it with binary formats is bad because it will unnecessarily increase the filesize and resources to decode them. Using it for config files will require all programs to run an XML parser and make the config files less human readable. Using it to express laws will just make them inaccessible to the common person by requiring them to have expensive proprietary software (or software made by an illegal monopoly) to even view them.
        If they want bills to be searchable, they should be designing database tables for them, and allow the public to export the database (or subsets of it) in a standard database format. For online viewing, they could easily export the data into HTML (or XML) using PHP.
        Using "Microsoft Word and a special converter to do the job" is just stupid. Creating a program that allows some intern to key the data into the database would probably be easier and more effective in the long run.
        
        Re:Why didn't they just use standard HTML? (Score:2)
        
        by moncyb ( 456490 ) writes:
        
        Oh yeah, just make up some contrived obviously biased answer! Do you make infomercials???? Or maybe you just don't know anything about html.
        The html version of your "example" would probably look more like this:
        <p><a name="para1">(1)</a> blah, blah, blah
        ...and for you information, browsers already search that way--the paragraph in question can be referenced by appending a #para1 to the document's url.
  - Re:Indeed, it's not free (Score:1)
    
    by Maserati ( 8679 ) writes:
    
    The article mentions WordPerfect as well. And so long as the DTD is available, anything else that reads and writes XML will work fine.
  - Re:Indeed, it's not free (Score:2)
    
    by smallpaul ( 65919 ) writes:
    
    Unicode is owned by Unicode Incorporated [unicode.org] and all of it's documents and standarts are issued under a restrictive license [unicode.org] with a unilaeral change clause:
    
    Have you looked at the copyrights for most standards? Try to get a free copy of the SGML or EDI standards? Unicode is wide open comparitively. Plus, if you're going to complain about vendor-owned consortia, you might as well whine about the W3C itself.
  - Re:Indeed, it's not free (Score:3, Insightful)
    
    by RennieScum ( 118197 ) writes:
    
    Paranoia.
    
    It shows how each line, name and term has an identifying tag, created by exporting the document from a word processor such as Microsoft Word or Corel WordPerfect into a
    
    special XML template
    
    They're usign a *tool* to help convert .doc and .wpd files to XML. They're just leveraging their assets (MSW*rd being an, ahem, asset) so that secretaries and regular folk can do the work of text entry in tools they are familiar with, which then gets converted into a useable format.
    
    Settle down, they're not trying to use MSXML engines to do the work. Sheesh.
- Re:Example of the new markup (Score:2)
  
  by Guppy06 ( 410832 ) writes:
  
  You forgot the default value of "Save the children" in your <excuse> tags there...
- Re:Example of the new markup (Score:2)
  
  by Megane ( 129182 ) writes:
  
  Don't forget the usefulness of the <pork> tag.
- Re:Example of the new markup (Score:1)
  
  by zrodney ( 253699 ) writes:
  
  this is meant as a joke, but it's a good idea!
  
  someone could have a metaserver which puts these
  additional tags into the offical descriptions of
  the bills.
  
  each could have links to the sponsoring groups of
  lobbists or grassroots which in turn could be
  crosslinked to show which bills are obviously
  just kickbacks, and which are really concerned with
  issues.
  - Re:Example of the new markup (Score:3, Interesting)
    
    by danro ( 544913 ) writes:
    
    Neat idea...
    Just write a http proxy that applies an XSLT to the document. Generate the tag-values from the opensecrets.org database (if they have one).
    Could probably be done by one person in a week or two, if opensecrets keep a reasonable usable database, and are willing to cooperate.
    
    If I were an american I would be tempted to write the thing myself...
    It would be great to just go to a website and see all bills with a header that indicated which elected officials was involved, and their voting record and ties to special interests.
    
    Hell, if anyone wants to do this, I am willing to contribute just because it's cool...
They are using WordPerfect Too (Score:3, Informative)

by frank249 ( 100528 ) writes: on Thursday July 04, 2002 @03:06PM (#3823293)

It reports that the HR has made 100 DTDs and uses Microsoft Word and a special converter to do the job.
The article actualy says It shows how each line, name and term has an identifying tag, created by exporting the document from a word processor such as Microsoft Word or Corel WordPerfect into a special XML template.

That would make sense since most of the US government still uses WordPerfect [corel.com]. WordPerfect comes with extensive XML publishing functions including making your own DTDs.

BTW Corel just announced that a new version of Ventura Publisher is coming out in the fall with cross platform XML publishing built in. The next version of WordPerfect is also going to have a much better XML publisher now that they bought XMetaL [corel.com].

Share
twitter facebook
don't even validate (Score:2, Interesting)

by Steve X ( 11964 ) writes:

heh, their XML documents don't even come close to validating. they say it's all beta, but wow, that's impressive. good to know my taxes are being put to good use - high-quality design. i think nsgmls says it best about their design:

value of attribute "regeneration" cannot be "yes"; must be one of "yes-regeneration", "no-regeneration"
save a buck or two (Score:1)

by jrs 1 ( 536357 ) writes:

i think that they could have saved a buck or two by using open office. although, if it's not their money that they're spending, i doubt they care.
- - Re:save a buck or two (Score:1)
    
    by jrs 1 ( 536357 ) writes:
    
    right, but are they aware that open office is free as in software?
DTDs, Schema, and XDR (Score:4, Informative)

by jaaron ( 551839 ) writes: on Thursday July 04, 2002 @03:11PM (#3823323) Homepage
Actually, if you check the source, you'll see that they are using XML namespaces and schemas. Actually, they're using something called XDR (XML-Data-Reduced) which was developed by Microsoft and is upwards compatable with XML schema. I'm familiar with schema but not XDR. For more information, you may want to check out these links:
- http://www.schemavalid.com/faq/xml-schema.html#a4 [schemavalid.com]
- http://www.netcrucible.com/xslt/msxml-faq.htm#Q13 [netcrucible.com]
- http://www.ltg.ed.ac.uk/~ht/XMLData-Reduced.htm [ed.ac.uk]
- http://www.w3.org/TR/1998/NOTE-XML-data/ [w3.org]
And thanks to this poster [slashdot.org] for pointing it out.
Share
twitter facebook
- Re:DTDs, Schema, and XDR (Score:2)
  
  by smallpaul ( 65919 ) writes:
  
  In what sense is XDR "forwards compatible" with XML Schema? In the sense that you can rewrite all of that Microsoft-proprietary stuff into XML Schema if you care to put in the effort?
  - Re:DTDs, Schema, and XDR (Score:2)
    
    by vidarh ( 309115 ) writes:
    
    No, in the sense that there is a publicly available XSL stylesheet that will do the conversion for you. XDR was a stopgap thing for Microsoft to get schema support out the door before the XML schema spec was finished.
    - Re:DTDs, Schema, and XDR (Score:2)
      
      by smallpaul ( 65919 ) writes:
      
      I was speaking with a Microsoft employee on the Schema team today. He reacted in horror to the view that XDR is "upwards compatible" with XML Schema.
- Re:DTDs, Schema, and XDR (Score:2)
  
  by deblau ( 68023 ) writes:
  
  Just so no one is confused, that's Microsoft's XDR, not the real XDR [faqs.org].
Great! (Score:5, Funny)

by Rombuu ( 22914 ) writes: on Thursday July 04, 2002 @03:12PM (#3823324)

And it looks like the DTDs will be free to use and distribute!

Great, now I can make my own crazy laws! Yipee!

Share
twitter facebook
- Re:Great! (Score:2)
  
  by mdemeny ( 35326 ) writes:
  
  Great, now I can make my own crazy laws! Yipee!
  Actually it's so that lobbyists [riaa.org] can make their own crazy laws [eff.org]. Yipee, indeed.
  - Re:Great! (Score:1, Funny)
    
    by Anonymous Coward writes:
    
    No problem. My crazy law is that no-one (especially not the RIAA) can make crazy laws except me.
Another Use for Microsoft crap (Score:3, Insightful)

by codeguy007 ( 179016 ) writes: on Thursday July 04, 2002 @03:18PM (#3823360)

I thought the US Government was starting to learn that Microsoft software was to be avoided. By finding more uses for it, I am afraid that it is obviously not true.

Share
twitter facebook
- Re:Another Use for Microsoft crap (Score:3, Funny)
  
  by DunbarTheInept ( 764 ) writes:
  
  Yes, they are using MS software, but this once they are using it to export things into a well documented, open format that could be made to work with anything (unlike a Word document). Sure, maybe different browsers aren't good at reading the XML the government is putting it out in the way that makes IE most comfortable, but at least it is in a DOCUMENTED format this time, one that the open source community can respond to and implement fairly quickly if there's incentive to (and I think having all major US government stuff in that format would be a big enough incentive.)
  
  Is it still biased in favor of IE users right now? Absolutely, I won't deny that. But if it is actually a properly documented format for once then that bias won't last. This isn't a perfect situation, but it's a major step up from publishing things in proprietary binary word processor formats like they did in the past.
What part about public domain don't they get? (Score:5, Insightful)

by ClarkEvans ( 102211 ) writes: on Thursday July 04, 2002 @03:19PM (#3823364) Homepage

Dig the notice at xml.house.gov -- The document type definitions (DTDs) presented on this site were developed at the U.S. House of Representatives by employees of the Federal Government in the course of their official duties. Pursuant to Title 17 Section 105 of the United States Code, these DTDs are not subject to copyright protection and are in the public domain. These DTDs are in draft form. The U.S. House of Representatives assumes no responsibility whatsoever for their use by other parties, and makes no guarantees, expressed or implied, about their quality, reliability, or any other characteristic. These DTDs can be redistributed and/or modified freely provided that any derivative works bear some notice that they are derived from it, and any modified versions bear some notice that they have been modified. (emphasis mine)

Either these DTDs are copyrighted and they can place restrictions upon distribution or they arn't. This need people have to control everything is just driving me crazy. The whole reason for Title 17 Section 105 is so that the Government can't put restrictions on this kind of stuff (bills, laws, etc.) ...

Share
twitter facebook
- Finish the job. (Score:1)
  
  by Futurepower(R) ( 558542 ) writes:
  
  Give the parent post that 5th point. He's right.
- Re:What part about public domain don't they get? (Score:2)
  
  by Maserati ( 8679 ) writes:
  
  They can't enforece it, but they can ask, preferrably nicely. I can't think of any reason to steal it and distribute it without attribution (not that someone else couldn't) so I'm not real worried at this point. Besides, stealing from COngress torques them off, they hate the competition.
  - Re:What part about public domain don't they get? (Score:2)
    
    by ClarkEvans ( 102211 ) writes:
    
    I can't think of any reason to steal it and distribute it without attribution (not that someone else couldn't) so I'm not real worried at this point. (emphasis mine)
    
    And how could I possibly steal something that is in the public domain? Just beacuse they wrote it they own it? The framers of the consitution rejected natural-rights thought with regard to intellectual property. Who owns it anyway? The public of the U.S. paid for it, so don't we own it? If I copy it and use it for my own purposes why would this make me a thief?
    
    I think you have fallen into the group-think that the RIAA wants everyone to succumb to.
    - Re:What part about public domain don't they get? (Score:1)
      
      by Maserati ( 8679 ) writes:
      
      Steal it may have been an overbroad statement. If anyone has fallen into the RIAA groupthink it's the HoReps who put the notice in in the first place.
- Re:What part about public domain don't they get? (Score:1)
  
  by spotter ( 5662 ) writes:
  
  IIRC the law only applies in the US (or for US citizens, forget which). Outside the US, they are still copyrighted.
So does this mean... (Score:2, Funny)

by neonzebra ( 33639 ) writes:

.... that the president can use an XSLT to make a bill into law?
ddt free to use? huh??? (Score:3, Insightful)

by CProgrammer98 ( 240351 ) writes: on Thursday July 04, 2002 @03:38PM (#3823430) Homepage

"And it looks like the DTDs will be free to use and distribute"

Ummmmm if you're using a validating xml parser, you HAVE to have access to the dtd!!! All DTDs have to be free to use!

Share
twitter facebook
Happy 4th! (Score:1, Offtopic)

by Pinball Wizard ( 161942 ) writes:

To recognize our great country on its birthday, I present you with an XML representation of the American flag:
<?xml version="1.0" encoding="ISO-8859-1" >
-<Flags>
-<Flag type="American">
<symbol type="Stars">
<count>50</count>
<background>navy</background>
<color>white</color>
</symbol>
<symbol type="Stripes">
<stripeno=1>
<stripeval>Deleware</stripeval>
<color>red</color>
</stripeno>
<stripeno=2>
<stripeval>Pennsylvania</stripeval>
<color>white</color>
</stripeno>
<stripeno=3>
<stripeval>New Jersey</stripeval>
<color>red</color>
</stripeno>
<stripeno=4>4</stripeno>
<stripeval>Georgia</stripeval>
<color>white</color>
</stripeno>
<stripeno=5>
<stripeval>Connecticut</stripeval>
<color>red</color>
</stripeno>
<stripeno=6>
<stripeval>Massachusetts</stripeval>
<color>white</color>
</stripeno>
<stripeno=7>
<stripeval>Maryland</stripeval>
<color>red</color>
</stripeno>
<stripeno=8>
<stripeval>South Carolina</stripeval>
<color>white</color>
</stripeno>
<stripeno=9>
<stripeval>New Hampshire</stripeval>
<color>red</color>
</stripeno>
<stripeno=10>
<stripeval>Virginia</stripeval>
<color>white</color>
</stripeno>
<stripeno=11>
<stripeval>New York</stripeval>
<color>red</color>
</stripeno>
<stripeno=12>
<stripeval>North Carolina</stripeval>
<color>white</color>
</stripeno>
<stripeno=13>
<stripeval>Rhode Island</stripeval>
<color>red</color>
</stripeno>
</symbol>
</flag>
</flags>
Note: I'm from New Mexico, so I know what it feels like when a state gets left out. Rest assurred, my flag includes Deleware!
- Re:Happy 4th! (Score:1)
  
  by csguy314 ( 559705 ) writes:
  
  Jeez, what a way to honour your country...
  by misspelling *Delaware*!
HR has made 100 DTDs (Score:5, Funny)

by Ilan Volow ( 539597 ) writes: on Thursday July 04, 2002 @05:45PM (#3823919) Homepage

Congress has always been full of lyahs and chetahs. That it's now full of schemas is really no surpise.

Share
twitter facebook
yep (Score:1)

by Mikkel_bob ( 538800 ) writes:

And it looks like the DTDs will be free to use and distribute!

No, this doesn't mean you can make your own laws. =P
we need open source software (Score:1)

by Trailer Trash ( 60756 ) writes:

They're already using vb-script in their xsl stylesheet, I can see Microsoft trying to weasel their way in here (or some Microsoft-based consulting company). We need to get some open source software that can be of use to them, and hopefully to state governments as well. Anyone game?
U.S. Senate Responds... (Score:1)

by cburley ( 105664 ) writes:

...by making resolutions in CommonLISP S-expressions.
XML creaps in another place (Score:2)

by thogard ( 43403 ) writes:

Didn't any of the XML supporters every study parsing in their CS classes? Or are they just web control freaks that didn't bother with anything past highschool. Oh wait, I'm talking about w3c so of course they are contorl freaks. At least most people ingored them.

The problem with XML is that it diverges into two dinstict worst cases. One requires and infinite amount of memory, the other and infinite amount of time. Both of these are bad things and much study of algorithms is about avoiding both of these conditions. Odd thing is most people in the IT field today have no clue about why this happens or even that it can happen. Of course these are the same programmers that coudn't describe a quicksort if they had to or descibe something in BNF grammar. And we wonder why most programmers today just produce garbage.
- Re:XML creaps in another place (Score:2)
  
  by vidarh ( 309115 ) writes:
  
  Can you elaborate? I can't see what part of parsing XML you are referring to - parsing XML for the most part seems relatively simple, though I haven't written a complete XML parser or spent the time to read through the complete specification.
  - - Re:XML creaps in another place (Score:2)
      
      by vidarh ( 309115 ) writes:
      
      I don't see the problem. If the closing tag is missing and you are using a Sax parser the only effect is one more scope indicator, and the parser will plod along happily until you try to close the surrounding tag at which point it will know right away that it should give an error.
      Whether it will allow you to try to recover or not at that stage would be up to the parser.
      Recovering from malformed input is regardless a difficult task, and typically you don't want to go there - that's not a parsing issue, but an issue of trying to predict how an error should be recovered.
      For a DOM parser, the parser would do the same thing, and just fail and free the tree once it found the surrounding tag (or the end of the file). However using a DOM parser with a scenario like the one you suggested would be plain stupid.
      In either case, handling a missing closing tag is trivial with XML, and I certainly can't see any justification for the claim that you'd either need unlimited memory or unlimited time based on that
      Anyway, you've just given an example of a case where ANY grammar based on nested blocks will have to have thought put into it when it is fed bad data, with no justification for why it should make XML bad from a parsing standpoint.
      Do you have a better example?
      - Re:XML creaps in another place (Score:2)
        
        by vidarh ( 309115 ) writes:
        
        This is a very different problem from what you suggested in the other message, and is can be just as real with REAL documents.
        So what you are really saying is that your problem is with ANY system that allow scoping, and where state is required for each scope until the scope is closed?
        The problem with that is that scoping is useful and makes it a lot easier to represent a whole lot of data in a structured form that seems natural to humans.
        In other words, an XML parser may require more resources than a parser for a grammar without scoping. But the scoping is allowed for a reason - it provides structure that is hard to provide without it.
        The reason you can't make a file that breaks grep is that grep doesn't care about structure. You can easily work on XML files withouth running into the problem as well if you ignore structure. But then you are also losing a whole lot of advantages.
        I still don't see this as a problem. You need to handle resource limits regardless. If you have 1MB available, as you originally used in your example, then when you have used that 1MB then you have to fail gracefully. If the only case where you use the whole 1MB is a broken document, then whether you fail because the parser detects it or fail because you don't have more memory is irellevant - the parse failed.
        If you need to give more specific error messages, you can do that fairly easily, by, when you've filled memory scanning the remainder of the document to determine whether any of the outer tags will EVER get closed.
        If you want to recover from unclosed tags, the standard way of doing that for HTML and XML is to define which start tags you want to autoclose which types of open tags for.
        This is a straightforward mechanism that works well, in particular in the presence of a schema or DTD where you can easily determine where leaving a tag open means the document is malformed where it may possibly be wellformed if the tag is closed.
        I haven't implemented it for XML, but I have implemented in an HTML filter that needed to handle particularly broken HTML.
        In the real world this is a problem only if you don't think about it and design your software to handle it, just as not thinking your design through in general leads to broken software.
        
        Re:XML creaps in another place (Score:2)
        
        by vidarh ( 309115 ) writes:
        
        The scoping issue and the stack depth issue are the same, and the solutions I described are solutions in common use.
        And I'm used to dealing with users on the input side. The company I work for operate the .name TLD. Registrars interact with us via XML. Our subcontractors interact with us via XML. We're dealing with far from perfect XML and errors needs to be communicated.
        We did use to have an ASCII based format, and we had more problems with that. The advantage of XML is that the users can validate the XML generated pretty well on their side by running it through an XML parser with schema validation support.
- Re:And now ged rid of the legacy (Score:1)
  
  by cifey ( 583942 ) writes:
  
  Schemas can be developed with backwards compatability to the dtd's. When implemented they would just find errors in the existing documents to be corrected.
- The Importance of DTDs (Score:2)
  
  by The Monster ( 227884 ) writes:
  
  DTDs are obselete by now
  
  They may not be bleeding edge, but what's important about this is that the House is making
  a commitment to open data formats. Even where we don't get open source code, this guar-
  antees that we don't get the most virulent form of 'vendor lock-in', where failure to pay the
  latest rent increase means we can't even access our own data [slashdot.org] anymore.
  ---
  Fight Page Widening! Make your own line <br>:reaks.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Yee haw! Crappy laws in better format! (Score:2)

Was there any doubt they wouldn't be free? (Score:1)

Re:Was there any doubt they wouldn't be free? (Score:2)

Re:Was there any doubt they wouldn't be free? (Score:2, Informative)

Ugh. DTDs?!? (Score:2, Insightful)

They DO use schemas... (Score:2)

Re:They DO use schemas... (Score:2)

Re:Ugh. DTDs?!? (Score:1, Insightful)

Schema war is not over...W3C XML-Schema is bloated (Score:3, Insightful)

Re:Schema war is not over...W3C XML-Schema is bloa (Score:1)

Re:Schema war is not over...W3C XML-Schema is bloa (Score:2)

DTD is sooo 1999. (Score:3, Insightful)

Re:DTD is sooo 1999. (Score:2)

DTD may be old (Score:1)

Re:DTD may be old (Score:2)

Re:DTD is sooo 1999. (Score:5, Insightful)

MOD PARENT UP (Score:1)

I second that. (Score:1, Redundant)

Not the real issue (Score:1)

Re:Not the real issue (Score:1)

Even HTML would be a HUGE improvemt (Score:2)

Uhhh.... (Score:4, Interesting)

Re:Uhhh.... (Score:2)

Re:Uhhh.... (Score:2)

Re:Uhhh.... (Score:2)

Stylesheet issues... (Score:5, Informative)

Re:Uhhh.... (Score:1)

Re:Uhhh.... (Score:5, Informative)

It's the XSLT (Score:1, Informative)

Re:It's the XSLT (Score:1)

Re:It's the XSLT (Score:2)

Re:Uhhh.... (Score:2, Informative)

I get this in Netscape 7 Preview: (Score:2)

Check this with IE though: (Score:2)

Re:Check this with IE though: (Score:1)

Re:Check this with IE though: (Score:2)

Re:Just use IE6 (Score:2)

Re:Just use IE6 (Score:1)

Re:Just use IE6 (Score:2)

Re:Just use IE6 (Score:2)

Re:Just use IE6 (Score:1)

How Slashdot-like (Score:5, Funny)

Re:How Slashdot-like (Score:1)

DTDs (Score:2)

Oh Boy! (Score:1, Offtopic)

Re:Oh Boy! (Score:1)

Lawmakers who don't understand the law (Score:4, Interesting)

Re:Lawmakers who don't understand the law (Score:1)

Re:Lawmakers who don't understand the law (Score:1)

Re:Lawmakers who don't understand the law (Score:2)

I say... (Score:2)

Example of the new markup (Score:5, Funny)

Re:Example of the new markup (Score:2)

Indeed, it's not free (Score:3, Informative)

Why didn't they just use standard HTML? (Score:2)

Re:Why didn't they just use standard HTML? (Score:2)

Re:Why didn't they just use standard HTML? (Score:2)

Re:Why didn't they just use standard HTML? (Score:2)

Re:Indeed, it's not free (Score:1)

Re:Indeed, it's not free (Score:2)

Re:Indeed, it's not free (Score:3, Insightful)

Re:Example of the new markup (Score:2)

Re:Example of the new markup (Score:2)

Re:Example of the new markup (Score:1)

Re:Example of the new markup (Score:3, Interesting)

They are using WordPerfect Too (Score:3, Informative)

don't even validate (Score:2, Interesting)

save a buck or two (Score:1)

Re:save a buck or two (Score:1)

DTDs, Schema, and XDR (Score:4, Informative)

Re:DTDs, Schema, and XDR (Score:2)

Re:DTDs, Schema, and XDR (Score:2)

Re:DTDs, Schema, and XDR (Score:2)

Re:DTDs, Schema, and XDR (Score:2)

Great! (Score:5, Funny)

Re:Great! (Score:2)

Re:Great! (Score:1, Funny)

Another Use for Microsoft crap (Score:3, Insightful)

Re:Another Use for Microsoft crap (Score:3, Funny)

What part about public domain don't they get? (Score:5, Insightful)