Catch up on stories from the past week (and beyond) at the Slashdot story archive


Forgot your password?
User Journal

Journal Journal: Blog this week: slashdot, joelonsoftware, perlmonks
'not impressed' [1]: acvh [2] - '... The games available at release and soon
  after do not look very interesting ...'

yes the Japanese games are different but there is a good reason. Firstly
the system is not capable of running a full blown title as per PS2.
So the designers are really forced to rethink their game style (and do
they think) to fit the limitations of the hardware format. Take for
instance konami with MG. They release MG but not *solid*, but Acid [3].
Same franchise, different game style (with a card twist) [4].

Konami already make a slew of GBA games [5] so they pretty well undersand
their market. Its a different market to PS/XB.

Remember Digimon [6], small animals, monsters fighting, computers and
CARDS ... it's a bit like a digital D&D for those old enough /young
enough to remember. All appeal to a very specific market.

After attending a ACMI game time symposium [1] in Melbourne this year
I had the chance to hear/see *Tetsuya Mizuguchi* [7], Gamespot
interview [8]. He talked for about an hour about game design and a
bit about some upcoming titles for PSP [9], [10]. I now understand a
bit more about *Japanese* game style and take my hat off to true innovators.
No cheesy ports of your [insert your top 10 title here] PS2, PC games here.
All *new* original ideas.

Joel on Software forum:

        Christopher Baus / Microsoft in Trouble: [12], [13]

        Overall not a bad article. I wish it was a bit longer and expaned on
themes of *growth*. (IBM still grows but why cant MS?). Maybe thats
the idea behind the book - highlight short essays and see if there is
a market to expand?

But 2 things to me dont ring true to me after reading the article:

'... Consumers want their data to be securely managed, and they want
it everywhere, all the time. They don't want it locked up on their PCs. ...'

Access not distribution: I don't hear the outcry for data everywhere,
just access to data fullstop. Applications everywhere lock up your
data into boxes. You have to work hard to get at it. Getting access
to your data on your PC. Then selective data access to the Web. MS
works hard to lock your data up (Outlook Express for example). But
Linux, Mac they are all as guilty.

'... Microsoft can't force relevance into the desktop. This is also
why WinFS on the desktop is a waste of time. Consumers don't want to
manage their contact database schemas on the desktop. They want google
to do it for them. ...'

Ability to search: This statement ('WinFS on the desktop is a waste of
time') is pure bollocks. Walk into any office and you will see the
nightmare of data management. "Where is it" is the catchcry. Imagine
the revolution of a search engine on the desktop integrated into
applications but not via a network connection. It is no *silver bullet*
but can you remember the Internet before Yahoo, Alta-Vista and Google?

I often think MS missed entirely the archive and retreival (only just
getting into search) of data paradigm just like they missed networking.

        Out of Sight, Out of a Job, [14]

        Not a bad article for mainstream. But does it perpetuate the myth of
the atypical programmer cliche` of *pizza eating, softdrink guzzling
male youth - a prime candidate for the *business machine* to exploit ...
oh I mean facilitate? (read the Sean Carroll's and Napters of the world)?

Also I'd challenge the view that:
'... other type of programmer that is doomed is the developer with poor
social skills ... traditionally been a haven for the eccentric and socially
awkward. ...'

    Social skills? What about online social skills. Many a software developer
may be articulate in the written word, yet shun face time? When it boils
down to it, 'talk is cheap'. Developers who are skilled in some *desirable*
area who demonstrate 'operacy', even in the face of competition is going to
have an opportunity to work. I see a lot of people with *soft* skills who
now boast of their *non-technical* expertise (accountants, PM's, MBA's).
But if you cant do it in code, all the talking and com skills wont help you
(though it may help you organise a team to do it). Maybe the reason for
the shift is something other than just *traditional* com skills?

'... The great advantage that local resources have over distant ones is
presence: Face-to-face meetings and talks remain vastly more productive
than phone calls, and phone calls are vastly more productive than instant
messages, or, worst-of-all, e-mail ...'

        Face time to who? Other developers, internal company, clients? In fact
the Internet and POTS (plain old telephone system) which predates the
Internet actaully extend presence. Is it 100% critical for *face time* for
all occasions? No. Face time can be optomised to where its required most.
Negotiating deals with clients, external meets with other technologists

What I would have liked to have read more about is the *business acumen* of
developers and the relationship between ability to build, assemble, then
*SELL* software (and make money).

I don't believe that the *socially challenged* developer is doomed but
the *entrepreneur* challenged one is. For example India is taking away
jobs from US companies (or US companies are feeding them) because of the
(necessary) entrepreneur skills that India builds though intense competition.
They simply have exploiting cost, law of diminishing firms, lowering
transaction costs.

Free software and news groups also pose a challenge. In the face of free
and open code how do the successful rise to the challenge? Through
communication skills, entrepreneurship?

        Why Is Usability So Hard [15]
        this is a great article and certainly one I would like included.
what separates this one from others:

        *describes real problem (usability) - real and not going away
        *article is generic - will be useful in 1 month as it will be in
          5 years
        *easy to read - (practiced what is preached and data is chunked)
        *good ideas - presented ooze *thought* behind them
        *framework -outlines steps to good design

I dont want to suggest that it needs any real changes but if additions
are made be sure to include:

        *a few 'real world examples' may enforce the points made.
        *change 'websites' to 'internet enabled applications, devices'

Read in conjunction with Paul Grahams (Made in the USA, [16] and Taste [17]) the
flipside of usability they form powerful insights into the contemporary
software malaise.
        The Vim advantage [18]
                gvim, vim cover 4 of the major wants you listed above:

        * syntax
        * resource (only reason emacs sucks)
        * folding
        * X platform features

        the kicker I've usually found is when you are working on a
        machine w/o enough resources [19] for emacs OR some foreign machine
        that only has vi (elvis). Then most of the editing attributes
        you found in vim, gvim are still there .. just degraded.

        Plus vim has reference book [19] and other docs [20] for those who want to
        get it and an active developer [21] community (scripts [22] etc).

        All hail *Bram Moolenaar*
        Apollo 12 at 35, [24]
        read and drool: AGC, DSKY and more (Score:5, Informative) [25]
        by goon (2774) on 11-18-04 23:15 (#10861798)
        ( | Last Journal: 09-24-04 9:26 )

        for those who where not around here's some links to the AGC, DSKY and more:
        *Apollo Guidance Computer (AGC) [26]
        *slash article with source code listing [27]
        *Simulation of Apollo Guidance Computer [28]
        *DSKY [29]

        [1] not impressed, acvh
        [3] Solid Gear Acid,
        [4] TGS 2004: Metal Gear Acid Hands-On: Turn based Metal Gear turns out to be quite a blas
        [6] Digimon World 2, Walkthrough/FAQ

        [7] ACMI game time symposium, Melbourne, Oct 2004,
        [8] Testsuya Mizuguchi, software developer Sega Rally (Sega) 1994, Rez (PS2)

        [9] Q&A: Tetsuya Mizuguchi, Rez creator tells what's holding the Japanese
                game industry back.
        [10] PSP, Playstation Personal,
        [11] PSP Site Launches, Launch Titles Confirmed,
        [12] Best Software Essays, 2004,
        [13] Christopher Baus / Microsoft in Trouble,
        [14] Out of Sight, Out of a Job, Larry O'Brien,
        [15] Why Is Usability So Hard,
        [16] Made in USA [16], Paul Graham,
        [17] Taste for Makers, Paul Graham,
        [18] the vim advantage,
        [19] Vi IMproved - Vim, by Steve Oualline, English,
        [20] Vim Documentation,
        [21] Vim News,
        [22] Vim scripts,
        [23] Bram Moolenar,
        [24] Apollo 12 at 35,
        [25] read and drool: AGC, DSKY and more,
        [26] Apollo Guidance Computer (AGC)
        [27] slash article with source code listing
        [28] Simulation of Apollo Guidance Computer
        [29] DSKY,


Journal Journal: good guys, bad guys. dvd rce. gnome saves the day

good guys
for the year and a bit I've been able to regularly catch up with an old show that I hadn't watched for some 20yrs. every friday afternoon, 1400 I could watch, pretty much add free on Channel 31 (which transmits from deep within the bowels of story hall). At the time it was last on tv I was at uni where I remember the 'absolutely positivly' t-shirts for sale in the story hall lobby.

bad guys
That was until the ABA decided to stop commercial transmissions. Well this was pretty much the end of friday afternoon episodes. Well so I thought.

  • ... all this mucking around is because of rce ...

dvd & rce
I noticed my favourite haunt, minatour, you can get the box set for $450AUD which is a bit steep. Oh well. That's until amazon uk discounted it for £45 delivered. Hmm looking better. But then what do I play it on. Could try it on my ps2 scee3000 but hacks only apply to 1000 early models. (remember that ps2 are regionalised much like dvd players). That leaves either trying to see if DVD region X works. I could try to mod the machine but purchasing such chips is illegal in AUS. Didn't feel like laying out another 14 pounds so back to the drawing board.

I got the ps2 back in 2000 with the intention of using it for games, CD's and DVD's. But rce means I can't play anything but region 4. What happens if I want to get a region 2 dvd not released in aus?

This is where gnome, linux, some fancy hardware come to the rescue

gnome and xine: the good guys win
so what's left? Gnome. I recently had a hardware and software upgrade on the linux desktop machine. It has a dvd player. Why didn't I think of it before. Whacked in the disc. Loaded up xine, played around with the vision controls and whammo, no more rce.

The board has a late model NVidia graphics card so I'm now looking out for a suitable cable to run the signal to the tv. But cripes now I can do some coding while I watch my weekly episode. This week, 'Old Dog with New Tricks'. Only 2700min left to go.

This is why I like linux :)

User Journal

Journal Journal: Pub interaction model

goon discussed at joel on software ~ social interface design forum

        Pub interaction model

        The Pub interaction model aids scanning of topic information. Is it good
        enough to copy existing models?

        In my many travels into SIC (Social interactive community) sites that I
        have noticed a re-occuring interaction model that takes advantage of
        everyday interaction models most would be familiar with. I call it the
        'Pub interaction model'. It probably has some fancy technical name [0].
        But for the sake of argument, Pub [1] will do.

        What was the last time you went to a Pub (party, class or lunchroom are
        equally good alternatives)? Imagine walking into the Pub for the first
        time. Think of the questions you ask yourself as you enter the room?

                *How do you choose which group you want to join?
                *How do you strike up a conversation with complete newcomers?
                *How do you splice a question into the discussion?

        The list goes on. Can you see the similiarity between entering a Pub,
        finding a suitable group, topic and engauging in conversation AND
        choosing what group, topic to reply in a JOSSID forum? [2]

        But how does this relate to software design and SID? [3] Well I actively
        look for 'Pub' like interaction models as a way to quickly choose what
        forum I might be able to interact. I can quickly scan the room visually
        (whos in the room, what topic groups are operating) and aurally (whos
        saying what) then dive in.

        In terms of software lets look at some examples.

              Email clients are a common example. In Thunderbird [4] and Outlook
              Express for example you can filter your inboxes (walk into the pub and
              label social groups), then scan the number of topics (listen to the
              conversation) to see number of emails.

              FogBUGZ [5] also uses the Pub model when showing a customer problem.
              Allowing the person(s) responsible to see the whole discussion at a
              glance (though in the past) to resolve a customer support issue.

              Social interaction sites such as Slashdot or Perlmonks the Pub model I
              look for here is the slashdot front page [6] or Perlmonk Monastry gates [7].
              Both mechanisms allow me to see what groups are discusssing what threads
              and allowing me to make decisions.

              Another example where this model is used is blogs (soap box sites).
              The example here is Monologue [8]. I can quickly scan the site with the Pub
              model to see if theres a worthy topic to read. There is no feedback on this
              site (hence the term soap box) but I can see the person talking, scan the
              topics and skip the noise [9].

        But is copying existing models good enough?

                '... a particular state of the neurones in the brain is more likely
        to be followed by another specific state than by a random state ...'

        Edward de Bono argues in his book, 'New thinking for the new millennium' [10],
        that patterns or paths that are built you are less likely to change.


                '... the downside of patterns is that once we have set them up we are
        trapped by them ... we also need the ability to challenge these
        patterns from time to time in order to set up better patterns ...' [11]

        This may be a new field of software development so avoid the trap of
        simply replacing existing social models, roles or modes of interaction.
        While you have the chance, don't be constrained by 'old' patterns or paths
        we observe. Experiment and develop better group interaction models instead.

        While I find the Pub model advantagous to use and software developers
        continue to implement them, are we are missing out on designing better social
        interactions instead of accepting some possibly inferior alternative(s)?


        [0] Aggregator (eg: news aggregator)
        [1] Public house or Pub.
        [2] 'Joel On Software Social Interface Design' forum.
        [3] Social Interface design
        [4] Mozilla Thunderbird email client
        [5] FogBugz
        [6] Slashdot front page
        [7] Current monastic discussions on Perl
        [8] Mono blog is Monologue
        [9] For example, Todd Berman: Responding to a Response
        [10] New Thinking for the New Millennium, Edward de Bono, P14
                  Viking Press, 1999.
        [11] New Thinking for the New Millennium, et.,al. PP14-16.

Linux Business

Journal Journal: tale of 2 technologists

goon in response to #10094868

'... wouldn't you agree Linus got a measly sum compared to Steve Jobs ...'

the salesman
different generation, different locale. Jobs was around at the birth of the PC revolution. He's had plenty of time to create products (hardware + software), make mistakes and sell to a large domestic then international market. I dont think Jobs has ever given away code. Jobs has a knack (and the appropriate reality distortion field ) to foster an ideas environment, root out the better ones (for good or bad: read Insanly Great and think Andy Herzfeldt (And another thing)), take a punt and back the idea to the hilt.

For that Apple, Next, Pixar have delivered big bucks.

the engineer
Compared to Jobs and Apple, Linus and Linux are babies. Linux is a product of it's time. Just like in Victorian England where amateur gentleman had the time (and money) to ponder, think, question and execute their way into the industrial revolution, Linus tucked away in his bedroom with a donated '386 and copy of GCC gave heart and life to the GNU suite of tools in the form of the Linux kernal.

This is one big block in the Information revolution that is now occuring. And while Linus maybe currently *worth less* than Jobs the potential for Linux to generate new wealth is staggering.

In Killer App, Downes and Mui argue that moore + metcalf = law of disruption + coase . Linux and the birth of the Internet has in a way directly influenced this. Anyone who can exploit these effects and sell products stand to make $$$.

Linux is a product. How Linus utilises his time, programming and creating or selling: Its up to him.

'... Beyond that, make $$$ by selling some commercial software that people are not willing to write for their own enjoyment or use. ...'

think diesel not ford
or to put in a different light ... produce software that others have yet to think of or cannot do for themselves. Not everyone want's to sit behind a keyboard and have to understand computers. But to think you can make a living the old way, ignoring mr more and metcalf and hope that distruption and coase go away is shere lunacy.

I like to think of Linux as revolutionary as the Diesel engine (which by the way was not patented and possibly led to the early death of Rudolf Diesel) and Jobs as a Ford like figure. Though the analogy would probably be better with Gates.

To me Linus has unleashed the software equivalent of the Diesel engine. I can leave it to your imagination what legacy Gates and Jobs will leave.


Journal Journal: IronPython 0.6 released as CPL 6

Update: how alpha is IronPython?

reading from the readme with the download you can see it's pretty bare.

  • The only two Python programs that I know for sure run on IronPython are and the parrotbench benchmark/test-suite. Of course, IronPython is the only Python implementation other than CPython-2.3 that can successfully run the parrotbench benchmark/test-suite, so this shows that the implementation can handle the majority and trickiest parts of Python's core semantics successfully.

and ....

  • Most of the standard Python library is not currently implemented, so it is unlikely that many existing Python scripts will run successfully under this early alpha release of IronPython-0.6.

and ...

  • it's an implementation of the Python language, you should be able to use all of the language constructs that you're familiar with. By default, IronPython doesn't ship with any of the standard Python library, so if you wish to use these libraries you'll have to provide them yourself.

Ed Dumbill whacked up a comment on IP. Come on Ed. If your going to show a bit of code what about showing the steps to reproduce it? Seems Miguel could run it out of the box. Must be my crappy path setup.

mine for what it's worth is something along the lines of ....

  • d:\IronPython-0.6\bin\d:\mono\mono-1.0\bin\mono IronPythonConsole.exe

but my python paths are out so I cant load pyGtk. So I'll have to fiddle around with path environment variables. Shouldn't have this problem with Linux. I've got installed Python 2.3.n, Gtk, pyGtk, Mono-1.0, IronPython-0.6 on a win2k box. Looks like I'll have to reproduce this on the linux box.

IronPython 0.6 released as CPL
Hackers coding only in Python. Gimme a break. What we're largely seeing is *script* hackers coding in Python. cgi-bin. shell crap. webbots. It's where Python shines. But there's an awfully big collection of code projects that need to straddle the speed of C++ and the dynamicism (to some degree) of higher-level languages. And there's a lot of hacking opportunity there. Java does that region very well, thank you.

might be time to try IronPython (~1.7 times faster than standard Py2.3). Here's some performance benchmarks comparing cpython to ironpython for those interested in the numbers. interestingly, IronPythons creator Jim Hugunin designed Jython.

Thanks for prompting me to look as Jim just completed a talk at OSCON, 28 ~ IronPython: A fast Python implementation for .NET and Mono and has released the source.

btw it's released under CPL


Journal Journal: rewriting, 'making the news'

...It's not enough for journalists to be mere messengers without understanding the hidden agendas of the messages and the myths that surround it ... john pilger

Dan Gilmour is writing a new book, "Making the News" [Oreilly yet to be published] and asking for RFC. Added some /. comments.

Subject: Ideas for Making the News ~ Ch 2
From: goon
Reply-To: goon
Date: Fri, 21 May 2004 11:34:15 +1000
X-Evolution-Format: text/plain
Hi Dan, I've been reading through the preview of "making the news" and have a few comments..

re: Chapter 2: The Technology that Makes Making the News Possible

I've been on slashdot for a while (uid 2774) and there's a few thingsthat I can observe wrt /. and how *news* can be portrayed. Just like mainstream media, alternative media is still subject to problems. With slashdot problems occur in

*moderation - moderation judgement and bias (caveat emptor)
High moderation gets eyeballs. Moderators can make an arbitrary decision (not necessarily based on how technically/factually accurate a post is) on what they think is a good answer.

So answers to Microsoft questions/posts tend to garner negative moderation for valid responses.

Moderators also gain moderator status due to *experience* not knowledge so the sig/noise ratio can be high depending on what topic a question is asked.

(see for a better example of factual moderation. Talk to merlin aka randall schwartz. read to see what I mean about ~ signal to noise ratio over time.)

*story volume - more is (not) better
story volume dilutes the *talent pool*. commenting on stories. I notice that in areas thin in people who actually understand the topic result ina lot of meta discussion on the topic and not facts. I miss reading posts from *experts* in field.

*immediacy - (lack of systematic) follow up and sense of history
/. follows up on stories but nothing like newspaper journalists look for stories. /. tends to cover what others (mainstream media) have covered,but with feedback.

*editors - comment on the story (but don't have insight)
/. editors are pretty green. they get away with murder (sometimes) matching a quick headline and comment to a story and apply 180 to what the linked story actually says. /. ed's are seasoned in some ways more than any journo trained editor for the *tech* insight, implications and technical context.

*story selection - how stories are chosen (far from transparent, can be manipulated)
what is not reported can sometimes be as interesting as what is reported. /. gets the majority of stories from existing media, books or questions from readers.

Stories not reported in mainstream media have less chance being reported on /. unless asked by a lone reader.

*summary (why)
/. is in some way a electronic re-incarnation of the *letters page* in traditional print media. Rowdy, opinionated, funny and sometimes wrong /. is infinitely a more interesting than mainstream press. /. has grown through the cracks of mainstream media expressing it's multicast voice in an increasingly unicast press.

regs PR


Journal Journal: new format journal - longer articles of interest - p1

Upcoming articles of interest
  • Rather than just add links, topics and code without focus. This year I've decided to delve deeper into a particular areas of interest. I tend to post most stuff at slashdot sometimes at my site for perl stuff and occasionally my home page. This year I want to do much the same but filter the topics intelligently depending on the audience.

    Of course I could add an rss link and you roll your own ... one step at a time ... here's some monthly upcoming topics I'm currently writing up ... in no particular order

      • powershift - and how it effects software developers. can it be exploited?
        chefs and development - how cooking shows shed insight into software development.
        desktop, server integration - python on the desktop, perl at the server.
        networks and connections - slashdot friends, foe and foaf.
        stubby - automatic test unit generation.
        slither - simple email and sending complex markup. how POD/YAML saves the day.
        hack the block 1. - why I like (and my kiddies) to use blocks, not just toys. some surprising results.
        hack the block 2. - code visualisation with LEGO
        compiling and aggregating data - what can we learn from map making when compiling data (rss, xml, etc)?


    • building components in high level language python (getting sick of writing business logic in low level code, object pascal, c/cpp).

      • the delphi/python for delphi combination allows you to code business logic in python and the GUI in Delphi/cpp builder.

        andy bulka has a nice explanation explaining the why's and how patterns fit in.

        perl fits into this picture because it strategically gloves apache, CPAN is mature, the language is mature and evolving in a stable manner.

        python integrates into g's better. something that perl sucks at (being embedded). python also runs on windows. something perl via active state tries but has yet to pull off. (see past posts on using ActiveState and CPAN).

        python on the desktop, perl at the server.


    • installed and using sqlite instead of mysql
      • replacing ms access as a generic data bucket.
        not have to worry about running a heavier than necessary server on a client machine
        but now having to think about sqlite restrictions.

    • installed firefox 0.8
      • and thinking about the first firefox in thunderbird, episode 1 ... when I should be thinking of (clints russian flying epic!)
        using firefox->tools->DOM inspector to tackle regexes.
        DOM inspector does that parsing so you can see what tags you need to extract
        why aren't regexes used in mozilla firefox->bookmark manager->manage manager to search for saved links?
        unstable, had a hang with a download and takes longer to save complete pages
        quick mozilla hack - type in, about:mozilla in nav toolbar


The Internet

Journal Journal: 10 years on the Internet 2

10 years on the Internet
    • At the time I was working with a bunch of psychopaths - running around in boiler suits, heavy boots and lugging 50Kg loads in dangerous environments. Taking orders off idiots, having to ask permission to take a drink - but not doing what you are probably thinking. Certainly not enjoying the experience. I had to get out. I was hungry for something and somewhere else to work :)

    Not long ago I noted the 10 year anniversary of the first web page. Well this year marks my 10th year on the Internet. 10 years ago this month, January or February some time, I forget.

    • "... When is not import. Where is ..."

    I had just enrolled to do some some post grad studies. It was at a place of learning that I stumbled into the Internet.

Early Internet - what information can I find?

  • I had to get out of the current job I was in. Doing some extra studies was a good way learn more about computing and software development. The first thing I noticed about the computer centre was the crapy Windows 3.1 machines. Remember this was before the rah rah rah of Windows '95 understanding networking, browsers and IP stack (Trumpet filled this gap). Microsoft had yet to realise the importance of the Internet. Some say they still don't. It took me about one afternoon.

    • "... The biggest change was networking ... We had unlimited access to the web through our student accounts ..."

    Things had changed a bit from my undergraduate years. Gone where the mainframes, punch cards, deck writers. It was light years away from doing mainframe (similiar to this one) , batch card (info for purposes of understanding how batch works only) processing in Fortran, editing mainframe text files in Ed and writing desktop GIS graphical and console applications in pascal during undergrad.

    The biggest change was networking. The computer centre consisted of banks of networked, generic PC's via Banyan Vines to the *nix mainframes hidden deep in the bowels of applied science building. I went there once, to get a new password and was acknowledged by the cliched bearded systems admin who emerged only from their glass-bowled data centre, to quickly read out my 16 digit hashed password. We had unlimited access to the web through our student account.

    Getting onto the Internet involved a choice of CLI or graphical applications. Graphical Internet access was available through a client program called Cello an early browser interface. Mosaic had yet to be installed - had it been released? (yeah by about 2 yrs). I remember spending a bit of time working out how to search with a web search engine, and it's less successful predecessor, gopher and landing in a place called doomgate to grab Doom wad files.

First taste of *nix & internet apps

    • "...access through a mainframe Unix shell dialup account gave me my first real taste of *nix ..."

    Home access through a mainframe Unix dialup shell account gave me my first real taste of *nix. Ftp for downloads, Pine for email, Lynx for internet, vi for editing and c for programming applications.

    I think there was about 50K machines on the Internet by this time. The information space was huge and more importantly, growing exponentially.

    It would a be almost a year and a bit before I started at my first computing job at sausage. But by the end of the first day playing around on the Internet and Web I knew something big was going on.


Journal Journal: m$word-pdf, maestro and australia day

doc->pdf cont.

looks like I have found a possible quicker path to get word docs to pdf. (wonder if it can handle complex tables and embedded images?).

matts (axkit) just left a message on my use.perl journal reporting that he recommends using HTMLDOC for converting html to pdf. It also has an Axkit plugin. I checked out the man and faq. GNU licensing, perl bindings and end user support available.


downloading maestro (38.5Mb) to check out mars rover data. reminds me of the 3d pathfinder back in 97. posted detailed comparison of rover ground images of opportunity and spirit.

australia day

25 Jan. all I can say is crack a tube! (Sound of cans opening) and Australia, Australia, Australia, Australia, we love you amen!.

philosophers song - monty pythons flying circus - Episode 22
Immanuel Kant was a real piss-ant who was very rarely stable.
Heideggar, Heideggar was a boozy beggar who could think you under the table.
David Hume could out-consume Wilhelm Freidrich Hegel.
And Whittgenstein was a beery swine who was just as sloshed as Schlegel.
There's nothing Nieizsche couldn't teach 'ya 'bout the raising of the wrist.
Socrates, himself, was permanently pissed.
John Stewart Mill, of his own free will, after half a pint of shanty was particularly ill.
Plato, they say, could stick it away, half a crate of whiskey every day!
Aristotle, Aristotle was a bugger for the bottle,
And Hobbes was fond of his Dram.
And Rene Descartes was a drunken fart:
"I drink, therefore I am."
Yes, Socrates himself is particularly missed;
A lovely little thinker, but a bugger when he's pissed.


Journal Journal: complex HTML-->???-->PDF? 4

the problem

  • but the current Word doc (the catalogue) has tables, graphics and was 'built' with Word templates, so I have no idea how ell it would all convert.

the site that got me interested in pdf was Stas Beckmans site, He gave a talk to the melbourne pm last year. Through the course of his talk on mod_perl 2 he showed the notes from his site in html with pdf downloads of the site.

So I tried to re-create this html->ps->pdf so that I too could have a printable version of a project I'm working on called Ratpile (make a directory that has *stuff* stored in it searchable by stuffing information about it into a relational database - data mining some may call it.) using perl+DBI+TT2. The template I created is a *bare bones* html page sans images. This is the technique Stas is using with his docset.

the point I guess I'm trying to make is I've used text only and not images. I've done a bit of research and this is what I've come up with...

  • graphics are supported in postscript (3?)
  • others better (ybiC) than I, have hacked together html->PS->PDF code and appears to handle images via html2ps but not html tables (Create PostScript and PDF versions of all HTML files in given directory )
  • one approach could be to use Matt Sergeants, PDFLib (load_image method) a oo wrapper around pdflib by but I seem to remember has restrictions for use under OSI (has to be opensource, private use or researcher).
  • or use Alfred Reibenschuhs - Text::PDF::API where I found via an old page PDF-API2-0 which has some image (jpg,png,handleing capabilities
  • logreport has an interesting set of observations about html->PDF generation. Namely problems with html formatting and tables

building html->PDF with images and troublesome html tables

now given what we have found above I would suggest the following (unless anyone has a better idea) of using:

  • extract word document to html
  • extract table data (word document via OLE) or (via html via Html-TableExtract - like latter better.)
  • remove html tables in html documents
  • reinsert data into a simple table using <pre> tags for layout and html tags for bolding, emphasis. Or find some other method by experimentation in html for representing tables (text)
  • PDF-API2 as the PDF renderer. This can all be done in code.

the real problem maybe rendering the tables generated from word. complicated layout in word (re-rendered to html) will have to be modified to the postscript syntax then rendered to PDF. The problem is defined by converting the html tables to pdf.

it is not rocket science to create a bit of code to extract the data from the table, re-create a table using PDF-API (and its child modules).

but is there a shorcut?

of course you could forget all the above and take your chances with Michael Frankl's HTML-HTMLDOC and convert you html files directly to PDF :)


damn I love cpan.


Journal Journal: Iain Truskett RIP 2003-12-29

update: 13JAN2003 Andy Lester (oreillynet) writes ... I never met him, or even spoke to him except through email and IRC, but I feel like he was a friend. He was always willing to help out when possible, and at least discuss an idea when not.

perlmonks today -

Iain Campbell Truskett ("Spoon"/"Koschei") RIP 2003-12-29

According to his girlfriend's blog, Koschei passed away from a cardiac arrest at the age of 24. He was a significant contributor to the CPAN, and frequently interacted with us on the Perl IRC channels. His wiki is still up for perusal. He will be missed.

Rest In Peace, my friend.

-- Randal L. Schwartz, Perl hacker

Iain will be missed. It was only last November that he gave up some time to present a talk on DateTime at the PM.

Iain talked in detail how the development was proceeding on the Datetime modules and how he spent the last 8 months hard work crafting ("hacking" is a poor choice of words. Conceptually DateTime is not for the weak of mind). The talk was well presented, informative and most of all amusing (especially the bit on the evolution of the French Revolutionary Calander)

I will certainly be thinking of him when I'm using any of his modules. His legacy is more than mere code.


Journal Journal: cop this! 3

should rtfm the /. faq pages before I post.

reading rdf stuff at xmlhack

downloading Kings Quest for the young bloke and playing Hogs of War.

remember to update shipshape stuff. especially since I indexed the ratpile and the rest of the hard drive. created perl code to query all htm* and pdf files used template toolkit to output page and used ghostscript, ps2html and docset to generate pdf of printed page much like

found why the delay in longhorn and possibly the journalling file system - dependency on yukon (pt 7 managing yukon dependencies). yukon is ms-sql server database in development.


Journal Journal: 'LinuxGazette' + (lawsuit ? '.net' : '.com')

update: 07DEC2003 - did not take too long. Check out SSC vs Continued. Look out for ricks comments.

I'm always on the hunt for online books and references. One site I've thumbed through a lot is LinuxGazette. Filled with a broad range of Linux related topics, programming, sys admin etc.

Then doing some meta-moderation I came across this article, SSC Trademark Threats vs, 12-02-03 9:05.

I originally became aware of this tussle after link harvesting for later use and checking out their author faq to see if it was worth writing an article or two. Now you know something fishy is going on when the shutters go down, then come back up again sans original crew plus heated boards.

But have ssc bit off more than they can chew? I'm a member of the Victorian Linux Users Group, LUV among others and so is Rick Moen. He actively pops up (even though he's on the flip side of the world) on the LUV mailing list answering technical questions etc, berating those who don't do their homework and helping others who do. I've never really found out why? but myself and others are grateful or at least amused. (I wish LUV's email list was online you could read the posts) The Age newspaper did article on him a while back (SSC used this as a publicity puff peice on their site for issue JAN2003 (see linux links).

stop sniffing my wallet!

In a world of creators and distributors, traditionally it's the distributors who make the cash. Go ahead make your money distributing CD's of documentation, printing Journals but don't take what is not yours - user generated documentation try to spin it into your own cashcow. You're not going to get any documentation from moi at your site.

Sick 'em rick

whats a ternary conditional operator, '?:'.

Slashdot Top Deals

Work is the crab grass in the lawn of life. -- Schulz