Peter H.S. - Slashdot User

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Sunday March 08, 2015 @10:48PM (#49212437) Attached to: Ubuntu To Officially Switch To systemd Next Monday

What I am saying is that what systemd calls "log database" is no such thing and massively inferior to the real thing.

Please don't confuse the term database with a RDBMS or similar. A database can be a simple text file with structured data like /etc/passwd.
http://en.wikipedia.org/wiki/F...
systemd's journal is by any definition a database file and since it contains logging info, I fail to see the problem in calling it a "log database".

I also _know_ that people that put logs into databases uses real databases, you know the ones that come with ACID.

Some use ACID RDBMS, some use non-ACID NoSQL databases, etc. It all depends on needs and to why they collect the logs in the first place.

What you do _not_ do is change the on-machine logs into a wannabe-database. That is about the worst thing you can do.

You are of course wrong about this, Enterprise log analysers like Splunk uses simple files and indexes to store events etc. pretty much the same way that systemd does.

There is considerably overhead when using ACID RDBMS', so when things needs to be really fast you do stuff like systemd and Splunk does. Logstash (probably one of the more popular log-analysers) doesn't use a ACID compliant RDMBS as backend either (though it can if you want to).

The point is that the world of logging and databases have moved on considerably the last decade. The primary reason being more and more data that needs to be analysed (fast) for one reason or another (Business, real-time security etc.).

systemd's journal is a pretty significant upgrade to the otherwise rather fossilised world of Linux logging. Don't get me wrong, I have tremendous respect for the Rsyslog team, but they have been struggling for over a decade to solve just some of the problems systemd's journal now have solved.

And I have have never really seen any good argumentation against using binary, structured and indexed log files:

They can be read by all standard Linux text tools with piping.
There exist multiple independent readers for them.
The logs can be programmatically accessed through a myriad of languages.
They provide functionality that can't be matched with legacy syslog files.
There is no non-contrived scenario where they can't be read one way or another.
They can be exported in any format and have default export options for all relevant industry standards.
Unlike syslog output, they have a stable and documented API.

So there isn't any real downside to using binary journal log files, while there is considerably advantages.

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Sunday March 08, 2015 @06:55PM (#49211611) Attached to: Ubuntu To Officially Switch To systemd Next Monday

So now you are going from saying that people do not use DB's for logging unless they are clueless; (Let me quote you on this:)

And yes, I have extensive experience with log analysis. You do not turn the logs into a database unless you are fully clueless.

..to that DB's can be a good thing?
And please remember that ACID doesn't give any protection against file corruption, nor does it prevent user space programs corrupting the logs by generating obviously impossible field values. But then again, neither does the journal nor plain text logs.

Again, for those who need to run a full DB for their logs, systemd's binary file format is great, because it allows the journal to be exported in industry standard formats like JSON. That way the remote log-sink database can receive and store rich meta data in a totally stable and structured way; changing hostnames, IP's, or even different wordings or even different and changing languages used by the daemon log output isn't a problem with the journal, since it is based on field values, not complex regex-ing of unstructured, undocumented, unstable, language specific words.

Oh, and tamper proof, cryptographically "sealed" logs too (FFS) if you want that.

If your remote log-sink solution isn't a full DB, you still gets all the benefits of receiving structured log entries with eg. full microsecond precision timestamps _and_ monotonic timestamps. It is trivial to convert eg. JSON output with defined field names to any other structured format, while converting and aggregating unstructured text files is a pain.

It is also great for those who only needs local logging; the journal has many of the advantages of a full DB, but without the complexity and overhead. Its append based file format is also much more robust against file level corruption than databases. Since the log files are structured and indexed with field values, they are easy to perform powerful, yet simple queries on.

How do you find all syslog entries with the priority level "error" generated by the previous boot only?
With the journal it is : "journalctl -b -1 -p err"

And how do you generate a full list of every executable, including their path?

Since you can "tab" trough the values in the journal this can easily be done: "journalctl -F _EXE "

And with the -x switch, the help database is activated, giving further explanation on what the log entry means, and gives direct links to upstream support, perhaps linking directly to a page that explains the error code etc:

Example:
# journalctl -b -x -u systemd-logind.service

mar 07 16:46:16 localhost systemd-logind[546]: New session 1 of user Peter H.S.
(log entry above, help database output below:)
-- Subject: A new session 1 has been created for user Peter H.S
-- Defined-By: systemd
-- Support: http://lists.freedesktop.org/m...
-- Documentation: http://www.freedesktop.org/wik...

There are seriously many people who just trawls through long log files (even using vi apparently), because it is hard to grep for anything unless you already know what to grep for, and because regex'es are well, regex'es; difficult to use, understand and to remember all the many variations. Basically, newbies can't read or filter Linux syslog log-files by other means than trawling through them.

They can't have a useful GUI either because it is impossible to make a distro-agnostic syslog gui. Again, a problem that systemd's journal solves.

The bottom line is, that the only virtue syslog files have, namely that they are human readable, is a serious hindrance for their use too: they can't add monotonic timestamps and micro-precision timestamps and other meta-data without being excessively difficult to read for humans.

With the journal you can give machine parsers exactly the structured log info they need and can benefit from, while still allowing easy readable log output: a simple cmd-line switch determines the format. So you get all the benefits of legacy text logs without their severe limitations. It is a win-win situation.

These days, using simple, unstructured text log files, is simply obsolete. The market and various industries have already decided on this issue. Logs are meant to analysed, and the sheer volume of them means they need to be structured, indexed and have rich logging info, just like systemd's journal. That way they can be aggregated and machine parsed with minimal effort.

Comment Re:Question from a non-Linux user (Score 1) 765

by Peter H.S. on Sunday March 08, 2015 @05:24PM (#49211287) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Watching systemd evolve (Score 0) 765

by Peter H.S. on Sunday March 08, 2015 @12:17PM (#49209889) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Question from a non-Linux user (Score 0) 765

by Peter H.S. on Sunday March 08, 2015 @11:51AM (#49209755) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Question from a non-Linux user (Score 1) 765

by Peter H.S. on Sunday March 08, 2015 @07:49AM (#49208933) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Question from a non-Linux user (Score 1) 765

by Peter H.S. on Sunday March 08, 2015 @07:08AM (#49208873) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Question from a non-Linux user (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @08:49PM (#49207357) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @08:40PM (#49207307) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @08:20PM (#49207225) Attached to: Ubuntu To Officially Switch To systemd Next Monday

contain rich meta-data....

What a load of claptrap. People log to log files for reasons of the lowest common denominator. We have things called 'databases' for this kind of stuff and there's perfectly good reasons we don't use them for logging which should be obvious to anyone with an ounce of sense.

Really, are you unaware how common it is to aggregate log files in databases? This is a major selling point for Rsyslog that it is able to do so.
In fact, Rainer started Rsyslog exactly in order to overcome the many deficiencies with syslog(3), including the severe limitations of unstructured text files.

systemd's journal is pretty much a stroke of genius in that it overcomes all the limitations of unstructured, unindexed text logs, while not being a full blown DB either (the journal files are basically appended text files with a different line delimiter and an index in front).

Log files keep on growing every year, because people log more and more, and systems are running more and more daemons. Analysing such logs means machine parsing, and structured and indexed log formats like the journal have a huge advantage for this kind of work.

Since the journald binary logging file format is stable and fully documented....

I sincerely hope Red Hat and Lennart is paying for this piece of contrived PR.

Ah, so now facts and reality is propaganda. Take a look here
http://www.freedesktop.org/wik...
and here (about the file journal file format)
http://www.freedesktop.org/wik...

The arguments against it are based on contrived scenarios

Corrupted logs on a perfectly running system isn't a contrived scenario.

It is when the corruption means squat for the ability to actually read the logs.

For some reason you seem to think that "corruption" discovered by "journalctl" means the logs are unreadable, but as explained, they are often marked "corrupted" just because a single field value in a single log entry was discovered as impossible. This doesn't mean that the log file can't be read.

....through the Linux/Unix concept of piping.

Piping eh? Wow. You give the impression you haven't heard of it before ;-).

I don't care what you think, as long as you agree that the whole notion of not being able to use standard Linux text tools like grep together with systemd's journal is just plain wrong and a total non-issue.

There just aren't any good arguments against structured, indexed log files that can be programmatically accessed and has rich meta-data.

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @07:47PM (#49207069) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @07:19PM (#49206965) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:KDE and GSoC (Score 1) 53

by Peter H.S. on Saturday March 07, 2015 @06:58PM (#49206901) Attached to: KDE Accepted To Google Summer of Code 2015

Comment Re:Question from a non-Linux user (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @05:54PM (#49206545) Attached to: Ubuntu To Officially Switch To systemd Next Monday

Comment Re:Watching systemd evolve (Score 1) 765

by Peter H.S. on Saturday March 07, 2015 @05:20PM (#49206335) Attached to: Ubuntu To Officially Switch To systemd Next Monday

The major point of systemd's binary logs are that they are both structured, indexed, and contain rich meta-data, something plain syslog logs isn't.

That again means that when you export your logs to a log-sink, you can do so in a format that fits directly into the log-sinks database like JSON format and preserve the rich meta-data like the "_MACHINE_ID" field, that uniquely are tying every log-entry to a certain machine. You can also be sure that that the program claiming to generate the entry actually is correct since journald provides kernel guarantee for this.

If you choose to keep the logs in journald's binary format on the log sink server, you gain the benefit of indexed fields when analysing data; a huge win when it comes to random access. So much faster than trawling through text logs (O(n) complexity and all that).

Every log entry can be traced back to a certain machine (not just hostname), many log fields have kernel guarantee for what they say, besides having integrity checking and even strong cryptographically "forward secure sealing" (FSS) security against tampering.
The journald logs are simply designed from the ground up to be read an analysed on other computers than the one they were generated on, either in their native format or as exported logs.

Since the journald can be accessed programmatically, it can aggregate and analyse logs across different languages and has strong immunity against changing wording of the daemons log output (it operates on fields, not words).

And since the journal logs are structured identically by a documented standard, they are trivial to aggregate, unlike the output of many different syslog implementations.

Since the journald binary logging file format is stable and fully documented, and can be accessed programmatically with language bindings, you don't need a specific binary like "journalctl" to read them. In fact, there are there is a Rsyslog module that allows it to directly read (and export with metadata) systemd journal's. There are also Python modules etc. that acts as journal readers etc.
In fact, it is quite possible to make a systemd journal reader that works on non-systemd platforms like MS-Windows or OSX.

The journald collate all logging on the Linux machine, that means everything can be trivially exported to a remote log-sink, including the kernel ring buffer and the binary utmp and wtmp log files that syslog doesn't know about, and it can include early boot and late shutdown log info because it can work in initramfs, something syslog can't.

How about journald being designed as signal-safe from the ground up (unlike syslog) and that it doesn't silently drop messages under load etc. etc.....

systemd's binary log format is simply a massive win for both the enterprise user and the Linux newbie and everything in between.

The arguments against it are based on contrived scenarios, like professional admins that doesn't have (systemd) boot medias and lack access to a pc, a usb stick and a internet connection so they can make one in 5 minutes.

All the standard Linux text tools like grep, tee, sed, sort etc. work with systemd's journal through the Linux/Unix concept of piping.

Sure, systemd's journal still doesn't have as many tool sets and projects around it as syslog. But that it is simply a matter of time.

Slashdot Top Deals