weblog.mattdorn.com

Generously funded by Matt Dorn

Kill Your Word Processor: Popularizing Lightweight Markup

with one comment

As a part of my tentatively titled Kill Your Word Processor project, I’ve been trying to think of ways to expand the audience for it, particularly because I’ve more than once been frustrated by collaborators’ insistence on the use of clunky word processors and browser-based WYSIWYG editors which I’ve long since given up on.

I even posted a note on the Backpack forum, where users have been exposed to the Textile lightweight markup syntax, trying to get others’ opinions on the matter, but it seems I’m preaching to the choir over there.

I have, however, been encouraged by what appears to be an increased interest in lightweight markup that seems to be part of a broader paradigm shift led by the spread of "Web 2.0."

But ultimately tangible results will be the most effective means of persuasion–e.g., a beautifully formatted LaTeX document converted to PDF, and a crisp, clean Web page springing from the same source; a demonstration of the tools that make such results effortless, etc. Still, I wanted to at least lay out some of the rational arguments here, hopefully with the end result of compiling a concise list of the benefits of the moving to lightweight markup.

There are at least two technologies that have preceded the current wave of interest in alternatives to bloated word processing software–one rooted in the requirements of collaboration on document composition, the other in the benefits of separating the production of content from its presentation.

Wikis emerged on the strength of their usefulness for collaboration on documents. The limited utility of passing Word documents back and forth as attachments via email should be apparent to most by now to anyone with enough familiarity with the Internet to know that there are tools (e.g., Wikis) that make that technique unnecessary. Among other benefits, most Wiki software packages have built-in versioning features as well as warning features to deal with simultaneous edits.

Wikis in fact sprang from the "Extreme Programming" (XP) community and from one of the guiding principles of that community: "Use the simplest thing that could possibly work." For that reason, Wikis did in fact incorporate and promote the concept of separation of content from presentation through the use of exactly the kind of simplified markup languages that are under discussion here. From c2.com, the site of the original Wiki:

Wiki’s emphasis is on content, not presentation. The simple markup rules make people focus on expressing their ideas, not making them pretty. Some people have found that working in this minimal medium improves their writing.

Unfortunately, in many cases, more technically inclined users demanded the ability to include HTML formatting by typing angle brackets which make the raw input of a document difficult to read, while the less technically inclined demanded the inclusion of word processor-esque WYSIWYG editors. The degree of accommodation of such requests varies from package to package, but clearly not all Wiki users were convinced of the benefits of simplified markup. This may be because those markup languages were not particularly well thought-through.

The concept of separating content from presentation has a more venerable heritage, however: It’s embodied in the LaTeX typesetting system, which was derived from TeX, which in turn was developed by Donald Knuth in the 1970s. Ubergeek protestations notwithstanding, composing a document in LaTeX’s own syntax is far more painful than composing in a word processor, but the content/presentation concept is firmly embedded here, as is the benefit of being able to compose in any text editor.

LyX is an editor that takes a lot of the pain out of composing LaTeX documents. While I don’t consider it very useful for my purposes, its documentation does make some interesting points about the importance of content/presentation separation:

Part of the initial challenge of using LyX comes from the change in thinking that you, the user, must make. At one time, all we had for creating documents were typewriters, so we all learned certain tricks to get around their limitations. Underlining, which is little more than overstriking with the "_" character, became a way to emphasize text. You were forced to figure out column sizes and tab stops, and set them, before creating a table. The same applied for letters and other right justified text. Hyphenation at the end of a line required a careful eye and a lot of foresight.

In other words, we’ve all been trained to worry about the little details of which character goes where. Consequently, almost all word processors have this mentality. They still use tab stops for adding whitespace. You still need to worry about exactly where on the page something will appear. Emphasizing text means changing a font, similar to changing the typewriter wheel. This is the underlying philosophy of a WYSIWYG word processor: "What You See Is What You Get". Unfortunately, that paradigm often results in "What You See Is All You Get".

This is where LyX differs from an ordinary word processor. You don’t concern yourself with what character goes where. You tell LyX what you’re doing and LyX takes care of the rest, following a set of rules called a style. …

So, the basic idea behind LyX is: specify what you’re doing, not how to do it. Instead of "What You See Is What You Get," the LyX model is "What You See Is What You Mean" or "WYSIWYM." It’s a powerful idea that greatly simplifies the mechanics of writing documents. This is also why LyX isn’t so good for creating posters and flyers—in this case, you do want to specify exactly where everything goes, because there are no functional units like paragraphs, sections, etc.

(To be fair, most word processors do in fact have a similar "style" feature to help you concentrate more on the content and impose a uniform look on your documents, but almost nobody uses it, probably because the nature of the word processor discourages its use.)

In spite of the fact that LyX is an application and not a markup syntax, you could easily replace the word "LyX" in several parts of this passage with "lightweight markup" and gain a firmer understanding of the benefits of the approach I’m describing. This passage is also interesting for indicating the circumstances under which this approach is not appropriate (e.g., posters and flyers).

Finally, in a recent thread on David Heinemeier Hansson’s site titled "Why do people still use word processors?", Hansson responds to a challenge to his rhetorical question and makes us aware of the false dichotomy between "technical" and "nontechnical" users vis-a-vis the difficulty of using a lightweight markup like Textile:

Reader: "People" are generally not IT professionals. It’s hard enough for some to even understand the concept of a table. It’s very hard for quite a few to understand the simple concepts of HTML. LaTeX is harder still to learn and use. … Textile is a good start and leverages some users, but probably not my mother, anyone else in her generation, or a significant part of the entire private sector.

Hansson: I was puzzled that students attending HA.dat (computer science and business administration) would suffer to Word when there were plenty of alternatives that didn’t require a bachelor’s degree in CS. … Your mother is excused, they were not.

I.e., there’s no reason why lightweight markup shouldn’t be appealing at least to computer literate users, even if non-computer literate can’t reasonably be expected to live without the word processor’s conceptual link back to the pre-digital world of typewriters.

Written by mdorn

December 22nd, 2005 at 3:33 pm

Posted in technology

Tagged with ,

One Response to 'Kill Your Word Processor: Popularizing Lightweight Markup'

Subscribe to comments with RSS or TrackBack to 'Kill Your Word Processor: Popularizing Lightweight Markup'.

  1. Good review.

    I’ll add three points:

    1) Another huge advantage of LyX is integration with BibTeX for citations and bibliographies. This is something that does not exist for free, in a form useful for professionals and academics, in the Microsoft Word world. There, you have to pay good money for a system like Endnote that locks you in even worse than Word does — your documents simply won’t show any citations unless you have Endnote installed!

    LyX + a BibTeX reference manager (I like JabRef) = freedom from Word + Endnote.

    2) LaTeX produces higher quality printouts because it’s not limited by WYSIWYG. Screen fonts are an order of magnitude smaller in resolution, so if you want something to look identically on screen and on paper, you will have to adhere to the screen’s lowest common denominator. In fact, it’s so poor that many printing features, such as font kerning, aren’t even bothered with in the WYSIWYG world.

    I can spot a printed Word document from across the room. A LaTeX document is indistinguishable from a book — in fact, many books you own were probably typeset in TeX.

    3) To be fair, Word has incorporated some post-WYSIWYG thinking. WYSIWYG was terrific for the office letter environment — a secretary could quickly produce a letter and send it to the printers while being able to proof-read it on-screen. But, with longer things, like technical documentation, it presented problems. Re-formatting was a nightmare of meticulous work. So, Word introduced “styles” — which is really a WYSIWYM concept. You can set your “style” to heading without worrying to much about how it looks. You “mean” it to be a heading, and you can format it when you’re done writing.

    To an extent, Word does a better job than LyX in letting you change the formatting of your document. LyX lets you change things like margins, fonts and spacing, but the need to preview as PDF makes it slower to tweak.

    I hope this will get better in future versions of LyX. Live PDF previewing, perhaps?

    Tal

    19 May 10 at 7:28 pm

Leave a Reply