From Postscript to XML and Beyond …

By Neil Perlin (with thanks to Spiderman)

In this article, I’ll discuss how changes in the coding used to create online documentation and help (“online content” from now on) have changed how technical communicators work. From that perspective, I’ll then look at the changes we can expect to see as XML and XML-based authoring tools enter the market and how to react to those changes.

Let’s start by reviewing the past.

From Word Processors to Online Help

In the early ‘80s, the technical writer’s main tool changed from the typewriter to the word processor. Technical writers suddenly had to deal with issues like file formats and format conversion among different word processors. But the focus was still the writing; the code that controlled the documents was hidden.

In the late ‘80s, Microsoft Word’s Rich Text format (RTF) spread as Word gained market share. But the RTF code would still have stayed hidden except for one twist – Microsoft used RTF as the code for WinHelp, Windows’ first native help format. Help authors who were creating WinHelp couldn’t use authoring tools like RoboHelp; they didn’t yet exist. So help authors had to hand-code using tools like Windows Notepad.

Effects?  The process of technical communication had jumped in complexity. Technical communicators/help authors suddenly had to worry about code syntax as well as content. Hand-coding ability became an indicator of an author’s machismo (or machisma, in many cases). Much of today’s hand-coding mystique dates from this era.

Hand-coding RTF is slow and typo-prone, so help authoring tools, or HATs, appeared in the early ‘90s. Today’s best-known HAT, RoboHelp, dates from that time.

Effects?  The HATs physically inserted the codes so help authors could, theoretically, go back to focusing on content. But early HATs were crude enough that authors still needed coding skills to fix problems or add features that their HAT didn’t support. So coding ability was still important.

By the late ‘90s, HATs had improved enough that authors no longer needed to know what was going on at the code level. WYSIWYG interfaces hid most of the code detail. Then Microsoft released HTML Help…

HTML Arrives

HTML Help set the stage for today’s development environment. It also reprised some problems of the RTF era, including:

  • HTML was new and unknown to most help authors, just as RTF had been.
     
  • The HATs had to catch up to the new technology. This forced early HTML Help authors to learn HTML and to create and edit files in tools like Notepad. Coding ability became crucial again.

Effects?  The need for coding ability returned. So did the perception that coding was the indicator of an author’s machismo/machisma. This again diverted authors’ focus from the content.

But the move to HTML added new problems:

  • More technical complexity. Authors had to determine whether users had the right browser (if they had one at all) and the help mini-browser. This depended on the version of Windows that users had. In effect, help authors were getting involved with system configuration, normally IT’s job.

    The shift to HTML also meant that help began to resemble web sites. This meant that authors had to go beyond the usual Microsoft-centric, C: drive world to think about Linux users, Mac users, Netscape users, and others, who might be running help systems from servers instead of floppies or CDs. Get this information wrong and authors might pick the wrong help format or the wrong HAT.
     
  • Greater development pressure. Online help had become accepted, even expected, in many products. This meant authors suddenly had to deal with time-to-market and ROI (return on investment) pressures. The unhurried freedom to experiment and play that authors had enjoyed only a few years earlier was gone.

Effects?  Technical complexity skyrocketed. The new HTML-based output formats were often confusing. (Was “Web Help” the same as “WebHelp?”) Standards and consistency were becoming important. Plus, as technical communication became still more technical, the nature of the writing began to change away from creativity. Authors who still viewed themselves primarily as writers began to dislike the increasing emphasis on technology. Some began to leave technical communication.

XML Arrives

HTML’s limits were clear by the mid-90s, and the W3C (WorldWide Web Consortium) released a replacement, XML (Extensible Markup Language), in the late 90s. XML has been used for eCommerce since the late ‘90s but has only hit technical communication in the last few years. In other words, we’re now moving into the next new era.

XML-based help is reprising the problems we experienced with HTML and RTF but with some new twists:

  • XML is more confusing than HTML because XML is not a language. Instead, it’s a “meta-language” - a set of rules for creating languages. So authors may have to learn different XML languages depending on the markets they support. And, as with HTML, XML is new and unknown to many technical communicators.
     
  • The HATs are trying to catch up with the technology, so early authors may again have to work at the code level. This situation won’t last long; WYSIWYG tools are already appearing. But the early tools won’t offer all the features that authors need, or have become accustomed to from their HTML work.
     
  • More technical complexity. The different types of files involved in XML require a level of rigor that will be very new to authors accustomed to HTML. What’s the difference between well-formed and valid and how is that expressed in a DTD or XSD, for example?  How does a CSS render an XML file?  What’s a transform and how does it relate to the XML files?  And so on.

    Just to confuse things, the technology is changing rapidly. For example, schemas are replacing DTDs for syntax control. It’s important to understand the technical aspect of these changes, but also their effects on your strategy and documentation processes.
     
  • Time-to-market and ROI pressures continue to rise, but with a new angle. Online content is moving into low-tech - hospitals, insurance companies, banks, etc. – as such companies start to abandon hard-copy. Authors in such companies are making the same mistakes that authors in high-tech did years ago. The difference is that low-tech authors don’t have the latitude that high-tech authors once did. The market pressures are too great. Yet the low-tech authors often don’t know where to turn for help.

And XML adds some new problems:

  • The need for more rigorous writing methodologies, like structured writing. This will be a difficult change for companies whose culture doesn’t support rigorous writing standards, but it’s necessary to take full advantage of XML’s single sourcing power.
     
  • More complex single sourcing. HATs have claimed single sourcing capability for years, but it’s largely been limited to hard-copy and online help. “True” single sourcing might require supporting multiple mobile device formats, for example, or hard-copy plus multiple online outputs targeted to multiple audiences. Few authors need this level of flexibility yet, but market changes will eventually make it necessary. Such single sourcing will call for tools that offer variables, reusable chunks of content, conditionality, selectable CSS chunks, and so on. The price will be a need for rigorous planning and enforcement of standards new to many authors.

Effects?  Further increases in complexity, a further need for rigor and standardization, and shorter turnaround times. For non-techie authors who survived the shift to HTML, XML may be the last straw.

The Mistakes and the Response

This history has seen several consistent mistakes, the primary ones being: 

  • Continued hand-coding. In the past, authors hand-coded because they had to due to HAT limitations. Today, most HATs are good enough that authors now hand-code because they want to. Fewer and fewer do, but it’s still common.
     
  • No authoring standards. In the past, projects and development teams were small enough, lead time long enough, technical requirements simple enough, and user expectations low enough that inconsistent authoring wasn’t a major problem. That’s changing today.
     
  • Not knowing how to use your HAT. Many HAT users have never been formally trained in the mechanical, design, or management aspects of their HAT. They were just told to “go learn it”. A surprising number of users have down just that. They’re working inefficiently but they are working. But many untrained HAT users are just struggling, producing hard-to-manage output, and just feeling out of control.

In my opinion, here’s what needs to happen to fix these mistakes and get the power and flexibility to make full, effective use of the tools.

  • Define a formal policy to ban hand-coding in all but exceptional cases. There will be cases where a HAT can’t perform some task, but those cases will become rarer as HATs get better. Today’s authors should understand the codes and concepts in order to understand what their HAT is doing, but the authors should rarely if ever find themselves actually typing codes.
     
  • Define, promulgate, and enforce standards in technical areas like style sheets and templates. The result will be a more disciplined development culture that creates more consistently coded, structured, and formatted content. Another benefit will be greater development speed as the style sheets and templates automate decisions that authors once had to mull over.
     
  • Define, promulgate, and enforce standards for development best practices across the documentation group or even across the company. The result will be a more effective development culture that efficiently captures and shares the experience, skill, and contributions of all the developers.
     
  • Budget to train authors on how to use their HAT and any supporting technologies. Not doing so – just buying a HAT and telling authors to “go learn it” – may save money in the short-run but is tremendously wasteful beyond that.

The beauty of XML is that it offers tremendous power and flexibility, but only if we use the tools properly in the first place. A character in one of the Spiderman movies put it nicely“With great power comes great responsibility.”

About the Author

Neil has 27 years experience in technical communication, with 21 in training, consulting, and development for various types of online formats and tools such as WinHelp, HTML Help, CE Help, JavaHelp, RoboHelp, and some now forgotten. Neil is a columnist and frequent speaker for the STC and other professional groups, a senior member of the STC’s Boston chapter, the creator and manager of the Beyond the Bleeding Edge stem at the STC’s annual conference, and an Associate Fellow of the STC.

                                              

Neil is a Madcap Certified Instructor for Flare, and a Macromedia-Certified Instructor for RoboHelp and Captivate. He provides training, consulting, and development for online help and documentation, Flare, RoboHelp, Captivate, XML, and single-sourcing through Hyper/Word Services of Tewksbury, MA. He can be reached at nperlin@concentric.net, www.hyperword.com.


 

  

 




54th Annual Conference


Make your plans for
Minneapolis! 
May 13
16, 2007 
Preconference Workshops
May 12
13

STC Conference page 
Minneapolis Guides 

 

Registration Deadlines

  Speakers:
  
All others: (Advanced)

April n

April nn

(On-site registration
continues after April nn)

 

 


ET SIG Home | Newsletter Home | About The Edge | Contacts | Membership | Resources
Copyright Society for Technical Communication
For copyright statement and other administrative matters, see About The Edge.

Contact the newsletter editor