Selections for Web Publishing
By Don Dugdale
[Chart]:
Creating Web Documents
[Sidebar]:
Where to Learn More
As the market booms for tools to create HTML pages, it can be difficult
to choose one that suits your taste. Plenty of options are available, and
they're changing with every passing moment.
Ironically, the devices that make it easy to compose HTML
pages also limit the power of the language.
The act of creating and publishing pages on the World Wide Web is fast outgrowing
the domain of the programmer and is arriving full-blown on the screens of
millions of computer users around the world. Formerly restricted to those
who had learned the somewhat arcane Hypertext Markup Language (HTML) that
defines Web page formats, page creation now can be handled with more intuitive
graphical tools. Some of the newest tools offer drag-and-drop functions
and integration with the Web browsers used to access those pages. Many of
them make the insertion of HTML tags transparent, so the author never has
to deal with HTML syntax.
"The creation of Web sites has been left to a techie person,"
says Mary Beth Smartt, product manager for Interleaf's Cyberleaf document
conversion tool in Waltham, MA. "Now other people are saying this stuff
is not that hard to do if you have the right tools." In particular,
the integration of HTML editor and Web browser is in demand, for reasons
of convenience and transparency of use.
The automated tools aren't for everyone. Larger organizations that want
an extensive Web presence still require various tools suited to software
engineers who understand the mechanics of HTML and Web server functions.
This is especially the case when continuous updating of content is required.
"There is no magic tool that does everything I need it to do,"
says Mike Knowles, senior technical staff member for corporate information
development at IT vendor Amdahl Corp. in Sunnyvale, CA. "We use a combination
of Cyberleaf, QuarkXpress extensions, FrameMaker 5, a couple of freeware
HTML editors and basic text editors. None alone is adequate to give us Web-ready
output."
Smaller organizations may be the best customers for the sophisticated what-you-see-is-what-you-get
(WYSIWYG) tools, because they are less likely to have trained personnel
to devote to their Web site, according to Lydia Dobbins, vice president
and general manager of Navisoft, a Vienna, VA, company that makes Web server
and publishing software. The larger the company, the more likely it is to
hire a firm to do the work or to have programmers who will put together
their own toolset to create pages and administer the site. Companies' motivations
also have evolved, Dobbins says.
"The early adopters were looking to develop commercial activities,
thinking everybody's going to get rich [by selling on the Web]. That's gone
past its first bloom, because there are a host of issues around commerce
that are going to take time to work out. The more likely early drivers now
are those who want a corporate presence on the Web as an information center
for customers and prospects."
For whatever reasons, users can feel like kids in a candy store when they
get into Web publishing and its associated activities. Rather than sampling
every one, however, it is better to try to determine which choices are good
for you. The following is a survey of some available brands. Keep in mind,
though, that this market is in rapid flux.
Ways to Create Documents
HTML documents are created using three basic methods: a simple text editor
to write the words and HTML tags manually; HTML converters and filters to
convert existing documents to HTML format; and a program or script to create
the HTML file.
Many Web page authors get started simply by copying someone else's formatted
HTML file into their own text editor, changing the words to suit their own
purpose and sending that file to be loaded onto the Web server. It's a quick,
clean way to get a page on the Web but not very flexible. If you have taken
the time and trouble to learn HTML tags, however, a Unix-based text editor
such as vi, elm, or emacs will allow you to create the same product as do
the more advanced WYSIWYG authoring tools.
The obvious disadvantage is having to type in all the tags by hand. It's
easy to get a tag wrong or misspell a word, because most text editors lack
automatic spelling or tag checkers (although separate HTML syntax checkers
are available). Also, there's no mechanism for enforcing HTML integrity
and structure, so getting it right may involve a lot of trial and error.
A text editor works for creating a few pages, but almost everyone who does
more than that acquires tools that make the job easier and faster.
Some Web authors power up their favorite text editor into a customized HTML
text editor. For example, an editor configuration file, .exrc (available
free at http://www.cosy.sbg.ac.at/www-doku/html/dot.exrc.bin),
turns vi into an HTML editor, employing abbreviations and macros that generate
the markup tags.
The task of converting existing documents into HTML format can be simplified
by using a converter application. These utilities take a file formatted
in some word processing or publishing program and automatically insert the
appropriate HTML tags. They also provide access to the convenience of the
formatting and spell-checking features of the word processor.
One of the first converters came out all of a year ago. Cyberleaf converts
documents in FrameMaker, Microsoft Word, and WordPerfect formats, as well
as Interleaf documents, to HTML format and CompuServe's Graphics Interchange
Format (GIF). The user doesn't have to know HTML and gets to use a graphical
user interface (GUI).
Some companies that have a lot of documents to convert have opted for Cyberleaf.
"Once you set it up and match styles to HTML tags, it's pretty much
a self-eating watermelon," says Amdahl's Knowles. "Cyberleaf is
good for us because we have hundreds of thousands of pages in Interleaf."
Amdahl not only maintains product information for the public on its external
Web server but, like many other companies, keeps extensive documentation
on internal Web sites for its own employees. One of its projects is to make
its technical manuals available on the internal Web.
Another converter is WebWorks Publisher from Quadralay of Austin, TX, which
works only with FrameMaker (which is now owned by Adobe Systems of Mountain
View, CA). In fact, version 5 of FrameMaker itself, released last summer,
contains a trimmed-down version of WebWorks Publisher called WebWorks HTML
Lite. The lite version does not contain features such as handling of book
files and graphics conversion, which are included in the full-blown product.
Like Cyberleaf, WebWorks translates formatted files into HTML and GIF, handling
text, graphics and hyperlinks through a GUI. It also supports input file
formats such as WordPerfect, Microsoft Word, and Microsoft's Rich Text Format
(RTF).
The European Laboratory for Particle Physics (known by its French acronym,
CERN), birthplace of the Web, distributes a shareware converter of FrameMaker
documents called WebMaker. It requires installation of four other public-domain
programs (gawk, ghostscript, pbmplus and xlisp), which are included with
the distribution. WebMaker runs on most Unix platforms, as do Cyberleaf
and WebWorks Publisher, which also support Windows. Quadralay promises a
Macintosh version of WebWorks.
Another public-domain tool is rtftohtml, which, as the name implies, converts
RTF documents to HTML. RTF is a format for interchanging documents between
Microsoft Word and other word processing packages and is supported by FrameMaker,
Interleaf and other packages on PC, Macintosh, Next and Unix platforms.
An extension called rtftoweb adds extra formatting features.
Programmed Updates
The third way to create an HTML file for the Web is to let an application
or a script do the work--once you have found the application and learned
to use it. These applications can be used in two ways to create dynamic
content for Web pages. The first is by programming the Web server to do
automatic, periodic updates of Web pages. For example, if you keep product
information in a database and want your Web pages always to reflect the
latest database changes, you could write a script that would go into the
database, pull out certain data and regenerate the Web pages periodically
(daily, weekly or quarterly). The script retrieves exactly the information
it needs, then uses it to create fresh Web pages. This is the method UniForum
uses to create periodic updates of the online version of its Open Systems
Products Directory.
Perhaps the most popular scripting language is Perl, a public-domain language
available from a number of FTP sites. Designed for database management and
especially good for handling text files, Perl scripts search for strings
in the database, then perform some function on the strings. Most public-domain
programs for generating HTML documents are written in Perl.
The second way to make Web pages dynamic is to use Common Gateway Interface
(CGI) programs to generate pages on the fly as requests come into the server
from browsers. CGI programs are often used to respond to information entered
on an HTML form in a browser. More dramatically, they can take a browser
request for some Web page, enter a relational database through a gateway,
pull out up-to-the-second information and supply it to the browser on the
fly for every hit that comes into the server. In that way, updates must
be posted only to the database, and the Web page stays as current as the
database without actually changing the HTML page. Each of the major RDBMS
vendors maintains a set of toolkits to handle such requests, so programmers
don't have to write the source code from scratch. These libraries of routines
usually are available via FTP.
One company using this method is the Internet Shopping Network (ISN) of
Palo Alto, CA, which sells everything from disk drives to daisies via the
Web, though most of its business is computer-related. Branching out from
the ISN home page (http://shop.internet.net),
shoppers can retrieve product information and prices, then place an order.
In most cases, changeable data such as pricing comes directly from a database
as the browser contacts the server.
To construct their Web pages, ISN engineers use mostly text editors such
as BBEdit, along with Perl scripts and C scripts to integrate database information,
Wide Area Information Server (WAIS) indexing and other resources. "If
you know HTML, it's just as fast to type in the tags as it is to do it in
some text editors," says Robert Boucher, until recently a senior software
engineer with ISN.
Ironically, the principal disadvantage of most Web publishing tools is that
the devices that make it easy to compose HTML pages also limit the power
of the language by restricting its flexibility. For example, ISN likes to
use several columns of varying widths on its Web pages--something not allowed
by automated tools. In addition, an authoring tool that came out only months
ago probably lacks some advance in HTML or the state of the art that an
engineer would want to employ right now.
Authoring and Publishing Tools
Three basic types of Web authoring and publishing tools are available: shareware
in the public domain, which is available free or for a nominal charge; professional
versions marketed by software companies, ranging in price from $50 to over
$1,000; and word processor add-ons, made available by the software vendors.
Each has its pros and cons.
Numerous shareware programs are available now. (For a rundown of
them, see Mag's Big List of HTML Editors, http://sdg.ncsa.uiuc.edu/~mag/work/HTMLEditors.)
Some of the most popular include:
Arachnid HTML. This editor for Windows is in version 0.5a and includes
advanced features for use by Netscape browsers.
BBEdit HTML Tools and BBEdit HTML Extensions. These are free
extensions to BBEdit, a text editor that already contains support for HTML
editing.
HotMetal and HotMetal Pro from Softquad Corp. of Toronto are
WYSIWYG editors for Unix, Windows and Macintosh. HotMetal is the unsupported
freeware version, now in version 1.0+. It features document templates, forms
support, rules checking and cleanup of existing documents that have bad
markup. The commercial version, HotMetal Pro ($195), supports most Netscape
and HTML 3.0 extensions.
HTML Assistant. This editor for Windows also comes in free and commercial
versions.
HTML Writer. Web authors like this editor because it allows editing
of any size file, unlike many Windows HTML editors. It's available for a
donation.
Web Wizard: The Duke of URL. This home page builder for Windows 3.1
and Windows 95 appeals to authors who want a user-friendly interface. Those
who want advanced HTML features should look for another tool. It's available
for $10. Professional Web publishing tools include, in addition to
those already mentioned:
PageMill. From Adobe Systems, this authoring software ($99) enables
WYSIWYG Web page writing and editing in a word processor-like environment,
though the author can apply only styles available in HTML. An integrated
preview browser, image handling and forms creation are included.
Home Page Wizard and Publishing Wizard. Since last November,
CompuServe makes these tools available to its subscribers as part of their
membership. Home Page Wizard offers drag-and-drop editing, templates, and
image insertion to help design personal home pages. Publishing Wizard sends
the completed page to CompuServe's Web server.
NaviPress and NaviServer. NaviSoft, a subsidiary of America
Online, released its first two Web publishing and maintenance products in
mid-1995. NaviPress ($99) is a WYSIWYG authoring and browsing tool for Windows,
Macintosh and Motif. It allows the author to publish on a server maintained
by NaviSoft and test and review the page as content is developed. Version
1.1 was expected to be released early this year. NaviServer is a Web site
management system that sells for $5,000 on Unix platforms and $1,495 on
Windows NT.
Navigator Gold. This product from Netscape of Mountain View, CA (in
beta test near the end of 1995), combines version 2.0 of Netscape Navigator,
the most popular browser on the Web, with a WYSIWYG editor for creating
HTML pages. New features include the display of multiple, independently
scrollable frames on a single browser page; seamless viewing of multimedia
content with plug-ins such as Adobe Acrobat; support of Java applets, allowing
secure, two-way, real-time interaction over the net; and Netscape Livescript,
a programmable application programming interface that allows the page designer
to program events such as sounds and image changes in response to mouse
clicks and other actions in the browser.
WebAuthor 2.0. Released by Quarterdeck of Marina Del Rey, CA, in
November ($49.95), it supports HTML 3.0 and Netscape formatting extensions,
so Web page developers can create pages that use columns, tables and flow
text. It's both an authoring tool and a converter, keeping word processing
files intact as they are converted to HTML documents. It supports backgrounds,
centering, image maps, direct keyboard access and direct formatting from
the MS Word toolbar. With the addition of WebImage from Group 42, Inc.,
a multimedia company based in Milford, OH, the user can convert images to
and from popular Web formats, including GIF, JPEG and PNG.
FrontPage. This first product ($695) from Vermeer Technologies of
Cambridge, MA, was introduced in October. It runs on Windows and Macintosh
systems for creating and maintaining interactive Web sites. Software extensions
are available that enable FrontPage-developed sites to reside on Web servers
running Unix, Windows NT, Windows 95 and Windows 3.1. FrontPage's technical
innovations include Web Wizards, which are smart templates that guide authors
and designers through the development process; and WebBots, drop-in objects
that add interactive functions to a Web site without programming. The application
comes with 12 WebBots for adding such functions as forms, discussion groups,
surveys and text searches. FrontPage also includes Web site management functions
and support for geographically dispersed, multiple-user Web development
teams.
In the category of add-ons to word processors, the best-known examples
are Microsoft Internet Assistant and Novell's WordPerfect Internet Publisher.
Internet Assistant is companion software for Word 6.0 for Windows. Internet
Publisher is a free add-on to WordPerfect 6.1 and includes Netscape Navigator.
Both are reported to create functional HTML pages without many frills.
In considering the world of Web publishing, it's useful to remember that
the only thing that can be counted on is constant change. New tools and
updates of old ones are ever in the works--so much so that to be on the
leading edge, a Web publisher has to make consistent use of products in
their beta-test state. Taking full advantage of the latest offerings in
the world of Web technology requires technical acumen and knowledge of the
market. But a place in that world is becoming available for ordinary computer
users whose purposes go beyond the technology for its own sake.
Don Dugdale is a technology writer based in San Jose, CA.
He can be reached at dugdale@netgate.net.