Using the Web Instead of a Window System

James Rice, Adam Farquhar, Philippe Piernot, and Thomas Gruber
Knowledge Systems Laboratory
Gates Building 2A, M/C 9020
Stanford, CA, 94305
+1 415 723 3444
{rice, axf, piernot, gruber}@ksl.stanford.edu

Permission to copy without fee all or part of this material is granted provided that copies are not made or distributed for direct commercial advantage, the ACM copyright notice and the title of the publication and its date appear, and notice is given that copying is by permission of ACM. To copy otherwise, or to republish, requires a fee and/or permission.
CHI '96 Vancouver, B.C. Canada
(c) 1996 ACM
Images used herein copyright and other HTML pages (c) 1996 Stanford University
The most up-to-date verion of this paper can be found here. A linearized version more suitable for printing can be found here.

Abstract

Keywords

1. Introduction

1.1. The Application

1.2. Providing Access to Applications Over the Web

1.3. Other Web-Based Applications at the KSL

2. Problem Analysis and Design

2.1. Objectives

2.1.1. Support for Naïve Users

2.1.2. Support for Power Users

2.1.3. User Support

2.1.4. Trustworthiness

2.1.5. Overcome Previous Difficulties

2.2. Constraints Imposed by HTML and HTTP

2.2.1. Multiple Browsers

2.2.2. Time-Travel in the Interface

2.3. Choices and Design Principles

2.3.1. Minimize Intermediate Pages and Number of Clicks

2.3.2. Edit-In-Place Metaphor

2.3.3. Pages Represent Objects

2.3.4. Five Types of Interface Widgets

2.3.5. Consistency Across Browsers

2.3.6. Native Window System's Submit Buttons

3. Evolution of Design

3.1. Early Design and Mockup

3.2. Early Implementation

3.3. Problems of Scale

3.4. Breakdown of the Edit-in-place Model

3.5. Observations on User Interaction

3.5.1. Reluctance to Click

3.5.2. Logging Out

3.5.3. The Usage of User Preferences

4. The Future

4.1. Java and Client-side Scripting

4.2. HTML: Problems and Solutions

5. Conclusions

Acknowledgments

References

Abstract

We show how to deliver a sophisticated, yet intuitive, interactive application over the web using off-the-shelf web browsers as the interaction medium. This attracts a large user community, improves the rate of user acceptance, and avoids many of the pitfalls of software distribution.

Web delivery imposes a novel set of constraints on user interface design. We outline the tradeoffs in this design space, motivate the choices necessary to deliver an application, and detail the lessons learned in the process.

These issues are crucial because the growing popularity of the web guarantees that software delivery over the web will become ever more wide-spread.

This application is publicly available at: http://www-ksl-svc.stanford.edu:5915/

Keywords

Internet application, remote user interface, active document, CSCW, World Wide Web, Hypertext, HTML, HTTP, Java.

1. Introduction

The recent explosion in the popularity of the world-wide web and its associated hypertext markup language (HTML) and hypertext transfer protocol ( HTTP) presents an exciting new opportunity to provide widely distributed access to sophisticated software applications [16]. The design of HTTP and HTML, however, place a number of novel and often severe constraints on the design of interfaces that use them. This paper outlines our approach to providing a rich user interface to a sophisticated application over the web, specifies our design objectives, the constraints imposed by HTML and HTTP, and the trade-offs that we made in order to meet these objectives whilst satisfying these constraints. We further document the evolution of the design and implementation of our application, and some of the lessons learned. In addition, we also cover a number of topics which are not mentioned in the CHI proceedings version of this paper. These are the use of user preferences, the impact of Java, and ways in which HTML could easily be changed to support this sort of application better.

1.1. The Application

Our application [1] is a system for browsing, creating, and editing Ontologies. An ontology is a formal specification that defines the representational vocabulary in some domain of discourse [3]. While our ontologies are used for knowledge sharing and agent interoperability [2], they share many properties in common with class libraries, object-oriented specifications, object-oriented simulation components, database schema definitions, and knowledge bases. We expect that many of our design decisions will apply to systems for browsing and editing these types of objects.

For example, an ontology about physical measurement [5] defines logical relations and functions that specify concepts such as unit of measure (e.g., meters), physical dimension (e.g., length), and algebraic operators for unit conversion and dimensional analysis. Such ontologies are used to enable the sharing and exchange of data and models among distributed and heterogeneous applications.

Ontology construction is, by nature, a distributed, collaborative activity [7]. Since an ontology is a specification of a domain that is common to several applications, defining an ontology is analogous to defining a technical standard where effective communication and collaboration is required. Not only must there be some medium for forming concensus on the meaning of terms, but the expertise needed to define and review ontologies is distributed. For example, an ontology of therapeutic drugs is the product of a consensus among practitioners in the field and other potential stakeholders - all of whom are distributed across geographical and organizational boundaries (e.g., hospitals, insurers, government agencies). These properties of the user population motivate the development of distributed collaboration and the ontology editor tool.

Our application is unusual among web services because it allows users to create and edit objects, rather than simply retrieve them by following hypertext links or by making simple database queries.

The implemented application provides a full, distributed, collaborative editing environment with over a hundred user commands, context sensitive help [14], context sensitive user feedback and bug-report collection, multi-level undo/redo, multi-user sessions. It has been publicly available since February 1995 and (as of 1/96) supports about 900 users, about 150 of whom we would classify as "serious," who average over 2000 requests per day. Feedback from users and data collected online support the claim that it is possible to deploy richly interactive applications over the web. Based on our experience in developing the ontology editor, we offer an approach to developing and delivering web-based applications that instantiates this claim.

1.2. Providing Access to Applications Over the Web

We came to the web approach out of frustration with conventional approaches to software development and distribution. The costs of ensuring that software was portable over multiple hardware and software platforms together with the cost of distributing software releases, patches, and upgrades were simply too high for a small research institution. Indeed, these costs often dominated our efforts to develop and test new functionality. Providing a system to support distributed collaborative work would only make the situation worse.

The web appeared to be a reasonable alternative. We had previously developed experience in providing automatically generated HTML documents to describe ontologies and other structured objects [9]. People found that using their native hypertext browsing systems to examine these documents was both valuable and appealing.

The ontology editor is a good candidate for this approach because (1) it requires distributed access from heterogeneous environments; and (2) think-time dominates computation time. This makes it possible to provide a useful shared computational resource for a community of users from a centralized server without the hardware costs being too onerous.

1.3. Other Web-Based Applications at the KSL

The ontology editing environment described in this paper is one out of a number of applications that we have built using the web. To put this work into context, we list some of these applications here.

The compilation of Ontolingua ontologies into static hyperwebs. Before we developed our ontology editor application which we show in this paper, we had a compiler that compiled the ontology data structures into static hyperwebs. A library of ontologies produced by this method can be found here. In this example, we show the appearance of the three types of pages produced by this compilation process.
An inspector for arbitrary data structures. This was the first application that we developed that dynamically generated specific HTML in response to a user's requests. This is in contrast to the static compilation of ontologies, which was done offline. Unlike the case of the static compilation of the ontology library, this virtual document is unbounded in size and is not exhaustively generated. Indeed, the HTML generated for any given data structure will change over time as the content of that data structure itself changes. This is an example of the use of this inspector to look at an ontology. The real inspector itself is accessible as one of the KSL Network Services.
The Device Modelling Environment (DME) [4]. This is an example of a semi-static application. DME simulates complex physical device models and is able to explain the behavior of the device. For this application, we ran the DME simulator on a number of devices, the simulation results were cached. The DME system was then run in with these simulation results to give a virtual document able to answer a number of interesting questions about the behavior of the device that was simulated. A discussion of how this application was put on the web and of the design can be found in [6].
Before the advent of web technologies, the DME system had a window-based interface that allowed the user to control the course of the simulation, and a hypertext browser that allowed the user to get explanations of the device's behavior. This window-based user interface used a proprietory window substrate. (Here are some screen snapshots of the DME user interface when running a model of the Space Shuttle's Reaction Control System). Example DME simulation scenarios can be accessed here.

2. Problem Analysis and Design

In this section we begin to describe how we addressed the problem of delivering our network-based application, and how we designed it.

2.1. Objectives

Our goal was to create a general environment to facilitate the development and sharing of ontologies. Such an environment must assist the user in the basic development tasks of browsing, creating, maintaining, sharing, and using ontologies. We also realized that many of our users want to develop ontologies through a consensus process; therefore, we needed to provide tools to help people collaborate during development.

From the outset, we wanted to make our potential user community as large as possible. The target community is diverse and includes academics, industrial researchers, and government employees. Once we had established our web-based system, our user community grew from a handful to hundreds in the space of a few months.

Our burgeoning user community was an additional incentive to use the web. Furthermore, distributed access eliminates the need for our users to have high-end hardware systems or expensive licenses for the proprietary software systems that we used to develop our application more efficiently. A centralized server model also means that we can make changes and upgrades to the server at a single site (or small number of controlled sites); the new, improved software is instantly accessible to all users. In short, we decided that it would be cheaper to provide free computational resources to our entire user community than to develop shrink-wrap software and distribute it to them.

Our challenge was to design a user interface that operates under the constraints of the web, remains intuitive and natural to users, and provides full access to the application.

2.1.1. Support for Naïve Users

In order to achieve our goal of broad impact, we worked to make our application intuitive and usable to anyone familiar with web browsers (e.g., Netscape Navigator or Mosaic) so that a new user could pick up and productively use our application without any formal training or preparation. We explicitly addressed this by the following means:

HTML used for documentation: Our application has no manual at all per se. All documentation is provided on-line using the same browser that the user already employs to operate the application.

Guided tour: One important aspect of the documentation is a guided tour [11, 15], which takes the new user through a credible editing scenario, motivating the different features of the system.

In-context, automatically-generated help: Unlike applications such as Microsoft Word 6.0, in which documentation is available only in an idiosyncratic hypertext browser, our application uses the same hypertext browser uniformly for all of its interface. This means that help and documentation are seamlessly integrated with the rest of the system. Indeed, the help facility builds help pages in real-time so as to give the most focused and helpful response in the user's context. A novel feature of the help facility is that the commands available to the user are echoed as widgets on the help page (figure 1 right). If the user clicks on the widget in the help page which has accompanying explanatory text and links to worked examples, then the command associated with that widget is activated just as if it were invoked on the page from which the user made the help request.

Familiar interface: There was no existing interface metaphor for ontology editing. We selected a document metaphor (as opposed to a two dimensional representation as a directed graph, for example) so that the application could be smoothly integrated into the HTML world. This allows users who are familiar with hypertext documents and browsers to make an easy transition into our interactive application.

2.1.2. Support for Power Users

Although we were keen to expand our user community as much as possible, we were aware that experienced ontology developers would not want to be held back by an interface that was optimized for naïve users. It is very important to us to support power users. The vast majority of meaningful ontologies are developed by such users, even if these ontologies are then used by the less experienced as building blocks. The issue of power user support was eventually resolved by providing a powerful user preference facility. This allows the experienced user to tailor the look and feel as well as the set of commands exposed at any given time.

2.1.3. User Support

In order to be responsive to our users, we wanted to provide an obvious mechanism that they could use to make suggestions and report bugs. We resolved this by providing feedback buttons on every page for comments, questions and bug reports. The application captures the user's context automatically so as to make the user's bug report as useful to the developers as possible.

2.1.4. Trustworthiness

Because we were intending to provide a multi-user shared resource, it was essential for the user community to find our application both reliable and trustworthy. Otherwise, they would not use it to develop confidential or proprietary ontologies. To this end, our application supports a user and group model similar to that provided by operating systems and password protection to ensure privacy and security. Ontologies are protected by maintaining multiple versions to which the user can revert if necessary. At a finer granularity, a multi-level Undo/Redo feature allows the user to repair any mistakes. A sophisticated compare facility, which compares semantic units rather than text, allows the user to see exactly what has changed; an analysis facility tells the user what ought to be changed.

2.1.5. Overcome Previous Difficulties

Our previous software delivery attempts had been plagued by long release cycles, installation difficulties, and problems with patch distribution. Users had to acquire expensive proprietary software and high-end hardware to run the system. This was very unpopular, and substantially reduced the impact of our work by excluding PC and Mac users from the user community. We needed an approach that would overcome all of these problems.

There are a number of possible ways to deliver applications to a user community. In the past, we had always built our software to be as portable as possible, testing it on a number of platforms and with a number of different compilers from different vendors. This approach was necessary because our user community always wanted us to ship source code to them so that they could modify and experiment with our software. The cost of testing and ensuring cross-platform portability proved to be very high. Users would often employ versions of operating systems, platforms or compilers to which we had no access. One way to address this problem would have been to offer only shrink-wrap software releases to our users. This was not a tenable option because of the diversity of platforms in our user community. We simply could not build a "PC-only", or "SUN/OS-only" application. Our new approach of keeping control of the software, and shipping a network service seemed the best way for us to overcome the difficulties of shipping software, while being able to impact a wide user community.

Figure 1. The final design (left) used a row of icons (top) for important or frequently used operations, and pop-up menus preceeded by submit buttons (e.g., "File" top left) as labels. Selecting the `?' help command (top center left) takes the user to a page explaining all applicable commands, and with a labeled explanation of the page (right). The command widgets on the help page are real. Selecting one of these widgets will execute the command in the context of the page that the user was on. Tears in the figure elide material extraneous to this discussion. Note: this figure is very wide and might require horizontal scrolling to see the right-hand part of the figure. A slide show examplifying the Help feature can be found here.

2.2. Constraints Imposed by HTML and HTTP

There are only two ways for the browser to transmit information to the server (i.e., the application). Selecting an anchor transmits a request to follow a hypertext link. Pressing a submit button transmits the widget state. Until a submit button is pushed, it is not possible for the server to determine anything about intermediate activities that a user might perform such as typing text into an input field, toggling radio buttons or check boxes, selecting items from menus, moving the mouse, and so on.

The only way for the server to transmit information to a browser is in response to a submit or select action initiated by the user. Moreover, there is a fundamental difference between the type of interaction supported by HTML and the forms of interaction with which we are all accustomed in graphical user interfaces. In GUIs, operations typically take the form of the user selecting an operand or operands through direct manipulation and then applying an operator by means of a menu selection or keyboard accelerator. In HTML-based user interfaces, there is no notion of selecting objects pre se. There are, in effect, two disjoint forms of interaction:

The page that the user is on is viewed as an implicit operand, and therefore the user can select a command to apply to that operand. The user can change the current operand by clicking on an object and going to the page for that object, where the object the user selected will become the current implicit operand for further commands.
The semantics of a simple link can be locally changed so that a particular command is applied to the operand named in the link. This has the consequence of potentially confusing the user because web users are very strongly conditioned to expect that a normal link will take the user to the object selected.

In effect, an HTML interface can allow the user to apply a number of different commands to a single object, or a single command to a one of a number of different objects. Commands that take multiple operands are much harder to implement.

Consequently, it is currently impossible to implement many features of sophisticated user interfaces over the web. In particular, tightly coupled interfaces that provide immediate feedback to the user are not possible. For instance, given an initial menu selection, it is not possible for the application to gray out options that are incompatible with it. Furthermore, there can be no direct manipulation of objects such as one might find in a graphical class browser. Finally, it is not possible for the application to preempt the browser's activity or provide any asynchronous communication. For example, it is not possible to notify the user asynchronously about the results of a background task or remind the user to save work.

Delivering notifications of collaborators' work on shared data is another problem. Since several people simultaneously edit the same ontology, we need to provide some way for users to be made aware of the changes made by others. Today's HTTP makes this difficult because it does not allow a server to send unsolicited notifications to the client. We work around this limitation by presenting pending notifications at the top of the page in a dialog. The notification includes a textual description of the changes being made by collaborators on shared data, and a link is provided that will take the user to the modified object. Within a shared session, users share the same undo history so that a user can undo damage performed by a less enlightened user. A command is also provided to make announcements to users so that they can be forewarned of a change to come and be given a suitable justification. An improvement to the HTTP standard to accomodate asynchronous notifications would be preferred (some are being proposed).

Bandwidth is a key constraint. Each user action (submit or select) causes an entire new page to be transmitted back to the browser over the network. There is no method for the application to cause an incremental update of a portion of the display. Even on high-bandwidth local area networks, transmitting and rendering large pages is time consuming; for distant browsers it becomes the dominant cost.

The application has very little control over appearance. HTML explicitly cedes rendering decisions to browsers. This has many advantages for browsing hypertext documents, but it proves awkward for interface design. There is little control over the location of displayed objects on the finally rendered page. In particular, if a page is too large to fit on a single screen, there is no way of controlling how and to where the browser will scroll the page. This registration problem can be very difficult to work around and frustrating for users (see section 3.4).

The widget set available through HTML's forms capability is extremely restricted. It includes only pop-up menus, single/multiple selection scrolling lists, submit buttons, checkboxes, radio buttons, mapped images, and text type-in widgets (see figures 1 and 2). There is no way to combine either submit button or anchor behavior with a pop-up menu to provide the sort of command selection model that is present in so many user interfaces (e.g., the Macintosh menu bar). There is no way to associate a "right button" menu with items, nor is there a way to provide constraints on selection elements (e.g., toggling "show text files" causes a scrolling list of files to filter out non-text files). There is also no way to include iconic elements in menus or scrollable lists. (See here for a comparison of the widget set under different platforms)

Figure 2. In early designs, radio buttons were used to select an argument (left). Selecting an operation from a pop-up menu and clicking on a submit button (top left) inserted suitable edit widgets into the flow of the page at the selected location (right). A slide show of the actual mockup HTML pages can be found here.

2.2.1. Multiple Browsers

A growing number of web browsers from a variety of institutions and vendors make non-standard extensions to the published protocols. These browsers provide different levels of support for features in the protocols. Fortunately, the protocol provides a means for browsers to identify themselves (although they do not describe their capabilities in a meaningful way), which allows servers such as ours to be sensitive to some of the distinctions between them.

2.2.2. Time-Travel in the Interface

A unique feature of web browsers is that they cache previously seen pages and allow the user to revisit these cached pages without notifying the server. In interactive applications delivered on the web, these "previously seen pages" can represent earlier states in a dialog with the user. As a consequence, the browser allows the user to travel back in time to an earlier interaction state and attempt to execute the commands as they were presented then. Since the state of the data on the server may have changed as a result of the user's interactions with the server, incoherent transactions can result. There are two alternative ways for applications to handle this. Applications must either support time-travel, or they must always reflect the most recent state. For example, suppose that a user engages in a dialog to delete an object and then backs up to a page on which the object is still present. If the user selects the deleted object as represented on that page, the application should either undo the deletion or report an error in a graceful way. Our application chooses the latter approach.

2.3. Choices and Design Principles

The underlying principle of our design was to minimize the cognitive load on the user imposed by the interface [8]. As we worked within the constraints of HTML and HTTP to meet our objectives, we used this principle to guide our design decisions. The steps we took included: minimizing the number of intermediate states in complex command execution, minimizing the number of distinct types of pages, the number of distinct widgets, working to achieve consistent appearance across browsers, but working within the widgets provided by the native window systems. In this section we discuss these issues in more detail.

2.3.1. Minimize Intermediate Pages and Number of Clicks

The restrictions on the widget set force complex interactions to be broken down into a sequence of simple ones punctuated by submit actions and the transmission of intermediate pages. To simplify the execution of complex commands, we chose (1) to minimize the number of actions that the user must perform, and (2) to minimize the number of pages presented during the course of the interaction. It is important to minimize these because of the cognitive effort required to parse and understand a new page, because displaying a new page makes it change registration (see section 3.4), and because transmitting and rendering a page may take several seconds.

The desire to minimize the number of clicks and intermediate pages has some consequences. Because we want to minimize the number of clicks that a user performs, we are inclined to make pages richer (and potentially more confusing) to a user. Our application consists of the display and editing of a hierarchy of objects; classes exist within ontologies, classes have slots (attributes), and those slots can themselves have facets (attributes of attributes). We could have chosen to present a different page for each slot and for each facet in any given class. Instead, we display all of the slots and facets of a class on a single page. A user can therefore edit a facet value directly on a class without going through any intermediate pages. This design helps to satisfy our goal of minimizing the number of clicks and intermediate pages, but at the expense of making the pages larger and more complex. The user is presented with considerably more information than may be of interest, and the user is more likely to have to scroll in order to see what is of interest.

2.3.2. Edit-In-Place Metaphor

In order to minimize the number of different-looking pages to which the user would be exposed, we decided that the editing environment should look as much like the browsing environment as possible. This is natural in our application, because even when a user is creating new objects, most of a user's work is browsing. We preserve the browsing environment's look and feel by inserting edit widgets into the flow of the HTML page (see figures 2 and 3) in a manner similar to the protoTextExpando widget in the Newton user interface [12]. An example of this insertion of edit widgets into the flow of the HTML page can be found here in the middle of the worked example.

2.3.3. Pages Represent Objects

A page was to be a representation of a specific object. This meant that commands on the menu bar would implicitly apply to the underlying objects which may be nested (e.g., an attribute of a class in an ontology.) Commands applicable to all of these may appear on the menu bar.

2.3.4. Five Types of Interface Widgets

We decided that in the normal course of events the user should be exposed to the minimal number of distinct widgets. The final design includes only five types of widgets (see section 3.2).

2.3.5. Consistency Across Browsers

We wanted our user interface to work in the same way across all platforms and browsers. This would minimize training and documentation problems. We explicitly chose to limit use of undocumented or non-portable extensions provided by different browsers. One significant exception to this decision is HTML tables, which provide a much better way to present tabulated information than is otherwise possible. We use HTML tables when communicating with a Netscape browser [13] and use PREformatted text (a fixed-width font display mode) for other browsers.

We explicitly decided to support multiple browsers. Some user interfaces on the web are almost unusable without using Netscape. We did not want to limit our user community in this way, and so preserved a consistent look and feel across platforms and browsers.

2.3.6. Native Window System's Submit Buttons

Many developers choose to exercise artistic control over the look and feel of their web pages by using custom bitmapped graphics for buttons instead of the native look and feel that the web browsers provide. Consequently, their web pages look more uniform across platforms, but users must learn to recognize buttons anew for each of these idiosyncratic applications. Furthermore, when this mapped graphic approach is used, the browser is not able to provide feedback to the user about which regions of the bitmap will result in commands being executed.

In contrast, we chose to use the native window system's submit buttons for our interface. Our application was going to be complex enough without the user having to relearn what a button is supposed to look like.

The constraint of supporting naïve users meant that we could not allow any user interface tricks such as having extremely stylized icons that cause differing behavior depending on where on the icon the user clicks. While such approaches allow very terse and dense displays that efficiently take advantage of the available screen real-estate and bandwidth, they would be difficult for members of our target community to understand and learn.

3. Evolution of Design

As we saw in the previous section, the design for our user interface was influenced both by a number of hard constraints imposed by our need to deliver our application over the web, and a number of design choices that reflected our beliefs about the usage patterns of our system and our prospective user community. In this section we describe the evolution of our design and our software, and show how we changed our user interface as we learned more about the application and received user feedback. One of the significant advantages of our implementation approach - a central server and standard browsers, coupled with a dynamic, rapid-prototyping implementation infrastructure - was that we could rapidly receive user feedback and introduce changes in response, often within a few minutes or hours. This meant that as soon as the server was established, our design progressed very rapidly. In this section we describe several of the significant events in this evolutionary process.

3.1. Early Design and Mockup

We were fortunate to start this project with a preexisting substrate. Earlier work had enhanced Ontolingua [3], our ontology development tool so that ontologies could be compiled into static hyperwebs (see here for more on this process and examples). These hyperwebs displayed the concepts in the ontology with one concept per page. Our first step, therefore was to take pages of hypertext generated by this compilation process and manually edit them so as to mock up the editing interface. This process of developing mockups lasted about two weeks and went through multiple iterations, experimenting with different ways to handle the perceived problems of having a simple user interface that was nevertheless powerful. Early designs experimented with putting edit widgets next to each of the potentially modifiable objects in the page being displayed. Selecting one of these edit widgets would take you to a different mockup page with, for example, a text widget allowing the user to edit the selected value. These mockups can be seen here. The whole mockup process was performed by editing these HTML files to refine the design.

We tried several different approaches to provide the functionality of a right button command menu, which is not supported by HTML. For example, in one mockup, there was a radio button in front of each editable object, and at the top of the page was a pop-up menu of operations to be applied to the object selected with the radio button (figure 2). A "Do It" submit button caused the execution of the selected operation. This method requires three different widget selections which involved mouse travel from the selected object back to the top of the page and often required the user to scroll the viewport. This would have been burdensome on our users.

We addressed this problem by using a single intermediate page for edit operations, and by limiting the expressiveness of the user interface - we do not allow the user to execute all conceivably legal operations at any given point. We eventually settled on a design in which a small edit widget would be placed next to any editable object. Selecting this widget would take the user to a page just like the current page, only with suitable edit widgets replacing the value that was selected, and allowing the user to select between a number of possible edit options.

In our application, it is necessary to be able both to create new objects (e.g., classes) and also to add objects to existing objects (e.g., add a property or value to a given object). Finding a good way to represent this proved tricky, and we experimented with several methods. For example, we tried using a "dummy" entry for values, so that for every list of values there would be an extra one at the beginning (or end) in italics that was a place holder for a new value to be added (this is an example of this design mockup). This was found to be confusing, and was soon dropped in favor of the design described in the next section.

3.2. Early Implementation

Moving on from our mockups, our initial implementation of the editing environment attempted to implement the design that we had mocked up as closely as possible, but we rapidly ran into difficulties. For example, in the mockups, quite by chance, we had selected an object which had only one value for each of the attributes being displayed. This encouraged us to position a single edit widget in front of the set of attribute values in order to elicit and/or modify all of the values simultaneously in one text widget. We soon discovered that the more general case of there being several attribute values was very common and the single text widget approach was confusing to our users. Problems with eliciting multiple values in a single text widget were compounded by our desire to edit values in context, which made us want to minimize the size of the text widget. Values were sometimes scrolled out of the user's viewport, which resulted in erroneous editing operations and much confusion. Early user feedback quickly caused us to change our design in favor of eliciting or editing only one attribute value at a time. This had the side-effect of increasing screen clutter by putting individual edit widgets in front of every editable value (figure 3 left).

The next problem that arose was that users were confused by the methods for creating and adding objects and values. They often added values that were illegal or non-existent objects. The number of possible legal values for attributes in the system is typically too large to enumerate explicitly. Because of this, we developed a sophisticated, context-dependent completion facility, which space prevents us from describing here.

After a few iterations we ended up with a design which we have broadly kept since this early stage (see figure 3). In this design, we distinguish between five different types of widget:

the simple hyperlink, which always takes the user to a page showing the selected object.
edit widgets, represented by pencil icons that allow the user to edit an existing value and addition widgets, which are represented by icons such as . Only one type of object can be added at each location; the type is indicated by the icon.
a submit button and pop-up menu combination for commands.
text widget and submit button combinations for value elicitation, and
tool bar icons presenting important, general commands.

Figure 3. The final version uses edit widgets in front of all editable objects, and "+" widgets wherever an addition is possible (left). In this example, we show editing the Slot-Value-Type facet value of Thesis.University from Institution to University. Selecting an edit pencil inserts the necessary widgets to elicit or modify the value in context (middle). The user has entered University instead of Institution. The user can remove screen clutter by inhibiting all edit widgets (right). Here is a detailed worked example of an editing session using the ontology editor, showing a number of the features of the editor.

Although we went to considerable effort to make the user interface clear and intuitive, it was still novel in many ways. To familiarize new users with the meaning of the various icons and edit widgets they might see, we introduced a "Welcome" page which displays the icon images together with a brief explanation. Users can suppress this page once they are familiar with its contents.

By this point, we were able to edit values and create new objects in our system, but we were unable to save our changes. Our user interface mockups had focused exclusively on the different ways to edit objects in the system - they completely neglected the sorts of operations that are typically found on the "File" menu of a normal user interface. This presented us with a problem. We already had a row of buttons at the top of the page that would provoke the creation of new objects. When we added the options that were necessary for typical file menu operations we found that we had a glut of buttons. The number of buttons would only increase as the system became more sophisticated.

At this point, we had to make a significant compromise. Again, because HTML does not support pop-up submit buttons, we were unable to implement the obvious and familiar behavior of a menu bar. The only "correct" model for command menus in HTML is the exhaustive enumeration of the commands as submit buttons. This was untenable because of the number of commands we needed to support (typically around 40 per page). As an alternative, we partitioned the commands into broad classes and put the commands on menus. In front of each menu we placed the name of the menu as a submit button (see figure 1, top left). This meant that the user had to select an option from the menu and then click on the submit button to execute the selected operation. This is non-standard, but we found that our users got used to it fairly quickly, probably because we went to significant effort to make sure that the system would always put the most likely choice as the default option on the menu. Thus, a single pointer operation often suffices.

Because we could not predict whether our users would find the factoring of commands onto menus with submit buttons described above reasonable, we introduced a user preferences mechanism that allowed the user to control, among other things, the look and feel of these menu commands. In practice, most users ended up accepting our default setting for the command menu's look and feel. The desire for menu flexibility was the impetus for us to introduce a user preferences facility, but this feature has become ever more important in our system. Users connect to our application from all around the world from sites with widely differing quality of network connectivity. Consequently it is very important to support options that let the user trade prettiness or verbosity for bandwidth.

The issue of how much power to give to the user came up with the introduction of this command menu model. Our system is non-modal in the sense that it is always theoretically legal to jump to (say) a class creation dialog from the middle of a different object creation dialog. By default, we choose to hide this fact from users in order to simplify the appearance of the user interface. However, power users resent such restrictions, and so we have had to add extra preferences to allow our power users full access to all applicable commands, irrespective of their current context.

3.3. Problems of Scale

By the time the system had reached this level of sophistication, we had made our application available to the public, and around one hundred users had tried it out. We quickly found that users were unable to visualize large ontologies with hundreds or thousands of concepts. This is a problem analogous to browsing a flat file system with hundreds or thousands of files. We considered using a graphical rendering of the hierarchy [10], but this had several problems: transmitting large images is too slow; it is not possible to position the image so that a particular object is visible; the browser cannot tell which portions of the image are links, and the browser may not be able to display graphical data. We chose to provide a hierarchical browsing mechanism with the same sort of triangular open/close widgets that are found in the Macintosh file system. The system uses a heuristic method to compute the appropriate initial set of objects to display in the "open" state (figure 4).

Figure 4. The hierarchical class browser shows large graphs of objects in a compact form. Triangular widgets open and close subgraphs. "Focus" widgets move the selected object to the root of the displayed tree.

We found that however hard we might try, our application always produced pages that were multiple screens in height. Consequently, we replicated the menu bar at the bottom of the page so that the user could scroll to the nearest end of the document in order to select one of these more global commands.

User feedback has caused us to spend more effort on displaying large ontologies. We have recently addressed this by factoring pages showing large ontologies automatically. We have also introduced a method of focusing the hierarchical browsing tool onto subgraphs of the class lattice, and for using this same hierarchical browsing technique to look at attributes as well as classes and instances.

3.4. Breakdown of the Edit-in-place Model

Our early design criteria had called for trying to preserve the illusion that there were only a few different types of pages: one for ontologies, one for the library, and one for frame objects. Edit widgets would be inserted in-place to preserve the look and feel of these pages (see figure 3). When it came to object creation, we did not, of course, have an object to display. When the user selected a creation operation, we presented the user with a fake page representing an object of the type being created, with a text widget into which the user could enter the new object's name. This preserved a uniform look and feel for all of our pages. We rapidly discovered that this approach confused our users. Their cognitive model of object creation was sufficiently different from normal editing and browsing operations that being presented with a fake page for the object to be created was confusing. The users clearly liked to think of the creation pages as being (pseudo) modal dialogs, and didn't want to view the creation page as an ersatz, incomplete object.

Another problem with the edit-in-place model of interaction is that HTML and HTTP give no effective control over viewport positioning. This means that when the user selects an edit operation, we have no choice but to refresh the whole page (unlike a more tightly-coupled user interface in which the system might push existing text aside in order to make room for the edit widgets). Although it is possible to tell the browser where to scroll to on the new page, this is very coarse-grained control. Browsers differ significantly in their scrolling behavior, and this typically results in a loss of registration of the user's viewport, and therefore the loss of the user's cognitive focus.

In practice we suspect that it might be better not to use this edit-in-place model, but rather to use simple pages that give the illusion of modal dialogs. This is an open design question and needs further investigation. Registration of the display is only reliable when the whole HTML page is less than one screen in height. Because browsers cannot inform the server of the browser's window size or exact viewport position within a document, this problem cannot be solved in general. Furthermore, our application typically generates HTML pages more than one screen high.

3.5. Observations on User Interaction

We have conducted one informal email survey of our users and have collected over 1500 in-context comments and bug reports. Together with informal observation of local users, we have observed several surprising characteristics of system usage.

3.5.1. Reluctance to Click

We have noticed that users are reluctant to experiment freely within the application. This becomes an issue when users are unable to understand an icon or command. They do not try it to see what it will do. This surprised us. We had expected that users would experiment freely because the network connection isolates them from most problems. There are several possible explanations for their behavior: They may be avoiding wasting their time. Some browsers are difficult to interrupt if an address is not well formed, a server refuses to respond or responds slowly, or if a large document is transmitted. They may be trying to protect their work. Many web-based applications are brittle and crash unexpectedly. Few applications provide as many safeguards as ours does against irrecoverable damage. They may also be trying to protect the work of other users by following only understood paths. It would be interesting to find out why users are so reluctant to experiment, since this could have significant impact on the design of networked application user interfaces if this behavior proved to be widespread.

3.5.2. Logging Out

We have observed that our users have a powerful desire to be able to log out of our system. Because a user's connection with our application lasts only as long as each individual request from the browser - typically only a second or two - and because the user could walk away, leaving the browser at any time - we have had to engineer the system so that logging out is not necessary. The system has a set of timeouts and defaults that allow it to recover quiescent resources. However, we have noticed that users feel a need to log out, even when they are told explicitly that it is not necessary. We have therefore provided a command to give the user the illusion of logging out.

3.5.3. The Usage of User Preferences

It is often assumed that preferences in applications are not of significant utility because few users actually change them. If we look at active users, whom we define as those who have executed at least 50 commands, we find that some 41% of our users have changed at least one preference. Figure 5 plots the distribution of the number preferences modified against the number of requests performed by each user. There are 47 preference variables in the system of which 42 have been modified by at least one user (here is an example page showing the preferences dialog). Some preferences have been modified by only one user, one has been set by 47 users. From the graph we can see that although power users are changing more preferences, as one would expect, they are changing a focused set of preferences. We conclude from this that although there is a fairly large number of preferences, the number is not wildly in excess of the number needed to represent the degree of customization required by our users.

Figure 5. The average number of preference variables set by each user plotted against the total number of non-trivial requests made by each user to our server

If we look at the actual preferences that are being set, we see that a number of the selections clearly represent a particular user's personal preference for some particular look and feel, other preference selections are clustered around, for example, a number of preferences that allow the user to control the bandwidth used in communicating with our server. Indeed, we have found that the issue of bandwidth is sufficiently important to our users that we label any preference that might impact bandwidth with a splash graphic.

4. The Future

The web is an ever-changing medium, and both the technology and standards are advancing very rapidly. We must therefore consider what we would do (or would have done) if Java or some similarly capable remote scripting facility were to be reliable and powerful enough to use for our application. Certainly, had Java been ready for serious use a year ago we would have been very tempted to use it to get around some of the severe constraints of HTML. In this section, we detail some of the issues surrounding the use of client-side applets, and we then enumerate some of the simple changes that could be made to HTML to make this sort of application much easier and better.

Please note that although in this section we refer to Java, the reader should treat this as a generic reference for some facility that allows the portable specification of client-side applets. At present, it would appear that Java will win this war, but this is by no means certain.

4.1. Java and Client-side Scripting

In order to develop Java programs of any real sophistication, one needs programming tools such as debuggers. These are not currently available. We clearly expect this problem to be solved in time.
Although Java allows one to embed applications within a browser, there is currently no documented, cross-platform and cross-browser API that allows one to interface to the browser. This makes it difficult to add to the browser's menu bar, or to interface to the underlying operating system's drag-and-drop capability, both of which are clearly desirable.
Using Java would allow one to circumvent a number of the constraints in the widget set that HTML imposes. For example, one could have dialogs that could update and constrain themselves dynamically in response to user selections. However, because these widgets would not be "standard" or familiar to the user, any extra widgets that we provided would impose an extra cognitive burden on the user and a documentation burden on the implementors. We would probably only use such widget set extensions when the resulting widgets were much more obvious than the alternatives in plain HTML.
The use of Java would result in a heterogeneous implementation resulting inevitably in increased costs. Given that a significant motivation for our application was reducing cost, we would be reluctant to do anything that would increase implementation cost without a very clear benefit to our user community.
Our goal of making our application available to the widest possible audience would require that Java was entirely ubiquitous. It would have to be supported by a wide variety of browsers, from Netscape and NetCruiser to Lynx. From our point of view it is not acceptable to assume that the fact that the majority of our users currently use Netscape is enough to allow us to make our application be Netscape-only. At present, users can still access our system through a TTY-based browser such as Lynx. If we were to make Java an important part of our application, we would either exclude such users or we would have to support both plain HTML interaction and Java as well, which would result in a duplication of effort, and increased development and maintenance costs.
Any computation performed on the client side increases the computational demands on the user's machine. We want to be as inclusive as possible in our user community, so we are reluctant to run substantial code on the client side. Of course, in some cases, the reduced network traffic resulting from running applets on the client side would compensate for the extra computational burden on the client side.
Even with the use of Java, the vast majority of our computation would be performed on the server, not only because of the need to support collaborative work, but also because the user can easily build up multi-megabyte state. Shipping this much state over the net is impractical. Even for operations such as user input completion, the number of options that the user can select is much too large to transmit to the client machine. This makes it impossible to use Java to provide tightly-coupled completion, even though this would be one of the most obviously desirable benefits of a more powerful widget set.
The use of Java to do anything non-trivial would probably require establishing an extra stream between the client and server machines above and beyond the HTTP connection. This is likely to cause significant problems because a number of our users operate from behind firewalls that would not permit this sort of connection. Moreover, the ephemeral connections of HTML/HTTP provide a certain degree of resiliency that would be lost by the need to keep open persistent network connections over possibly unreliable networks.

4.2. HTML: Problems and Solutions

HTML could be extended to privide better support for more sophisticated applications, making user interaction more satisfactory and implementation cheaper and simpler. Please note that although we show some examples of how one might like to represent some of these ideas, some of these examples violate the current lexis and syntax rules of HTML. To address all of these issues, a proper rewrite of HTML would be required. Sadly, we are probably stuck with all of the warts of HTML.

HTML only supports a single mouse event. If we could also support a pop-up menu of options from any given link, we could support far more sophisticated context-specific behavior, and we could avoid a lot of intermediate pages. For example, we might like to be able to say:

          <A HREF="some-default.html"
             ALTERNATIVES="red.html",
                          "green.html",
                          "blue.html"
             ALTERNATIVE-PRINTED=
               "Red",
               "Green",
               "Blue">
          Color
          </A>

This would insert a normal link on the word "Color". On some browsers, using the right mouse button would pop-up a menu with the three options "Red", "Green" and "Blue". Selecting one of these options would take the user to red.html, green.html, or blue.html respectively. A different link color or underline could be used to indicate anchors that have alternatives. In text-based and mouseless browsers, a keystroke would be used to pop up the menu.

It is not currently possible to have a button on the screen that looks like a submit button, but is actually implemented as a link. This makes it harder to keep a uniform look-and-feel in applications that intermingle links with submit buttons, and for which the distinction between links and submit buttons may not be obvious. One can use ISMAPed images for this purpose, but one then divorces oneself from the native window system's widget set, making the application harder to learn. This problem could easily be overcome by extending the INPUT HTML element to allow the user to say:

          <INPUT NAME="My Button"
                 TYPE="URL"
                 VALUE="URL-for-my-button.html">

This would insert a button labeled "My Button". Selecting the button would have the same effect as one would expect from clicking on a normal link, taking the user to the page denoted by "URL-for-my-button.html".

The action associated with a form is restricted in two serious ways. First, there can be only one such action attribute URL - you cannot associate an overriding URL with different submit buttons. Second, it is not possible to associate a scroll position with either a form or a submit button. One could clearly overcome these restrictions by allowing the user to say:

          <INPUT TYPE="submit" NAME="My Button"
            VALUE="my-button"
            ACTION="/cgi-bin/foo#scroll-to-here">

This would insert a button labeled "My Button" in the page. Selecting this button would override the action attribute URL associated with the form, but would still transmit the values associated with all of the other widgets in the form. One would modify the specification of HTML so that scroll positions in forms would be handled correctly. The correct semantics would be for the browser to strip out the scroll position before sending the form, and scrolling as appropriate upon receipt of the resulting page. Supplying

          <INPUT TYPE="SUBMIT"
                 NAME="My Button"
                 VALUE="my-button"
                 ACTION="#scroll-to-here">

would result in the browser scrolling to the anchor named "scroll-to-here" in the page resulting from the URL specified in the action attribute of the form itself.

HTML provides very poor control over the scroll position of the browser. Not only does the browser not inform the server of the current scroll position, but HTML also only provides a very coarse-grained control of scrolling. It would be very useful if the browser were to insert a header field such as

          Scroll-position: <<name>>[+/-<<lines>>]

where <<name>> indicates the closest named anchor to the home position of the user's viewport, +/- indicates a "+" or "-" character, and <<nlines>> would be the number of lines from that anchor to the home position of the viewport. Obviously, because HTML cedes formatting decisions to the browser, this would still be only an approximation to the viewport's position, but it would still be useful. An application could increase the resolution required simply by inserting more named anchors.

A server cannot cause the browser to scroll to a particular location. This means that it is often necessary for the server to force a redirection on the browser that would otherwise not be necessary simply in order to scroll to the desired position in the page. If the server could send a

          Scroll-to: <<name>>[+/-<<lines>>]

header to the browser, these redirections would be obviated.

It is not possible to control the position (or even presence) of scroll bars on TEXTAREA widgets. This is a particular problem because the default of most widget sets is to put the vertical scroll bar on the right of the text widget and the horizontal scroll bar on the bottom. For large TEXTAREAs, this often results in the vertical scroll bar being scrolled off the right of the user's viewport, and the horizontal scroll bar being scrolled off the bottom of the viewport. Adding an attribute to the TEXTAREA and INPUT/TYPE=TEXT widget elements of the form:

          <TEXTAREA NAME="my-textarea"
             scrollbarpositions=<<positions>>>
             ....
          </TEXTAREA>

where

          <<positions>> ::= <<position>> |
                          <<position>>, <<positions>>
          <<position>>::= [vertical|horizontal]
                          = [left|right|top|
                             bottom|none]

would allow the user to specify the positions and absence of scroll bars. For some applications it may be desirable to have scroll bars on all four sides of a viewport.

There is no way to augment the global (application) menu bar. It would be very desirable to be able to say:

          <INPUT TYPE="submit" 
                 LOCATION="menubar" 
                 MENU="Ontology Editor|Create"
                 NAME="Class" 
                 VALUE="create-class">

This would insert a command called "Class" on the "Create" submenu of a menu item called "Ontology Editor" that would be added to the browser's menu bar. Selecting this command would have the effect of performing a submit on the name called "Class". In this example, a transition to a submenu is shown with a "|" character.

Clearly, one would want this facility to support the TYPE=URL attribute option mentioned above as well.

Many browsers provide feedback to the user about the link under the user's mouse by displaying the URL of the link at the bottom of the window. In the early days of HTML this was adequate because URLs typically named documents, and the names of those documents usually gave some hint as to the content. In non-trivial applications, however, URLs are automatically generated, often having names such as "item-124874638". These names do not provide adequate hints about the content of the page pointed to by the link. Submit buttons are even worse in this respect because they don't cause any URL information to appear at the bottom of the window. The anchor (A), OPTION and INPUT elements should be enhanced to support documentation. For example, it would be most useful to be able to say:

          <A HREF="item-1235897" DOCUMENTATION="Edit Class">
          <INPUT TYPE="SUBMIT" 
                 DOCUMENTATION="Delete this class and all of its subclasses"
                 NAME="Delete">
          <SELECT.....>
              <OPTION DOCUMENTATION="The color red in RGB values">
                Red
          </SELECT>

There is no way to declare the canonical URL for a page. When a user clicks on a link or submit button, the browser records on its history the URL that the user selected. However, if the user clicked on a submit button that caused the system to create a new object, for example, one would like to take the user to the page for the new object. If one does this without forcing a redirection, the browser thinks that the user is on a page named by the form's action attribute URL. This means that if the user reloads the page either the form submission will reposted, or the browser will complain that the resubmission is not allowed. Similar problems arise if the user attempts to add the page to the browser's hotlist. Although this problem can be solved by the use of a redirection, it would be much simpler if the browser could insert in the HTML text an element something like:

          <CANONICAL-URL "some-URL">

This would tell the browser that whatever it might have thought the URL for the current page was, the real URL is in fact "some-URL".

There is no way to gray-out either graphics or menu options in HTML. This means that the service implementor is forced to generate two sets of graphics; one for active commands, and one for any inapplicable commands. This doubles the network traffic associated with downloading the graphics as well as increasing the human effort required to create the service. What is more, because many browsers do not support unbordered, anchored images, one has to supply grayed out graphics that guess the width of the borders of anchored images (typically two pixels) in order to maintain registration. It would be helpful if one could tell the browser to do its own graying out of images:

          <IMG SRC="some-picture.gif" GRAYED-OUT>

A browser could easily overlay a gray stipple pattern on the image to be grayed out, or could gray the color values in the color map of the image to achieve the required effect.
Similarly, it would be useful to be able to make menu options unselectable/grayed out:

          <OPTION UNSELECTABLE>Some option

This would have the obvious benefits of alerting the user of the existence of inapplicable commands and preserving registration.

There is no control of how links get marked as visited. In a number of browsers, visited links get marked in red or purple. This is a useful mechanism, but if the page denoted by a particular link has some ephemeral or time-dependent content, it is deceptive to the user to mark the link as visited. Similarly, if a link actually takes you to a page on which you create a new object, you probably wouldn't want to make the link appear to be visited. A related problem arises with unbordered images. Current browsers do not indicate if such a link has been visited. It would be helpful, therefore, if one could add an attribute to IMG items that would tell the browser to use a suitable browser-specific way to indicate that a link has been visited:

          <A HREF="foo.html">
            <IMG SRC="graphic.gif" 
                 BORDER=0
                 MARK-IF-VISITED>
          </A>

and to prevent a link from being marked as visited:

          <A HREF="foo.html" NEVER-VISITED>
             ...
          </A>

Browsers should be sensitive to the Expires: header field so that a link would no longer be marked as visited once its expiration date had been passed. This is not a matter of HTML specification, but is rather an issue of good client implementation.

There is no way to associate keyboard accelerators with submit buttons or links. This makes it much harder for power users to use HTML-based systems effectively. Obviously, keyboards vary, as does the availability of modifier keys, but most browsers could bind an "Alt" or "Option" key sequence. This would allow:

          <INPUT TYPE="SUBMIT" NAME="Do it" 
                 KEY="D">

which would bind Option/Alt-D to the submit button named "Do it". Clearly, some canonical representation would be needed for keys such as Space, Enter and Return. The case of binding the Enter or Return key is an interesting special case of binding the default selection. Window systems typically have a way to indicate which selection will be made or operation performed if the user hits the Enter key. It would be useful if the submit button or link bound to the Enter key would be labelled in the normal way for the native window system as being the default selection.

There is no guarantee that a given browser will be able to support scrolling in pop-up menus. This is typically a problem with the widget set being used by the implementors of X-based browsers. However, this is a major restriction for application designers, since there is no way for a server to tell just how many items a client will be able to cram into a menu before it truncates them (if at all). The only obvious solutions to this would be either to specify in the standard that compliant browsers must support scrolling in pop-up menus, or failing this, that the browser would render a pop-up menu with too many options as a non pop-up scrolling menu.
A major problem with delivering a complex application over the web is that almost everything on the page is anchored. This means that the size of the document that is actually transmitted to the user in bytes is typically very much larger than would be expected for a document of that apparent size. This presents serious bandwidth problems for users who are connecting over slow lines. One can circumvent this problem to an extent by using a clever coding scheme to minimize the lengths of URLs. However, HTML is an intrinsically verbose protocol, and some aspects of the protocol actively punish users of more complex user interfaces. For example, if you use anchored images to insert widgets, you will probably want to make these images unbordered. However, this means that you have to pay an extra nine bytes for each such image to switch off the border. HTML would be much enhanced by a pair of simple extensions. First one could overload the BASE element to allow the specification of defaults for such properties as image borderedness. This would not be a major change given that BASE is already being overloaded to support mechanisms such as frames in Netscape. The second change would be to support a simple text substitution mechanism. If, for example, one could say something like the following:

          <DEFINE NAME=1>
           <IMG SRC="/image/gadget.gif" BORDER=0>
          </DEFINE>

then one could have the client replace all instances of <1> with <IMG SRC= "/images/gadget.gif" BORDER=0>. This could greatly compress the HTML generated by a complex application.

No graphics can be embedded in pop-up menus. This is an annoying non-regularity in HTML.
There is no control over client-side caching in HTML/HTTP beyond the use of the Expires:, If-Modified-Since:, and Date: header fields. These would probably be sufficient for most purposes if it were not the case that some clients have user preference options that effectively allow the user to override this mechanism. For example, in Netscape, the user can set a "Verify Document" preference to "Once per session", "Never", or "Every time". Unless the user sets this cache preference to "Every time", an application such as ours will not work correctly.
HTTP very much needs a way to solve this problem, either by requiring compliant clients to take notice of the above header fields, or by at least having a header field that would announce to the server what the user's cache preference setting is. Such a header field would at least allow a server to notify a user that a given service is likely to be unusable without a change of preferences.
There is no way in HTML to implement a pop-up submit button. The irony of this is that associating an action with a menu selection is probably the most common form of interaction that users have with menus. Applications would be much simpler if it were to be possible to have a real menu bar (properly documented as described above) with submit actions embedded within the menus.

5. Conclusions

We have shown how we were able to deliver a sophisticated, yet intuitive, interactive application as a network service. An off-the-shelf web browser provides the user's interaction medium. Leveraging standard browsers attracts a large user community, improves the rate of user acceptance, avoids software installation and distribution problems, and dramatically reduces turnaround time for software development.

The key contribution of this paper is to describe the constraints imposed on user interface design by this interaction medium, outline the tradeoffs in this design space, and motivate the choices we made in order to deliver our application over the web. We further outlined the lessons we learned and the design changes we made as the interface evolved in response to user feedback.

These issues are crucial because the growing popularity of the web ensures that this form of software delivery will become ever more wide-spread.

This application is available on the web at:

http://www-ksl-svc.stanford.edu:5915/

Acknowledgments

The authors gratefully acknowledge the support of the following funding agencies: NASA Ames Research Center under contract NCC2-537, ARPA and NASA/ARC under contract NAG2-581 (ARPA order 8607), and CommerceNet under contract CN-1094 (TRP #F33615-94-4413). Numerous people in our user community were both influential and helpful in the development of this application, particularly Wanda Pratt, Rupert Brauch, and Richard Fikes.

References

1: Farquhar, A, Fikes, R, Pratt, W, and Rice, J. Collaborative Ontology Construction for Information Integration. Technical Report KSL-95-63, Stanford University, Knowledge Systems Laboratory. 1995
2: Fikes, R., Cutkosky, M., Gruber, T., & van Baalen, J. Knowledge Sharing Technology Project Overview. Technical Report KSL 91-71, Stanford University, Knowledge Systems Laboratory. 1991
3: Gruber, T. R. A Translation Approach to Portable Ontology Specifications. Knowledge Acquisition, 5(2), 199-220. 1993
4: Gruber, T., and Gautier, P. Machine-generated Explanations of Engineering Models: A compositional modeling approach. Proceedings of the 13th International Joint Conference on Artificial Intelligence, Chambéry, France, 1502-1508. Knowledge Systems Laboratory. 1993.
5: Gruber, T., and Olsen, G. An ontology for engineering mathematics. In Jon Doyle, Piero Torasso, & Erik Sandewall, Ed., Fourth International Conference on Principles of Knowledge Representation and Reasoning, Gustav Stresemann Institut, Bonn, Germany, Morgan Kaufmann, 1994.
6: Gruber, T., Vemuri, S, and Rice, J. Model-based Virtual Document Generation. Technical Report KSL-95-80 (this is a hypertext paper best viewed here), Stanford University, Knowledge Systems Laboratory. 1995
7: Grudin, J. Groupware and Cooperative Work: Problems and Prospects, in The Art of Human-Computer Interface Design. Laurel, B., Ed. Addison-Wesley Publishing Company, Inc., Reading, Massachusetts, 1990, pp. 171-185.
8: Gygi, K. Recognizing the Symptoms of Hypertext... and What to do About it, in Laurel, B., ed., The Art of Human-Computer Interface Design. Addison-Wesley Publishing Company, Inc., Reading, Massachusetts, 1990, pp. 279-287.
9: Knowledge Sharing Technology Project. http://www-ksl.stanford.edu/kst/kst-overview.html
10: Lamping, J, Rao, R, Pirolli, P: A Focus+Context Technique Based on Hyperbolic Geometry for Visualizing Large Hierarchies. In CHI '95 Human Factors in Computing Systems. 1995. New York: ACM Press.
11: Marshall C., and Irish P. Guided tours and On-Line Presentations: How Authors Make Existing Hypertext Intelligible for Readers. Proceedings ACM Hypertext'89. Pitsburgh, PA, USA, 5-8 November 1989. ACM Press 1989.
12: Newton Programmers Guide. Apple Computer Inc. Cupertino CA. 1993
13: Netscape Navigator Handbook, Netscape Communications Corporation, Mountain View, California. Also available on the world wide web at: http://home.netscape.com/eng/mozilla/1.1/handbook/
14: Sukaviriya S, and Foley J. Coupling A UI Framework with Automatic Generation of Context-Sensitive Animated Help, Proceedings ACM UIST'90 (User Interface Software and Technology), Snowbird, Utah. October 3-5 1990. 152-166 ACM Press 1990
15: Trigg, R. Guided tours and tabletops: tools for communicating in a hypertext environment. ACM Trans. Office Info. Syst. 6, 4, October 1988, 398-414.
16: World Wide Web Consortium. http://www.w3.org/