Site hosted by Angelfire.com: Build your free website today!
THE HISTORY OF HTML

THE EARLY HISTORY OF HTML

Tell me more about HTML

HTML in 1990 - 1992 HTML is the language that powers the Web in many respects, as the lingua franca that Web browsers are expected to be able to render. HTML has had unprecedented levels of success, and the uptake is all the more surprising when you realise that it was only invented in 1990, and few people knew about it before 1993.

In fact, although HTML has changed relatively little since those early days, the history of HTML is rather cloudy. However, with a little detective work on the Web, it is possible to reconstruct most of the events that led to the creation and subsequent deployment and acception of HTML.

1990 Tim Berners-Lee first started to come up with code for his WWW project in 1990. The first mention of him working on code for processing HyperText can be found in the original HyperText.m file that Tim worked on, dated 25th September 90.

From the 27th to the 30th November 1990, Tim and Robert Cailliau attended ECHT '90 - the European HyperText Convention. After ECHT '90, it appears that he had some more ideas about the (probably as yet unnamed) World Wide Web, and in the last few months of 1990, he started to produce more code, and also the first recorded HTML documents.

In fact, the earliest HTML document on the WWW at the moment dates from 13th November - a couple of weeks before the conference - as evidenced a HTTP HEAD request, which returns "Last-Modified: Tue, 13 Nov 1990 15:17:00 GMT". The page is still functional in most modern Web browsers, and even contains a functional HyperLink!

Early HTML Markup So, what was early HTML actually like? The following is the code used in the oldest HTML document referenced above:

-

LINKS AND ANCHORS


A link is the connection between one piece of
hypertext and another.
These are the tags and attributes evident from the first five days of Dec 1990:-
A document from the 3rd December: h1 ol li a a@href
A document from the 5th December: title h2 p

Incredibly enough, it is still possible to create a decent working HTML document using just these tags.

Many more tags were set out in TimBL's early formatting test case, probably used to test the WorldWideWeb browser application on the NeXT:

Hypertext HTML formatting example - 7th December 1990 SGMLguid + HyperText = HTML? But, why these tags? Was there anything that influenced early HTML? Tim had mentioned that some of the early HTML documents were based on an old SGML language that CERN was already using:

We have included in HTML some tags from the SGML tagset used at and once supported at CERN [...] The HTML parser will ignore tags which it does not understand, and will ignore attributes which it does not understand of CERN-SGML tags. - http://www.w3.org/Test/test

I did not know that HTML had been derived from such a language until I stumbled across an interesting set of documents from the 19th December 1990, that contained the following unusual tags:- [1] had the tags bl bib bib@id hp1 [2] had the tags xmp h3 [3] had the tags box fn [4] had the tags i1 i1@ix dl dt dd dl@compact [5] had the tags bibref bibref@refid

Looking carefully at these documents, they are actually extracts from a large SGMLguid document (SGMLguid was the SGML language at CERN that Tim refered to) last modified by TimBL on the same date. In other words, they weren't actually HTML as I first thought, but Tim was in the process of converting them over: i.e. hyperlinking them together!

After further study of these documents and others in the same subdirectory, it became apparent that most of the early HTML tags were actually taken from the CERN SGMLGuid language, which itself was a variant of AAP (an early SGML language). For example, title, hn, p, ol and so on are all apparently taken from this language. The only radical change was the addition of the all important anchor () link, without which the WWW wouldn't have taken off.

Pre-1990: GMLguide and GML It took a while to find out about SGMLguid, and I had originally thought that SGML stood for "Standard Generalized Markup Langauge". However, after entering some of the tags from the example document into Google, I found the Waterloo SCRIPT GML User's Guide, dating from October 18 1988, and containing a reference to the language "GMLguide". SGMLguid is most likely a corruption of that, or possibly a pun, since the version used at CERN was truly SGML.

The User's Guide from 1988 mentions the tags ADDRESS, BODY, DL, DT, DD, H[0-6], INDEX, LI, OL, P, Q, TITLE, UL, and XMP. The main missing tag is, of course, A (for Anchor).

In fact, the tags from Script GML are largely taken from GML itself. GML was the predecessor to SGML, and was developed by IBM. There is a GML Starter Set Reference guide on the Web, and from this it appears that the language has been in existence since about 1980. This is as far back as HTML can be traced.

A sidenote of some interest may be Michael Friendly's GMLHTML: A GML to HTML Translator for Waterloo Script/GML.

It is incredible to think that most of HTML was already defined as GML and GMLguide, and that Tim wanted to show how one could HyperLink them together. That's why HTML is so basic: beacuse it's actually just a derived version of Hyperlinked SGMLguid, but leaving out much of the typographic junk, to make it easier to learn and parse - I presume.

As if we need any further convincing of HTML's roots in the GMLguide language, here is an excerpt from the SGMLguid document, with a .sgml extension, last modified (i.e. saved) by TimBL on the 19th December, but with a last revised date of April 1990:


INTRODUCTION

This manual describes how to build a distributed system using the Remote Procedure Call system developed in the Online Group of the DD Division of CERN, the European Particle Physics Laboratory.

The remote procedure call product consists of two essential parts: an RPC compiler which is used during development of an application, and the RPC run time system, which is part of the run time code.

Target systems supported are

It was actually first written in 1986, showing that the basis for HTML is probably a lot older than people may think it is.

1991 Still, although there were some basic features by the start of 1991, many of the features were yet to be added, and the language really came into its own in these years.

Towards the end of 1991, Dan Connolly came on the scene. Here is TimBL trying to describe the basics of HTML to him in October 1991:

Re: status. Re: X11 BROWSER for WWW

Here is some discussion about the tags -- where it's not in http://info.cern.ch/hypertext/WWW/MarkUp/Tags.html I have updated that document now.

Most of the tags are just style tags: this goes for the headings H1 to H6, the lists UL and OL with list elements LI, the glossary DL with elements DT and DD.

is designed to be used for putting in the top banner of a window, or using as the window name. It also is what you would use in a history list. It shouldn't be displayed in the text itself, as usually there is a heading at the top of the text anyway. A difference is that thet title is designed to make sense out of context, whereas the heading is within context. For example, a title might be "Formatting Characters for Printf -- C reference manual" whereas the heading may just be "Formatting characters".

The base address tag is not used, nor is highlighting HP1 etc.

Anchors are used! The REL attribute is NOT used.

BACK TO TOP