HTML
- REDIRECT
Originally defined as a highly simplified subset of SGML, HTML is now an international standard (ISO/IEC 15445:2000). The HTML specification is maintained by the World Wide Web Consortium (W3C).
Early versions of HTML were defined with looser syntactical rules, which helped its adoption by those unfamiliar with web publishing. Web browsers commonly made assumptions about intent, and proceeded with rendering of the page. Over time, the trend has been to create an increasingly strict language syntax. HTML 4.01 is the current version of the HTML specification, although the W3C is moving toward replacing it with XHTML, which applies the stricter rules of XML to HTML.
| Table of contents |
|
2 Version history of the standard 3 Markup elements 4 The Document Type Definition 5 Separation of style and content 6 See also 7 External links |
Introduction
HTML is a form of markup that is oriented toward the construction of single-page text documents with specialized rendering software called HTML user agents, the most common example of which is a web browser. HTML provides a means by which the document's content can be annotated with various kinds of metadata and rendering hints. The rendering cues may range from minor text decorations, such as specifying that a certain word be underlined or that an image be inserted, to sophisticated imagemaps and form definitions. The metadata may include information about the document's title and author, structural information such as headings, paragraphs, lists, etc., and crucially information that allows the document to be linked to other documents to form a hypertext web.
HTML is a text based format that is designed to be both readable and editable by humans using a text editor. However, writing and updating a large number of pages by hand in this way is time consuming, requires a good knowledge of HTML and can make consistency difficult to maintain. Visual HTML editors such as Macromedia Dreamweaver, Adobe GoLive or Microsoft FrontPage allow the creation of web pages to be treated much like word processor documents. However, the code generated by these programs is frequently of poor quality.
HTML can be generated on the fly using a server-side scripting system such as PHP, JSP or ASP. Many web applications like content management systems, wikis and web forums generate HTML pages.
HTML is also used in email. Many email clients include a GUI HTML editor for composing emails and a rendering engine for displaying them once received. Use of HTML in email is quite controversial and many mailing lists deliberately block it.
Version history of the standard
- HTML 2.0 — (RFC 1866) approved as a proposed standard September 22, 1995,
- HTML 3.2 — January 14, 1997,
- HTML 4.0 — December 18, 1997,
- HTML 4.01 (minor fixes) — December 24, 1999,
- ISO/IEC 15445:2000 ("ISO HTML", based on HTML 4.01 Strict) — May 15, 2000.
The HTML 3.0 standard was proposed by the newly formed W3C in March, 1995, and provided many new capabilities such as support for tables, text flow around figures and the display of complex math elements. Even though it was designed to be compatible with HTML 2.0, it was too complex at the time to be implemented, and when the draft expired in September 1995 work in this direction was discontinued due to lack of browser support. HTML 3.1 was never officially proposed, and the next standard proposal was HTML 3.2 (code-named 'Wilbur'), which dropped the majority of the new features in HTML 3.0 and instead adopted many browser-specific elements and attributes which had been created for the Netscape and Mosaic web browsers. Support for math as proposed by HTML 3.0 finally came about years later with a different standard, MathML.
HTML 4.0 likewise adopted many browser-specific elements and attributes, but at the same time began to try to 'clean up' the standard by marking some of them as 'deprecated', and suggesting they not be used.
Minor editorial revisions to the HTML 4.0 specification were published as HTML 4.01. Due to the advent of XHTML, there will not be any more new versions of HTML.
Markup elements
Below are the kinds of markup elements in HTML.
- Structural markup. Describes the purpose of text. For example,
<h2>Golf</h2>
- Presentational markup. Describes the appearance of the text, regardless of its function. For example,
<b>boldface</b>
- Hypertext markup. Links parts of the document to other documents. For example,
<a href="http://www.wikipedia.org/">Wikipedia</a>
The Document Type Definition
All HTML documents should start with a Document Type Definition (DTD) declaration, to specify which version of the standard they conform to. For example:
Separation of style and content
Efforts of the web development community have led to a new thinking in the way a web document should be written; XHTML epitomizes this effort. Standards stress using markup which suggests the structure of the document, like headings, paragraphs, block quoted text, and tables, instead of using markup which is written for visual purposes only, like <font>, <b> (bold), and <i> (italics). Some of these elements are not permitted in certain varieties of HTML, like HTML 4.01 Strict. CSS provides a way to separate the HTML structure from the content's presentation, by keeping all code dealing with presentation defined in a CSS file. See separation of style and content.
See also
- Character encodings in HTML
- Dynamic HTML
- HTML editor
- HTML element
- HTML scripting
- Parse
- Tim Berners-Lee
- Unicode and HTML
- Web colors
- List of document markup languages
- Comparison of document markup languages
- Comparison of layout engines (HTML)
- XHTML
External links
W3C Specifications
Validators
Selected Tutorials/Guides