A markup language is a modern system for annotating An annotation is a summary made of information in a book, document, online record, video, software code or other information, "in the margin", or perhaps just underlined or highlighted passages. Annotated bibliographies, give descriptions about how each source is useful to an author in constructing a paper or argument. Creating these a text in a way that is syntactically distinguishable from that text. The idea and terminology evolved from the "marking up" of manuscripts, i.e. the revision instructions by editors, traditionally written with a blue pencil on authors' manuscripts. Examples are typesetting instructions such as those found in troff troff is a document processing system developed by AT&T for the Unix operating system and LaTeX Latex as found in nature is a milky sap-like fluid found in 10% of all flowering plants . It is a complex emulsion consisting of proteins, alkaloids, starches, sugars, oils, tannins, resins, and gums that coagulates on exposure to air. It is usually exuded after tissue injury. In most plants, latex is white, but some have yellow, orange, or, and structural markers such as XML Extensible Markup Language is a set of rules for encoding documents in machine-readable form. It is defined in the XML 1.0 Specification produced by the W3C, and several other related specifications, all gratis open standards tags. Markup is typically omitted from the version of the text which is displayed for end-user consumption. Some markup languages, like HTML have presentation semantics In computer science, particularly in human-computer interaction, presentation semantics specify how a particular piece of a formal language is represented in a distinguished manner accessible to human senses, usually human vision. For example, saying that <bold> ... </bold> must render the text between these constructs using some bold, meaning their specification prescribes how the structured data is to be presented, but other markup languages, like XML, have no predefined semantics.

A well-known example of a markup language in widespread use today is HyperText Hypertext is text displayed on a computer or other electronic device with references to other text that the reader can immediately access, usually by a mouse click or keypress sequence. Apart from running text, hypertext may contain tables, images and other presentational devices. Hypertext is the underlying concept defining the structure of the Markup Language (HTML HTML, which stands for HyperText Markup Language, is the predominant markup language for web pages. It is written in the form of HTML elements consisting of "tags" surrounded by angle brackets within the web page content), one of the document formats of the World Wide Web The World Wide Web, abbreviated as WWW and commonly known as the Web, is a system of interlinked hypertext documents accessed via the Internet. With a web browser, one can view web pages that may contain text, images, videos, and other multimedia and navigate between them by using hyperlinks. Using concepts from earlier hypertext systems, British. HTML is mostly an instance of SGML The Standard Generalized Markup Language is an ISO-standard technology for defining generalized markup languages for documents. ISO 8879 Annex A.1 defines generalized markup: (though, strictly, it does not comply with all the rules of SGML) and follows many of the markup conventions used in the publishing industry in the communication of printed work between authors, editors, and printers.

Contents

Types

There are three general categories of electronic markup: Presentational, procedural, and descriptive.[1][2]

Presentational markup is that used by traditional word-processing systems, binary codes embedded in document text that produced the WYSIWYG WYSIWYG , is an acronym for What You See Is What You Get. The term is used in computing to describe a system in which content displayed during editing appears very similar to the final output, which might be a printed document, web page, slide presentation or even the lighting for a theatrical event.[clarification needed] effect. Such markup is usually designed to be hidden from human users, even those who are authors or editors.

Procedural markup is embedded in text and provides instructions for programs that are to process the text. Well-known examples include troff troff is a document processing system developed by AT&T for the Unix operating system, LaTeX Latex as found in nature is a milky sap-like fluid found in 10% of all flowering plants . It is a complex emulsion consisting of proteins, alkaloids, starches, sugars, oils, tannins, resins, and gums that coagulates on exposure to air. It is usually exuded after tissue injury. In most plants, latex is white, but some have yellow, orange, or, and PostScript PostScript is a dynamically typed concatenative programming language created by John Warnock and Charles Geschke in 1982. PostScript is best known for its use as a page description language in the electronic and desktop publishing areas; it is expected that the processor runs through the text from beginning to end, following the instructions as encountered. Text with such markup is often edited with the markup visible and directly manipulated by the author. Popular procedural-markup systems usually include programming constructs, such that macros or subroutines can be defined and invoked by name. An example of descriptive markup would be the troff's .bd, which instructs the processor to switch to a bold-face font.

In Descriptive markup, the markup is used to label parts of the document rather than to provide specific instructions as to how they should be processed. The objective is to decouple the inherent structure of the document from any particular treatment or rendition of it. Such markup is often described as "semantic". An example of descriptive markup would be HTML's <cite> tag, which is used to label a citation.

There is considerable blurring of the lines between the types of markup. In modern word-processing systems, presentational markup is often saved in descriptive-markup-oriented systems such as XML Extensible Markup Language is a set of rules for encoding documents in machine-readable form. It is defined in the XML 1.0 Specification produced by the W3C, and several other related specifications, all gratis open standards, and then processed procedurally by implementations. The programming constructs in descriptive-markup systems such as TeX may be used to create higher-level markup systems which are more descriptive, such as LaTeX Latex as found in nature is a milky sap-like fluid found in 10% of all flowering plants . It is a complex emulsion consisting of proteins, alkaloids, starches, sugars, oils, tannins, resins, and gums that coagulates on exposure to air. It is usually exuded after tissue injury. In most plants, latex is white, but some have yellow, orange, or.

In recent years, a number of small and largely unstandardized markup languages have been developed to allow authors to create formatted text via web browsers, for use in wikis Wikis may exist to serve a specific purpose, and in such cases, users use their editorial rights to remove material that is considered "off topic." Such is the case of the collaborative encyclopedia Wikipedia. In contrast, open purpose wikis accept content without firm rules as to how the content should be organized and web forums. The markup language used by Wikipedia Wikipedia is a free, web-based, collaborative, multilingual encyclopedia project supported by the non-profit Wikimedia Foundation. Its 16 million articles have been written collaboratively by volunteers around the world, and almost all of its articles can be edited by anyone with access to the site. Wikipedia was launched in 2001 by Jimmy Wales is one such.

History

The term markup is derived from the traditional publishing practice of "marking up"' a manuscript A manuscript or handwrit is a recording of information that has been manually created by someone or some people, such as a hand-written letter, as opposed to being printed or reproduced some other way. The term may also be used for information that is hand-recorded in other ways than writing, for example inscriptions that are chiselled upon a hard, which involves adding handwritten annotations in the form of conventional symbolic printer Printing is a process for reproducing text and image, typically with ink on paper using a printing press. It is often carried out as a large-scale industrial process, and is an essential part of publishing and transaction printing's instructions in the margins and text of a paper manuscript or printed proof Proofreading traditionally is the reading of a galley proof of text or art to detect and correct production errors. Computerization has required proofreaders to increasingly adopt skill-sets general to desktop publishing. For centuries, this task was done primarily by skilled typographers known as "markup men"[3] or "copy markers"[4] who marked up text to indicate what typeface In typography, a typeface is a set of one or more fonts, in one or more sizes, designed with stylistic unity, each comprising a coordinated set of glyphs. A typeface usually comprises an alphabet of letters, numerals, and punctuation marks; it may also include ideograms and symbols, or consist entirely of them, for example, mathematical or map-, style, and size should be applied to each part, and then passed the manuscript to others for typesetting Typesetting is the composition of text material by means of types by hand. Markup was also commonly applied by editors, proofreaders, publishers, and graphic designers, and indeed by document authors.

GenCode

The idea of using markup languages in computer text processing was probably first publicly presented by publishing executive William W. Tunnicliffe William W. Tunnicliffe is credited by Charles Goldfarb as being the first person (1967) to articulate the idea of separating the definition of formatting from the structure of content in electronic documents at a conference in 1967, although he preferred to call it "generic coding." It can be seen as a response to the emergence of programs such as RUNOFF that each used their own control notations, often specific to the target typesetting device. In the 1970s, Tunnicliffe led the development of a standard called GenCode for the publishing industry and later was the first chair of the International Organization for Standardization The International Organization for Standardization , widely known as ISO (pronounced /ˈaɪsoʊ/ EYE-soe), is an international-standard-setting body composed of representatives from various national standards organizations. Founded on 23 February 1947, the organization promulgates worldwide proprietary industrial and commercial standards. It has committee that created SGML The Standard Generalized Markup Language is an ISO-standard technology for defining generalized markup languages for documents. ISO 8879 Annex A.1 defines generalized markup:, the first standard descriptive markup language. Book designer Stanley Rice published speculation along similar lines in 1970.[5] Brian Reid Brian Keith Reid is a computer scientist most famous for developing the Scribe word processing system, the subject of his 1980 doctoral dissertation, for which he received the Association for Computing Machinery's Grace Murray Hopper Award in 1982. Scribe was a pioneer in the use of descriptive markup. Reid presented a paper describing Scribe in, in his 1980 dissertation at Carnegie Mellon University Coordinates: 40°26′36″N 79°56′37″W / 40.443322°N 79.943583°W Carnegie Mellon University is a private research university in Pittsburgh, Pennsylvania. The university began as the Carnegie Technical Schools, founded by Andrew Carnegie in 1900. In 1912, the school became Carnegie Institute of Technology and began granting four-year, developed the theory and a working implementation of descriptive markup in actual use.

However, IBM International Business Machines (NYSE: IBM) is a multinational computer, technology and IT consulting corporation headquartered in Armonk, New York, United States. IBM is the world's fourth largest technology company and the second most valuable global brand (after Coca-Cola). IBM is one of the few information technology companies with a researcher Charles Goldfarb Charles F. Goldfarb is known as the father of SGML and is a co-inventor of the concept of markup languages. In 1969 Charles Goldfarb, leading a small team at IBM, developed the first markup language, called Generalized Markup Language, or GML. In an interview with Web Techniques Magazine editor Michael Floyd, Dr. Goldfarb explains that he coined is more commonly seen today as the "father" of markup languages. Goldfarb hit upon the basic idea while working on a primitive document management system intended for law firms in 1969, and helped invent IBM GML Generalized Markup Language is a set of macros that implement intent-based markup tags for the IBM text formatter, SCRIPT/VS. SCRIPT/VS is the main component of IBM's Document Composition Facility (DCF). A starter set of tags in GML is provided with the DCF product later that same year. GML was first publicly disclosed in 1973.

In 1975, Goldfarb moved from Cambridge, Massachusetts Cambridge is a city in Middlesex County, Massachusetts, United States, in the Greater Boston area. It was named in honor of the University of Cambridge in England, a nexus of the Puritan theology embraced by the town's founders. Notably, Cambridge is home to two internationally prominent universities, Harvard University and the Massachusetts to Silicon Valley Silicon Valley is in the southern part of the San Francisco Bay Area in Northern California, United States. The region is home to many of the world's largest technology companies including Apple, Google, Facebook, HP, Intel, Cisco, eBay, Adobe, Agilent, Oracle, Yahoo, Netflix, and EA. The term originally referred to the region's large number of and became a product planner at the IBM Almaden Research Center The IBM Almaden Research Centre is in San Jose, California, and is one of IBM's eight worldwide research labs. Its scientists perform basic and applied research in computer science, services, storage systems, physical sciences, and materials science and technology. The centre opened in 1986, and continues the research started in San Jose more than. There, he convinced IBM's executives to deploy GML commercially in 1978 as part of IBM's Document Composition Facility product, and it was widely used in business within a few years.

Development informally began in 1978[citation needed] on what ultimately became the SGML standard, which was based on both GML and GenCode; Goldfarb eventually became chair of the SGML committee. SGML was first and released by ISO as the ISO 8879 standard in October 1986.

Some early examples of computer markup languages available outside the publishing industry can be found in typesetting tools on Unix Unix is a computer operating system originally developed in 1969 by a group of AT&T employees at Bell Labs, including Ken Thompson, Dennis Ritchie, Brian Kernighan, Douglas McIlroy, and Joe Ossanna. Today's Unix systems are split into various branches, developed over time by AT&T as well as various commercial vendors and non-profit systems such as troff troff is a document processing system developed by AT&T for the Unix operating system and nroff nroff is a Unix text-formatting program; it produces output suitable for simple fixed-width printers and terminal windows. It is an integral part of the Unix help system, being used to format man pages for display. In these systems, formatting commands were inserted into the document text so that typesetting software could format the text according to the editor's specifications. It was a trial and error Trial and error, or trial by error or try an error, is a general method of problem solving, fixing things, or for obtaining knowledge. "Learning doesn't happen from failure itself but rather from analyzing the failure, making a change, and then trying again." iterative process to get a document printed correctly.[citation needed] Availability of WYSIWYG WYSIWYG , is an acronym for What You See Is What You Get. The term is used in computing to describe a system in which content displayed during editing appears very similar to the final output, which might be a printed document, web page, slide presentation or even the lighting for a theatrical event.[clarification needed] ("what you see is what you get") publishing software supplanted much use of these languages among casual users, though serious publishing work still uses markup to specify the non-visual structure of texts, and WYSIWYG editors now usually save documents in a markup-language-based format.

TeX

Another major publishing standard is TeX, created and continuously refined by Donald Knuth Donald Ervin Knuth (born January 10, 1938) is a renowned computer scientist and Professor Emeritus of the Art of Computer Programming at Stanford University in the 1970s and '80s. TeX concentrated on detailed layout of text and font descriptions in order to typeset mathematical books in professional quality. This required Knuth to spend considerable time investigating the art of typesetting Typesetting is the composition of text material by means of types. However, TeX has a steep learning curve, so that it is mainly used in academia Academia, Acadème, or the Academy are collective terms for the community of students and scholars engaged in higher education and research, where it is the de facto standard in many scientific disciplines. A TeX macro package known as LaTeX Latex as found in nature is a milky sap-like fluid found in 10% of all flowering plants . It is a complex emulsion consisting of proteins, alkaloids, starches, sugars, oils, tannins, resins, and gums that coagulates on exposure to air. It is usually exuded after tissue injury. In most plants, latex is white, but some have yellow, orange, or provides a descriptive markup system on top of TeX, and is widely used.

Scribe, GML and SGML

Main articles: IBM Generalized Markup Language Generalized Markup Language is a set of macros that implement intent-based markup tags for the IBM text formatter, SCRIPT/VS. SCRIPT/VS is the main component of IBM's Document Composition Facility (DCF). A starter set of tags in GML is provided with the DCF product and Standard Generalized Markup Language The Standard Generalized Markup Language is an ISO-standard technology for defining generalized markup languages for documents. ISO 8879 Annex A.1 defines generalized markup:

The first language to make a clear and clean distinction between structure and presentation was Scribe Scribe is a markup language and word processing system which pioneered the use of descriptive markup. Scribe was revolutionary when it was proposed, because it involved for the first time a clean separation of structure and format, developed by Brian Reid Brian Keith Reid is a computer scientist most famous for developing the Scribe word processing system, the subject of his 1980 doctoral dissertation, for which he received the Association for Computing Machinery's Grace Murray Hopper Award in 1982. Scribe was a pioneer in the use of descriptive markup. Reid presented a paper describing Scribe in and described in his doctoral thesis in 1980.[6] Scribe was revolutionary in a number of ways, not least that it introduced the idea of styles separated from the marked up document, and of a grammar In linguistics, grammar is the set of structural rules that govern the composition of sentences, phrases, and words in any given natural language. The term refers also to the study of such rules, and this field includes morphology, syntax, and phonology, often complemented by phonetics, semantics, and pragmatics. Linguists do not normally use the controlling the usage of descriptive elements. Scribe influenced the development of Generalized Markup Language Generalized Markup Language is a set of macros that implement intent-based markup tags for the IBM text formatter, SCRIPT/VS. SCRIPT/VS is the main component of IBM's Document Composition Facility (DCF). A starter set of tags in GML is provided with the DCF product (later SGML) and is a direct ancestor to HTML and LaTeX Latex as found in nature is a milky sap-like fluid found in 10% of all flowering plants . It is a complex emulsion consisting of proteins, alkaloids, starches, sugars, oils, tannins, resins, and gums that coagulates on exposure to air. It is usually exuded after tissue injury. In most plants, latex is white, but some have yellow, orange, or.

In the early 1980s, the idea that markup should be focused on the structural aspects of a document and leave the visual presentation of that structure to the interpreter led to the creation of SGML. The language was developed by a committee chaired by Goldfarb. It incorporated ideas from many different sources, including Tunnicliffe's project, GenCode. Sharon Adler, Anders Berglund Anders Berglund is a Swedish organizer, composer, conductor, pianist and musician, and James A. Marke were also key members of the SGML committee.

SGML specified a syntax for including the markup in documents, as well as one for separately describing what tags were allowed, and where (the Document Type Definition (DTD) or schema). This allowed authors to create and use any markup they wished, selecting tags that made the most sense to them and were named in their own natural languages. Thus, SGML is properly a meta-language, and many particular markup languages are derived from it. From the late '80s on, most substantial new markup languages have been based on SGML system, including for example TEI and DocBook. SGML was promulgated as an International Standard by International Organization for Standardization, ISO 8879, in 1986.

SGML found wide acceptance and use in fields with very large-scale documentation requirements. However, it was generally found to be cumbersome and difficult to learn, a side effect of attempting to do too much and be too flexible. For example, SGML made end tags (or start-tags, or even both) optional in certain contexts, because it was thought that markup would be done manually by overworked support staff who would appreciate saving keystrokes[citation needed].

HTML

Main article: HTML

By 1991, it appeared to many that SGML would be limited to commercial and data-based applications while WYSIWYG tools (which stored documents in proprietary binary formats) would suffice for other document processing applications. The situation changed when Sir Tim Berners-Lee, learning of SGML from co-worker Anders Berglund and others at CERN, used SGML syntax to create HTML. HTML resembles other SGML-based tag languages, although it began as simpler than most and a formal DTD was not developed until later. Steven DeRose[7] argues that HTML's use of descriptive markup (and SGML in particular) was a major factor in the success of the Web, because of the flexibility and extensibility that it enabled (other factors include the notion of URLs and the free distribution of browsers). HTML is quite likely the most used markup language in the world today.

Some[citation needed] would restrict the term "markup language" to systems that directly support non-hierarchical structures (see Hierarchical model). By this definition HTML, XML, and even SGML (apart from its rarely-used CONCUR option) would be disqualified and called "container languages" instead. However, the term "container language" is not in widespread use, and such hierarchical languages are almost universally considered markup languages. There is active research on non-hierarchical markup models, some expressed within XML and related languages (for example, using the Text Encoding Initiative Guidelines and derivatives such as the Open Scripture Information Standard and CLIX), and some not (for example, MECS and the Layed Markup and Annotation Language or LMNL). Much of this research is published in the proceedings of the Extreme Markup and Balisage conferences, generally held in Montreal.

Show All>>

 

The above information uses material from Wikipedia and is licensed under the GNU Free Documentation License.
Some facts may not have been fully verified for accuracy. [Disclaimers]
This page was last archived by our server on Fri Sep 3 11:55:27 2010. [ refresh local cache ]
Displaying this page or its contents does not use any Wikimedia Foundation's resources.
The owners of this site proudly support the Wikimedia Foundation.


Web Hosting Library Launches at 34SP.com - HostReview.com (press release)
hostreview.com
Web Hosting Library Launches at 34SP.com - HostReview.com (press release)
Tue, 24 Aug 2010 09:50:26 GMT+00:00
HostReview.com (press release) Guide To Creating A Website With HTML - HTML is the acronym for 'hypertext markup language ' and is the core coding present behind most web pages. ...
Google News Search: Markup language,
Fri Sep 3 11:55:29 2010
Editor markuplanguage png
wiki.eclipse.org
Editor markuplanguage png
633px x 720px | 132.00kB

[source page]

Editor Context Menu

Yahoo Images Search: Markup language,
Fri Sep 3 11:55:29 2010
Django snippets: Template filter implementing the Trac wiki markup ...
djangosnippets.org
Django snippets: Template filter implementing the Trac wiki markup ...

simon

hu, 11 Sep 2008 16:08:36 GM

I don't know how robust or secure this is, but it's working for me so far. Author: simon; Posted: September 11, 2008; . Language. : Python; Django Version: 1.0; Tags: templatetag trac filter . markup. ; Score: 0 (after 0 ratings) ...

Google Blogs Search: Markup language,
Sat Sep 4 15:58:08 2010
who developed the hyper text markup language?
Q. Founder of HTML
Asked by Suresh S - Fri Aug 22 00:46:42 2008 - - 2 Answers - 0 Comments
Yahoo Answers Search: Markup language,
Fri Sep 3 11:55:30 2010