Open Access Open Access  Restricted Access Subscription Access

Transformation from HTML to XML: Methodology and Tool

J. Wenny Rahayu,
Lydia Bishay,
David Taniar,

Abstract


HTML has been widely used by Web developers due largely to its ease of use. Despite its wide use, HTML has reached its limit as the markup language for the Web. HTML’s use of predefined tags, which serve merely as a presentation tool, has been considered as inflexible in the language. XML has been introduced to overcome the deficiencies of HTML, especially in terms of defining more structural and semantic-based mark-ups. Due to this fact, it is then desirable that the existing HTML pages are transformed into an XML format. In this paper, we present a methodology and a tool for transformation from HTML to XML. The transformation consists of three steps: (i) reformatting HTML pages, (ii) deriving a Web-schema, and (ii) using a mapping tool to do the transformation. In the last step above, we use a mapping tool, which we have built for this purpose. The tool is a semi-automatic tool that takes HTML pages and their Web-schemas, and transforms them into an XML format.

Keywords


HTML; XML; HTML mapping ot XML; Web pages; Web schemas; XML Tags; Case Tools

Citation Format:
J. Wenny Rahayu, Lydia Bishay, David Taniar, "Transformation from HTML to XML: Methodology and Tool," Journal of Internet Technology, vol. 4, no. 4 , pp. 229-236, Oct. 2003.

Full Text:

PDF

Refbacks

  • There are currently no refbacks.





Published by Executive Committee, Taiwan Academic Network, Ministry of Education, Taipei, Taiwan, R.O.C
JIT Editorial Office, Office of Library and Information Services, National Dong Hwa University
No. 1, Sec. 2, Da Hsueh Rd., Shoufeng, Hualien 974301, Taiwan, R.O.C.
Tel: +886-3-931-7314  E-mail: jit.editorial@gmail.com