Tools Home : HTML Tools : Web Page Cleaner

Click here to show HTML tools HTML Tools

Click here to expand XML tools XML tools

Click here to expand plain text tools Plain Text Tools

Click here to expand other tools Other tools

 Beta tools
 Add Tools Demo

Web Page Cleaner

This tool is used to remove all formatting from an HTML document, either located at a specified web address or uploaded from the user’s files, and leave the text for further processing.

Please click the ? buttons at the bottom right of each set of options for more information on that set.

For further information on this tool, please see the TADA Wiki's Web Page Cleaner entry here. A glossary of terms is also available here.


To remove all HTML formatting from and convert it to a plain text document:
  1. Source text
    1. Enter ‘’ into the ‘URL’ field.
  2. Results
    1. Select ‘Convert to plain text’ from the ‘Clean option’ drop-down list.
  3. Click the ‘Submit’ button to process the text.
» Source HTML


This section determines the source of the document you wish the tool to process. HTML can be obtained either from a web address or by uploading a file.


Source URL
To use content from a web page, enter a full web address (URL) in the field provided. Copy and paste from your browser’s address bar for best results.

Local file
To upload an HTML (.html) file from your computer, choose ‘Local file,’ click ‘Browse,’ and select the file you wish to use from your directory.
» Results

This section allows the user to choose how the tool's final results will be displayed.


Clean option
Use this drop-down list to choose between stripping all tags from the source text and converting it to plain text.

Open results in new window
Check this box to display the results in a new window or browser tab. This option is selected by default. Some pop-up blockers may prevent a new window from being opened; if so, un-check the box to open the results in the same window instead.
`*' indicates a required field



TAPoRware Project, McMaster University,