Tools Home : HTML Tools : List Words

Click here to show HTML tools HTML Tools

Click here to expand XML tools XML tools

Click here to expand plain text tools Plain Text Tools

Click here to expand other tools Other tools

 Beta tools
 Add Tools Demo
 Manual
 About

List Words
?
Summary

This tool can be used to list words found within a specified tag based on user selected criteria. The query results can be displayed alphabetically, by frequency, by order of appearance, or in reversed alphabetical order. If no tag is specified, the <body> tag is used.

Note: If user wants to list the words in a given typed-in list, a words file or glasgow, the words list order will be the same as the order of user entered in the corresponding field. Then the sort criteria will not apply.

For more details see here.

Walkthrough

Example: fetch HTML from http://www.w3.org/; extract HTML between <body> and </body> tags; filter words that appear in the Glasgow stop words list; sort results by frequency.
  1. Source text
    1. Enter `http://www.w3.org/' in the URL field.
  2. Subtext limited to
    1. Enter `body' in the Elements field.
    2. select Words not in the list below;
    3. select Use Glasgow stop_words list.
  3. Results
    1. Select By Frequency in the Sort drop-down menu.
*
» Source text
  Example: http://en.wikipedia.org/wiki/Socrates

?
Summary

Determines the HTML source. HTML can be obtained from a URL or by uploading a file.

Fields

Source URL
HTML from the entered URL will be used as the data source for the analysis.

Local file
Use this field to upload a local HTML file for analysis.
» Subtext limited to
(separate multiple elements with a `,')





(separate words by ',')
?
Summary

Limits included text to text that appears between specific tags. Furthermore, words can be filtered using a variety of word filtering techniques.

Fields

All words
Words will not be filtered.

Words in the stop list below
Words that do not appear in the stop list defined below will be filtered.

Words not in the stop list below
Words that appear in the stop list defined below wlil be filtered.

Words matching pattern
Words matching the entered regular expression will not be filtered.

Word list typed in
Only words entered in the text field will not be filtered.

Text file with words
Only words found in the uploaded file will not be filtered. Words should be delimited by commas.

Use Glasgow stop_words list
Words found in the Glasgow list will be filtered.
» Results

?
Summary

Allows the user to choose how the results will be formatted and whether they should be displayed in a new browser window.

Fields

Sort
Allows you to sort the results in one of several ways.

Display as
Determines the format in which results will be delivered

Open results in new window
Checking this box will display the results in a new window. This option is selected by default. In some cases pop-up blockers may disallow windows from being created, in which case this option may be de-selected.
`*' indicates a required field

 

 

TAPoRware Project, McMaster University,