Enable JavaScript to interact with content and submit forms on Wolfram websites. Learn how

Wolfram Language & System Documentation Center

Analyze the Text on a Webpage

WORKFLOW

Analyze the Text on a Webpage

Import text from a webpage

Get the text from a webpage as a string:

This is the beginning of the imported text:

Find common words

Find the 10 most common nontrivial words on the webpage and the number of times they occur:

Make a word cloud

Make a word cloud of the text:

Notes

The text of Wikipedia pages can be easily extracted using WikipediaData, which automatically strips page contents that are not text:

Top