Use WebExecute to get the rendered text content of a node and its descendants.
Using JavaScript Directly...
Begin the session
Use StartWebSession to begin the session:
data:image/s3,"s3://crabby-images/30fe4/30fe4d58fcfd1505f826cae0d3dbddc8871967f0" alt=""
- If no browser is supplied to StartWebSession, it will default to Google Chrome.
Extract text
Open the page you would like to get text from:
data:image/s3,"s3://crabby-images/2148d/2148db970c8d7a3dc44601358f82731940f39495" alt=""
Use the "JavascriptExecute" command to directly write JavaScript that returns the contents of the innerText HTML tag:
data:image/s3,"s3://crabby-images/431b4/431b4057f0e05b54f20bbf4eec5df51a86bb85c2" alt=""
Use Select to remove digit characters and non-English words:
data:image/s3,"s3://crabby-images/efc56/efc5617af95e4490ddd9550150b92b793c33c939" alt=""
Analyze the text
Use ToLowerCase to reduce duplication of words and DeleteStopwords to remove prepositions and other similar words from analysis:
Use WordCloud to create a word cloud of frequently used nontrivial words on the webpage:
Use StringRiffle to concatenate words into a single string, separating them with whitespaces:
Use WordCounts to count the number of times a word appears in the string, and take the top five most frequently used words:
data:image/s3,"s3://crabby-images/16ca4/16ca4192b97b52b21bf1933efb909268b54fe7bb" alt=""
Use BarChart to visualize the frequency of words:
Close the session
Use DeleteObject to terminate the web session process:
Using WebExecute Commands Related to Elements of Webpages...
Begin the session
Use StartWebSession to begin the session:
data:image/s3,"s3://crabby-images/63c95/63c957c628a064ab2606e20553c4ad4944ec606d" alt=""
- If no browser is supplied to StartWebSession, it will default to Google Chrome.
Extract text
Open the page you would like to get text from:
Use the "LocateElements" command to get the ID attribute named "content":
- ID attributes are uniquely named, and should return a single WebElementObject.
Use the "ElementText" command to get the text from the ID:
data:image/s3,"s3://crabby-images/bc702/bc7029150601d3efa82a55a466342353dcb283f9" alt=""
Use Select to remove digit characters and non-English words:
data:image/s3,"s3://crabby-images/36ba9/36ba9f4f04bd5b5170c33e1c8d7305128a85f6d3" alt=""
Analyze the text
Use ToLowerCase to reduce duplication of words and DeleteStopwords to remove prepositions and other similar words from analysis:
Use WordCloud to create a word cloud of frequently used nontrivial words on the webpage:
Use StringRiffle to concatenate words into a single string, separating them with whitespaces:
Use WordCounts to count the number of times a word appears in the string, and take the top five most frequently used words:
data:image/s3,"s3://crabby-images/5f61f/5f61ff1c23162b27c43bab9f84a562136ab7ac19" alt=""
Use BarChart to visualize the frequency of words:
Close the session
Use DeleteObject to terminate the web session process: