gives a dataset of information about entities, dates, quantities and other content-related elements found in text.
includes the property props for each object in the dataset produced.
Details and Options
- In TextContents[text,…], text can be a string, a file with plain text represented by File[…], a ContentObject expression or a list of these text objects.
- TextContents[{text1,text2,…},…] gives cases for each texti.
- Identification type form can be:
-
"type" any text content type (e.g. "Noun", "City") Entity[…,…] a specific entity of a text content type form1form2… - form matching any of the formi
Containing[outer,inner] forms of type outer containing type inner Verbatim["string"] a specific string to be matched exactly pattern a string pattern to be matched Automatic entities, dates, quantities and other content-related elements - Possible choices for the property prop include:
-
"String" string of the identified text (default) "Position" start and end position of the string in text "Probability" estimated probability that the identification is correct "Type" type of content (entity type, …) "Interpretation" standard interpretation of the identified string "Snippet" a snippet around the identified string "HighlightedSnippet" a snippet with the identified string highlighted All all the preceding properties {prop1,prop2,…} a list of property specifications - The following options can be given:
-
AcceptanceThreshold Automatic minimum probability to accept identification TargetDevice "CPU" whether CPU or GPU computation should be used for entity detection VerifyInterpretation False whether interpretability should be verified
Examples
open allclose allBasic Examples (1)Summary of the most common use cases
https://wolfram.com/xid/0d6ftmvbbt-e90bf1
Only get the results for locations:
https://wolfram.com/xid/0d6ftmvbbt-7hz0ud
Only get the results for locations and quantities:
https://wolfram.com/xid/0d6ftmvbbt-4fep64
Get interpretations for all cases:
https://wolfram.com/xid/0d6ftmvbbt-yvw7x7
Get a specified set of properties for entities:
https://wolfram.com/xid/0d6ftmvbbt-uwjjqa
Options (2)Common values & functionality for each option
AcceptanceThreshold (1)
By default, all the detected entities have an estimated probability higher than 0.5:
https://wolfram.com/xid/0d6ftmvbbt-bt3f6g
Get only the entities that are highly probable to be correct by setting a high AcceptanceThreshold:
https://wolfram.com/xid/0d6ftmvbbt-r99ohu
VerifyInterpretation (1)
By default, some entities cannot be interpreted, either because they are not correct or because they are not yet in the knowledgebase:
https://wolfram.com/xid/0d6ftmvbbt-4kphdi
Use VerifyInterpretation to filter out the entities that cannot be interpreted:
https://wolfram.com/xid/0d6ftmvbbt-2rnck2
Properties & Relations (1)Properties of the function, and connections to other functions
TextContents handles the same types as TextPosition and TextCases and always identifies the same substrings as these functions for a given type:
https://wolfram.com/xid/0d6ftmvbbt-06jihr
https://wolfram.com/xid/0d6ftmvbbt-vlw3g0
https://wolfram.com/xid/0d6ftmvbbt-mf2vdb
A dataset that is similar to the output of TextContents can be obtained using TextCases:
https://wolfram.com/xid/0d6ftmvbbt-158xfy
https://wolfram.com/xid/0d6ftmvbbt-9g1wob
Neat Examples (1)Surprising or curious use cases
Load the text of a Wikipedia page about the Moon:
https://wolfram.com/xid/0d6ftmvbbt-yk175x
https://wolfram.com/xid/0d6ftmvbbt-zh9624
Extract notable text contents from the page:
https://wolfram.com/xid/0d6ftmvbbt-36ly2k
Visualize the frequency of content types found on the page:
https://wolfram.com/xid/0d6ftmvbbt-4bjx2y
https://wolfram.com/xid/0d6ftmvbbt-dregh4
Find potential notable persons identified on the page:
https://wolfram.com/xid/0d6ftmvbbt-z0fqea
Interpret these persons as entities:
https://wolfram.com/xid/0d6ftmvbbt-oa8av0
Visualize occupations of these persons:
https://wolfram.com/xid/0d6ftmvbbt-07a6xe
Wolfram Research (2019), TextContents, Wolfram Language function, https://reference.wolfram.com/language/ref/TextContents.html.
Text
Wolfram Research (2019), TextContents, Wolfram Language function, https://reference.wolfram.com/language/ref/TextContents.html.
Wolfram Research (2019), TextContents, Wolfram Language function, https://reference.wolfram.com/language/ref/TextContents.html.
CMS
Wolfram Language. 2019. "TextContents." Wolfram Language & System Documentation Center. Wolfram Research. https://reference.wolfram.com/language/ref/TextContents.html.
Wolfram Language. 2019. "TextContents." Wolfram Language & System Documentation Center. Wolfram Research. https://reference.wolfram.com/language/ref/TextContents.html.
APA
Wolfram Language. (2019). TextContents. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/TextContents.html
Wolfram Language. (2019). TextContents. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/TextContents.html
BibTeX
@misc{reference.wolfram_2024_textcontents, author="Wolfram Research", title="{TextContents}", year="2019", howpublished="\url{https://reference.wolfram.com/language/ref/TextContents.html}", note=[Accessed: 09-January-2025
]}
BibLaTeX
@online{reference.wolfram_2024_textcontents, organization={Wolfram Research}, title={TextContents}, year={2019}, url={https://reference.wolfram.com/language/ref/TextContents.html}, note=[Accessed: 09-January-2025
]}