LetterCounts

LetterCounts["string"]

gives an association whose keys are the distinct letters in string, and whose values give the number of times those letters appear in string.

LetterCounts["string",n]

gives counts of the distinct n-grams consisting of runs of n letters in string.

LetterCounts["string",n,{"c1","c2",}]

allows the characters ci to appear in n-grams, in addition to ordinary letters.

LetterCounts[{"string1","string2",},]

gives the counts for each of the stringi.

Details and Options

  • LetterCounts[string,n] considers n-grams that start at every position in string and includes them in its counts if they contain only letters.
  • LetterCounts effectively uses LetterQ to determine whether to consider a character to be a letter.
  • LetterCounts has the option IgnoreCase. With the setting IgnoreCase->True, letters are in effect all converted to lower case before being counted.
  • LetterCounts[BioSequence["type","seq"]] will give the letters of the string "seq".

Examples

open allclose all

Basic Examples  (4)

Count the distinct letters in a string:

Non-letter characters are excluded from letter counts:

Count the distinct 2-gram letter sequences in a string:

Count the distinct letters of each string in a list:

Scope  (4)

Compute all 5-grams of letters in a string:

The 5-grams start in positions 1 through 9 of the string:

Specify a list of non-letter characters to be counted:

Count the distinct 2-gram letter sequences of each string in a list:

Find the letters of a biomolecular sequence:

Options  (2)

IgnoreCase  (2)

The default setting IgnoreCase->False counts uppercase and lowercase characters independently:

Use IgnoreCase->True to count all occurrences of a letter, regardless of case:

Count n-grams regardless of case:

Applications  (1)

Count the characters in the full English names of each of the first 10 positive integers:

Merge the counts:

Properties & Relations  (1)

LetterCounts counts the distinct letter characters in a string:

CharacterCounts additionally includes numbers, punctuation and whitespace:

Neat Examples  (1)

Find the 20 most frequently occurring letters in a body of text:

Do the same for 3-letter sequences:

Wolfram Research (2015), LetterCounts, Wolfram Language function, https://reference.wolfram.com/language/ref/LetterCounts.html (updated 2024).

Text

Wolfram Research (2015), LetterCounts, Wolfram Language function, https://reference.wolfram.com/language/ref/LetterCounts.html (updated 2024).

CMS

Wolfram Language. 2015. "LetterCounts." Wolfram Language & System Documentation Center. Wolfram Research. Last Modified 2024. https://reference.wolfram.com/language/ref/LetterCounts.html.

APA

Wolfram Language. (2015). LetterCounts. Wolfram Language & System Documentation Center. Retrieved from https://reference.wolfram.com/language/ref/LetterCounts.html

BibTeX

@misc{reference.wolfram_2024_lettercounts, author="Wolfram Research", title="{LetterCounts}", year="2024", howpublished="\url{https://reference.wolfram.com/language/ref/LetterCounts.html}", note=[Accessed: 22-December-2024 ]}

BibLaTeX

@online{reference.wolfram_2024_lettercounts, organization={Wolfram Research}, title={LetterCounts}, year={2024}, url={https://reference.wolfram.com/language/ref/LetterCounts.html}, note=[Accessed: 22-December-2024 ]}