Wolfram Language & System Documentation Center

MissingValueSynthesis

is an option for functions such as Classify that specifies how missing values should be replaced.

Details

Missing value synthesis, also known as missing imputation, is done by conditioning a distribution on known values, as in SynthesizeMissingValues.
Missing values are typically represented by Missing[…].
MissingValueSynthesis can be used at training time, inference time or to update the synthesizer of an existing model.
Classify[data,…,MissingValueSynthesissynth] can be used to specify a missing synthesis method or model for training (and similarly for other training functions).
ClassifierFunction[…][example,MissingValueSynthesissynth] can be used to temporarily overwrite the synthesis method during classifier inference (and similarly for other machine learning models).
Classify[ClassifierFunction[…],MissingValueSynthesissynth] can be used to overwrite the internal missing synthesizer of the classifier (and similarly for other machine learning models).
Possible settings for MissingValueSynthesis include:

	Automatic	automatically choose distribution method and synthesis strategy
	None	do not use any missing synthesizer
	method	use the specified method
	strategy	how to synthesize from the distribution
	assoc	specify both distribution method and synthesis strategy

Possible settings for method include:

	Automatic	automatically choose the distribution method
	"Multinormal"	use a multivariate normal (Gaussian) distribution
	"ContingencyTable"	discretize data and store each possible probability
	"KernelDensityEstimation"	use a kernel mixture distribution
	"DecisionTree"	use a decision tree to compute probabilities
	"GaussianMixture"	use a mixture of Gaussian (normal) distributions
	LearnedDistribution[…]	use the specified distribution

Possible settings for strategy include:

	Automatic	automatically choose the synthesis strategy
	"RandomSampling"	randomly sample from the conditioned distribution
	"ModeFinding"	attempt to find the mode of the conditioned distribution

In the form Methodassoc, the association assoc should be of the form <|"LearningMethod"method,"EvaluationStrategy"strategy|>.

Examples

Basic Examples (2)

Train a predictor with two input features:

Wolfram Language code:

x = {{1, 3}, {2, 4}, {3, 5}, {4, 4}, {5, 8}, {6, 9}, {7, 4}, {8, 6}, {9, 12}};
y = {2, 4, 5, 4, 6, 7, 4, 5, 9};
p = Predict[x -> y]

Get the prediction for an example that has a missing value:

Wolfram Language code: p[{5, Missing[]}]

Set the missing value synthesis to replace missing variables with their most likely value given known values (which is the default behavior):

Wolfram Language code: p[{5, Missing[]}, MissingValueSynthesis -> "ModeFinding"]

Replace missing variables with random samples conditioned on known values:

Wolfram Language code: p[{5, Missing[]}, MissingValueSynthesis -> "RandomSampling"]

Averaging over many random imputations is usually the best strategy and allows obtaining the uncertainty caused by the imputation:

Wolfram Language code:

MeanAround[Table[p[{5, Missing[]}, MissingValueSynthesis -> "RandomSampling"], 10]]
MeanAround[Table[p[{5, Missing[]}, MissingValueSynthesis -> "RandomSampling"], 100]]

Specify a learning method during training to control how the distribution of data is learned:

Wolfram Language code: p = Predict[x -> y, MissingValueSynthesis -> "KernelDensityEstimation"]

Predict an example with missing values using the "KernelDensityEstimation" distribution to condition values:

Wolfram Language code: p[{5, Missing[]}]

Provide an existing LearnedDistribution at training to use it when imputing missing values during training and later evaluations:

Wolfram Language code:

dist = LearnDistribution[x, Method -> "Multinormal"];
p = Predict[x -> y, MissingValueSynthesis -> dist];
p[{5, Missing[]}]

Specify an existing LearnedDistribution to synthesize missing values for an individual evaluation:

Wolfram Language code:

dist2 = LearnDistribution[x, Method -> "KernelDensityEstimation"];
p[{5, Missing[]}, MissingValueSynthesis -> dist2]

Control both the learning method and the evaluation strategy by passing an association at training:

Wolfram Language code:

p = Predict[x -> y, MissingValueSynthesis -> 
	<|"LearningMethod" -> "Multinormal", "EvaluationStrategy" -> "RandomSampling"|>];
p[{5, Missing[]}]

Train a classifier with two input features:

Wolfram Language code:

x = {{1, 3}, {2, 4}, {3, 5}, {4, 4}, {5, 8}, {6, 9}, {7, 4}, {8, 6}, {9, 12}};
y = {"A", "B", "A", "B", "B", "B", "A", "B", "A"};
c = Classify[x -> y]

Get class probabilities for an example that has a missing value:

Wolfram Language code: c[{5, Missing[]}, "Probabilities"]

Set the missing value synthesis to replace missing variables with their most likely value given known values (which is the default behavior):

Wolfram Language code: c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "ModeFinding"]

Replace missing variables with random samples conditioned on known values:

Wolfram Language code: c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "RandomSampling"]

Averaging over many random imputations is usually the best strategy and allows obtaining the uncertainty caused by the imputation:

Wolfram Language code: MeanAround[Table[c[{5, Missing[]}, "Probabilities", MissingValueSynthesis -> "RandomSampling"], 100]]

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

MissingValueSynthesis

Details

Examples

Basic Examples (2)

Text

CMS

APA

BibTeX

BibLaTeX

MissingValueSynthesis

Details

Examples

Basic Examples (2)

See Also

Related Guides

History

Text

CMS

APA

BibTeX

BibLaTeX