Wolfram Language & System Documentation Center

TrainingProgressMeasurements

is an option for NetTrain that specifies measurements to make while training is in progress.

Details

In TrainingProgressMeasurements->spec, the following forms for spec are allowed:

	"measurement"	a named, built-in measurement
	NetPort["output"]	the value of an output port of the net
	NetPort["tdata"]	the value of training data for the net
	NetPort[{lspec,"output"}]	the value of an interior activation of the net
	NetPort[{lspec,"weight"}]	the value of a weight array
	<\|"Measurement"spec,…\|>	a measurement with suboptions
	<\|"Measurement"Function[…],…\|>	a custom function to measure

Setting TrainingProgressMeasurements{spec₁,spec₂,…} will result in multiple measurements being made.
With the default setting of TrainingProgressMeasurementsAutomatic, the "ErrorRate" measurement will be automatically included for nets that are trained using a single CrossEntropyLossLayer.
For nets that contain a CrossEntropyLossLayer, the following built-in measurements are available:

	"Accuracy"	fraction of correctly classified examples
	"Accuracy"n	fraction of examples with the correct result in the top n
	"AreaUnderROCCurve"	area under the ROC curve for each class
	"CohenKappa"	Cohen's kappa coefficient
	"ConfusionMatrix"	counts c_ij of class i examples classified as class j
	"ConfusionMatrixPlot"	plot of the confusion matrix
	"Entropy"	entropy measured in nats
	"ErrorRate"	fraction of incorrectly classified examples
	"ErrorRate"n	fraction of examples with the incorrect result in the top n
	"F1Score"	F₁ score for each class
	"FScore"β	F_β score for each class
	"FalseDiscoveryRate"	false discovery rate for each class
	"FalseNegativeNumber"	number of false negative examples
	"FalseNegativeRate"	false negative rate for each class
	"FalseOmissionRate"	false omission rate for each class
	"FalsePositiveNumber"	number of false positive examples
	"FalsePositiveRate"	false positive rate for each class
	"Informedness"	informedness for each class
	"Markedness"	markedness for each class
	"MatthewsCorrelationCoefficient"	Matthews correlation coefficient for each class
	"NegativePredictiveValue"	negative predictive value for each class
	"Perplexity"	exponential of the entropy
	"Precision"	precision for each class
	"Recall"	recall rate for each class
	"ROCCurve"	receiver operating characteristics (ROC) curve for each class
	"ROCCurvePlot"	plot of the ROC curve
	"ScottPi"	Scott's pi coefficient
	"Specificity"	specificity for each class
	"TrueNegativeNumber"	number of true negative examples
	"TruePositiveNumber"	number of true positive examples

For nets that contain a MeanSquaredLossLayer or MeanAbsoluteLossLayer, the following built-in measurements are available:

	"FractionVarianceUnexplained"	the fraction of output variance left unexplained by the net
	"IntersectionOverUnion"	intersection over union for bounding boxes
	"MeanDeviation"	mean absolute value of the residuals
	"MeanSquare"	mean square of the residuals
	"RSquared"	coeficient of determination
	"StandardDeviation"	root mean square of the residuals

The pure function for the custom function measurement will be supplied with the association described in TrainingProgressFunction.
Suboptions can be specified for a given measurement using the association form <|"Measurement"measurement,opt₁val₁,…|>.
When the source of a measurement is ambiguous, the suboption "Source""outname" can be used to select a specific output port of the net from which to take the measurement.
The suboption "ClassAveraging"type can be specified for measurements that are calculated per-class, such as "Precision", "Recall", "F1Score" and "ROCCurve". Possible types include:

	None	return the per-class measurements (default)
	"Macro"	return the macro-average of the per-class measurements
	"Micro"	return the micro-average of the per-class measurements

The suboption "Direction"direction specifies which direction of change in the measurement is an improvement. Possible values for directions are:

	Automatic	measurement direction is chosen to correspond to improved accuracy (default)
	"Decreasing"	measurement should decrease
	"Increasing"	measurement should increase

The "Aggregation"aggregation option can be used to aggregate non-scalar NetPort measurements. Possible aggregations are:

	None	return the array unchanged (default)
	"L1Norm"	return the L1 norm of the flattened array
	"L2Norm"	return the L2 norm of the flattened array
	"Max"	return the maximum of the flattened array
	"Mean"	return the mean of the flattened array
	"Min"	return the minimum of the flattened array
	"RootMeanSquare"	return the root mean square of the flattened array
	"StandardDeviation"	return the standard deviation of the flattened array
	"Total"	return the sum of the flattened array

The "Interval"->"interval" option specifies how often to collect a measurement. Possible intervals are:

	Automatic	automatically choose an appropriate interval (default)
	"Batch"	collect the measurement every batch
	"Round"	collect the measurement every round

If a TrainingProgressFunction is supplied to NetTrain, associations of the most recent values of the requested measurements are available using the keys "RoundMeasurements" and "ValidationMeasurements".
Associations of the final value of all measurements can be obtained after training by specifying "RoundMeasurements" and "ValidationMeasurements" as properties in NetTrain[net,data,properties] or in a NetTrainResultsObject.
The default key used to access a particular measurement is based on the measurement specification:

	"measurement"	"measurement"
	"measurement"x	"measurement"
	NetPort["name"]	"name"
	NetPort[{part₁,part₂,…,"port"}]	"part₁/part₂/…/port"

The "Key"->"key" option can be used to specify a key for a measurement. This is primarily useful to uniquely identify measurements that have the same default key.
These measurement keys are also used for accessing round and validation measurements in TrainingStoppingCriterion.
When a textual reporting form is specified via TrainingProgressReporting, some measurements may be reported in aggregated form or not reported at all. For class-based measurements like "Precision", "Recall", "F1Score", etc., the macro-average is reported. For non-scalar NetPort[…] measurements, the scalar mean is reported. For "ROCCurve" and "ConfusionMatrix", no textual summary is reported.
The "PlotType"type suboption can be used to specify how to scale the axis of the measurement plot. Possible types are:

	Automatic	adaptively choose the scale as appropriate (default)
	"Linear"	plot the measurement on a linear scale
	"Log"	plot the measurement on a log scale

Examples

open all close all

Basic Examples (1)

Examine the accuracy for LeNet trained on FashionMNIST during training:

Wolfram Language code:

NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], All, TrainingProgressMeasurements -> "Accuracy", MaxTrainingRounds -> 1]

Scope (7)

Examine the final validation precision and recall for LeNet trained on FashionMNIST:

Wolfram Language code:

NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], "ValidationMeasurements", ValidationSet -> Scaled[0.1], MaxTrainingRounds -> 2, TrainingProgressMeasurements -> {"Precision", "Recall"}]

Examine the final validation and training confusion matrices for LeNet trained on FashionMNIST (note that the training progress panel will show the confusion matrix for the training data by default; click the plot to toggle between the two):

Wolfram Language code:

NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], {"RoundMeasurements", "ValidationMeasurements"}, ValidationSet -> Scaled[0.1], MaxTrainingRounds -> 2, TrainingProgressMeasurements -> "ConfusionMatrixPlot"]

Examine the final-round, macro- and micro-averaged specificity for a small toy problem, using the "Key" option to uniquely identify the measurements:

Wolfram Language code:

net = NetChain[{10, 5, SoftmaxLayer[]}];
input = RandomReal[1, {100, 10}];
output = RandomInteger[{1, 5}, {100}];
NetTrain[net, <|"Input" -> input, "Output" -> output|>, "RoundMeasurements", TrainingProgressMeasurements -> {<|"Measurement" -> "Specificity", "ClassAveraging" -> "Macro", "Key" -> "SpecMacro"|>, <|"Measurement" -> "Specificity", "ClassAveraging" -> "Micro", "Key" -> "SpecMicro"|>}]

Examine the mean round output of an internal layer of the network by creating a new output port:

Wolfram Language code: netWithFoo = NetGraph[{4, 3, 2, 1, LogisticSigmoid}, {1 -> 2 -> 3 -> 4 -> 5 , 3 -> NetPort["Foo"]}]

Wolfram Language code:

input = RandomReal[1, {100, 4}];
output = RandomInteger[{0, 1}, {100}];
NetTrain[netWithFoo, <|"Input" -> input, "Output" -> output|>, "RoundMeasurements", RandomSeeding -> 42, TrainingProgressMeasurements -> NetPort["Foo"]]

Alternatively, examine the same internal output using without creating a new output:

Wolfram Language code: netWithoutFoo = NetGraph[{4, 3, 2, 1, LogisticSigmoid}, {1 -> 2 -> 3 -> 4 -> 5 }]

Wolfram Language code:

NetTrain[netWithoutFoo, <|"Input" -> input, "Output" -> output|>, "RoundMeasurements", RandomSeeding -> 42, TrainingProgressMeasurements -> NetPort[{3, "Output"}]]

Examine the error rate of one of the outputs for a multitask trained network for the CIFAR-100 dataset:

Wolfram Language code:

trainingData = ResourceData["CIFAR-100", "TrainingDataset"];
testData = ResourceData["CIFAR-100", "TestDataset"];
labels = Union@Normal@trainingData[All, "Label"];
sublabels = Union@Normal@trainingData[All, "SubLabel"];

Wolfram Language code:

net = NetGraph[{100, Ramp, {20, SoftmaxLayer[]}, {100, SoftmaxLayer[]}}, {NetPort["Image"] -> 1 -> 2 -> 3 -> NetPort["Label"], 2 -> 4 -> NetPort["SubLabel"]}, "Image" -> NetEncoder[{"Image", {28, 28}}], "Label" -> NetDecoder[{"Class", labels}], "SubLabel" -> NetDecoder[{"Class", sublabels}]]

Wolfram Language code:

NetTrain[net, trainingData, "ValidationMeasurements", ValidationSet -> testData, TrainingProgressMeasurements -> <|"Measurement" -> "ErrorRate", "Source" -> "SubLabel"|>]

Examine data produced by the training data generator function:

Wolfram Language code:

gen = {Function[<|"Input" -> RandomReal[1, #BatchSize], "Output" -> RandomReal[1, #BatchSize], "X" -> RandomReal[10] * #Round|>], "RoundLength" -> 64};
NetTrain[LinearLayer[{}, Input -> {}], gen, All, TrainingProgressMeasurements -> NetPort["X"]]

Use the custom function measurement and ClassifierMeasurements to measure properties not yet supported by TrainingProgressMeasurements:

Wolfram Language code:

NetTrain[NetModel["LeNet Trained on MNIST Data", "UninitializedEvaluationNet"], "MNIST", All, TrainingProgressMeasurements -> <|"Measurement" -> Function[ClassifierMeasurements[#Net, Thread@Function[#Input -> Round[Normal@#Output - 1]]@#BatchData, "MeanDecisionUtility"]], "Interval" -> "Batch", "Key" -> "MDU"|>, MaxTrainingRounds -> 1]

Properties & Relations (2)

Some of the built-in measurements, such as "Precision", "Recall", "F1Score" and "AreaUnderROCCurve", are not defined for batches and will only be measured after every completed round:

Wolfram Language code:

res = NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], All, MaxTrainingRounds -> 1, TrainingProgressMeasurements -> {"Accuracy", "Precision"}];
res["BatchMeasurements"]
res["RoundMeasurements"]

For non-batch measurements, setting "Interval"->"Batch" will have no effect:

Wolfram Language code:

res = NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], All, MaxTrainingRounds -> 1, TrainingProgressMeasurements -> <|"Measurement" -> "Precision", "Interval" -> "Batch"|>];
res["BatchMeasurements"]

Validation measurements will only be computed if a validation set is provided and will include measurements not defined on batches:

Wolfram Language code: res["ValidationMeasurements"]

Wolfram Language code:

NetTrain[NetModel["LeNet"], ResourceData["FashionMNIST"], "ValidationMeasurements", ValidationSet -> Scaled[0.2], MaxTrainingRounds -> 1, TrainingProgressMeasurements -> {"Accuracy", "Precision"}]

The "IntersectionOverUnion" measurement expects the input and target bounding boxes to be supplied as lists with the form {x₁,y₁,x₂,y₂}, where (x₁,y₁) and (x₂,y₂) are the coordinates describing the bottom-left and top-right corners of a bounding box.

The calculation of intersection over union is equivalent to the following Mathematica function:

Wolfram Language code:

iou[{inputX1_, inputY1_, inputX2_, inputY2_}, {targetX1_, targetY1_, targetX2_, targetY2_}] := Block[{x1, y1, x2, y2, inputArea, targetArea, intersectionArea}, 
	x1 = Max[inputX1, targetX1];
	y1 = Max[inputY1, targetY1];
	x2 = Min[inputX2, targetX2];
	y2 = Min[inputY2, targetY2];
	
	intersectionArea = Max[(x2 - x1), 0] * Max[(y2 - y1), 0];
	
	inputArea = (inputX2 - inputX1) * (inputY2 - inputY1);targetArea = (targetX2 - targetX1) * (targetY2 - targetY1);
	
	intersectionArea / (inputArea + targetArea - intersectionArea + $MachineEpsilon)
	];

Wolfram Language code:

data = {{{0, 0, 10, 10}, {0, 0, 10, 10}}, {{0, 0, 5, 10}, {0, 0, 10, 10}}, {{0, 0, 10, 10}, {0, 0, 5, 5}}, {{0, 0, 5, 5}, {5, 5, 10, 10}}, {{0, 0, 0, 5}, {1, 1, 1, 10}}};

Wolfram Language code: Map[Apply[iou]]@data

Wolfram Language code:

Map[NetMeasurements[MeanAbsoluteLossLayer["Input" -> 4], <|"Input" -> {First@#}, "Target" -> {Last@#}|>, "IntersectionOverUnion"]&]@data

Possible Issues (5)

The shorthand syntax TrainingProgressMeasurementsmeasurement can only be used if there is exactly one source for measurement:

Wolfram Language code:

net = NetChain[{4, 3, 2, 1, LogisticSigmoid}, "Input" -> 2];
net2CELosses = NetGraph[{net, net}, {}]

Wolfram Language code: NetTrain[net2CELosses, "RandomData", TrainingProgressMeasurements -> "Accuracy"]

Specify which output to measure:

Wolfram Language code:

NetTrain[net2CELosses, "RandomData", TrainingProgressMeasurements -> <|"Measurement" -> "Accuracy", "Source" -> "Output2"|>]

If two or more measurements that have the same default key are requested, the "Key"->"key" suboption must be used to uniquely identify them:

Wolfram Language code:

net = NetChain[{4, 3, 2, 3, SoftmaxLayer[]}, "Input" -> 2];
NetTrain[net, "RandomData", "RoundMeasurements", TrainingProgressMeasurements -> {"Accuracy", "Accuracy" -> 2}]

Wolfram Language code:

net = NetChain[{4, 3, 2, 3, SoftmaxLayer[]}, "Input" -> 2];
NetTrain[net, "RandomData", "RoundMeasurements", TrainingProgressMeasurements -> {<|"Measurement" -> "Accuracy", "Key" -> "Acc"|>, <|"Measurement" -> "Accuracy" -> 2, "Key" -> "Top2Acc"|>}]

Custom function measurements must have the "Key" suboption specified:

Wolfram Language code:

NetTrain[NetModel["LeNet"], "MNIST", All, TrainingProgressMeasurements -> <|"Measurement" -> Function[0.5], "Interval" -> "Batch"|>]

Wolfram Language code:

NetTrain[NetModel["LeNet"], "MNIST", TrainingProgressMeasurements -> <|"Measurement" -> Function[0.5], "Interval" -> "Batch", "Key" -> "Const"|>, TimeGoal -> 0.1]

Custom function measurements must return a numeric result:

Wolfram Language code:

NetTrain[NetModel["LeNet"], "MNIST", All, TrainingProgressMeasurements -> <|"Measurement" -> Function[None], "Interval" -> "Batch", "Key" -> "None"|>]

Wolfram Language code:

NetTrain[NetModel["LeNet"], "MNIST", TrainingProgressMeasurements -> <|"Measurement" -> Function[0.5], "Interval" -> "Batch", "Key" -> "Const"|>, TimeGoal -> 0.1]

Trying to measure an output port being used for the LossFunction can fail:

Wolfram Language code: net = NetModel["LeNet Trained on MNIST Data"]

Wolfram Language code:

res = NetTrain[net, {RandomImage[] -> 1}, All, TrainingProgressMeasurements -> NetPort["Output"], LossFunction -> "Output"]

A simple workaround is to create an additional output port to measure and explicitly label the training data:

Wolfram Language code: net = NetGraph[{NetModel["LeNet Trained on MNIST Data"]}, {"1" -> NetPort["Output"], "1" -> NetPort["Measure"]}]

Wolfram Language code:

res = NetTrain[net, <|"Input" -> {RandomImage[]}, "Output" -> {1}|>, All, TrainingProgressMeasurements -> NetPort["Measure"], LossFunction -> "Output"]

Neat Examples (1)

Animate the history of the confusion matrix for a LeNet trained on a sample of FashionMNIST:

Wolfram Language code:

net = NetModel["LeNet"];
data = RandomSample[ResourceData["FashionMNIST"], 1000];
res = NetTrain[net, data, "RoundMeasurementsLists", MaxTrainingRounds -> 30, TrainingProgressMeasurements -> "ConfusionMatrixPlot"];

Wolfram Language code: ListAnimate[res["ConfusionMatrixPlot"]]

Top

More Learning

Tech Support

Wolfram Solutions

Wolfram Solutions For Education

Get Started

Grow Your Skills

Work with Us

Educational Programs for Adults

Educational Programs for Youth

Read

TrainingProgressMeasurements

Details

Examples

Basic Examples (1)

Scope (7)

Properties & Relations (2)

Possible Issues (5)

Neat Examples (1)

Text

CMS

APA

BibTeX

BibLaTeX

TrainingProgressMeasurements

Details

Examples

Basic Examples (1)

Scope (7)

Properties & Relations (2)

Possible Issues (5)

Neat Examples (1)

See Also

Tech Notes

History

Text

CMS

APA

BibTeX

BibLaTeX