Wolfram Language & System Documentation Center

CrossEntropyLossLayer

represents a net layer that computes the cross-entropy loss by comparing input class probability vectors with indices representing the target class.

CrossEntropyLossLayer["Probabilities"]

represents a net layer that computes the cross-entropy loss by comparing input class probability vectors with target class probability vectors.

CrossEntropyLossLayer["Binary"]

represents a net layer that computes the binary cross-entropy loss by comparing input probability scalars with target probability scalars, where each probability represents a binary choice.

CrossEntropyLossLayer

CrossEntropyLossLayer["Index"]

represents a net layer that computes the cross-entropy loss by comparing input class probability vectors with indices representing the target class.

CrossEntropyLossLayer["Probabilities"]

represents a net layer that computes the cross-entropy loss by comparing input class probability vectors with target class probability vectors.

CrossEntropyLossLayer["Binary"]

represents a net layer that computes the binary cross-entropy loss by comparing input probability scalars with target probability scalars, where each probability represents a binary choice.

Details and Options

CrossEntropyLossLayer exposes the following ports for use in NetGraph etc.:
"Input" real array of rank n

"Target" real array of rank n or integer array of rank n-1

"Loss" real number
When operating on multidimensional inputs, CrossEntropyLossLayer effectively threads over any extra array dimensions to produce an array of losses and returns the mean of these losses.
For CrossEntropyLossLayer["Binary"], the input and target should be scalar values between 0 and 1, or arrays of these.
For CrossEntropyLossLayer["Index"], the input should be a vector of probabilities {p₁,…,p_c} that sums to 1, or an array of such vectors. The target should be an integer between 1 and c, or an array of such integers.
For CrossEntropyLossLayer["Probabilities"], the input and target should be a vector of probabilities that sums to 1, or an array of such vectors.
For the "Index" and "Probabilities" forms, where the input array has dimensions {d₁,d₂,…,d_n}, the final dimension d_n is used to index the class. The output loss is taken to be the mean over the remaining dimensions {d₁,…,d_n-1}.
CrossEntropyLossLayer[…][<|"Input"->in,"Target"target|>] explicitly computes the output from applying the layer.
CrossEntropyLossLayer[…][<|"Input"->{in₁,in₂,…},"Target"->{target₁,target₂,…}|>] explicitly computes outputs for each of the in_i and target_i.
When given a NumericArray as input, the output will be a NumericArray.
CrossEntropyLossLayer is typically used inside NetGraph to construct a training network.
CrossEntropyLossLayer can operate on arrays that contain "Varying" dimensions.
A CrossEntropyLossLayer[…] can be provided as the third argument to NetTrain when training a specific network.
When appropriate, CrossEntropyLossLayer is automatically used by NetTrain if an explicit loss specification is not provided. One of "Binary", "Probabilities", or "Index" will be chosen based on the final activation used for the output port and the form of any attached NetDecoder.
CrossEntropyLossLayer[form,"port"->shape] allows the shape of the input or target port to be specified. Possible forms for shape are:

	"Real"	a single real number
	"Integer"	a single integer
	n	a vector of length n
	{n₁,n₂,…}	an array of dimensions n₁×n₂×…
	"Varying"	a vector whose length is variable
	{"Varying",n₂,n₃,…}	an array whose first dimension is variable and remaining dimensions are n₂×n₃×…
	NetEncoder[…]	an encoder
	NetEncoder[{…,"Dimensions"{n₁,…}}]	an encoder mapped over an array of dimensions n₁×…

Options[CrossEntropyLossLayer] gives the list of default options to construct the layer. Options[CrossEntropyLossLayer[…]] gives the list of default options to evaluate the layer on some data.
Information[CrossEntropyLossLayer[…]] gives a report about the net layer.
Information[CrossEntropyLossLayer[…],prop] gives the value of the property prop of CrossEntropyLossLayer[…]. Possible properties are the same as for NetGraph.

Examples

open all close all