replaces missing values in each example by generated values.
uses the distribution dist to generate values.
Details and Options
- SynthesizeMissingValues can be used on many types of data, including numerical, nominal and images.
- Each examplei can be a single data element, a list of data elements or an association of data elements. Examples can also be given as a Dataset object.
- The following options can be given:
FeatureNames Automatic feature names to assign for input data FeatureTypes Automatic feature types to assume for input data Method Automatic which modeling algorithm to use MissingValuePattern _Missing|Indeterminate the pattern of the assumed missing values PerformanceGoal Automatic aspects of performance to optimize RandomSeeding 1234 what seeding of pseudorandom generators should be done internally TimeGoal Automatic how long to spend for training TrainingProgressReporting Automatic how to report progress during training ValidationSet Automatic the set of data on which to evaluate the model during training
- Possible settings for PerformanceGoal include:
"Quality" maximize the synthesis quality "Speed" maximize the synthesis speed Automatic automatic tradeoff between speed and quality
- Possible settings for Method are as given in LearnDistribution[…].
- In the form Methodassoc, the association assoc can have elements:
"LearningMethod" method for learning the probability distribution "EvaluationStrategy" how to synthesize from the distribution
- Possible settings for method element "EvaluationStrategy" include:
"RandomSampling" randomly sample from the conditioned distribution (default) "ModeFinding" attempt to find the mode of the conditioned distribution
- The following settings for TrainingProgressReporting can be used:
"Panel" show a dynamically updating graphical panel "Print" periodically report information using Print "ProgressIndicator" show a simple ProgressIndicator "SimplePanel" dynamically updating panel without learning curves None do not report any information
- Possible settings for RandomSeeding include:
Automatic automatically reseed every time the function is called Inherited use externally seeded random numbers seed use an explicit integer or strings as a seed
- SynthesizeMissingValues[…,FeatureExtractor"Minimal"] indicates that the internal preprocessing should be as simple as possible.
Examplesopen allclose all
Basic Examples (2)
Specify that the missing values to be replaced are integers using MissingValuePattern:
Replace Missing values using "Multinormal" method for computing the distribution: