Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

What are “intelligent surrogates for missing values”?

0
Posted

What are “intelligent surrogates for missing values”?

0

A9. CART handles missing values in the database by substituting “surrogate splitters,” which are back-up rules that closely mimic the action of primary splitting rules. Suppose that, in a given model, CART splits data according to household income. If a value for income is not available, CART might substitute education level as a good surrogate. The surrogate splitter contains information that is typically similar to what would be found in the primary splitter. Other products’ approaches treat all records with missing values as if the records all had the same unknown value; with that approach all such “missings” are assigned to the same bin. In CART, each record is processed using data specific to that record. This allows records with different data patterns to be handled differently, which results in a better characterization of the data. By using surrogates to stand in for missing values, CART generates robust and reliable predictive models, even when applied to very large databases

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123