Showing 1 - 10 of 260
For high-dimensional data, most feature-selection methods, such as SIS and the lasso, involve ranking and selecting features individually. These methods do not require many computational resources, but they ignore feature interactions. A simple recursive approach, which, without requiring many...
Persistent link: https://www.econbiz.de/10010871404
Many contemporary classifiers are constructed to provide good performance for very high dimensional data. However, an issue that is at least as important as good classification is determining which of the many potential variables provide key information for good decisions. Responding to this...
Persistent link: https://www.econbiz.de/10004982372
In standard parametric classifiers, or classifiers based on nonparametric methods but where there is an opportunity for estimating population densities, the prior probabilities of the respective populations play a key role. However, those probabilities are largely ignored in the construction of...
Persistent link: https://www.econbiz.de/10008553412
Persistent link: https://www.econbiz.de/10008784112
The interpretation of generative, discriminative and hybrid approaches to classification is discussed, in particular for the generative-discriminative tradeoff (GDT), a hybrid approach. The asymptotic efficiency of the GDT, relative to that of its generative or discriminative counterpart, is...
Persistent link: https://www.econbiz.de/10008484580
The aims of this short note are two-fold. First, it shows that, for a random variable X, the area under the curve of its folded cumulative distribution function equals the mean absolute deviation (MAD) from the median. Such an equivalence implies that the MAD is the area between the cumulative...
Persistent link: https://www.econbiz.de/10009143278
Persistent link: https://www.econbiz.de/10005532760
Persistent link: https://www.econbiz.de/10005411960
In semiparametric models it is a common approach to under-smooth the nonparametric functions in order that estimators of the finite dimensional parameters can achieve root-n consistency. The requirement of under-smoothing may result as we show from inefficient estimation methods or technical...
Persistent link: https://www.econbiz.de/10011126315
One of the most difficult problems in applications of semiparametric generalized partially linear single-index model (GPLSIM) is the choice of pilot estimators and complexity parameters which may result in radically different estimators. Pilot estimators are often assumed to be root-n...
Persistent link: https://www.econbiz.de/10010983767