Similar Search Results

On selecting interacting features from high-dimensional data

Hall, Peter; Xue, Jing-Hao - In: Computational Statistics & Data Analysis 71 (2014) C, pp. 694-708

For high-dimensional data, most feature-selection methods, such as SIS and the lasso, involve ranking and selecting features individually. These methods do not require many computational resources, but they ignore feature interactions. A simple recursive approach, which, without requiring many...

Persistent link: https://www.econbiz.de/10010871404

Tilting methods for assessing the influence of components in a classifier

Hall, Peter; Titterington, D. M.; Xue, Jing-Hao - In: Journal of the Royal Statistical Society Series B 71 (2009) 4, pp. 783-803

Many contemporary classifiers are constructed to provide good performance for very high dimensional data. However, an issue that is at least as important as good classification is determining which of the many potential variables provide key information for good decisions. Responding to this...

Persistent link: https://www.econbiz.de/10004982372

Incorporating prior probabilities into high-dimensional classifiers

Hall, Peter; Xue, Jing-Hao - In: Biometrika 97 (2010) 1, pp. 31-48

In standard parametric classifiers, or classifiers based on nonparametric methods but where there is an opportunity for estimating population densities, the prior probabilities of the respective populations play a key role. However, those probabilities are largely ignored in the construction of...

Persistent link: https://www.econbiz.de/10008553412

Median-Based Classifiers for High-Dimensional Data

Hall, Peter; Titterington, D. M.; Xue, Jing-Hao - In: Journal of the American Statistical Association 104 (2009) 488, pp. 1597-1608

Persistent link: https://www.econbiz.de/10008784112

On the generative-discriminative tradeoff approach: Interpretation, asymptotic efficiency and classification performance

Xue, Jing-Hao; Titterington, D. Michael - In: Computational Statistics & Data Analysis 54 (2010) 2, pp. 438-451

The interpretation of generative, discriminative and hybrid approaches to classification is discussed, in particular for the generative-discriminative tradeoff (GDT), a hybrid approach. The asymptotic efficiency of the GDT, relative to that of its generative or discriminative counterpart, is...

Persistent link: https://www.econbiz.de/10008484580

The p-folded cumulative distribution function and the mean absolute deviation from the p-quantile

Xue, Jing-Hao; Titterington, D. Michael - In: Statistics & Probability Letters 81 (2011) 8, pp. 1179-1182

The aims of this short note are two-fold. First, it shows that, for a random variable X, the area under the curve of its folded cumulative distribution function equals the mean absolute deviation (MAD) from the median. Such an equivalence implies that the MAD is the area between the cumulative...

Persistent link: https://www.econbiz.de/10009143278

Sliced Regression for Dimension Reduction

Wang, Hansheng; Xia, Yingcun - In: Journal of the American Statistical Association 103 (2008) June, pp. 811-821

Persistent link: https://www.econbiz.de/10005532760

ASYMPTOTIC DISTRIBUTIONS FOR TWO ESTIMATORS OF THE SINGLE-INDEX MODEL

Xia, Yingcun - In: Econometric Theory 22 (2006) 06, pp. 1112-1137

Persistent link: https://www.econbiz.de/10005411960

Optimal smoothing for a computationally and statistically efficient single index estimator

Hardle, Wolfgang; Xia, Yingcun; Linton, Oliver - London School of Economics (LSE) - 2009

In semiparametric models it is a common approach to under-smooth the nonparametric functions in order that estimators of the finite dimensional parameters can achieve root-n consistency. The requirement of under-smoothing may result as we show from inefficient estimation methods or technical...

Persistent link: https://www.econbiz.de/10011126315

Semi-parametric estimation of generalized partially linear single-index models

Xia, Yingcun; Härdle, Wolfgang - Sonderforschungsbereich 373, Quantifikation und … - 2002

One of the most difficult problems in applications of semiparametric generalized partially linear single-index model (GPLSIM) is the choice of pilot estimators and complexity parameters which may result in radically different estimators. Pilot estimators are often assumed to be root-n...

Persistent link: https://www.econbiz.de/10010983767