The Liem Sioe Liong/First Pacific Company Professor of Statistics
Room: 471 JMHH
Office: (215) 898-8236, Fax: (215) 898-1280, Dept: (215) 898-8222
Email: click here for an image of the address
Curriculum vitae:
[.pdf]
(a more fun alternative from
the 2004/5 MBA guide))
STATISTICS 541 has switched from spring to fall.
This amounts to a switch with STATISTICS 540 (Statistical Computing)
which is now taught in spring as STATISTICS 542.
A preliminary version and a companion paper which I keep posted because others have
started referring to them:
The reason for posting this now is that a few of us in the department are currently discussing
model diagnostics.
Local Multidimensional Scaling for Nonlinear Dimension Reduction, Graph Layout and Proximity Analysis
[.pdf]
Quasi-Darwinian Selection in Marketing Relationships
[.pdf]
Journal of Marketing, Oct 2007,
featured JM blog article
and a finalist for JM's 2007 Harold H. Maynard Award.
Along with the paper go a few scenario calculations that are not included in the article:
[.pdf]
Loss Functions for Binary Class Probability Estimation: Structure and Applications. (Former title: Degrees of Boosting)
[.pdf] (under revision)
Yi Shen's 2005 Ph.D. thesis on cost-weighted class probability estimation
[pdf]
Cost-Weighted Boosting with Jittering and Over/Under-Sampling: JOUS-Boost
[pdf]
(Journal of Machine Learning Research 8 (Mar), 409-439, 2007)
Observations on Bagging (Statistica Sinica 2006, Special Issue on Machine Learning, 16 (2), 323--352 (2006))
[pdf]
The Effect of Bagging on Variance, Bias, and Mean Squared Error
[.pdf]
PPT slides,
Smoothing Effects of Bagging
[.pdf]
Calibration for Simultaneity: (Re)Sampling Methods for Simultaneous Inference with
Applications to Function Estimation and Functional Data
[.pdf, 1.7MB] (under revision)
Inference for Data Visualization
[.pdf]
A. Buja and D.F. Swayne; J. of Classification, 19, 7-43, 2002.
joint with Deborah Swayne, Michael Littman, Nate Dean, Heike Hofmann, and Lisha Chen.
An older version that had both papers in one should be considered out of date.
Appeared in "Handbook of Statistics", eds. E. Wegman, C. R. Rao.
Alan Gous and Andreas Buja;
Journal of Computational and Graphical Statistics, 13 (1), 1-19 (2004).
(We are permitted to post the color version of this paper. The printed version is
b/w with gray-scale figures.)
Data Mining Criteria for Tree-Based Regression and Classification
[.ps.gz]
A. Buja and Y.-S. Lee; Proceedings of KDD 2001, 27--36.