Shane Jensen Shane T. Jensen

 Associate Professor
 Department of Statistics
 The Wharton School
 University of Pennsylvania

 463 Huntsman Hall
 3730 Walnut Street

 stjensen at

My current cv is available for download in PDF format.

A list of my publications can be found here: Publication List

My Google Scholar profile: Google Scholar Profile


 Research Interests

I enjoy developing statistical methodology for a wide variety of application areas. Some of my current interests are:

1. Genetics and Molecular Biology

Developing sophisticated statistical models for the evolution of genomic sequences. Areas of application include the response of HIV under various therapies as well as evolution during cancer progression. Developing models for combining heterogeneous data sources to refine predictions about co-regulated genes and regulatory networks in cells.

Check out this TED talk on the power of molecular biology and genetics.

2. Statistics in Sports

Developing novel statistical models for the comparison of baseball players in terms of on-field performance. Evaluation of fielding ability as well as prediction of future hitting and pitching performance. Quantifying player performance in hockey.

3. Bayesian Nonparametrics

Extensions of Dirichlet processes for grouped and ordered data. Alternative prior processes for non-parametric clustering. Tree-based approaches for high-dimensional settings.

4. Economics and Marketing

Estimating income volatility while allowing for heterogeneity over time and between individuals in the population. Exploring the relationship between income volatility and risk aversion. Modeling career choice as a function of risk aversion. Models for missing data in marketing research.

 Individual Project Pages

SAFE: Spatial Aggregate Fielding Evaluation, our methodology for measuring fielding ability in major league baseball players using a hierarchical probit model. Results are presented across seven seasons of high-resolution ball-in-play data.

COGRIM: Bayesian variable selection model for regulatory network inference through the integration of gene expression data, ChIP binding data and sequence motif data.

PHYLOCLUS: Suite of perl programs for clustering co-regulated genes based on phylogenetically discovered transcription factor binding motifs.

MOTIF CLUSTERING: Perl programs and supplemental material for clustering transcription factor binding motif matrices based on a hierarchical Bayesian model.

