Ryan Riegel
The below is presently outdated, and I apologize. Anyway, publications:
SDM 2008: Massive-Scale Kernel Discriminant Analysis: Mining for Quasars
Compstat 2006: Large-Scale Kernel Discriminant Analysis with Application to Quasar Discovery
One of these days I'll make a more exciting webpage, with pictures and
other cool things. But for now, this should suffice.
Personal Information
I'm a second-year Ph.D. student associated with both ISS and CSE. I
obtained a BS in Joint Mathematics and Computer Science from Harvey
Mudd College in May 2005 and have no master's degree. I am working
with Prof. Alex Gray and my primary focus is computational statistics,
which consists of finding quick, novel, or otherwise interesting code
solutions for the analysis of data, and is especially concerned with
massive data sets or high dimensionality.
Schedule
I was fixing to put this into a snazzy table, but then I remembered
that this is html, not LaTeX. Along with the general theme of this
site, I'll get around to makeing this look nicer in a bit, for
relative sizes of "bit."
Actually, I should just link to my
In general, I'll arrive at the TSRB sometime before my scheduled
seminars there, or noonish on days I don't have scheduled obligations,
and I don't leave until 9:00 or so unless I'm being particularly
unproductive. I'm there pretty much everyday of the week and some
weekends, too. My desk is near Alex Gray's office.
Projects:
Algorithmica
While one might say that this project hasn't officially started yet,
my research group and I are at least thinking about it rather
intently. Algorithmica is intended to be a system capable of
automatically deriving and implementing algorithms to solve
user-specified problems. Its basic method will be to stitch together
highly general, pre-programmed code segments that address certain
computational tasks in a manner it deduces to be both effective and
efficient for the problem at hand. This project is based on previous
work under the name AutoBayes by Bernd Fischer, Johann Schumann, Wray
Buntine, and Alex Gray.
Implementing applications of Generalized N-body Problems
The above project will ultimately be able to reproduce a great deal of
the work encompassed here, but in the meantime, my efforts are
directed towards the class of problems known as Generalized N-body
Problems. These are characterized as computational problems that
involve comparisons between groups of data points that exist in a
metric space, and can often be solved efficiently with simultaneous
expansion upon pairs (or larger groups) of spatially-informed trees.
Notable examples are all-nearest-neighbors, kernel density estimation (KDE),
non-parametric Bayesian classification, the n-point correlation, and
many others. Currently, I am working on improving the speed of SMO
(the state-of-the-art in training support vector machines) through the
application of our methods.
Investigation of L2E/L1E as a general objective function
Minimizing integrated square error (L2E) is a common obective in
non-parametric problems (e.g., KDE), but it can be used for parametric
fitting as well. It demonstrates increased resistance to outliers and
other noise over maximum likelihood (ML), which is the defacto
objective function of EM algorithms. Further, integrated absolute
error (L1E) may offer even further robustness, though until now, it
has not been considered to be a feasible objective function due to its
inconvenient mathematical properties. It is, however, possible to
overcome L1E's difficulties with an EM-like surrogate optimzation that
converges to L1E's results while only performing L2E-like work.
Contact Information:
rriegel -at- cc.gatech.edu
College of Computing
Georgia Institute of Technology
Atlanta, GA 30332-0280
Last Modified: Sep. 2, 2006