Current Research Interests
I now work on web index selection.
Past Research Projects
In grad school, I worked on computer vision. Below you will
find papers, code, and datasets for some of the projects I have worked
on. Unfortunately, I do not have time to give support for code or provide
any additional datasets not listed here.
Video Event Detection
We developed volumetric features for event detection in crowded
videos. We used spatio-temporal 3D shape and flow features to find
human actions such as picking up an object from the ground, waving for
the bus, and sitting down. The method works despite significant
clutter and motion in the video.
Published in
ICCV 2005,
VS 2007,
ICCV 2007,
Ph.D. Thesis: CMU-CS-08-113.
Thesis Oral Slides and
Videos.
Dataset for ICCV 07 paper. Code for reading and manipulating the dataset.
Photo Quality Assessment
We developed high-level semantic features to assess the quality of
photographs. The system can classify between photographs taken by
professionals versus snapshots taken by amateurs. A web image search
engine can use the quality metric to place the best quality
photographs near the top, giving the user the most relevant results.
Published in CVPR 2006.
Lecture slides.
Sample Results.
Music Identification
We used machine learning techniques to develop an audio fingerprinting
algorithm for music identification. Given ten seconds of low-quality
music recorded through a phone, our system can robustly identify the
song in a large database of songs.
Published in CVPR 2005.
Video Demonstration.
Binary demo code and sample music: [mrdemo.tgz 16MB]
C++ server code: [musicretr-1.0.tgz]
Java GUI code: [mrgui.tgz]
Image Retrieval with PCA-SIFT
We developed a more distinctive local image descriptor for image
retrieval. We then combined PCA-SIFT with locality sensitive hashing
to enable fast and accurate sub-image retrieval over tens of thousands
of image. The application can be used for image copyright
enforcement.
Published in CVPR 2004,
ACM MM 2004.
PCA-SIFT Code
(descriptor only). May require netpbm libraries.
Matlab code for
training PCA-SIFT.
Dataset for ACM MM paper.
Yan Ke