Graduate Student, EECS Dept., MIT.
Member of the Stochastic Systems Group (SSG),
Laboratory for Information and Decision Systems (LIDS).
MIT, Room 32-D570 77 Mass. Ave Cambridge, MA02139
Phone:
(617) 253-3816
Email:
vtan at mit dot edu
After our honeymoon in Perth, Australia
(left) and back in Singapore
(right) on the brand new Singapore Flyer.
Brief Biography
Vincent Tan is a second-year graduate student at Laboratory
for Information and Decision Systems (LIDS)
in the Department of EECS at MIT.
His research interests are in the broad areas of statistical signal processing,
graphical models and convex optimization. He is affiliated to the Stochastic
Systems Group (SSG) led by Prof. Alan Willsky.
Vincent was an
undergraduate in Electrical and Information Sciences (EIST) at Sidney Sussex College in Cambridge University. He worked on his Masters
project at the Signal Processing
Laboratory in the Engineering Department (CUED)
under Dr. Cédric
Févotte and received the Charles Lamb Prize for being the top
student in EIST in 2005. He spent his junior year at the Massachusetts
Institute of Technology (MIT) on the
Cambridge-MIT (CMI) Undergraduate
Exchange Program.
·In
the summer of 2008, Vincent was an intern at the Machine Learning and
Perception (MLP) group at
Microsoft Research Cambridge (MSR-C).
·From
2006 to 2007, Vincent worked at the Institute for Infocomm Research (I2R), a research institute under A*STAR.
·From
2005 to 2006, Vincent worked as a research engineer at the Defence Science
Organisation (DSO) National Laboratories
in Singapore.
·In
2004, he spent a summer at Caltech under
the Summer Undergraduate Research Fellowship (SURF).
Vincent’s research is
funded by the Agency of Science, Technology and Research (A*STAR), Singapore.
Vincent is a Student Member
of the IEEE. Vincent is married to a
wonderful woman, Huili, who is a graduate student in biology at the Whitehead
Institute.
Research
My research lies in the broad areas of statistical signal
processing, convex optimization and machine learning.
Current Research
I have been working on the use of
convex optimization and information theory to learn probabilistic graphical
models for the specific purpose of hypothesis testing/classification (SSP
2007). This work has been extended to sequentially and jointly learn increasingly complex probability models defined on
graphical models for discriminating between two hypotheses (ICASSP 2008, ITA
2008). I am also interested in frame representations, sampling theory and
signals with finite rate of innovation (TSP 2008).
Previous Research
Previously, I was involved in
developing new algorithms for privacy-preserving data mining. I examined the
use of various and devised novel algorithms for the reconstruction of a
distribution after a generic randomization process (MLDM 2007). I examined the
accuracy and utility of using Kernel Density Resampling methods for privacy
preservation (PinKDD 2007) in the context of distributed classification.
During my
final year at Cambridge,
I examined the effect of sparsity on underdetermined blind audio source
separation (SPARS 2005). The results show that the separation performance is
indeed correlated to the sparsity of the analysis coefficients of the sources
in the transform domain. Our results also show that the use of overcomplete
transforms does not lead to significant improvement in performance, because
they fail to improve the sparsity measure.
Publications
Vincent Y. F. Tan and Vivek K.
Goyal, “Estimating Signals with Finite Rate of Innovation from Noisy
Samples: A Stochastic Algorithm,” IEEE Transactions on Signal Processing, 2008,Accepted!
Vincent Y. F. Tan, John W. Fisher
III, Alan S. Willsky, “Learning Max-Weight Discriminative Forests,”
2008 IEEE International Conference
on Acoustics, Speech, and Signal Processing (ICASSP), Las Vegas,
Nevada, Mar 30 – April 4, 2008,Link
John
W. Fisher III, Vincent Y. F. Tan,
Alan S. Willsky, “Learning Max-Weight Discriminative Forests,”
2008 Information Theory and Applications Workshop (ITA),
La Jolla, California, Jan 27 – Feb 1, 2008. (JWF Invited)
Sujay Sanghavi, Vincent Y. F. Tan and Alan S.
Willsky, “Learning Graphical Models for Hypothesis Testing”,
In IEEE Statistical Signal Processing
(SSP) Workshop (2007), Madison, WI, Aug 26 - 29, 2007. Link
Vincent Y. F. Tan and See Kiong Ng, “Privacy-Preserving Sharing of
Horizontally-Distributed Private Data for Constructing Accurate
Classifiers”, Proceedings
of the First SIGKDD International Workshop on Privacy, Security, and Trust
in KDD (PinKDD'07), Lecture Notes in Computer Science (LNCS), Volume
4890, Pages 116-137, Springer, 2008. SpringerLink
Vincent Y. F. Tan and See Kiong Ng, “Privacy-Preserving Sharing of
Horizontally-Distributed Private Data for Constructing Accurate
Classifiers”, accepted by the
First ACM SIGKDD International Workshop on Privacy, Security, and Trust in
KDD (PinKDD 2007 held in conjunction with SIGKDD), San Jose,
California, August 12-15, 2007.pdf Link
Vincent Y. F. Tan and See Kiong Ng, “Generic Probability Density Function
Reconstruction for Randomization in Privacy-Preserving Data Mining”,
In: P. Perner (Ed.): Proceedings of the 5th International Conference on
Machine Learning and Data Mining (MLDM-07), (LNAI 4571), pp. 76-90, Leipzig, Germany, July 18-20, 2007. SpringerLink
Vincent Y. F. Tan and Cédric Févotte, “A Study of the Effect of
Source Sparsity for Various Transforms on Blind Audio Source Separation
Performance”. In Proceedings Workshop on Signal
Processing with Adaptive Sparse Structured Representations
(SPARS’05), Rennes,
France,
Nov 2005. pdfsound
samples
Reports and Thesis
Vincent Y. F. Tan, “Blind Audio
Source Separation”. M.Eng. Final Report, Signal Processing Laboratory,
Cambridge University Engineering Department, Jun. 2005. pdf
Vincent Y. F. Tan, “An
Algorithm for Finding Equivalent Sources For a Wave Scattering
Problem”. Summer Undergraduate Research Fellowship (SURF) Final
Report, Applied and Computational Mathematics, Caltech, Aug. 2004. France,
Nov 2005. pdf
Term Projects
Vincent Y. F. Tan, “Information
Geometry Analysis of Learning Mixtures of Trees”. MIT 6.441 Information Theory, EECS,
MIT, May 2008.pdf
Vincent Y. F. Tan, “Learning
Graphical Models using Information Criteria and Maximum Entropy
Relaxation”. MIT 6.867 Machine
Learning, EECS, MIT, Dec 2007.pdf
Vincent Y. F. Tan, “Estimating the
Parameters of a Signal with Finite Rate of Innovation from Noisy Samples:
Deterministic and Stochastic Algorithms”. MIT6.342 Wavelets, Approximation and Compression,
EECS, MIT, May 2007. pdf
Vincent Y. F. Tan, “Stochastic
Optimization of Keane's Bump Function”. CUED 5R1 Stochastic
Optimization Coursework, EIST, Jun. 2005. pdf
Vincent Y. F. Tan, “Stochastic
Processes: The Gibbs Sampler and the Straight Line”. CUED 5R1
Stochastic Processes Coursework, EIST, Jun. 2005. pdf
Vincent Y. F. Tan,
“Newsvendors Tackle the Newsvendors Problem”. CUED 4E9:
Quantitative Techniques in Operations Management, EIST, Jun. 2005. pdf
Presentations
“Information Geometry and Mixtures
of Trees”. 6.441 Information Theory, May 2008. pdf
“Boosted Graphical Model
Classifiers”. RQE Talk,
May 2008.
“Learning Max-Weight Discriminative
Forests”. Presented at ICASSP,
Apr 2008. pdf
“Learning Max-Weight Discriminative
Forests”. Presented at LIDS
Student Conference, Jan 2008. pdf
“Learning Graphical Models and
Max-Weight Discriminative Forests for Hypothesis Testing” Presented at Lincoln Labs Seminar, Jan 8, 2008. pdf
“Learning Graphical Models for
Hypothesis Testing”. Presented
at SSG Seminar, Oct 2007. pdf
“Learning Graphical Models for
Hypothesis Testing”. Poster
presented at SSP 2007, Madison, Wisconsin, Aug 2007. pdf
“Privacy-Preserving
Sharing of Horizontally-Distributed Private Data for Constructing Accurate
Classifiers”. Presented at PinKDD
2007, San Jose, California, Aug 2007. pdf
“Generic Probability Density
Function Reconstruction for Randomization in Privacy-Preserving Data
Mining “. Presented at MLDM
2007, Leipzig, Germany, Jul 2007. pdf
“Estimating the Parameters of a
Signal with Finite Rate of Innovation from Noisy Samples: Deterministic
and Stochastic Algorithms”. 6.342 Wavelets, Approximation and
Compression, May 2007. pdf
“Blind Audio Source
Separation”. M.Eng. Project Presentation, Jun. 2005. pdf
“The Newsvendor Problem”. CUED
4M9: Quantitative Methods in Operations Management Final Presentation May.
2005. pdf
“An Algorithm For Finding Equivalent
Sources For A Wave Scattering Problem”. Summer Undergraduate
Research Fellowship (SURF) Final Presentation, Caltech, Aug. 2004. pdf