Whois

eugenewu at mit dot edu

I'm an N'th year database graduate student in CSail at mit. My advisor is Sam Madden.

I previously studied at UC Berkeley and finished Fall 2006

I am currently

Co-teaching an IAP class called Introduction to Data Literacy with my buddy Adam Marcus. It is a heavily lab based course that introduces students to many basic data cleaning, analysis, and visualization techniques.

I Enjoy

I've worked at

Current Projects

Ranked Provenance

Coming up with approaches to efficiently store, represent and prioritize provenance data

DBTruck

A tool to import your data into whatever data store you want, as painlessly as possible. See article for motivation

Qurk

A look at optimizing human computation through a database lens. Qurk is a database prototype that enables users to write queries that compute results from both machines and humans. With adam marcus.

Random Projects

VLDB Conference Trends

A quick analysis of top keywords in VLDB conference paper titles in the past 11 years

Past Projects

WebTables

A look into the properties of structured data at the internet scale. With michael cafarella, yang zhang, nodira k., daisy wang and alon halevy.

Waapsi

An experimental course scheduling system. Tries to make the user experience not suck by using JS. This was around the time google calendar came out. With sukhchander khanna

SASE

System for declaratively filtering and correlating streams of events from sensor and rfid devices. Extends YFilter's core query processing engine. With yanlei diao and daniel gyllstrom

HiFi @ Berkeley

A Cascading Stream Architecture for Large-Scale Receptor-Based Networks. With the berkeley db group and notably shawn jeffrey and shariq rizvi

Publications

  1. Eugene Wu, Sam Madden: Partitioning Techniques for Fine-grained Indexing ICDE 2011
  2. Eugene Wu, Carlo Curino, Sam Madden: No Bits Left Behind CIDR 2011
  3. Adam Marcus, Eugene Wu, Sam Madden, Robert Miller: Crowdsourced Databases: Query Processing with People CIDR 2011
  4. Carlo Curino, Evan Jones, Raluca Popa, Nirmesh Malviya, Eugene Wu, Sam Madden, Hari Balakrishnan, Nickolai Zeldovich: Relational Cloud: A Database-as-a-Service for the Cloud   CIDR 2011
  5. Carlo Curino, Evan Jones, Yang Zhang, Eugene Wu, Sam Madden: Relational Cloud: The Case for a Database Service   Technical Report
  6. Philippe Cudre-Mauroux, Eugene Wu, Sam Madden: TrajStore: An Adaptive Storage System for Very Large Trajectory Data Sets ICDE 2010
  7. Eugene Wu, Philippe Cudre-Mauroux, Sam Madden: Demonstration of the TrajStore System VLDB 2009
  8. Philippe Cudre-Mauroux, Eugene Wu, Sam Madden: The Case for RodentStore: An Adaptive, Declarative Storage System CIDR 2009
  9. Michael Cafarella, Alon Halevy, Daisy Wang, Eugene Wu, Yang Zhang: WebTables: Exploring the Power of Tables on the Web VLDB 2008
  10. Michael Cafarella, Nodira Khoussainova, Daisy Wang, Eugene Wu, Yang Zhang, Alon Halevy: Uncovering the Relational Web WebDB 2008
  11. Daniel Gyllstrom, Eugene Wu, Hee-Jin Chae, Yanlei Diao, Patrick Stahlberg, Gordon Anderson: SASE: Complex Event Processing over Streams (Demo). CIDR 2007
  12. Eugene Wu, Yanlei Diao, Shariq Rizvi: High-performance complex event processing over streams. SIGMOD Conference 2006
  13. Daniel Gyllstrom, Eugene Wu, Hee-Jin Chae, Yanlei Diao, Patrick Stahlberg, Gordon Anderson: SASE: Complex Event Processing over Streams CoRR 2006
  14. Minos N. Garofalakis, Kurt P. Brown, Michael J. Franklin, Joseph M. Hellerstein, Daisy Zhe Wang, Eirinaios Michelakis, Liviu Tancau, Eugene Wu, Shawn R. Jeffery, Ryan Aipperspach: Probabilistic Data Management for Pervasive Computing: The Data Furnace Project. IEEE Data Eng. Bull.
  15. Michael J. Franklin, Shawn R. Jeffery, Sailesh Krishnamurthy, Frederick Reiss, Shariq Rizvi, Eugene Wu, Owen Cooper, Anil Edakkunni, Wei Hong: Design Considerations for High Fan-In Systems: The HiFi Approach. CIDR 2005
  16. Owen Cooper, Anil Edakkunni, Michael J. Franklin, Wei Hong, Shawn R. Jeffery, Sailesh Krishnamurthy, Frederick Reiss, Shariq Rizvi, Eugene Wu: HiFi: A Unified Architecture for High Fan-in Systems. VLDB 2004

Some Interesting Links

TED talks
See mike draw
Edward Tufte
Datejs, robust javascript date parser
I made tea
p.o.n.p.