
Experienced Search Architect
San Francisco Bay Area

Experienced Search Architect
San Francisco Bay Area
Enjoy building large systems from the ground-up.
Active in the open source community, contributor to many open source projects.
Experienced with search technology, both unstructured and semi-structured.
A Java junkie.
full text search, semi-structured search, vertical search, faceted search, olap, lucene, java, j2ee, solr, open source, mysql
(Privately Held; Internet industry)
August 2007 — Present (2 years 4 months)
Responsible for search at LinkedIn.
Enjoying being part of an incredible team.
* Built a Lucene based real-time indexing/searching system (Zoie) that serves as the foundation for different aspects of LinkedIn search functionality. This system was generously donated by LinkedIn to the open source community:
http://code.google.com/p/zoie
* Led an amazing team and built a distributed search system to allow LinkedIn's social network based people search to scale horizontally. This project marks the beginning of LinkedIn's next-generation search capabilities.
* Working with a couple of super-smart talented engineers on Kamikaze, an open source project that extends Apache Lucene:
1) P4Delta compression set that can be iterated in compressed form.
2) Boolean set/iterators: AND/OR/NOT over abstract docid sets.
3) Learning based algorithm to determine best representation for docid sets with respect to sparsity/density.
Check it out at: http://code.google.com/p/lucene-ext
(Computer Software industry)
February 2005 — Present (4 years 10 months)
Open source project that builds faceted search technology.
http://code.google.com/p/bobo-browse
http://www.browseengine.com
Patents:
* United States Patent 6999971: Apparatus and Method for Parametric Group Processing
* United States Patent 20040103087: Method and apparatus for combining multiple search workers
* United States Patent 20060294192: Access control systems and methods using visibility tokens with automatic propagation
Publications:
* Navigating large-scale semi-structured data in business portals
VLDB Conference Program 2001