Tech Lead at Twitter
- San Francisco Bay Area
John Wang's Overview
- Tech Lead at Twitter
John Wang's Summary
Building the next search infrastructure @Twitter to handle hundreds of millions updates and billions of queries a day! #jointheflock
● Creator and project lead for SenseiDB:
● Co-Creator of LinkedIn Signal:
● Responsible for large scale realtime faceted search technology.
● Avid contributor to the open source community:
▸ Sensei - http://www.senseidb.com
▸ Zoie - http://javasoze.github.com/zoie
▸ Bobo - http://javasoze.github.com/bobo
Some talks I gave:
▸ Keynote at QCon China 2011
▸ Speaker at QCon San Francisco 2010
▸ Speaker and Panel at Lucene Revolution 2010
▸ Guest Lecture at University of Utah 2010
▸ Speaker at SDForum 2009
Specialties: full text search, semi-structured search, vertical search, faceted search, lucene, java, j2ee, solr, open source, mysql, python, django
John Wang's Experience
Public Company; 1001-5000 employees; TWTR; Internet industry
June 2012 – Present (2 years 4 months) San Francisco Bay Area
Working on the next generation search infrastructure at Twitter.
John Wang's Projects
- November 2009 to Present
Open-source, distributed, realtime, semi-structured database
Powering LinkedIn homepage and LinkedIn Signal.
- September 2010 to Present
Personalized faceted view of network update stream and tweets
- August 2007 to January 2011
LinkedIn Search Products
- 2010 to 2010
Contributed a chapter on search technology behind LinkedIn.com
- January 2011 to Present
Backend for the LinkedIn Homepage. Built on top of SenseiDB.
Unified Social Content Platform (USCP)
- July 2011 to Present
The next-generation activity platform for LinkedIn
John Wang's Languages
English(Native or bilingual proficiency)
Chinese(Native or bilingual proficiency)
John Wang's Patents
- United States Patent 7,461,085
- Issued February 14, 2006
A method of parametric group processing includes forming a parametric index from an indexed database. A first parametric group and a second parametric group corresponding to elements in the parametric index are specified. The first parametric group and the second parametric group are merged to produce a merged parametric group. A parametric result is extracted from the merged parametric group, where the parametric result specifies a set of documents.
- United States Patent 20040103087
- Issued May 27, 2004
A method of combining information from multiple heterogeneous workers comprises transmitting a first search request to a search worker to assist the search worker in searching a first database and returning a first results set. A second search request is directed to a peer worker to assist the peer worker in initiating a search of a second database across a network asynchronously from the search worker and returning a second results set. The first results set and second results set are then incorporated into a composite results set.
- United States Patent 20060294192
- Issued December 28, 2006
Access control systems and methods regulate access to shared content items in a corpus using visibility tokens. A user provides other users with access to a content item by associating a content token with the content item and associating a matching user token with each user who is to be granted access. A user who attempts to access the content item succeeds only if that user has a user token matching the content token associated with the content item. User tokens can be propagated automatically from one user to another, e.g., based on trust relationships among the users. Content tokens can be indexed with content items so that when a user searches the corpus, a search engine can detect matches between user tokens and content tokens and filter the search results based on whether they are visible to the querying user.
- United States Patent Application 20120072432
- Filed September 20, 2011
A network update interface is presented to a user on a network to display network updates from other users of a mutual social-networking site. The network updates shared by the other users are gathered in a stream and supplied to a facet-filtering system including a network update interface. The user controls the display of certain network update items according to facet-filter characteristics enabled in facet-filter selection panels in the network update interface. The facet-filter characteristics are used by a facet filter to select certain network updates for display to the user in the network update interface. Trending links to further articles with content corresponding to the facet-filter characteristics are displayed to the user according to greatest popularity among the other users. Links to the profiles of the users sharing the articles are also provided in the network update interface.
- United States Patent 20140059141
- Issued August 23, 2012
Techniques for identifying and organizing electronic messages are described. With some embodiments, a method receives a user interaction update from a messaging client associated with a first user. The user interaction update has data indicating that the first user interacted with an electronic message. The method may then determine that the first user is associated with a classification attribute that defines a relationship in a social network. Using the user interaction update, the a social classification rule corresponding to the classification attribute associated with the first user is then updated. The social classification rule may include data usable to organize a plurality of electronic messages. Finally, the method may optionally send the updated social classification rule to a messaging client associated with a second user, where the updated social classification rule can be used to organize electronic messages displayed by the messaging client associated with the second user.
John Wang's Publications
- Very Large Data Bases (VLDB)
In this position paper, we propose social navigation as a paradigm
for information access. We define social navigation as navigation
through explicit manipulation of a social lens and offer examples
of its application.
John Wang's Skills & Expertise
- Search engine technology
- Search algorithms
- Distributed systems
- Distributed databases
- Open Source
- Vertical Search
- Social Media
- Social Networking
- Distributed Systems
- Search Engine Technology
- Java Enterprise Edition
- Information Retrieval
- Real Estate Investor
- Full Text Search