
Search and NLP Software Engineer; Author
Raleigh-Durham, North Carolina Area

Search and NLP Software Engineer; Author
Raleigh-Durham, North Carolina Area
I am a founder of Lucid Imagination and long time software engineer. I am a committer on the Apache Lucene Java, Solr and Mahout projects, a Lucene PMC member and an Apache Software Foundation Member. I have been programming for 11+ years and love the small company experience, as it is often a chance to wear many hats, ranging from project management to research to hard core development. I am the author of the "Search Smarter with Apache Solr" series at IBM developerWorks (see http://www.ibm.com/developerworks/views/java/libraryview.jsp?search_by=apache+solr)
I am mostly interested in Information Retrieval and Natural Language Processing (NLP) topics like search, text categorization, document summarization, etc. I started my career doing parallel/distributed electromagnetics simulations, did a few years in configuration management, but have spent most of my time working on NLP topics. I mostly work in Java, but have done a fair amount of Perl and C++ in my past. I wish I had more time to spend learning Ruby, as the basics that I have explored look really interesting.
In my spare time, I take my son to sports and other things. I like hiking, photography and fly-fishing. I also manage to swim-bike-run on a regular basis. During the winter, I try to get to Hurricanes game at least once a month.
Information Retrieval, machine learning, and question answering. Java programming. Lucene, Solr, Hadoop, Nutch, Mahout.
(Non-Profit; Computer Software industry)
February 2008 — Present (7 months)
Co-creator of Mahout machine learning project. The goal of Mahout is to create scalable, Apache licensed machine learning algorithms in a variety of areas, including, but not limited to: clustering, classification and categorization, feature reduction, summarization, sentiment analysis and graph importance.
http://lucene.apache.org/mahout
(Non-Profit; 201-500 employees; Computer Software industry)
January 2008 — Present (8 months)
(Privately Held; 11-50 employees; Computer Software industry)
August 2007 — Present (1 year 1 month)
Lucid Imagination: Offers a Lucene/Solr based platform for building
applications (easy install and upgrades, GUI, extra tools). The product
includes business hours support (24 x 7 will be available soon). Support
contracts are also available for direct users of the open source version.
The founding team includes Erik Hatcher, Grant Ingersoll, Marc Krellenstein and Yonik Seeley.
(Non-Profit; Computer Software industry)
May 2007 — Present (1 year 4 months)
(Self-Employed; Myself Only; Computer Software industry)
January 2007 — Present (1 year 8 months)
Author
(Non-Profit; Computer Software industry)
May 2006 — Present (2 years 4 months)
Committer on the Apache Lucene Java project.
(Educational Institution; 11-50 employees; Computer Software industry)
March 2004 — August 2007 (3 years 6 months)
(Privately Held; 11-50 employees; Computer Software industry)
March 2001 — October 2003 (2 years 8 months)
Senior Java Developer on configuration management software products.
(Privately Held; 11-50 employees; Computer Software industry)
January 1999 — March 2001 (2 years 3 months)
Worked on a cross language Information Retrieval product called CINDOR. Worked in Java, C++, Perl.
(Privately Held; 51-200 employees; Computer Software industry)
January 1999 — March 2001 (2 years 3 months)
Same as MNIS-Textwise Labs. Development of cross language search application.
(Educational Institution; 5001-10,000 employees; Computer Software industry)
June 1995 — August 1995 (3 months)
Summer internship in parallel computing architectures, programming, etc.
MS, Computer Science, 1996 — 2000
BA, Math, Computer Science, 1992 — 1996
fishing, hiking, photography, triathlon, Lucene
TriJug