Consultant at Collexis
Washington D.C. Metro Area
Consultant at Collexis
Washington D.C. Metro Area
Computer programmer and applied researcher who focuses on the application of machine learning techniques to very large data sets and has an extensive background in natural language processing.
Text mining, data mining, machine learning, data visualization, statistical modeling, natural language processing (NLP)
(Government Agency; Research industry)
December 2009 — Present (1 month)
(Government Agency; Research industry)
March 2008 — Present (1 year 10 months)
I work half time on the Research, Condition, and Disease Categorization (RCDC) system at the National Institutes of Health.
I wrote software in Java, Python, SQL, and R to do things like document classification, mining biomedical concepts from text, data clustering and visualization, statistical analysis, and automated quality control for text extracted from various electronic sources. I also wrote programs to help the RCDC thesaurus team with Collexis' core ontology-based NLP tools.
(Public Company; MOT; Telecommunications industry)
June 2005 — September 2005 (4 months)
Center for Human Interaction Research
Schaumburg, IL
(Public Company; MOT; Telecommunications industry)
June 2004 — September 2004 (4 months)
Center for Human Interaction Research
Schaumburg, IL
(Educational Institution; Higher Education industry)
August 2001 — August 2003 (2 years 1 month)
Voice and Natural Language Laboratory
Department of Computer Science
Durham, NC
PhD , Computational Linguistics , 2003 — 2008
MA , Linguistics , 1998 — 2001
Association for Computational Linguistics,NIH Judo Club,United States Judo Federation
Ohio State Graduate School Presidential Fellowship, 2007-2008