Monica Rogati
Senior Data Scientist @ LinkedIn
- Location
- Sunnyvale, California (San Francisco Bay Area)
- Industry
- Internet
As a LinkedIn member, you'll join 225 million other professionals who are sharing connections, ideas, and opportunities.
- See who you and Monica Rogati know in common
- Get introduced to Monica Rogati
- Contact Monica Rogati directly
Monica Rogati's Overview
- Current
-
- Senior Data Scientist at LinkedIn
- Advisor at Insight Data Science
- Past
-
- Research Assistant at Carnegie Mellon University, Computer Science Department
- Research Intern at IBM TJ Watson Research Center
- Research Intern at IBM TJ Watson Research Center
- Research Intern at AT&T Research
- Undergraduate Teaching Assistant & Software Developer at University of New Mexico
- Logic Design Assistant at NASA Microelectronics Research Center @UNM
- Education
-
- Carnegie Mellon University
- Carnegie Mellon University
- The University of New Mexico
-
Tudor Vianu Computer Science High School / Liceul de Informatica
- Recommendations
-
7 people have recommended Monica
- Connections
-
500+ connections
Monica Rogati's Summary
Hands on, no-nonsense data science generalist; pioneered data driven products with multi-million dollar business impact. Instrumental in building the LinkedIn product data science team.
Leading a team focused on building data products.
Strong background in applied machine learning (CMU CS PhD), social network analysis, recommender systems, statistical text mining and multilingual information retrieval (search) -- and a knack for branding & personal finance.
I turn data into products, actionable insights and (news) stories.
Specialties: applied machine learning, recommender systems, analytics, data science, data driven products, social network analysis, crowdsourcing, multilingual information retrieval, text classification, statistical machine translation, fraud detection, big data, large-scale applied text mining and data mining, evaluation design, medical applications, statistical natural language processing, clustering, event detection and tracking, personal finance, tomatoes.
________________
Selected Press Coverage (stories I found in the data):
✔ “Top CEO names” -- 200+ stories on NPR, ABC, CBS, CNN, The Financial Times, BloombergTV, Howard Stern :), Real Simple, US News, TechCrunch, Mashable, Time.com, FastCompany, CIO, Forbes.com. 3rd most popular LinkedIn blog post of 2011 (IPO was #10)
✔ “What’s in the DNA of an entrepreneur?” -- stories in TechCrunch, Business Insider, Mashable, San Jose Mercury News, CIO, San Francisco Chronicle, PC World, RWW, VentureBeat
✔ “Best time to get a promotion” -- 120+ stories in the WSJ, Forbes, SF Chronicle, Reuters, CNN, MarketWatch, CNBC, SJ Mercury News, HBR
✔ Forbes interview on “What is a Data Scientist?”
✔ Growth of data scientists chart -- articles in Fortune & the WSJ
✔ The Atlantic article on “The Data Driven Parent” (print edition)
✔ Wall Street Journal (front page, print edition): “In the Search for a Hot Job Title, Enter the Ninja”
________________
Selected talks, interviews and videos below.
Monica Rogati's Experience
Senior Data Scientist
Public Company; 1001-5000 employees; LNKD; Internet industry
May 2008 – Present (5 years 1 month)
How do you know Darth Vader’s LinkedIn account is not real? How do you train a recommender system *before* it goes live? What are the hottest companies this year, and more importantly, how do you leverage terabytes of data to define “hot”? What algorithms will find your next sharp CTO in the 10^8 LinkedIn profile haystack? I answer these types of questions every day - and more importantly, I ask them.
The hats: applied machine learning scientist, hands-on prototyper, product manager, data miner, talent high-pass filter, mentor, spokesperson, fraud fighter, crowdsourcer, acquisition due diligence, namer … and distributed database guinea pig.
Impact:
✔ Created and implemented the initial LinkedIn job-to-candidate matching system (Talent Match), now a multi-million dollar product.
✔ Developed the first machine learning model for LinkedIn’s People You May Know, immediately doubling invitation rates.
✔ Created and implemented the "Groups You May Like" product (v1.0) resulting in a double digit engagement rate.
✔ Coaxed stories out of terabytes of raw data -- they brought us to Good Morning America, the front page of the Wall Street Journal, The Economist and hundreds of other media outlets. This includes the top shared story in LinkedIn’s history (on overused buzzwords), job title trends, successful first names, “hot” companies and industries, labor market movements etc.
✔ Pioneer, scientist, and evangelist for the LinkedIn’s recommender system line of products: Jobs You May Be Interested In, Similar Jobs, Talent Match etc. Technical contributions include: solving the cold start training problems, a generalized metrics / evaluation framework, feature engineering, model building & visualization.
✔ Crowdsourcing specialist (Mechanical Turk, Crowdflower, Samasource).
✔ Fraud detection (fake accounts, spam, TOS violations).
✔ Co-organized the LinkedIn external tech talk series & the data science summer intern program
Advisor
Insight Data Science
Privately Held; 1-10 employees; Higher Education industry
2012 – Present (1 year)
Research Assistant
Carnegie Mellon University, Computer Science Department
Educational Institution; 5001-10,000 employees; Higher Education industry
September 2000 – May 2008 (7 years 9 months)
Advisors: Yiming Yang and Jaime Carbonell.
Feature selection for text classification. Cross-lingual information retrieval using parallel corpora, including full system implementation. Multilingual search competitions, named entity detection, event detection and tracking. Thesis work on domain adaptation of translation models, as applied to Cross-Language Information Retrieval, Event Tracking and Machine Translation. Crucial role in the CMU DARPA GALE project: grant proposal co-writing, end-to-end system architecture and design, project coordinator across CMU, UPitt & IBM; user study design, evaluation dataset design.
Research Intern
IBM TJ Watson Research Center
Public Company; 10,001+ employees; IBM; Information Technology and Services industry
May 2005 – August 2005 (4 months)
Research Intern
IBM TJ Watson Research Center
Public Company; 10,001+ employees; IBM; Information Technology and Services industry
May 2003 – August 2003 (4 months)
Unsupervised learning for Arabic stemming. Work published: ACL'03.
Research Intern
AT&T Research
Public Company; 10,001+ employees; T; Telecommunications industry
May 2000 – August 2000 (4 months)
Text mining and machine learning for the DARPA Communicator. Work published : NAACL'01, ACL'01.
Undergraduate Teaching Assistant & Software Developer
University of New Mexico
Educational Institution; 10,001+ employees; Higher Education industry
August 1997 – December 1999 (2 years 5 months)
Teaching Assistant, UNM CS Department (multiple classes, starting as a sophomore)
Scheme, C++, intro to algorithms.
Developed an automatic grader application to evaluate student projects and e-mail a progress report.
Software Developer, UNM CS Department
Lead a team developing a web-based grade database (SQL, C++) used by the three largest classes in the CS department.
Logic Design Assistant
NASA Microelectronics Research Center @UNM
1996 – 1999 (3 years)
Developed a packetizing chip software simulation and a multiplexer usage optimization for multiple outputs boolean functions; disassembled and redesigned a microcontroller test bench.
Responsible for student assignment evaluation for a logic design class.
Monica Rogati's Skills & Expertise
- Recommender Systems
- Social Networking
- Analytics
- Crowdsourcing
- Information Retrieval
- Machine Learning
- Text Mining
- Data Mining
- Text Classification
- Text Analytics
- Personal Finance
- Machine Translation
- Natural Language Processing
- Collaborative Filtering
- Information Extraction
- Clustering
- Hadoop
- Classification
- Predictive Modeling
- R
- Data Visualization
- data stories
- Python
- Data Science
- Apache Pig
- Big Data
- Data
- Public Speaking
- Social Network Analysis
Monica Rogati's Education
Carnegie Mellon University
Ph.D., Computer Science
2003 – 2008
Thesis:
Domain Adaptation of Translation Models for Multilingual Applications
Feature selection for text classification. Cross-lingual information retrieval using parallel corpora; including full system implementation. Event Tracking; Machine Translation. DARPA GALE Distillation: proposal writing, end-to-end system design and specification, user study design, evaluation dataset design.
Carnegie Mellon University
MS, Computer Science
2000 – 2003
TA'd Data Structures and Algorithms (Java). Designed exam questions and assignments, lectured to a class of 150+.
The University of New Mexico
BS, Computer Science
1996 – 2000
First in my graduating class (summa cum laude)
Undergraduate research in neural networks and NLP.
Undergraduate TA for functional programming (Scheme).
Tudor Vianu Computer Science High School / Liceul de Informatica
Computer Science
Monica Rogati's Patents
-
Trainable Sentence Planning System
- United States Patent 7,729,918
- Issued June 1, 2010
Inventors: Monica Rogati, Owen Christopher Rambow (2nd), Marilyn A. Walker (1st) -
Methods and systems for exploring career options
- United States Patent US20120226623 A1
Inventors: Monica Rogati, Micah Alpern, Russell Jurney, Josh Fleetwood, DJ Patil, Peter Skomoroch, Chris Riccomini -
Methods and systems for identifying similar people via a business networking service
- United States Patent Application 13/194,883
- Filed July 29, 2011
Techniques for identifying and presenting member profiles similar to a source member profile are described. With some embodiments, a general recommendation engine is used to extract features from member profiles, and then store the extracted features, including any computed, derived or retrieved profile features, in an enhanced member profile. In real-time, the general recommendation engine processes client requests to identify member profiles similar to a source member profile by comparing select profile features stored in the enhanced member profile with corresponding profile features of the source member profile, where the comparison results in several similarity sub-scores that are then combined in accordance with directives set forth in a configuration file. Finally, the member profiles with the highest similarity scores corresponding with the user-selected member profile are selected, and in some instances, presented to a user.
-
Providing Recommendations to Members of a Social Network
- United States Patent Application 13/780116
-
Presenting Actionable Recommendations to Members of a Social Network
- United States Patent Application 13/780,198
Monica Rogati's Publications
-
Bridging offline and online social graph dynamics
- CIKM 2012
- 2012
Authors: Monica Rogati, Manuel Gomez Rodriguez -
Identifying similar people in professional social networks with discriminative probabilistic models
- SIGIR 2011 poster paper
- 2011
Authors: Monica Rogati, Suleyman Cetintas (1st), Luo Si (3rd), Yi Fang (4th) -
Modeling relationship strength in online social networks
- ACM International Conference on World Wide Web (WWW 2010)
- 2010
Authors: Monica Rogati, Rongjing Xiang (1st), Jennifer Neville (2nd) -
High-Performing Feature Selection for Text Classification
- ACM CIKM International Conference on Information and Knowledge Management (CIKM 2002)
- 2002
Authors: Monica Rogati, Yiming YangCited by 220+ papers
-
Unsupervised Learning of Arabic Stemming using a Parallel Corpus
- Association for Computational Linguistics (ACL 2003)
- 2003
Authors: Monica Rogati, Scott McCarley, Yiming Yang -
Resource Selection for Domain Specific CLIR
- ACM SIGIR Special Interest Group on Information Retrieval Conference (SIGIR 2004)
- 2004
Authors: Monica Rogati, Yiming Yang -
BLANC: learning evaluation metrics for Machine Translation
- North American Chapter of the Association for Computational Linguistics -Human Language Technology (HLT 2005)
- 2005
Authors: Monica Rogati, Lucian Vlad Lita (1st), Alon Lavie (3rd) -
Utility-based Information Distillation Over Temporally Sequenced Documents
- ACM SIGIR Conference on Research & Development on Information Retrieval (SIGIR 2007)
- 2007
Authors: Monica Rogati, Yiming Yang et. al. (Monica Rogati is the last author) -
Evaluating a Trainable Sentence Planner for a Spoken Dialogue Travel System
- The Association for Computational Linguistics (ACL 2001)
- 2001
Authors: Monica Rogati, Owen Rambow (1st), Marilyn Walker (3rd) -
SPoT: A Trainable Sentence Planner
- North American Chapter of the Association for Computational Linguistics (NAACL 2001)
- 2001
Authors: Monica Rogati, Marilyn Walker (1st), Owen Rambow (2nd) -
Cross-Lingual Pseudo-Relevance Feedback Using a Comparable Corpus
- Cross-Language Evaluation Forum (CLEF 2001)
- 2001
Authors: Monica Rogati, Yiming Yang -
Training a Sentence Planner for Spoken Dialog: The Impact of Syntactic and Planning Features
- European Conference on Speech Processing (EUROSPEECH 2001)
- 2001
-
Training a Sentence Planner for Spoken Dialogue Using Boosting
- Computer Speech and Language Special Issue on Spoken Language Generation (CSL 2002)
- 2002
Authors: Monica Rogati, Marilyn Walker (1st), Owen Rambow (2nd) -
Cross Lingual QA: A Modular Baseline in CLEF 2003
- Cross-Language Evaluation Forum (CLEF 2003)
- 2003
Authors: Monica Rogati, Lucian Vlad Lita (1st), Jaime Carbonell (3rd) -
Multilingual Information Retrieval using Open, Transparent Resources in CLEF 2003
- CLEF 2003
- 2003
Authors: Monica Rogati, Yiming Yang -
Customizing Parallel Corpora at the Document Level
- Association for Computational Linguistics (ACL 2004)
- 2004
Authors: Monica Rogati, Yiming Yang -
Cross-Language Event Tracking
- Asia Information Retrieval Symposium (AIRS 2004)
- 2004
Authors: Monica Rogati, Nianli Ma (1st), Yiming Yang (2nd) -
Combining Categorization-based and Corpus-based Approaches for CLIR
- The Florida Artificial Intelligence Research Society Conference (FLAIRS 2005)
- 2005
Authors: Monica Rogati, Yiming Yang (1st), Bryan Kisiel (3rd) -
BLANC: Learning Evaluation Metrics for MT
- Human Language Technology and Empirical Methods in Natural Language Processing Joint Conference (HLT-EMNLP 2005)
- 2005
Authors: Monica Rogati, Lucian Lita (2st), Alon Lavie (3rd) -
An Evaluation of Adaptive Filtering in the Context of Realistic Task-Based Information Exploration
- Information Processing and Management (IP&M)
- 2008
Authors: Monica Rogati, Daqing He et al. (Monica Rogati is the last author) -
Corpus microsurgery: criteria optimization for medical cross-language IR
- ACM conference on Information and knowledge management (CIKM 2008)
- 2008
Authors: Monica Rogati, Yiming Yang, Jaime Carbonell
Monica Rogati's Volunteer Experience & Causes
-
Volunteer Interests
-
Organizations I support:
- Kiva.org
- Oxfam
- DonorsChoose.org
-
Monica Rogati's Additional Information
- Groups and Associations:
- Honors and Awards:
-
Outstanding Junior & Senior of the Year
CRA Outstanding Undergraduate 2000 Honorable Mention
UNM university-wide commencement speaker
Regional ACM programming contest: 1st & 2nd place (undergrad)
Full ride+ scholarship from the NASA Microelectronics Research Center
Contact Monica for:
- career opportunities
- consulting offers
- new ventures
- expertise requests
- getting back in touch
View Monica Rogati’s full profile to...
- See who you and Monica Rogati know in common
- Get introduced to Monica Rogati
- Contact Monica Rogati directly