Technical Manager, Data Analytics & Visualization
- London, United Kingdom
Andrew Clegg's Overview
- Data Analytics Technical Manager at Pearson
- Visiting Lecturer at Birkbeck, University of London
- Information Scientist at National collaborating centre for Women's and children's health
- Senior Analyst Programmer at AstraZeneca
- Freelance Developer at Bomb Productions
- Birkbeck, U. of London
- Birkbeck, U. of London
- University College London, U. of London
Andrew Clegg's Summary
I'm a data science/engineering lead with a diverse background taking in computational linguistics, information retrieval, bioinformatics, online music, publishing, gaming and social media.
A frequent speaker at conferences and tech meetups, I'm used to engaging with technology professionals at all levels, from engineering teams to C-level executives. I am equally happy advising on enterprise data strategy, or tuning data processing algorithms for speed and scalability.
Specialties: Architecture, design and implementation of distributed analytics and search systems. Algorithms and methods for data and text mining. Visualizing and communicating the trends and patterns in data. Technical leadership of specialized data science and engineering teams.
Andrew Clegg's Experience
Data Analytics Technical Manager
Public Company; 10,001+ employees; PSON; Publishing industry
December 2011 – Present (1 year 6 months) London
My team provides analytical expertise, and data mining and visualization services, to companies across the Pearson group.
Our projects include visualizing high-throughput, loosely-structured data streams in near-real-time, modelling pricing and purchasing behaviour, and building recommendation and search engines.
We concentrate on scalable solutions using open-source tools where possible. Our engineering work is currently focused on ElasticSearch, Hadoop, HBase and D3.js, and we also employ a number of tools and languages including R, Tableau, Pig and Python for data analysis.
Public Company; 51-200 employees; Internet industry
February 2011 – December 2011 (11 months)
I worked in the Music Information Retrieval team which is responsible for automated recommendation systems, taste-based radio playlist generation, prototyping of new features and mash-ups, and mining and visualization of user and music data.
Senior Research Associate
Educational Institution; 5001-10,000 employees; Research industry
June 2008 – January 2011 (2 years 8 months)
Designing and developing software and methods for management, analysis and visualization of molecular biology data.
Educational Institution; 1001-5000 employees; Higher Education industry
August 2007 – June 2009 (1 year 11 months)
Writing and teaching Java course for MSc students. Developing training materials on software engineering for PhD students and postdoctoral scientists.
Health, Wellness and Fitness industry
November 2007 – May 2008 (7 months)
Information retrieval and document management specialist for research group developing clinical and public-health guidelines for the NHS.
Senior Analyst Programmer
Public Company; 10,001+ employees; AZN; Pharmaceuticals industry
September 2001 – September 2003 (2 years 1 month)
Deputy technical lead in a team of intranet developers.
Andrew Clegg's Honors and Awards
Pearson CIO's Award 2012-13Genevieve Shore, Chief Information Officer, Pearson
- March 2013
"The CIO award goes to Andrew Clegg for the biggest contribution to changing the engineering culture at Pearson Technology and driving innovative business engagement."
Andrew Clegg's Projects
- October 2012 to Present
This is where we occasionally release open-source components that we've built for internal projects.
- September 2009 to Present
I am the designer and webmaster for the website for Penhayl, a self-catering holiday cottage in Cornwall owned by my family.
- January 2011 to Present
My Github site, where I've published various tools, script and experiments in data-driven programming.
Andrew Clegg's Skills & Expertise
Andrew Clegg's Publications
Andrew Clegg's Education
Birkbeck, U. of London
PhD, Computational Linguistics, Text Mining
2003 – 2007
Thesis title: "Computational-Linguistic Approaches to Biological Text Mining".
Funded by the Biotechnology & Biological Sciences Research Council, and AstraZeneca PLC.
Birkbeck, U. of London
MSc, Bioinformatics (Distinction)
2001 – 2003
MSc on the application of computer science, mathematics and statistics to problems in biological science.
Final-year project: Developing a semantic search engine for biological journal abstracts.
University College London, U. of London
BSc Hons. (2:1), History & Philosophy of Science
1998 – 2001
BSc on the history, philosophy and social impact of science, technology and medicine.
Andrew Clegg's Additional Information
Data mining, text mining, information retrieval, search, machine learning, big data, bioinformatics, computational linguistics, natural language processing, social networks, scalability, distributed computing, grids, clouds, statistics, algorithms, music.
- Groups and Associations:
London BioGeeks (co-founder), London Java Community, London Clojure Dojo, Hadoop UK User Group.
- Honors and Awards:
Invited speaker at Big Data Week 2013, QCon 2013, Business Analytics 2012, Big Data Analytics 2012, Pearson Data Summit, Hadoop UK User Group, ElasticSearch London User Group, Cambridge University Computing Lab, Royal Statistical Society.
Past peer reviewer for several journals and conferences in the fields of bioinformatics and text mining.