Curriculum Vitae
QIN GAO
Language Technology Institution
Carnegie Mellon University
407 South Craig Street
Pittsburgh, PA 15213
(412) 268-5634 (Office)
(412) 567-6866 (Home)
email: q
...@cs.cmu.edu
EDUCATION
Master Student
Language Technology Institution, Carnegie Mellon University
August, 2007 – August 2009
GPA: 3.91
Master of Engineering
National Key Laboratory for Machine Perception, Peking University
September, 2004 – June, 2007
GPA: 3.82
Diploma Thesis: Research and Implementation of Chinese Spoken Document Retrieval System
Graduate with First Honor
Bachelor of Science
School of Mathematics Science, Peking University
Major: Mathematics, Scientific and Engineering Computing
September, 2000 – June 2004
GPA: 3.25
Graduate Research: Automatic Spoken English Quality Evaluation System
Second Major: Economics
PUBLICATIONS
- Qin Gao, Stephan Vogel, "Parallel Implementations of Word Alignment Tool", Software Engineering, Testing, and Quality Assurance for Natural Language Processing, pp. 49-57, June, 2008 pdf∞ bib∞
- Nguyen Bach, Qin Gao, Stephan Vogel, "Improving Word Alignment with Language Model Based Confidence Scores", Proceedings of the Third Workshop on Statistical Machine Translation, pp. 151-154, June, 2008. pdf∞ bib∞
- Almut Silja Hildebrand, Kay Rottmann, Mohamed Noamany, Qin Gao, Sanjika Hewavitharana, Nguyen Bach, Stephan Vogel, "Recent Improvements in the CMU Large Scale Chinese-English SMT System", Proceedings of ACL-08: HLT, Short Papers, pp. 77-80, June, 2008 pdf∞ bib∞
- Qin Gao, Xiaojun Lin, Xihong Wu, "Just-in-time Latent Semantic Adaptation on Language Model for Chinese Speech Recognition Using Web Data", International Workshop on Spoken Language Technology(SLT), pp.50-53, September, 2006. abstract & fulltext∞
- Runqiang Han, Pei Zhao, Qin Gao, Zhiping Zhang, Hao Wu, Xihong Wu, "CASA Based Speech Separation for Robust Speech Recognition", International Conference on Speech and Language Processing(ICSLP), pp.78-81, September, 2006. pdf∞
RESEARCH
Statistical Machine Translation:
GALE Project (2007-2008)
Participated in GALE machine translation tasks : Chinese-English Machine Translation Task in 2007, Arabic-English Machine Translation Task in 2008
Parallelized the training procedure of phrase-based statistical machine translation system
Parallelized the Word Alignment program (GIZA++)
Parallelized Phrase Extraction and Feature Estimation procedure on Hadoop platform
Improved the efficiency of SMT training procedure from than one week to less than 2 days by combining both of the new technologies.
Proposed and implemented corpus re-sampling technology using monolingual language models
NIST Machine Translation Evaluation (2005)
Implemented word based statistical machine translation decoder, added simulation annealing to the greedy algorithm.
Speech Recognition
Designed and implemented the PULSAR decoder based on Weighted Finite State Transducer and models from
SphinxTrain. (2003-2006)
Participated in building CASA-Based speech separation system, responsible for the pitch tracking and spectrum re-construction modules.(2006)
Proposed and implemented fast latent semantic adaptation for language models using automatically fetched data from the web search engine.(2006)
Built real-time TV caption alignment system, which was entitled Special Prize in the 3rd Beijing Challenge Cup Competition, and First Prize in the 9th National Challenge Cup Competition.(2004-2005)
Participated in 2003, 2004, 2005 National 863 Evaluation on Mandarin Speech Recognition.(2003-2005)
Built an oral Spoken English evaluation system using Verbal Information Verification technology.(2004)
Information Extraction
Whisper Project (2008)
Worked on automatic data labeling using Mechanic Turks service, currently working on selecting samples to be labeled, and maximize the t may boost the performance using multiple classifiers, so as to reduce the cost of obtaining labels.
Image Processing
Worked on OCR of television captions, worked on caption frame boundary detection and caption region detection. (2004-2005)
TEACHING
Teaching Assistant of the course Spoken Language Processing, taught by Prof. Huisheng Chi and Prof. Xihong Wu, in spring 2007 semester at Peking University
WORK HISTORY
Qunar.com Inc, Software engineer. In charge of implementing web content extraction system.(Aug. 2006 – Jun.e, 2007)
Hope Software, Software engineer. Worked on Television decision support system.( Jul. 2004–Nov. 2004)
Peking University, Network administrator. In charge of design and technical support of the network of School of Mathematics Science, (Jan. 2002–Jul. 2004).
Volunteer of International Conference of Mathematicians 2002, Group leader, leading the network support group. (Sep. 2002)
Chemistry Institution, Software engineer. China Academic of Science, building instrument control and data acquiring programs. (Sep. 2001–Oct. 2001)
HONORS
“May 4th Scholarship”, Peking University, Sep. 2006
“Silver Prize”, 9th Business Plan Competition of PKU. Dec. 2005
“First Prize”, 9th National Challenge Cup, with the work “Solution on Automatic Manufacture of TV Captions”. Nov. 2005
“May 4th Scholarship”, Peking University, Sep. 2005
“Special Prize”, 7th Challenge Cup of Capital College Students, with the work “Solution on Automatic Manufacture of TV Captions”, Jun. 2005
“Best Paper Award”, 8th National Conference on Human Machine Speech Communication, Sep. 2005
There are 6 comments on this page. [Display comments]