| Kim-Anh Do |
Present Title & Affiliation
Primary Appointment
Dual/Joint/Adjunct Appointment
Bio Statement
Kim-Anh Do, Ph.D., is a Professor and Chair in the Department of Biostatistics at M. D. Anderson, and a recipient of the Faculty Scholar Award at M. D. Anderson in 2003. She is a Fellow of the American Statistical Association and the Royal Statistical Society and is an Elected Member of the International Statistical Institute. She has served as a primary statistician or co-investigator on several National Institutes of Health (NIH) funded grants and clinical trials in prostate cancer, epidemiology, leukemia, upper aerodigestive cancer, breast cancer and brain cancer, including the Early Detection Research Network (EDRN) grant, the Prostate SPORE (as Director of the Biostatistics Core), the Breast SPORE, and the Brain SPORE at M. D. Anderson. She has significant publications in statistical methodology, computing, biomedical, and in other applied specialist journals. Her most recent interest is in the development of clustering and analytic methods for genomic and proteomic expressions. She has developed bioinformatics software and authored books: (i) Analyzing microarray gene expression data; (ii) Bayesian Inference for Gene Expression and Proteomics. Her extensive contribution to statistical and cancer research at M.D. Anderson has resulted in more than 150 published articles in the past years. Additional information regarding Dr. Do's educational and professional activities can be found here.
Research Interests
- Computational Statistics and Biostatistics
- Bioinformatics
- Statistical Genetics
- Non-parametric Statistical Methods
Office Address
Department of Biostatistics
1400 Pressler Street
Unit Number: 1411
Houston, TX 77030
Room Number: FCT4.6040
Phone: (713) 794-4155
Fax: (713) 563-4243
Email: kim@mdanderson.org
Education & Training
Degree-Granting Education | |
| 1990 | Stanford University, Stanford, CA, PHD, Statistics |
| 1985 | Stanford University, Stanford, CA, MS, Statistics |
| 1983 | Queensland University, Brisbane, Australia, B.Sc., First Class Honors, Mathematics and Computer Science |
Honors and Awards
| 2012 | Elected Member, International Statistical Institute |
| 2006 | Fellow, American Statistical Association |
| 2005 | Fellow, Royal Statistical Society |
| 2003 | Faculty Scholar Award, University of Texas M.D. Anderson Cancer Center |
| 1994 | Australian Academy of Science Travel Award |
| 1983 | Amy R. Hughes Award, Australian Federation of University Women, Australia |
| 1982 | Caltex Woman Graduate of the Year, Postgraduate Scholarship, University of Queensland |
Professional Memberships
| American Statistical Association, Houston Chapter (HACASA), Houston, TX President, 2003-2004 President-Elect, 2002-2003 |
Selected Publications
Peer-Reviewed Original Research Articles | ||
| 1. | Wang W, Baladandayuthapani V, Holmes C, Do K-A. Bayesian network-based integrative analysis of diverse genomics data. BMC Bioinformatics. In Press. | |
| 2. | León-Novelo LG, Müller P, Arap W, Sun J, Pasqualini R, Do KA. Bayesian decision theoretic multiple comparison procedures: an application to phage display data. Biom J 55(3):478-89, 5/2013. e-Pub 12/2012. PMID: 23281047. | |
| 3. | León-Novelo LG, Müller P, Arap W, Kolonin M, Sun J, Pasqualini R, Do KA. Semiparametric Bayesian Inference for Phage Display Data. Biometrics 69(1):174-83, 3/2013. e-Pub 1/2013. PMCID: PMC3622196. | |
| 4. | Wang W, Baladandayuthapani V, Morris JS, Broom BM, Manyam G, Do KA. Integrative Bayesian Analysis of High-dimensional Multi-platform Genomics Data. Bioinformatics 29(2):149-59, 1/2013. e-Pub 11/2012. PMCID: PMC3546799. | |
| 5. | Bonato V, Baladandayuthapani V, Broom BM, Sulman EP, Aldape KD, Do KA. Bayesian ensemble methods for survival prediction in gene expression data. Bioinformatics 27(3):359-67, 2/2011. e-Pub 12/2010. PMCID: PMC3031034. | |
| 6. | Zhang S, Müller P, Do KA. A Bayesian Semiparametric Survival Model with Longitudinal Markers. Biometrics 66(2):435-43, 6/2010. e-Pub 6/2009. PMCID: PMC3045702. | |
| 7. | Brewster AM, Do KA, Thompson PA, Hahn KM, Sahin AA, Cao Y, Stewart MM, Murray JL, Hortobagyi GN, Bondy ML. Relationship between epidemiologic risk factors and breast cancer recurrence. J Clin Oncol 25(28). e-Pub 9/2007. PMID: 17785707. | |
| 8. | Ji Y, Yin G, Tsui K-W, Kolonin MG, Sun J, Arap W, Pasqualini R, Do K-A. Bayesian mixture models for complex high dimensional count data in phage display experiments. Journal of the Royal Statistical Society: Series C (Applied Statistics) 56(2):139-52, 3/2007. | |
| 9. | Do K-A, McLachlan GJ, Bean R, Wen S. Gene shaving versus mixture models for the clustering of microarray gene expression data. Cancer Informatics 2:25-43, 2007. PMID: No PubMed. | |
| 10. | Kim SJ, Uehara H, Yazici S, Busby JE, Nakamura T, He J, Maya M, Logothetis C, Mathew P, Wang X, Do KA, Fan D, Fidler IJ. Targeting platelet-derived growth factor receptor on endothelial cells of multidrug-resistant prostate cancer. J Natl Cancer Inst 98(11):783-93, 6/2006. PMID: 16757703. | |
| 11. | Do K-A Müller P, Tang F. A Bayesian mixture model for differential gene expression. Journal of the Royal Statistical Society, Series C-Applied Statistics 54(3):1-18, 2005. PMID: No PubMed. | |
| 12. | Do KA, Johnson MM, Lee JJ, Wu XF, Dong Q, Hong WK, Khuri FR, Spitz MR. Longitudinal study of smoking patterns in relation to the development of smoking-related secondary primary tumors in patients with upper aerodigestive tract malignancies. Cancer 101(12):2837-42, 12/2004. PMID: 15536619. | |
| 13. | Do K-A, Green A, Guthrie JR, Dudley EC, Burger HG, Dennerstein L. Longitudinal study of risk factors for coronary heart disease across the menopausal transition. Am J Epidemiol 151:584-93, 3/2000. PMID: 10733040. | |
| 14. | Do K-A, Kirk K. Discriminant analysis of event-related potential curves using smoothed principal components. Biometrics 55:174-81, 3/1999. PMID: 11318152. | |
| 15. | Wood ATA, Do K-A, Broom BM. Sequential linearization of empirical likelihood constraints with application to U-statistics. Journal of Computational and Graphical Statistics 5:365-85, 1996. | |
| 16. | Booth JG, Do K-A. Simple and efficient methods for constructing bootstrap confidence intervals. Computational Statistics 8:333-46, 1994. | |
| 17. | Do K-A, Hall P. Distribution estimation using concomitants of order statistics, with application to Monte Carlo stimulation for the bootstrap. Journal of the Royal Statistical Society, Series B 54:1-14, 1992. | |
| 18. | Do K-A, Hall P. On importance sampling for the bootstrap. Biometrika 78:161-167, 1991. | |
| 19. | Do K-A, McLachlan GJ. Estimation of mixing proportions: a case study. Applied Statistics 33:134-40, 1984. | |
| 20. | Zhang L, Baladandayuthapani V, Mallick BK, Thompson PA Bondy ML, Do K-A. Bayesian hierarchical structured variable selection methods with application to MIP studies in breast cancer. JSSR-Series C. Submitted. | |
Book Chapters | ||
| 1. | Wang W, Baladandayuthapani V, Holmes C, Do K-A. Bayesian graphical models for integrating multi-platform genomics data. In: Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press. In Press. | |
| 2. | Broom BM, Do K-A, Bondy M, Thompson P, Coombes K. Methods for the analysis of copy number data in cancer research. In: Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press. In Press. | |
Books (edited and written) | ||
| 1. | Do K-A, Qin Z, Vannucci M. Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press, 2013. | |
| 2. | Do K-A, Müller P, Vannucci M. Bayesian Inference for Gene Expression and Proteomics. Cambridge University Press, 2006. ISBN: 052186092X. | |
| 3. | McLachlan GJ, Do K-A, Ambroise C. Analyzing Microarray Gene Expression Data. In: Wiley Series in Probability and Statistics. Wiley-Interscience: New Jersey, 2004. ISBN: 0471226165. | |
Grant & Contract Support
| Title: | Sapacitabine therapy to create synthetic lethality in DNA repair-deficient CLL |
| Funding Source: | NIH/NCI |
| Role: | Statistician |
| Principal Investigator: | William Wierda |
| Duration: | 8/1/2012 - 7/30/2015 |
| Title: | Center for Clinical and Translational Research (PP-2) |
| Funding Source: | NIH/NCI |
| Role: | Collaborator - Statistical Leader |
| Principal Investigator: | David McPherson |
| Duration: | 7/1/2012 - 6/30/2017 |
| Title: | Towards Personalized Therapy of Resistant Triple Negative Breast Cancer |
| Funding Source: | American Cancer Society (ACS) |
| Role: | Investigator |
| Principal Investigator: | Ana Gonzalez-Angulo |
| Duration: | 7/1/2011 - 6/30/2015 |
| Title: | Innovative Multidisciplinary Education: The Statistical Genetics of Addiction |
| Funding Source: | NIH/NIDA |
| Role: | Mentor |
| Principal Investigator: | Shine Chang |
| Duration: | 8/1/2010 - 3/31/2015 |
| Title: | Optimizing Akt/mTOR targeted breast cancer therapy |
| Funding Source: | Susan G. Komen Breast Cancer Foundation |
| Role: | Biostatistician |
| Principal Investigator: | Funda Meric-Bernstam |
| Duration: | 7/1/2010 - 6/30/2012 |
| Title: | M D Anderson Cancer Center Prostate SPORE (PC-B) |
| Funding Source: | NIH/NCI |
| Role: | Core Director |
| Principal Investigator: | Christopher J. Logothetis |
| Duration: | 9/2/2009 - 8/31/2014 |
| Title: | Ethnic differences in the mutational status of the P13K pathway and breast cancer outcome |
| Funding Source: | Susan G. Komen Breast Cancer Foundation |
| Role: | Statistician |
| Principal Investigator: | Abenaa Brewster |
| Duration: | 8/3/2009 - 8/2/2012 |
| Title: | Chemotherapy Resistance in Hispanic and African American Patients |
| Funding Source: | Susan G. Komen Breast Cancer Foundation |
| Role: | Investigator |
| Principal Investigator: | Ana Gonzalez-Angulo |
| Duration: | 7/29/2009 - 7/28/2012 |
| Title: | SPORE in Brain Cancer (PP-3A) |
| Funding Source: | NIH/NCI |
| Role: | Co-Investigator |
| Principal Investigator: | W. K. Alfred Yung |
| Duration: | 9/1/2008 - 8/31/2013 |
| Title: | Cancer Center Support Grant - Biostatistics Shared Resource (PPSR-21) |
| Funding Source: | NIH/NCI |
| Role: | Statistician |
| Principal Investigator: | Ronald DePinho |
| Duration: | 7/1/2008 - 6/30/2013 |
| Title: | UT Health Science Center - Center for Clinical and Translational Research - (PP-4) |
| Funding Source: | NIH/NCRR (Subcontract from The University of Texas Health Science Center) |
| Role: | Biostatistician |
| Principal Investigator: | David McPherson |
| Duration: | 7/1/2007 - 6/30/2013 |
| Title: | UTMDACC SPORE in Breast Cancer (PC-B) |
| Funding Source: | NIH/NCI |
| Role: | Investigator |
| Principal Investigator: | Gabriel Hortobagyi |
| Duration: | 9/23/2005 - 8/31/2011 |
Last updated: 5/8/2013
- Global Navigation
- About Us
- Locations
- Calendar
- Careers
- Publications
- How You Can Help
- Contact Us
- Newsroom
- Site Index
- Patient and Cancer Information
- myMDAnderson
- Cancer Information
- Patient Information
- Care Centers & Clinics
- Children's Cancer Hospital
- Services & Amenities
- Education and Research
- Departments, Programs and Labs
- PeopleFinder
- Research at MD Anderson
- Education & Training
- Resources for Professionals
- For Employees
- Employee Alert Information
- Employee Resources
- Doing Business
- Vendors & Suppliers
- Partners & Affiliates
- Legal and Policy
- Legal Statements
- Site Policies
- Reporting Fraud, Waste & Abuse

