Skip to Content

'
Kim-Anh Do

Present Title & Affiliation

Primary Appointment

Department Chair, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX
Professor, Department of Biostatistics, The University of Texas MD Anderson Cancer Center, Houston, TX

Dual/Joint/Adjunct Appointment

Adjunct Professor, Statistics, Rice University, Houston, TX
Adjunct Professor, Statistics, Texas A&M University, College Station, TX

Bio Statement

Kim-Anh Do, Ph.D., is a Professor and Chair in the Department of Biostatistics at MD Anderson, and a recipient of the Faculty Scholar Award at MD Anderson in 2003 and the Texas 4000 Distinguished Professorship in 2013. She is a Fellow of the American Statistical Association, the American Association for the Advancement of Science (AAAS) and the Royal Statistical Society and is an Elected Member of the International Statistical Institute. She has served as a primary statistician or co-investigator on several National Institutes of Health (NIH) funded grants and clinical trials in prostate cancer, epidemiology, leukemia, upper aerodigestive cancer, breast cancer and brain cancer, including the Early Detection Research Network (EDRN) grant, the Prostate SPORE (as Director of the Biostatistics Core), the Breast SPORE, and the Brain SPORE at M. D. Anderson. She has significant publications in statistical methodology, computing, biomedical, and in other applied specialist journals. Her most recent interest is in the development of clustering and analytic methods for genomic and proteomic expressions. She has developed bioinformatics software and authored books: (i) Analyzing microarray gene expression data; (ii) Bayesian Inference for Gene Expression and Proteomics; and (iii) Advances in Statistical Bioinformatics--Models and Integrative Inference for High-Throughput Data. Her extensive contribution to statistical and cancer research at M.D. Anderson has resulted in more than 160 published articles in the past years. Additional information regarding Dr. Do's educational and professional activities can be found here.

Research Interests

  • Computational Statistics and Biostatistics
  • Bioinformatics
  • Statistical Genetics
  • Non-parametric Statistical Methods

Office Address

The University of Texas MD Anderson Cancer Center
Department of Biostatistics
1400 Pressler Street
Unit Number: 1411
Houston, TX 77030
Room Number: FCT4.6040
Phone: (713) 794-4155
Fax: (713) 563-4243
Email: kim@mdanderson.org

Education & Training

Degree-Granting Education

1990 Stanford University, Stanford, CA, PHD, Statistics
1985 Stanford University, Stanford, CA, MS, Statistics
1983 Queensland University, Brisbane, Australia, B.Sc., First Class Honors, Mathematics and Computer Science

Honors and Awards

2013 Texas 4000 Distinguished Professorship
2012 Elected Member, International Statistical Institute
2006 Fellow, American Statistical Association
2005 Fellow, Royal Statistical Society
2003 Faculty Scholar Award, University of Texas M.D. Anderson Cancer Center
1994 Australian Academy of Science Travel Award
1983 Amy R. Hughes Award, Australian Federation of University Women, Australia
1982 Caltex Woman Graduate of the Year, Postgraduate Scholarship, University of Queensland

Professional Memberships

American Statistical Association, Houston Chapter (HACASA), Houston, TX
President, 2003-2004
President-Elect, 2002-2003

Selected Publications

Peer-Reviewed Original Research Articles

1. Leon-Novelo LG, Mueller P, Arap W, Sun J, Pasqualini R, Do KA. Bayesian decision theoretic multiple comparison procedures: an application to phage display data. Biom J 55(3):478-89, 5/2013. e-Pub 12/2012. PMCID: PMC3840910.
2. Leon-Novelo LG, Mueller P, Arap W, Kolonin M, Sun J, Pasqualini R, Do KA. Semiparametric Bayesian Inference for Phage Display Data. Biometrics 69(1):174-83, 3/2013. e-Pub 1/2013. PMCID: PMC3622196.
3. Wang W, Baladandayuthapani V, Morris JS, Broom BM, Manyam G, Do KA. iBAG: integrative Bayesian analysis of high-dimensional multi-platform genomics data. Bioinformatics 29(2):149-59, 1/2013. e-Pub 11/2012. PMCID: PMC3546799.
4. Wang W, Baladandayuthapani V, Holmes CC, Do KA. Integrative network-based Bayesian analysis of diverse genomics data. BMC Bioinformatics 14 Suppl 13:S8, 2013. e-Pub 10/2013. PMCID: PMC3849715.
5. Bonato V, Baladandayuthapani V, Broom BM, Sulman EP, Aldape KD, Do KA. Bayesian ensemble methods for survival prediction in gene expression data. Bioinformatics 27(3):359-67, 2/2011. e-Pub 12/2010. PMCID: PMC3031034.
6. Zhang S, Mueller P, Do KA. A Bayesian Semiparametric Survival Model with Longitudinal Markers. Biometrics 66(2):435-43, 6/2010. e-Pub 6/2009. PMCID: PMC3045702.
7. Brewster AM, Do KA, Thompson PA, Hahn KM, Sahin AA, Cao Y, Stewart MM, Murray JL, Hortobagyi GN, Bondy ML. Relationship between epidemiologic risk factors and breast cancer recurrence. J Clin Oncol 25(28). e-Pub 9/2007. PMID: 17785707.
8. Ji Y, Yin G, Tsui K-W, Kolonin MG, Sun J, Arap W, Pasqualini R, Do K-A. Bayesian mixture models for complex high dimensional count data in phage display experiments. Journal of the Royal Statistical Society: Series C (Applied Statistics) 56(2):139-52, 3/2007.
9. Do K-A, McLachlan GJ, Bean R, Wen S. Gene shaving versus mixture models for the clustering of microarray gene expression data. Cancer Informatics 2:25-43, 2007. PMID: No PubMed.
10. Kim SJ, Uehara H, Yazici S, Busby JE, Nakamura T, He J, Maya M, Logothetis C, Mathew P, Wang X, Do KA, Fan D, Fidler IJ. Targeting platelet-derived growth factor receptor on endothelial cells of multidrug-resistant prostate cancer. J Natl Cancer Inst 98(11):783-93, 6/2006. PMID: 16757703.
11. Do K-A Mueller P, Tang F. A Bayesian mixture model for differential gene expression. Journal of the Royal Statistical Society, Series C-Applied Statistics 54(3):1-18, 2005. PMID: No PubMed.
12. Do KA, Johnson MM, Lee JJ, Wu XF, Dong Q, Hong WK, Khuri FR, Spitz MR. Longitudinal study of smoking patterns in relation to the development of smoking-related secondary primary tumors in patients with upper aerodigestive tract malignancies. Cancer 101(12):2837-42, 12/2004. PMID: 15536619.
13. Do K-A, Green A, Guthrie JR, Dudley EC, Burger HG, Dennerstein L. Longitudinal study of risk factors for coronary heart disease across the menopausal transition. Am J Epidemiol 151:584-93, 3/2000. PMID: 10733040.
14. Do K-A, Kirk K. Discriminant analysis of event-related potential curves using smoothed principal components. Biometrics 55:174-81, 3/1999. PMID: 11318152.
15. Wood ATA, Do K-A, Broom BM. Sequential linearization of empirical likelihood constraints with application to U-statistics. Journal of Computational and Graphical Statistics 5:365-85, 1996.
16. Booth JG, Do K-A. Simple and efficient methods for constructing bootstrap confidence intervals. Computational Statistics 8:333-46, 1994.
17. Do K-A, Hall P. Distribution estimation using concomitants of order statistics, with application to Monte Carlo stimulation for the bootstrap. Journal of the Royal Statistical Society, Series B 54:1-14, 1992.
18. Do K-A, Hall P. On importance sampling for the bootstrap. Biometrika 78:161-167, 1991.
19. Do K-A, McLachlan GJ. Estimation of mixing proportions: a case study. Applied Statistics 33:134-40, 1984.
20. Zhang L, Baladandayuthapani V, Mallick BK, Thompson PA Bondy ML, Do K-A. Bayesian hierarchical structured variable selection methods with application to MIP studies in breast cancer. JSSR-Series C. Submitted.

Book Chapters

1. Wang W, Baladandayuthapani V, Broom BM, Do K-A. Bayesian graphical models for integrating multi-platform genomics data. Methods for the analysis of copy number data in cancer research. In: Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press, 2013.
2. Broom BM, Do K-A, Bondy M, Thompson P, Coombes K. Methods for the analysis of copy number data in cancer research. In: Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press, 2013.

Books (edited and written)

1. Do K-A, Qin Z, Vannucci M. Advances in Statistical Bioinformatics: Models and Integrative Inferences for High-Throughput Data. Cambridge University Press, 2013.
2. Do K-A, Müller P, Vannucci M. Bayesian Inference for Gene Expression and Proteomics. Cambridge University Press, 2006. ISBN: 052186092X.
3. McLachlan GJ, Do K-A, Ambroise C. Analyzing Microarray Gene Expression Data. In: Wiley Series in Probability and Statistics. Wiley-Interscience: New Jersey, 2004. ISBN: 0471226165.

Grant & Contract Support

Title: Optimizing Akt/mTOR targeted breast cancer therapy
Funding Source: Susan G. Komen Breast Cancer Foundation
Role: Biostatistician
Principal Investigator: Funda Meric-Bernstam
Duration: 11/3/2013 - 11/2/2014
 
Title: Sapacitabine therapy to create synthetic lethality in DNA repair-deficient CLL
Funding Source: NIH/NCI
Role: Statistician
Principal Investigator: William Wierda
Duration: 8/1/2012 - 7/30/2015
 
Title: Center for Clinical and Translational Sciences (PP-2)
Funding Source: NIH/NCI (Subcontract from the UT Health Science Center - Houston)
Role: Collaborator - Statistical Leader
Principal Investigator: David McPherson
Duration: 7/1/2012 - 6/30/2017
 
Title: Towards Personalized Therapy of Resistant Triple Negative Breast Cancer
Funding Source: American Cancer Society (ACS)
Role: Investigator
Principal Investigator: Naoto Ueno
Duration: 7/1/2011 - 6/30/2015
 
Title: M D Anderson Cancer Center Prostate SPORE (PC-B)
Funding Source: NIH/NCI
Role: Core Director
Principal Investigator: Christopher J. Logothetis
Duration: 9/2/2009 - 8/31/2014
 
Title: Ethnic differences in the mutational status of the P13K pathway and breast cancer outcome
Funding Source: Susan G. Komen Breast Cancer Foundation
Role: Statistician
Principal Investigator: Abenaa Brewster
Duration: 8/3/2009 - 8/2/2012
 
Title: Chemotherapy Resistance in Hispanic and African American Patients
Funding Source: Susan G. Komen Breast Cancer Foundation
Role: Investigator
Principal Investigator: Ana Gonzalez-Angulo
Duration: 7/29/2009 - 7/28/2012
 
Title: SPORE in Brain Cancer (PP-3A)
Funding Source: NIH/NCI
Role: Co-Investigator
Principal Investigator: W. K. Alfred Yung
Duration: 9/1/2008 - 8/31/2013
 
Title: UT Health Science Center - Center for Clinical and Translational Research - (PP-4)
Funding Source: NIH/NCRR (Subcontract from The University of Texas Health Science Center)
Role: Biostatistician
Principal Investigator: David McPherson
Duration: 7/1/2007 - 6/30/2013
 
Title: UTMDACC SPORE in Breast Cancer (PC-B)
Funding Source: NIH/NCI
Role: Investigator
Principal Investigator: Gabriel Hortobagyi
Duration: 9/23/2005 - 8/31/2011
 
Title: Cancer Center Support Grant - Biostatistics Shared Resource (Biostatistics Resource Group)
Funding Source: NIH/NCI
Role: Statistician
Principal Investigator: Ronald DePinho
Duration: 9/4/1998 - 6/30/2018

Last updated: 7/16/2014