Haiyan Huang, PhD

 

 

 

 

 

Assistant Professor
Department of Statistics

Interdepartmental Group in Biostatistics

Graduate Group in Computational and Genomic Biology.

University of California, Berkeley

CA, 94720, USA

 

Tel: (510)642-6433

Fax: (510)642-7892

Email: hhuang

AT stat DOT berkeley DOT edu

 

 

Research

Teaching

My CV

News & Events

 

 

Job Openings:

Applicants are invited for a two-year postdoctoral position to work on computational biology / bioinformatics, particularly on large-scale integrative analysis of genomic data and metadata.  Interested candidates possessing a Ph.D. in related field (physical sciences or biology), with a strong computer programming and quantitative background or computational genomics experiences should forward their CV, selected reprints, a statement of research interests and goals (2-3 pages), and the contact information (phone, email, address) for three references to:  hhuang ((AT)) stat ((DOT)) berkeley ((DOT)) edu

 

 

Research interests

Development and application of mathematical/statistical methods for problems associated with various biological systems and data

1.     Integrative analysis of public repositories

2.     Analysis of SAGE, microarray and tiling array data

3.     Cellular interaction/regulatory network construction using  gene expression data

4.     Comparative Genomics; studying conserved non-coding elements in human genome

5.     Clustering/classification methods

6.     Stein’s Method and its application in biological sequence analysis

 

 

People

 

Graduate Students:

l       Daisy Yan Huang (Mar 2008 – present)

l       Kyungpil Kim (Aug 2008 – present)

l       Ling Meng (Master, Dec 2008 – present)

l       Qinghui Gao (Visiting student, Nov 2008 – present)

 

Post-doc

l       Qunhua Li (joint with Professor Peter Bickel)

 

Former Graduate Students:

l       Siew-leng Melinda Teng, PhD, 2007 Summer

Thesis Title: Statistical methods in integrative analysis of gene expression data with applications to biological pathways

Current Position: Statistician, Genentech, Inc.

l       Na Xu, PhD, 2008 summer (co-advised with Prof. Peter Bickel)

Thesis Title: Transcriptome Detection by Multiple RNA Tiling Array Analysis and Identifying Functional Conserved Non-coding Elements by Statistical Testing

Current position: statistician, Genentech, Inc.

l       Hua Chen, Master 2008

Thesis Title: Bayesian Method for Multi-Loci Association Study of Human Disease

Current Position: Research fellow, Harvard University

 

Major Collaborators:

l       Peter Bickel, Statistics Department, UC Berkeley

l       Lewis Feldman, Plant Molecular Biology, UC Berkeley

l       Yishi Jin, Molecular Biology, UC Santa Cruz

l       Lydia Sohn, Mechanical Engineering, UC Berkeley

l       Xianghong Zhou, Department of Computational and Molecular Biology, USC

 

 

Teaching

 

Undergraduate courses:

STAT 152: Survey Sampling (Fall 2003, Fall 2004, Fall 2005, Fall 2006)

BIOE/STAT C141: Statistics for Bioinformatics (Spring 2004, Spring 2005, Spring 2006, Spring 2007, Spring 2008)

STAT 131A: Statistical Inferences for Social and Life Scientists (Spring 2009: class materials can be found at bspace) 

 

Master courses:

    STAT 200B: Introduction to Probability and Statistics at an Advanced Level (Spring 2006, Spring 2007)

 

PhD courses:

    STAT 210A: Theoretical Statistics (Fall 2008)

    STAT 246: Statistical Genetics (Spring 2009: class materials can be found at the class website and bspace)

(co-teaching with Prof. Sandrine Dudoit in Biostatistics)

 

Seminars: Tuesday Statistics Colloquium

 

 

 

Publications

 

  1. Teng S, Zhou XJ, Huang H* (2008). A Statistical Framework to Infer Functional Gene Associations from Multiple Biologically Interrelated Microarray Experiments. Journal of the American Statistical Association (JASA), in press.

*corresponding author; work with PhD student

  1. Liu C, Hu J, Kalakrishnan M, Huang H*, Zhou XJ* (2008) Integrative Disease Classification Based on Cross-platform Microarray Data, Proceedings of APBC 2009, accepted. (invited for publication in the BMC Bioinformatics Supplement Issue).

*co-corresponding authors

  1. Wang F, Jiang T, Pan B, Sun Z, Teng S, Zhu Z, Gong G, Zang Y, Zhang H, Yue W, Hong N, Huang H, Blumberg H, Zhang, D (2008). Neuregulin 1 Genetic Variation and anterior cingulum integrity in schizophrenia and in health. Journal of Psychiatry & Neuroscience, accepted.
  2. Carbonaro A, Mohanty SK, Huang H, Godley LA and Sohn LL (2008) Cell Characterization Using A Protein-Functionalized Pore. Lab Chip, 8(9):1478-85.
  3. Huang Y, Li H, Hu H, Yan X, Waterman MS, Huang H, Zhou XJ (2007). Systematic Discovery of Functional Modules and Context-Specific Functional   Annotation of Human Genome. Bioinformatics (ISMB 2007), 23(13):i222-i229.
  4. ENCODE Consortium (2007).  Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature. 447, 799-816.
  5. Kim K, Zhang S, Jiang K, Cai L, Lee IB, Feldman LJ, Huang H* (2007). An Efficient Measure of Similarity between Gene Expression Profiles through Data Transformations. BMC Bioinformatics, 8:29 (highly accessed paper).

*corresponding author; work with graduate student

  1. Jiang K, Zhang S, Lee S, Tsai G, Kim K, Huang H, Zhu T, Feldman LJ (2006). Transcription Profile Analyses Identify Genes and Pathways Central to Root Cap Functions in Maize. Plant Molecular Biology, 60(3):343-63.
  2. Huang H, Kim K (2006). Unsupervised clustering analysis of gene expression, Chance, vol. 19, No.3.
  3. Zhou XJ, Kao MJ, Huang H, Wong A, Nunez-Iglesias J, Aparicio OM, Morgan TE, Wong WH (2005). Functional annotation and network reconstruction through cross-platform integration of microarray data. Nature Biotechnology, 23(2):238-43.
  4. Zhao X, Huang H, Speed T (2005). Finding short DNA motifs using permuted Markov models. Journal of Computational Biology, 12(6): 894-906 (journal version of the 2004 RECOMB paper; numbered as 14 below)
  5. Huang H, Kao MJ, Zhou X, Liu JS, Wong WH (2004). Determination of local statistical significance of patterns in Markov sequences with application to promoter element identification.” Journal of Computational Biology, 11(1):1-14.
  6. Cai L*, Huang H*, Blackshaw S, Liu JS, Cepko CL, Wong WH (2004). Clustering analysis of SAGE data using a Poisson approach. Genome Biology, 5(7):R51                                                                                                     

*Joint first authors

  1. Zhao X, Huang H, Speed T (2004). Finding short DNA motifs using permuted Markov models. Proceedings of RECOMB 2004.                                              
  2. Blackshaw S, Harpavat S, Trimarchi J, Cai L, Huang H, Kuo W, Fraioli R, Cho S, Yung R, Asch E, Wong WH, Cepko CL (2004). Genomic analysis of mouse retinal development. PLoS Biol, 2(9):E247.
  3. Allinen M, Beroukhim R, Cai L, Brennan C, Domenici CJ, Huang H, Porter D, Hu M,   Chin L, Richardson A, Schnitt S, Sellers W, Polyak K (2004). Molecular characterization of the tumor microenvironment in breast cancer. Cancer Cell, 6(1):17-32.
  4. Lippert RA, Huang H, Waterman MS (2002). Distributional regimes for the number of k-word matches between two random sequences. Proc Natl Acad Sci.USA, 99(22):13980-9.
  5. Huang H (2002). Error bounds on multivariate normal approximations for word count statistics. Advances in Applied Probability, 34(3): 559-586

 

 

Articles Submitted

 

  1. Huang H, Liu C, Zhou XJ (2009). A bayesian probabilistic approach toward transforming public microarray repositories into disease diagnosis databases, submitted. 
  2. Bickel PJ, Boley N, Brown JB, Huang H, Zhang NR (2008). Non Parametric Methods for Genomic Inference, submitted.

*authors ordered alphabetically

  1. Jiang K, Zhu T, Huang H, Feldman L (2008). The Maize Root Stem Cell Niche: A partnership between two transcriptionally distinct stem cell Populations, submitted.

 

 

Manuscripts in Preparation

 

  1. Na Xu, Bickel PJ, Huang H. Semi-parametric methods for multiple RNA tiling array analysis.
  2. Brown JB, Kechris KJ, Poulin F, Xu N, Bickel PJ, Huang H. Classification of  Transcriptional Function in Human Conserved Non-Coding Elements.

 

 

Book Chapters & Books Edited

 

  1. Huang H, Cai L, Wong WH (2007) Clustering Analysis of SAGE transcription profiles using a Poisson Approach, Chapter 14 of “Methods in Molecular Biology”, Humana Press Inc.
  2. “Research in Computational Molecular Biology” (11th Annual International Conference, RECOMB 2007), edited by Terry Speed and Haiyan Huang, Published by Springer.

 

 

News & Events

 

I am the Local Activity Committee Chair for the ICSA 2009 Applied Statistics Symposium to be held on June 21-24 in San Francisco, California, USA. Here is the link for the symposium:  http://icsa2.org/2009/

 

I am the IMS Program Chair for the 2009 WNAR/IMS annual meeting to be held on June 14-17 at Portland State University.  Here is the link for the meeting:  http://www.mth.pdx.edu/wnar