Systematic evaluation of variability in ChIP-chip experiments using predefined DNA targets

  1. David S. Johnson1,24,
  2. Wei Li2,24,25,
  3. D. Benjamin Gordon3,
  4. Arindam Bhattacharjee3,
  5. Bo Curry3,
  6. Jayati Ghosh3,
  7. Leonardo Brizuela3,
  8. Jason S. Carroll4,
  9. Myles Brown5,
  10. Paul Flicek6,
  11. Christoph M. Koch7,
  12. Ian Dunham7,
  13. Mark Bieda8,
  14. Xiaoqin Xu8,
  15. Peggy J. Farnham8,
  16. Philipp Kapranov9,
  17. David A. Nix10,
  18. Thomas R. Gingeras9,
  19. Xinmin Zhang11,
  20. Heather Holster11,
  21. Nan Jiang11,
  22. Roland D. Green11,
  23. Jun S. Song2,
  24. Scott A. McCuine12,
  25. Elizabeth Anton1,
  26. Loan Nguyen1,
  27. Nathan D. Trinklein,
  28. Zhen Ye14,
  29. Keith Ching14,
  30. David Hawkins14,
  31. Bing Ren14,
  32. Peter C. Scacheri15,
  33. Joel Rozowsky16,
  34. Alexander Karpikov16,
  35. Ghia Euskirchen17,
  36. Sherman Weissman18,
  37. Mark Gerstein16,
  38. Michael Snyder16,17,
  39. Annie Yang19,
  40. Zarmik Moqtaderi20,
  41. Heather Hirsch20,
  42. Hennady P. Shulha21,
  43. Yutao Fu22,
  44. Zhiping Weng21,22,
  45. Kevin Struhl20,26,
  46. Richard M. Myers1,26,
  47. Jason D. Lieb23,26, and
  48. X. Shirley Liu2,26
  1. 1 Department of Genetics, Stanford University Medical Center, Stanford, California 94305, USA;
  2. 2 Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute, Harvard School of Public Health, Boston, Massachusetts 02115, USA;
  3. 3 Agilent Technologies, Inc., Santa Clara, California 95051, USA;
  4. 4 Cancer Research UK, Cambridge Research Institute, Cambridge, CB2 0RE, United Kingdom;
  5. 5 Department of Medical Oncology, Dana-Farber Cancer Institute, Harvard Medical School, Boston, Massachusetts 02115, USA;
  6. 6 EMBL-European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom;
  7. 7 European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SD, United Kingdom;
  8. 8 Department of Pharmacology and the Genome Center, University of California–Davis, Davis, California 95616, USA;
  9. 9 Affymetrix, Inc., Santa Clara, California 95051, USA;
  10. 10 HCI Bio Informatics, Huntsman Cancer Institute, Salt Lake City, Utah 84112, USA;
  11. 11 Roche NimbleGen, Inc., Madison, Wisconsin 53719, USA;
  12. 12 Whitehead Institute, Cambridge, Massachusetts 02142, USA;
  13. 13 SwitchGear Genomics, Menlo Park, California 94025, USA;
  14. 14 Ludwig Institute for Cancer Research, Department of Cellular and Molecular Medicine, University of California, San Diego School of Medicine, La Jolla, California 92093-0653, USA;
  15. 15 Department of Genetics, Case Western Reserve University, Cleveland, Ohio 44106, USA;
  16. 16 Department of Molecular Biophysics and Biochemistry, Yale University, New Haven, Connecticut 06520, USA;
  17. 17 Department of Molecular, Cellular and Developmental Biology, Yale University, New Haven, Connecticut 06520, USA;
  18. 18 Department of Genetics, Yale University, New Haven, Connecticut 06520, USA;
  19. 19 Genentech Inc., South San Francisco, California 94080-4990, USA;
  20. 20 Department of Biological Chemistry & Molecular Pharmacology, Harvard Medical School, Boston, Massachusetts 02115-5730, USA;
  21. 21 Biomedical Engineering Department, Boston University, Boston, Massachusetts 02215, USA;
  22. 22 Bioinformatics Program, Boston University, Boston, Massachusetts 02215, USA;
  23. 23 Department of Biology and Carolina Center for Genome Sciences, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina 27599-3280, USA
  1. 24 These authors contributed equally to this work.

Abstract

The most widely used method for detecting genome-wide protein–DNA interactions is chromatin immunoprecipitation on tiling microarrays, commonly known as ChIP-chip. Here, we conducted the first objective analysis of tiling array platforms, amplification procedures, and signal detection algorithms in a simulated ChIP-chip experiment. Mixtures of human genomic DNA and “spike-ins” comprised of nearly 100 human sequences at various concentrations were hybridized to four tiling array platforms by eight independent groups. Blind to the number of spike-ins, their locations, and the range of concentrations, each group made predictions of the spike-in locations. We found that microarray platform choice is not the primary determinant of overall performance. In fact, variation in performance between labs, protocols, and algorithms within the same array platform was greater than the variation in performance between array platforms. However, each array platform had unique performance characteristics that varied with tiling resolution and the number of replicates, which have implications for cost versus detection power. Long oligonucleotide arrays were slightly more sensitive at detecting very low enrichment. On all platforms, simple sequence repeats and genome redundancy tended to result in false positives. LM-PCR and WGA, the most popular sample amplification techniques, reproduced relative enrichment levels with high fidelity. Performance among signal detection algorithms was heavily dependent on array platform. The spike-in DNA samples and the data presented here provide a stable benchmark against which future ChIP platforms, protocol improvements, and analysis methods can be evaluated.

Footnotes

  • 25 Present address: Division of Biostatistics, Dan L. Duncan Cancer Center, Department of Molecular and Cellular Biology, Baylor College of Medicine, Houston, TX 77030, USA.

  • 26 Corresponding authors.

    26 E-mail xsliu{at}jimmy.harvard.edu; fax (617) 632-2444.

    26 E-mail kevin_struhl{at}hms.harvard.edu; fax (617) 432-2529.

    26 E-mail Myers{at}shgc.stanford.edu; fax (650) 725-9687.

    26 E-mail jlieb{at}bio.unc.edu; fax (919) 962-1625.

  • [Supplemental material is available online at www.genome.org. The microarray data from this study have been submitted to Gene Expression Omnibus under accession no. GSE10114.]

  • Article published online before print. Article and publication date are at http://www.genome.org/cgi/doi/10.1101/gr.7080508

    • Received August 27, 2007.
    • Accepted December 12, 2007.
| Table of Contents