DiaGrid (distributed computing network) explained

DiaGrid is a large, multicampus distributed research computing network utilizing the HTCondor system and centered at Purdue University in West Lafayette, Indiana. In 2012, it included nearly 43,000 processors representing 301 teraflops of computing power. DiaGrid received a Campus Technology Innovators Award from Campus Technology magazine[1] and an IDG InfoWorld 100 Award[2] in 2009 and was employed at the SC09 supercomputing conference in Portland, Ore., to capture nearly 150 days of compute time for science jobs.[3]

Partners

DiaGrid is a partnership with Purdue, Indiana University, Indiana State University, the University of Notre Dame, the University of Louisville, the University of Nebraska, the University of Wisconsin, Purdue's Calumet and North Central campuses, and Indiana University-Purdue University Fort Wayne. It is designed to accommodate computers at other campuses as new members join. The Purdue portion of the pool, named BoilerGrid, is the largest academic system of its kind.

Management

DiaGrid is managed by Information Technology at Purdue (ITaP), the central information technology organization at Purdue's West Lafayette campus, and ITaP's research computing unit the Rosen Center for Advanced Computing, which also operates the Steele, Coates, Rossmann, Hansen and Carter cluster supercomputers.

HTCondor

Through HTCondor, developed at the University of Wisconsin, DiaGrid harvests and manages computing cycles from idle or underused high-performance computing cluster nodes, servers, machines in campus computer and other labs, and office computers. Whenever a local user or scheduled job needs a given machine, the HTCondor job is stopped and automatically sent to another HTCondor node as soon as possible. While this "opportunistic" model limits the ability to do parallel processing and communications, a HTCondor pool can provide smaller, serial jobs vast numbers of cycles in a very short amount of time. HTCondor—and by extension, DiaGrid—is designed for high-throughput computing and is excellent for parameter sweeps, Monte Carlo simulation, or nearly any serial application. Some classes of parallel jobs (master-worker) may be run effectively via HTCondor as well.

Networking

To pool computational resources spread around Indiana and the Midwest, DiaGrid takes advantage of I-Light, the high-speed fiber-optic state network connecting Indiana campuses to each other, the Internet and national research networks such as the Internet2 and National LambdaRail. DiaGrid provides computational resources to researchers on both the Open Science Grid and the U.S. National Science Foundation's Extreme Science and Engineering Discovery Environment system (formerly TeraGrid).

Uses

DiaGrid and BoilerGrid have been used by researchers at Purdue and elsewhere for a variety of purposes,[1] such as imaging the structure of viruses at near-atomic resolutions,[4] [5] simulating the early stages of the Solar System's formation, projecting the reliability of Indiana's electrical supply, modeling the spread of water pollutants, discerning the structure of protein molecules and identifying millions of potential new forms of zeolites, silicate minerals widely used to catalyze chemical reactions on an industrial scale.[6] DiaGrid also is being used to develop data processing techniques for the Large Synoptic Survey Telescope. Purdue added a Web-based portal for BLAST processing with DiaGrid in 2011.

External links

Notes and References

  1. Campus Technology . Grush . Mary . Villano . Matt . July 28, 2009 . Campus Technology Innovators Awards 2009: High-Performance Computing - Purdue University .
  2. The top 100 IT projects of 2009 . InfoWorld, November 23, 2009.
  3. Cycle Computing and Purdue University to Power Dynamic Optimized Condor Pool at SuperComputing 2009 . Nov 13, 2009 .
  4. 10.1038/nature06665 . Wen . Jiang . Nature . 451 . 7182 . 1130–1134 . Backbone structure of the infectious e15 virus capsid revealed by electron cryomicroscopy . Feb 28, 2008 . 18305544. etal. 2008Natur.451.1130J . 205212346 .
  5. Web site: Weimin . Wu . Wen . Jiang . Condor in Cryo-EM image processing . Apr 30, 2008 . January 21, 2010 . July 25, 2011 . https://web.archive.org/web/20110725232749/http://www.dia-grid.org/publications/weimin_cryo_em.ppt . dead .
  6. Ramdas . Pophale . Phillip A. . Cheeseman . Michael W. . Deem . A database of new zeolite-like materials . Physical Chemistry Chemical Physics . 13 . 27 . 2011 . 12407–12412 . 10.1039/C0CP02255A. 21423937 . 2011PCCP...1312407P .