IPlant Collaborative explained
Cyverse (formerly iPlant Collaborative) |
Commercial: | No |
Type: | Scientific support |
Language: | English |
Launch Date: | 2008 |
The iPlant Collaborative, renamed Cyverse in 2017, is a virtual organization created by a cooperative agreement funded by the US National Science Foundation (NSF) to create cyberinfrastructure for the plant sciences (botany).[1] The NSF compared cyberinfrastructure to physical infrastructure, "... the distributed computer, information and communication technologies combined with the personnel and integrating components that provide a long-term platform to empower the modern scientific research endeavor".[2] In September 2013 it was announced that the National Science Foundation had renewed iPlant's funding for a second 5-year term with an expansion of scope to all non-human life science research.[3]
The project develops computing systems and software that combine computing resources, like those of TeraGrid, and bioinformatics and computational biology software. Its goal is easier collaboration among researchers with improved data access and processing efficiency. Primarily centered in the United States, it collaborates internationally.
History
Biology is relying more and more on computers.[4] Plant biology is changing with the rise of new technologies.[5] With the advent of bioinformatics, computational biology, DNA sequencing, geographic information systems and others computers can greatly assist researchers who study plant life looking for solutions to challenges in medicine, biofuels, biodiversity, agriculture and problems like drought tolerance, plant breeding, and sustainable farming. Many of these problems cross traditional disciplines and facilitating collaboration between plant scientists of diverse backgrounds and specialties is necessary.[6]
In 2006, the NSF solicited proposals to create "a new type of organization – a cyberinfrastructure collaborative for plant science" with a program titled "Plant Science Cyberinfrastructure Collaborative" (PSCIC) with Christopher Greer as program director.[7] A proposal was accepted (adopting the convention of using the word "Collaborative" as a noun) and iPlant was officially created on February 1, 2008.[7] Funding was estimated as $10 million per year over five years.[8]
Richard Jorgensen led the team through the proposal stage and was the principal investigator (PI) from 2008 to 2009.[8] Gregory Andrews, Vicki Chandler, Sudha Ram and Lincoln Stein served as Co-Principal Investigators (Co-PIs) from 2008 to 2009. In late 2009, Stephen Goff was named PI and Daniel Stanzione was added as a Co-PI.[1] [9] [10] As of May 2014, Co-PI Stanzione was replaced by 4 new Co-PIs: Doreen Ware at Cold Spring Harbor, Nirav Merchant and Eric Lyons at the University of Arizona, and Matthew Vaughn at the Texas Advanced Computing Center.[11]
The iPlant project supports what has been called e-Science, which is a use of information systems technology that is being adopted by the research community in efforts such as the National Center for Ecological Analysis and Synthesis (NCEAS), ELIXIR,[12] and the Bamboo Technology Project that started in September 2010.[13] [14] iPlant is "designed to create the foundation to support the computational needs of the research community and facilitate progress toward solutions of major problems in plant biology."[15] [16]
The project works as a collaboration. It seeks input from the wider plant science community on what to build.[17] Based on that input, it has enabled easier use of large data sets,[18] created a community-driven research environment to share existing data collections within a research area and between research areas[19] and shares data with provenance tracking.[20] [21] One model studied for collaboration was Wikipedia.[22] [23]
Several more recent National Science Foundation awards mentioned iPlant explicitly in their descriptions, as either a design pattern to follow or a collaborator with whom the recipient will work.[24]
Institutions
The primary institution for the iPlant project is the University of Arizona, located within the BIO5 Institute in Tucson.[25] Since its inception in 2008, personnel worked at other institutions including Cold Spring Harbor Laboratory, University of North Carolina, Wilmington, and the University of Texas at Austin in the Texas Advanced Computing Center.[26] Purdue University and Arizona State University were part of the original project group.[8]
Other collaborating institutions that received support from iPlant for their work on a Grand Challenge in phylogenetics starting in March 2009 included Yale University, University of Florida, and the University of Pennsylvania.[26] A trait evolution group was led at the University of Tennessee.[27] A visualization workshop employing iPlant was run by Virginia Tech in 2011.[28]
The NSF requires that funding subcontracts stay within the United States, but international collaboration started in 2009 with the Technical University Munich[26] and University of Toronto in 2010.[28] [29] East Main Evaluation & Consulting provides external oversight, advice, and assistance.[30]
Services
The iPlant project makes its cyberinfrastructure available several different ways and offers services to make it the accessible to its primary audience. The design was meant to grow in response to needs of the research community it serves.[15]
The Discovery Environment
The Discovery Environment integrates community-recommended software tools into a system that can handle terabytes of data using high-performance supercomputers to perform these tasks much more quickly. It has an interface designed to hide the complexity needed to do this from the end user. The goal was to make the cyberinfrastructure available to non-technical end users who are not as comfortable using a command-line interface.[15] [31]
iPlant Foundational APIs
A set of application programming interfaces (APIs) for developers allow access to iPlant services, including authentication, data management, high performance supercomputing resources from custom, locally produced software.[15] [32]
Atmosphere
Atmosphere is a cloud computing platform that provides easy access to pre-configured, frequently used analysis routines, relevant algorithms, and data sets, and accommodates computationally and data-intensive bioinformatics tasks.[15] It uses the Eucalyptus virtualization platform.[33] [34]
iPlant Semantic Web
The iPlant Semantic Web effort uses an iPlant-created architecture, protocol, and platform called the Simple Semantic Web Architecture and Protocol (SSWAP) for semantic web linking using a plant science focused ontology.[15] [35] [36] SSWAP is based on the notion of RESTful web services with an ontology based on Web Ontology Language (OWL).[37] [38]
Taxonomic Name Resolution Service
The Taxonomic Name Resolution Service (TNRS) is a free utility for correcting and standardizing plant names. This is needed because plant names that are misspelled, out of date (because a newer synonym is preferred), or incomplete make it hard to use computers to process large lists.[15] [39] [40]
My-Plant
My-Plant.org is a social networking community for plant biologists, educators and others to come together to share information and research, collaborate, and track the latest developments in plant science.[15] [41] The My-Plant network uses the terminology clades to group users in a manner similar to phylogenetics of plants themselves.[41] It was implemented using Drupal as its content management system.[41]
DNA Subway
The DNA Subway website uses a graphical user interface (GUI) to generate DNA sequence annotations, explore plant genomes for members of gene and transposon families, and conduct phylogenetic analyses. It makes high-level DNA analysis available to faculty and students by simplifying annotation and comparative genomics workflows.[15] [42] It was developed for iPlant by the Dolan DNA Learning Center.[43] [44]
External links
Notes and References
- Web site: PSCIC Full Proposal: The iPlant Collaborative: A Cyberinfrastructure-Centered Community for a New Plant Biology . Award Abstract #0735191 . National Science Foundation . August 22, 2011 . September 21, 2011 .
- Book: Revolutionizing Science and Engineering Through Cyberinfrastructure: Report of the National Science Foundation Blue-Ribbon Advisory Panel on Cyberinfrastructure . January 15, 2003 . National Science Foundation . September 18, 2011 .
- Web site: Renewal announcement on iPlant news website . http://news.iplantcollaborative.org/?p=212 . May 27, 2014.
- Stein . Lincoln . Towards a cyberinfrastructure for the biological sciences: progress, visions and challenges . Nature Reviews Genetics . 9 . 678–688 . September 2008 . 10.1038/nrg2414 . 9 . 18714290. 339653 .
- Dilworth . Machi F . Machi Dilworth. Perspective: Plant biology—A quiet pioneer . Plant Biotechnology . 26 . 2 . 183–187 . 2009 . 10.5511/plantbiotechnology.26.183. free .
- Web site: Biological Sciences and Cyberinfrastructure for the 21st Century . Peter Arzberger . Coalition for Academic Scientific Computation presentation . March 24, 2010 . September 29, 2011 .
- Web site: Plant Science Cyberinfrastructure Collaborative (PSCIC) . Program Solicitation 06-594 . National Science Foundation . November 30, 2006 . September 21, 2011 .
- News: National Science Foundation Awards $50 Million for Collaborative Plant Biology Project to Tackle Greater Science Questions . News release . January 30, 2008 . National Science Foundation . September 21, 2011 .
- Web site: Stephen Goff, Ph.D. . Staff biography page from iPlant web site . September 21, 2011 . https://web.archive.org/web/20111128135054/http://www.iplantcollaborative.org/connect/staff-collaborators/stephen-goff-phd . November 28, 2011 . dead .
- Web site: Dan Stanzione, Ph.D. . Staff biography page from Texas Advanced Computing Center web site . September 21, 2011 .
- Web site: Leadership . Leadership page from iPlant website . May 27, 2014.
- Web site: ELIXIR: Data for Life . Official web site . EMBL-European Bioinformatics Institute . Wellcome Trust Genome Campus, Hinxton, Cambridgeshire, United Kingdom . September 28, 2011 .
- Web site: About Bamboo . Project Bamboo website . September 27, 2011 . https://web.archive.org/web/20111009074754/http://www.projectbamboo.org/about/ . October 9, 2011 . dead .
- Brown . Susan . Thatcher . Sherry . Factors Influencing Adoption And Non-Adoption Of Cyberinfrastructure By The Research Community . Pacific Asia Conference on Information Systems Proceedings . 2011 . 978-1-86435-644-1 .
- Goff . Stephen A. . The iPlant Collaborative: Cyberinfrastructure for Plant Biology . Frontiers in Plant Science . 2 . 34 . 2011 . 1664-462X . 10.3389/fpls.2011.00034. etal . 22645531 . 3355756. free .
- Web site: The iPlant Collaborative: A Cyberinfrastructure-Centered Community of Plant and Computing Scientists . Steve Goff . New PhytologistSymposium presentation . September 18, 2008 . September 29, 2011 . https://web.archive.org/web/20120422175116/http://www.newphytologist.org/physiological/goff.pdf . April 22, 2012 . dead .
- Cyberinfrastructure: Feed me data . The iPlant programme was designed to give plant scientists a new information infrastructure. But first they had to decide what they wanted... . Heidi Ledford . Nature . 459 . 7250 . 1047–1049 . June 24, 2009 . 10.1038/4591047a . 19553968. free .
- Book: 10.1109/CLUSTERWKSP.2010.5613093 . 978-1-4244-8395-2 . Comprehensive data infrastructure for plant bioinformatics . 2010 IEEE International Conference on Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS) . 1–5 . 2010 . Jordan . Chris . Stanzione . Dan . Ware . Doreen . Lu . Jerry . Noutsos . Christos . 2278398 . http://repository.cshl.edu/15442/1/Comprehensive_Data_Infrastructure_for_Plant_Bioinformatics.pdf .
- Reagan W. . Moore . Policy-Based Distributed Data Management Systems . Georgia Institute of Technology . May 19, 2009 . 4th International Conference on Open Repositories . September 28, 2011 . etal.
- Book: Ram . Sudha . Jun Liu . Provenance Management in BioSciences . 2010 . Advances in Conceptual Modeling – Applications and Challenges . 6413 . Advances in Conceptual Modeling‚ Applications and Challenges . 54–64 . Springer Berlin / Heidelberg . 10.1007/978-3-642-16385-2_8. 978-3-642-16384-5 .
- Book: 10.1109/HICSS.2010.263. Managing Knowledge in a Changing Scientific Landscape: The Impact of Cyberinfrastructure. 2010 43rd Hawaii International Conference on System Sciences. 1–10. 2010. Brown. Susan A.. Thatcher. Sherry. Dang. Yan. 978-1-4244-5509-6. 1132763.
- Web site: Who does what on Wikipedia? . Thomas Veneklasen . e! Science News . March 11, 2010 . September 29, 2011 .
- Who Does What: Collaboration Patterns in the Wikipedia and Their Impact on Data Quality . Jun Liu . Sudha Ram . December 2009 . 19th Workshop on Information Technologies and Systems . 175–180 . 1565682 .
- Web site: Award Search—Awardee Information . NSF website search . National Science Foundation . September 28, 2011 . See abstracts for awards #0849861, #0923975, #0940841, #0953184, #1027542, #1031416, #1032105, #1126481, #1126998.
- News: UA-Led Research Team Awarded $50 Million to Solve Plant Biology's Grand Challenges . News release . January 30, 2008 . University of Arizona . September 23, 2011 .
- News: iPlant Moving Forward on Grand Challenge Collaborations . April 9, 2009 . iPlant Leaflet . 09-2 . September 25, 2011 .
- Web site: Trait Evolution . iPlant web site . September 25, 2011 .
- News: Integration and Visualization Workshop . June 8, 2011 . iPlant Leaflet . 11-2 . September 25, 2011 .
- News: Computing for Life: Scientists depend on advanced computing to better understand evolution, drug discovery and genetics . May 31, 2010 . University of Texas at Austin . September 25, 2011 . https://web.archive.org/web/20111002012404/http://www.utexas.edu/features/2010/05/31/computing_life/ . October 2, 2011 . dead .
- Web site: Current Projects . EMEC website . February 21, 2013 .
- Web site: The iPlant Discovery Environment . Discovery Environment manual 0.4 . Matthew Helmke . August 19, 2011 . September 28, 2011 .
- Rion . Dooley . An API To Feed the World . https://archive.today/20120919035959/http://www.teragrid.org/c/document_library/get_file?uuid=6004ae82-248f-4c53-8427-5570815eda25&groupId=334534 . dead . September 19, 2012 . PDF . TeraGrid 2011: Extreme Digital Discovery . July 19, 2011 . Salt Lake City, Utah . September 28, 2011 .
- Kim . Seung-jin . Facilitating access to customized computational infrastructure for plant sciences: Atmosphere cloudfront . 2nd IEEE International Conference on Cloud Computing Technology and Science . November 1, 2010 . Indianapolis, Indiana . September 28, 2011 . etal.
- Web site: Enabling Plant Sciences Research with the iPlant Discovery Environment and Condor . Juan Antonio Raygoza Garay . Sonya Lowry . John Wregglesworth . May 3, 2011 . Condor Week presentation . University of Wisconsin Computer Science Department . September 28, 2011 .
- Gessler . D. D. . SSWAP: A Simple Semantic Web Architecture and Protocol for semantic web services . BMC Bioinformatics . 10 . 309 . September 23, 2009 . 19775460 . 10.1186/1471-2105-10-309 . 2761904 . 309. etal . free .
- Nelson . R. T. . Applications and methods utilizing the Simple Semantic Web Architecture and Protocol (SSWAP) for bioinformatics resource discovery and disparate data and service integration . BioData Mining . 3 . 1 . June 4, 2010 . 20525377 . 10.1186/1756-0381-3-3 . 2894815 . 3. etal . free .
- Book: Pautasso. Cesare. Wilde. Erik. Alarcon. Rosa. REST: Advanced Research Topics and Practical Applications. Springer Science & Business Media. 9781461492993. 76–77. 27 June 2016. en. 2013-12-04.
- Book: Barros. Alistair. Oberle. Daniel. Handbook of Service Description: USDL and Its Methods. Springer Science & Business Media. 9781461418641. 169. 27 June 2016. en. 2012-03-02.
- Martha L. . Narro . The TNRS: a taxonomic name resolution service for plants . August 2011 . Plant Biology . Minneapolis . etal . September 14, 2011 . https://web.archive.org/web/20110516074847/http://abstracts.aspb.org/pb2011/public/P21/P21011.html . May 16, 2011 . dead .
- Species spellchecker fixes plant glitches: Online tool should weed out misspellings and duplications . John Whitfield . Nature . 474 . 7351 . 263 . June 13, 2011 . 10.1038/474263a . 21677719 . free .
- Book: 10.1109/GCE.2010.5676118 . 978-1-4244-9751-5 . My-Plant.org: A phylogenetically structured social network . 2010 Gateway Computing Environments Workshop (GCE) . 1–8 . 2010 . Hanlon . Matthew R. . Mock . Stephen . Nuthulapati . Praveen . Gonzales . Michael B. . Soltis . Pamela . Soltis . Douglas . Majure . Lucas C. . Payton . Adam . Mishler . Brent . Tremblay . Susan . Madsen . Thomas . Olmstead . Richard . McCourt . Richard . Wojciechowski . Martin . Merchant . Nirav . 12621375 .
- Mary . Schaeffer . MaizeGDB: curation and outreach go hand-in-hand . Oxford University Press . Database: The Journal of Biological Databases and Curation . 2011 . 10.1093/database/bar022 . 3104940 . 21624896 . 2011 . bar022 . etal.
- Web site: DNA Subway: Fast Track to Gene Annotation and Genome Analysis . Dolan DNA Learning Center website . September 29, 2011 .
- Web site: DNA Subway Places Students On Fast Track To Plant Genome Analysis and DNA BarCoding . Uwe Hilgert . Botany 2011: Healing the Planet (workshop) . July 9, 2011 . September 29, 2011.