Astroinformatics Explained

Astroinformatics is an interdisciplinary field of study involving the combination of astronomy, data science, machine learning, informatics, and information/communications technologies.[1] The field is closely related to astrostatistics.

Background

Astroinformatics is primarily focused on developing the tools, methods, and applications of computational science, data science, machine learning, and statistics for research and education in data-oriented astronomy.[2] Early efforts in this direction included data discovery, metadata standards development, data modeling, astronomical data dictionary development, data access, information retrieval,[3] data integration, and data mining[4] in the astronomical Virtual Observatory initiatives.[5] [6] Further development of the field, along with astronomy community endorsement, was presented to the National Research Council (United States) in 2009 in the astroinformatics "state of the profession" position paper for the 2010 Astronomy and Astrophysics Decadal Survey.[7] That position paper provided the basis for the subsequent more detailed exposition of the field in the Informatics Journal paper Astroinformatics: Data-Oriented Astronomy Research and Education.

Astroinformatics as a distinct field of research was inspired by work in the fields of Geoinformatics, Cheminformatics, Bioinformatics, and through the eScience work[8] of Jim Gray (computer scientist) at Microsoft Research, whose legacy was remembered and continued through the Jim Gray eScience Awards.[9]

Although the primary focus of astroinformatics is on the large worldwide distributed collection of digital astronomical databases, image archives, and research tools, the field recognizes the importance of legacy data sets as well—using modern technologies to preserve and analyze historical astronomical observations. Some Astroinformatics practitioners help to digitize historical and recent astronomical observations and images in a large database for efficient retrieval through web-based interfaces.[1] [10] Another aim is to help develop new methods and software for astronomers, as well as to help facilitate the process and analysis of the rapidly growing amount of data in the field of astronomy.[11]

Astroinformatics is described as the "fourth paradigm" of astronomical research.[12] There are many research areas involved with astroinformatics, such as data mining, machine learning, statistics, visualization, scientific data management, and semantic science.[13] Data mining and machine learning play significant roles in astroinformatics as a scientific research discipline due to their focus on "knowledge discovery from data" (KDD) and "learning from data".[14] [15]

The amount of data collected from astronomical sky surveys has grown from gigabytes to terabytes throughout the past decade and is predicted to grow in the next decade into hundreds of petabytes with the Large Synoptic Survey Telescope and into the exabytes with the Square Kilometre Array.[16] This plethora of new data both enables and challenges effective astronomical research. Therefore, new approaches are required. In part due to this, data-driven science is becoming a recognized academic discipline. Consequently, astronomy (and other scientific disciplines) are developing information-intensive and data-intensive sub-disciplines to an extent that these sub-disciplines are now becoming (or have already become) standalone research disciplines and full-fledged academic programs. While many institutes of education do not boast an astroinformatics program, such programs most likely will be developed in the near future.

Informatics has been recently defined as "the use of digital data, information, and related services for research and knowledge generation". However the usual, or commonly used definition is "informatics is the discipline of organizing, accessing, integrating, and mining data from multiple sources for discovery and decision support." Therefore, the discipline of astroinformatics includes many naturally-related specialties including data modeling, data organization, etc. It may also include transformation and normalization methods for data integration and information visualization, as well as knowledge extraction, indexing techniques, information retrieval and data mining methods. Classification schemes (e.g., taxonomies, ontologies, folksonomies, and/or collaborative tagging[17]) plus Astrostatistics will also be heavily involved. Citizen science projects (such as Galaxy Zoo) also contribute highly valued novelty discovery, feature meta-tagging, and object characterization within large astronomy data sets. All of these specialties enable scientific discovery across varied massive data collections, collaborative research, and data re-use, in both research and learning environments.

In 2012, two position papers[18] [19] were presented to the Council of the American Astronomical Society that led to the establishment of formal working groups in astroinformatics and Astrostatistics for the profession of astronomy within the US and elsewhere.[20]

Astroinformatics provides a natural context for the integration of education and research.[21] The experience of research can now be implemented within the classroom to establish and grow data literacy through the easy re-use of data.[22] It also has many other uses, such as repurposing archival data for new projects, literature-data links, intelligent retrieval of information, and many others.[23]

Conferences

YearPlaceLink
2021Caltechhttps://sites.astro.caltech.edu/ai21/index.html
2020Harvardhttps://www.astroinformatics2020.org/
2019Caltechhttp://astroinformatics2019.org/
2018Heidelberg, Germanyhttps://astroinformatics2018.h-its.org
2017Cape Town, South Africahttps://web.archive.org/web/20170606020652/http://www.astroinformatics2017.ska.ac.za/
2016Sorrento, Italyhttp://www.iau.org/science/meetings/future/symposia/1158/
2015Dubrovnik, Dalmatiahttp://iszd.hr/AstroInfo2015/
2014University of Chilehttp://eventos.cmm.uchile.cl/astro2014/
2013Australia Telescope National Facility, CSIROhttp://www.atnf.csiro.au/research/workshops/2013/astroinformatics/
2012Microsoft Researchhttp://www.astro.caltech.edu/ai12/
2011Sorrento, Italyhttps://web.archive.org/web/20110814063529/http://dame.dsf.unina.it/astroinformatics2011.html
2010Caltechhttp://www.astro.caltech.edu/ai10/

Additional conferences and conference lists:

ItemLink
Machine Learning in Astronomy: Possibilities and Pitfalls (2022)https://sites.astro.caltech.edu/IAUS368/
The Astrostatistics and Astroinformatics Portal (ASAIP) big list of conferenceshttps://asaip.psu.edu/meetings
Astronomical Data Analysis Software and Systems (ADASS) annual conferenceshttp://adass.org/

See also

External links

Notes and References

  1. http://www.math.bas.bg/~nkirov/zip/SEEDI_astro_presentation.pdf Astroinformatics and digitization of astronomical heritage
  2. Borne . Kirk D. . Astroinformatics: data-oriented astronomy research and education . Earth Science Informatics . 12 May 2010 . 3 . 1–2 . 5–17 . 10.1007/s12145-010-0055-2. 207393013 .
  3. Borne. Kirk. Science User Scenarios for a Virtual Observatory Design Reference Mission: Science Requirements for Data Mining. astro-ph/0008307. 2000.
  4. Book: Kargupta . Hillol . etal . Next generation of data mining . 2008 . London . 9781420085860 . Borne . Kirk. Scientific Data Mining in Astronomy . CRC Press . 91–114.
  5. Book: 10.1117/12.487536. Distributed data mining in the National Virtual Observatory. Data Mining and Knowledge Discovery: Theory, Tools, and Technology V. 5098. 211–218. 2003. Borne. Kirk D. 28195520. Belur V. Dasarathy.
  6. Laurino . O. . D’Abrusco . R. . Longo . G. . Riccio . G. . Astroinformatics of galaxies and quasars: a new general method for photometric redshifts estimation . Monthly Notices of the Royal Astronomical Society . 21 December 2011 . 418 . 4 . 2165–2195 . 10.1111/j.1365-2966.2011.19416.x. 1107.3160 . 2011MNRAS.418.2165L . 7115554 .
  7. Borne. Kirk. Astroinformatics: A 21st Century Approach to Astronomy. Astro2010: The Astronomy and Astrophysics Decadal Survey. 0909.3892 . 2009. 2010. P6. 2009astro2010P...6B.
  8. Web site: Online Science. Talks by Jim Gray. Microsoft Research. 11 January 2015.
  9. Web site: Jim Gray eScience Award. Microsoft Research.
  10. http://www.casca.ca/lrp2010/Docs/LRPReports/astroinformatics_lrp.pdf Astroinformatics in Canada
  11. Web site: 'Astroinformatics' helps Astronomers explore the sky. Phys.org. Heidelberg University. 11 January 2015.
  12. Web site: The Fourth Paradigm: Data-Intensive Scientific Discovery. Microsoft Research. October 2009. Hey. Tony.
  13. Book: 10.1007/978-94-007-5618-2_9. Virtual Observatories, Data Mining, and Astroinformatics. Planets, Stars and Stellar Systems. 403–443. 2013. Borne. Kirk. 978-94-007-5617-5.
  14. Ball . N.M. . Brunner . R.J. . Data Mining and Machine Learning in Astronomy . International Journal of Modern Physics D . 19 . 7 . 1049–1106 . 10.1142/S0218271810017160 . 2010. 0906.2173 . 2010IJMPD..19.1049B . 119277652 .
  15. Book: 10.1063/1.3059074 . The LSST Data Mining Research Agenda . AIP Conference Proceedings . 347–351 . 2008 . Borne . K . Becla . J . Davidson . I . Szalay . A . Tyson . J. A . Bailer-Jones . Coryn A.L. 0811.0167 . 118399971 .
  16. Book: 10.1063/1.3059076. Parametrization and Classification of 20 Billion LSST Objects: Lessons from SDSS. AIP Conference Proceedings. 359–365. 2008. Ivezić. Ž. Axelrod. T. Becker. A. C. Becla. J. Borne. K. Burke. D. L. Claver. C. F. Cook. K. H. Connolly. A. Gilmore. D. K. Jones. R. L. Jurić. M. Kahn. S. M. Lim. K.-T. Lupton. R. H. Monet. D. G. Pinto. P. A. Sesar. B. Stubbs. C. W. Tyson. J. A. Bailer-Jones. Coryn A.L. AIP Conf. Proc.. 1082. 0810.5155. 117914490.
  17. Web site: Borne. Kirk. Collaborative Annotation for Scientific Data Discovery and Reuse. Bulletin of the ASIS&T. American Society for Information Science and Technology. 11 January 2016. https://web.archive.org/web/20160305073440/http://www.asis.org/Bulletin/Apr-13/AprMay13_RDAP_Borne.html. 5 March 2016. dead.
  18. Web site: Borne. Kirk. Astroinformatics in a Nutshell. asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. 11 January 2016.
  19. Web site: Feigelson. Eric. Astrostatistics in a Nutshell. asaip.psu.edu. The Astrostatistics and Astroinformatics Portal, Penn State University. 11 January 2016.
  20. Feigelson. E.. Ivezić. Ž.. Hilbe. J.. Borne. K.. New Organizations to Support Astroinformatics and Astrostatistics. Astronomical Data Analysis Software and Systems Xxii. 1301.3069. 2013. 475. 15. 2013ASPC..475...15F.
  21. Borne. Kirk. The Revolution in Astronomy Education: Data Science for the Masses. Astro2010: The Astronomy and Astrophysics Decadal Survey. 0909.3895 . 2009. 2010. P7. 2009astro2010P...7B.
  22. Web site: Using Data in the Classroom. Science Education Resource Center at Carleton College. National Science Digital Library. 11 January 2016.
  23. Book: Borne. Kirk. Astroinformatics: Data-Oriented Astronomy. George Mason University, USA. January 21, 2015.