E-Science
E-Science, also known as eScience, is the practice of conducting computationally intensive scientific research in highly distributed network environments.
This form of science involves the use of substantial data sets that necessitate grid computing, a method of leveraging multiple computers to process large data sets efficiently. In some cases, the term encompasses technologies that facilitate distributed collaboration, such as the access grid. The term was coined by John Taylor, the Director General of the United Kingdom's Office of Science and Technology, in 1999 and was used to describe a large funding initiative starting in November 2000. Since then, the term "e-science" has been more broadly interpreted since then as "the application of computer technology to the undertaking of modern scientific investigation", including the preparation, experimentation, data collection, results dissemination, and long-term storage and accessibility of all materials generated through the scientific process. These may include data modeling and analysis, electronic/digitized laboratory notebooks, raw and fitted data sets, manuscript production and draft versions, pre-prints, and print and/or electronic publications." In 2014, condensed the definition to "eScience promotes innovation in collaborative, computationally- or data-intensive research across all disciplines, throughout the research lifecycle" in one of the working definitions used by the organizers. E-science encompasses "what is often referred to as big data has revolutionized science... the Large Hadron Collider at CERN... generates around 780 terabytes per year... highly data intensive modern fields of science...that generate large amounts of E-science data include: computational biology, bioinformatics, genomics" and the human digital footprint for the social sciences.
Turing Award winner Jim Gray imagined "data-intensive science" or "e-science" as a "fourth paradigm" of science and asserted that "everything about science is changing because of the impact of information technology" and the data deluge.
E-Science revolutionizes both fundamental legs of the scientific method: empirical research, especially through digital big data; and scientific theory, especially through computer simulation model building. These ideas were reflected by The White House's Office and Science Technology Policy in February 2013, which slated many of the aforementioned e-Science output products for preservation and access requirements under the memorandum's directive. E-sciences include particle physics, earth sciences and social simulations.
Characteristics and examples
Most of the research activities into e-Science have focused on the development of new computational tools and infrastructures to support scientific discovery. Due to the complexity of the software and the backend infrastructural requirements, e-Science projects usually involve large teams managed and developed by research laboratories, large universities or governments. Currently there is a large focus in e-Science in the United Kingdom, where the UK e-Science programme provides significant funding. In Europe the development of computing capabilities to support the CERN Large Hadron Collider has led to the development of e-Science and Grid infrastructures which are also used by other disciplines.Consortiums
Example e-Science infrastructures include the,
a federation with various partners including the
, the Open Science Grid and the
.
To support e-Science applications, Open Science Grid combines interfaces to more than 100 nationwide clusters, 50 interfaces to geographically distributed storage caches, and 8 campus grids. Areas of science benefiting from Open Science Grid include:
- astrophysics, gravitational physics, high-energy physics, neutrino physics, nuclear physics
- molecular dynamics, materials science, materials engineering, computer science, computer engineering, nanotechnology
- structural biology, computational biology, genomics, proteomics, medicine
UK programme
After his appointment as Director General of the Research Councils in 1999 John Taylor, with the support of the Science Minister David Sainsbury and the Chancellor of the Exchequer Gordon Brown, bid to HM Treasury to fund a programme of e-infrastructure development for science which would provide the foundation for UK science and industry to be a world leader in the knowledge economy which motivated the Lisbon Strategy for sustainable economic growth that the UK government committed to in March 2000.In November 2000 John Taylor announced £98 million for a national UK e-Science programme. An additional £20 million contribution was planned from UK industry in matching funds to projects that they participated in. From this budget of £120 million over three years, £75 million was to be spent on grid application pilots in all areas of science, administered by the Research Council responsible for each area, while £35 million was to be administered by the EPSRC as a Core Programme to develop "industrial strength" Grid middleware. Phase 2 of the programme for 2004-2006 was supported by a further £96 million for application projects, and £27 million for the EPSRC core programme. Phase 3 of the programme for 2007-2009 was supported by a further £14 million for the EPSRC core programme and a further sum for applications. Additional funding for UK e-Science activities was provided from European Union funding, from university funding council SRIF funding for hardware, and from Jisc for networking and other infrastructure.
The UK e-Science programme comprised a wide range of resources, centres and people including the National e-Science Centre which is managed by the Universities of Glasgow and Edinburgh, with facilities in both cities.
Tony Hey led the core programme from 2001 to 2005.
Within the UK regional e-Science centres support their local universities and projects, including:
There are also various centres of excellence and research centres.
In addition to centres, the grid application pilot projects were funded by the Research Council responsible for each area of UK science funding.
The EPSRC funded 11 pilot e-Science projects in three phases :
- First Phase were CombEchem, DAME, Discovery Net, GEODISE, myGrid and RealityGrid.
- Second phase were GOLD and Integrative biology
- Third phase were PMSEG, CARMEN and NanoCMOS
The remaining £23 million of phase one funding was divided between the application projects funded by BBSRC, MRC and NERC:
- BBSRC: Biomolecular Grid, Proteome Annotation Pipeline, High-Throughput Structural Biology, Global Biodiversity
- MRC: Biology of Ageing, Sequence and Structure Data, Molecular Genetics, Cancer Management, Clinical e-Science Framework, Neuroinformatics Modeling Tools
- NERC: Climateprediction.com, Oceanographic Grid, Molecular Environmental Grid, NERC DataGrid
United States
United States-based initiatives, where the term cyberinfrastructure is typically used to define e-Science projects, are primarily funded by the National Science Foundation office of cyberinfrastructure and Department of Energy. After the conclusion of TeraGrid in 2011, the was established and funded by the National Science Foundation to help researchers and educators, with or without supporting grants, to utilize the nation’s advanced computing systems and services.The Netherlands
Dutch eScience research is coordinated by the in Amsterdam, an initiative founded by and .Europe
Plan-Europe is a Platform of National e-Science/Data Research Centers in Europe, as established during the constituting meeting 29–30 October 2014 in Amsterdam, the Netherlands, and which is based on agreed Terms of Reference. PLAN-E has a kernel group of active members and convenes twice annually. More can be found on .Sweden
Two academic research projects have been carried out in Sweden by two different groups of universities, to help researches share and access scientific computing resources and knowledge:- Swedish e-Science Research Center : Kungliga Tekniska högskolan (KTH), Stockholm University (SU), Karolinska institutet (KI) and Linköping University (LiU)
- eSSENCE, The e-Science Collaboration : Uppsala University, Lund University and Umeå University