CDL - More Data, More Science and......Moore's Law? | NSF

About the series

Abstract

In the same way that the Internet has combined with web content and search engines to revolutionize every aspect of our lives, the scientific process is poised to undergo a radical transformation based on the ability to access, analyze, and merge large, complex data sets. Scientists will be able to combine their own data with that of other scientists to validate models, interpret experiments, re-use and re-analyze data, and make use of sophisticated mathematical analyses and simulations to drive the discovery of relationships across data sets. This “scientific web” will yield higher quality science, more insights per experiment, an increased democratization of science, and a higher impact from major investments in scientific instruments.

At the same time, the traditional growth in computing performance is slowing, starting with flattening of processor clock speeds, but eventually also in transistor density. These trends will limit our ability to field some of the largest systems, e.g., exascale computers, but the cost in hardware, infrastructure and energy will limit the growth in computing capacity per dollar at all scales. Fundamental research questions exist in computer science to extend the limits of current computing technology through new architectures, programming models and algorithms, but also to explore options for post-Moore computing. While the largest computing capabilities have traditionally been focused on modeling and simulation, some of the data analysis problems arising from scientific experiments will also require huge computational infrastructure. Thus, a sophisticated understanding of the workload across analytics and simulations is needed to understand how future computer systems should be designed and how technology and infrastructure from other markets can be leveraged.

In this talk I will describe some examples of how science disciplines such as biology, material science and cosmology are changing in the face of their own data explosion, and how mathematical analyses, programming models, and workflow tools can enable different types of scientific exploration. This will lead to a set of open questions for computer scientists due to the scale of the data sets, the data rates, inherent noise and complexity, and the need to “fuse” disparate data sets. Rather than being at odds with scientific simulation, many important scientific questions will only be answered by combining simulation and observational data, sometimes in a real-time setting. Along with scientific simulations, experimental analytics problems will drive the need for increased computing performance, although the types of computing systems and software configurations may be quite different.

Biography

Kathy Yelick’s research is in programming languages, compilers, and algorithms for parallel machines, including the UPC and Titanium languages and automatic performance turning libraries. She was Director of the National Energy Research Scientific Computing Center (NERSC) from 2008 to 2012 and currently leads the Computing Sciences directorate at LBNL, which includes NERSC, Energy Sciences Network (ESnet) and a research division of scientists and engineers in applied math, computer science and computational science. She earned her Ph.D in Electrical Engineering and Computer Science from MIT and has been a professor at UC Berkeley since 1991 with a joint research appointment at LBNL since 1996.

She is an ACM Fellow and recent recipient of the ACM-W Athena award. She is a member of the National Academies Computer Science and Telecommunications Board (CSTB), and previously served on the California council on Science and Technology and the LLNS/LANS Science and Technology Committee overseeing research at Los Alamos and Lawrence Livermore National Laboratories.

To Join the Webinar:

Please register at:

https://nsf.webex.com/nsf/j.php?RGID=r76ea57a6829a160d6f397263d507e528

by 11:59pm EST on Tuesday, May 19, 2015.

After your registration is accepted, you will receive an email with a URL to join the meeting. Please be sure to join a few minutes before the start of the webinar. This system does not establish a voice connection on your computer; instead, your acceptance message will have a toll-free phone number that you will be prompted to call after joining. Please note that this registration is a manual process; therefore, do not expect an immediate acceptance. In the event the number of requests exceeds the capacity, some requests may have to be denied.

Organization

Directorate for Computer and Information Science and Engineering (CISE)

CDL - More Data, More Science and......Moore's Law?

About the series

Past events in this series

CDL - More Data, More Science and......Moore's Law?

Organization