Sign In

Communications of the ACM

ACM Careers

Deep Learning Tackles Science's Big Data Problem


View as: Print Mobile App Share: Send by email Share on reddit Share on StumbleUpon Share on Hacker News Share on Tweeter Share on Facebook
Titan supercomputer

Scientists will use ORNL computing resources such as the Titan supercomputer to develop deep learning solutions for data analysis.

Credit: Jason Richards / ORNL

A team of researchers from Oak Ridge National Laboratory has been awarded nearly $2 million over three years from the U.S. Department of Energy to explore the potential of machine learning in revolutionizing scientific data analysis.

 

The Advances in Machine Learning to Improve Scientific Discovery at Exascale and Beyond (ASCEND) project aims to use deep learning to assist researchers in making sense of massive datasets produced at the world's most sophisticated scientific facilities. Deep learning is an area of machine learning that uses artificial neural networks to enable self-learning devices and platforms. The team, led by ORNL's Thomas Potok, includes Robert Patton, Chris Symons, Steven Young, and Catherine Schuman.

While deep learning has long been used to classify relatively simple data such as photographs, today's scientific data presents a much greater challenge because of its size and complexity. Deep learning offers the potential to truly change the way in which researchers use massive datasets to solve challenges spanning the scientific spectrum.

For example, neutron scattering data collected at ORNL's Spallation Neutron Source contain rich scientific information about structure and dynamics of materials under investigation, and deep learning could help researchers better understand the link between experimental data and materials properties. "This understanding can help scientists build and support new scientific theories, and help to design better materials," Potok says.

The team aims to revolutionize current analysis paradigms by using deep learning to identify patterns in scientific data that alert scientists to potential new discoveries. Their novel high-performance computing methods will leverage ORNL's Titan supercomputer, the United States' most powerful for open science.

Potok's team plans to construct a deep learning network capable of deciphering data from hundreds of thousands of inputs, such as sensors, and learning from the complex matrices of sensor readings developed over time. ORNL's rich history of machine learning research, broad range of analytic expertise, and world-class computing resources such as Titan create an ideal setting for such research.

The researchers outline their approach to deep learning in "A Study of Complex Deep Learning Networks on High Performance, Neuromorphic, and Quantum Computers," presented at the 2nd Workshop on Machine Learning in HPC Environments.

"We revealed new capabilities not feasible with conventional computing architectures," says Potok. "It potentially allows us to solve very complicated problems unsolvable with current computing technologies."

The project is supported by DOE's Office of Science. The Spallation Neutron Source and the Oak Ridge Leadership Computing Facility are DOE Office of Science User Facilities.  


 

No entries found