Large Data Set Seismology: Strategies in Managing, Processing and Sharing Large Geophysical Data Sets
Date: 4/25/2019
Time: 4:00 PM to 5:15 PM
Room: Elliott Bay
As seismology grows increasingly data rich, studies are being designed that use ever larger volumes of available data. The strategies for collecting, processing and sharing these data are evolving accordingly. In cases when the traditional research pattern of downloading, managing and processing data locally becomes untenably slow, new approaches are required. These strategies may include employing a compute cluster, either operated by a research group, an institutional HPC resource or a cloud computing provider. Researchers may use new technologies and frameworks to orchestrate more advanced processing workflows aimed at large scale computation, e.g. Hadoop. Furthermore, they may employ stream processing, where data are processed as it is collected from a center, thus mitigating the local storage issues. Ultimately, working with large data sets challenges researchers to be more informed and deliberate about computation, data transmission, compression and storage. This shift in data processing scale has a number of implications for both data providers and research processing pipelines and a variety of approaches are being used to address these changes. We invite researchers and data providers to describe their experiences in collecting, managing and processing large data sets.
Conveners
Chad Trabant, IRIS Data Services (chad@iris.washington.edu)
Jonathan K. MacCarthy, Los Alamos National Laboratory (jkmacc@lanl.gov)
Oral Presentations
Participant Role | Details | Start Time | Minutes | Action |
---|---|---|---|---|
Submission | Efficient Storage and Processing of Segmented Waveform Data for the Generation of a Signal Quality Machine Learning Classifier | 04:00 PM | 15 | View |
Submission | Ambient Noise Processing With Julia | 04:15 PM | 15 | View |
Submission | Managing Large Data Sets for British Columbia Earthquake Early Warning | 04:30 PM | 15 | View |
Submission | Putting the Commercial Cloud to Work for Seismology | 04:45 PM | 15 | View |
Submission | The Promise of the Cloud and the IRIS DMC | 05:00 PM | 15 | View |
Total: | 75 Minute(s) |
Large Data Set Seismology: Strategies in Managing, Processing and Sharing Large Geophysical Data Sets
Description