for Employees, Students

Using the GWDG Data Pools for Scientific Data Sharing

Data ManagementEmployeesStudents Online

Event content

The scientific community relies heavily on the sharing of standardized datasets, such as ImageNet or Sentinel-2 imagery. To host these popular datasets in a central store, the GWDG offers the Data Pools service. Compared to conventional cloud-based approaches, we achieve significantly higher performance with Data Pools when running on our HPC systems. Additionally, the GWDG provides a number of standard datasets and derived data products, such as machine learning models. This service is not only for users to consume data but also allows them to share and host their own versioned datasets within our HPC systems. Other users of our systems can then use your dataset or data products to conduct their own research. Data Pools are specifically designed for the scientific community, providing versioned datasets that are citable.

Within this course, we will teach you how to discover existing Data Pools and how to publish your own dataset as a Data Pool to share it with others.

Learning goal

  • Understand the concept of data pools
  • Learn how to discover existing data pools
  • Learn to publish your own dataset as data pools


Information about the event

Max. participants

50

Requirements

Basic Linux and HPC experience

Speakers
Trainer picture
Dr. Hendrik Nolte
Trainer picture
Hauke Kirchner
Trainer picture
Dr. Freja Nordsiek

Details

Number
1572
Format
Block Course
Language
English

Location

Online (BigBlueButton)


Contact

GWDG Academy
support@gwdg.de

Registration

Log in with your account to register for an event

Dates

This event includes following dates:

Date Location
1. 18.03.2025 15:00 - 16:30 Online (BigBlueButton)