About Me & My Work

I am a Data Sourcing Specialist with experience in sourcing, organizing, and automating access to large scientific datasets from diverse servers across the globe.

Using Linux and terminal-based tools, I create structured folder hierarchies, store scripts in .sh and .txt formats, and implement automated workflows to ensure data integrity and reproducibility.

I am also proficient in using CDO (Climate Data Operators) for advanced dataset management, including merging, slicing, subsetting, and masking large climate and environmental datasets.

Workflow Terminal

Interactive demonstration of my data sourcing workflow using command-line tools

samuel@data-sourcing:~

Skills & Tools

Technologies and platforms I use for efficient data sourcing and management

Recent Projects

Data sourcing and management projects I've recently completed

CMIP6 Data Pipeline

Automation

Automated data sourcing and preprocessing pipeline for CMIP6 climate model outputs across multiple ESGF nodes.

Bash Scripting CDO ESGF Globus
View Details

Regional Climate Analysis

CORDEX

Sourced and processed CORDEX-Africa data for regional climate impact studies, implementing quality control procedures.

Python xarray CORDEX DataLad
View Details

Get In Touch

Let's discuss your data sourcing needs or potential collaborations

Contact Information

Connect with me