March 2025

Building Georeferenced Datasets from HDF5 Files with h5py, xarray, and Rasterio

Introduction HDF5 (.h5) files are widely used in scientific computing, particularly for storing large-scale datasets from remote sensing missions, climate models, and other geospatial applications. These files are self-describing, meaning they store both the data and metadata (e.g., spatial extent, resolution, units, etc.) in a hierarchical structure. However, working with HDF5 files can be challenging […]

Building Georeferenced Datasets from HDF5 Files with h5py, xarray, and Rasterio Read More »

Clustering London Accidents Data Using Fuzzy C-Means

Introduction Understanding patterns in accident data is crucial for urban planning, traffic management, and public safety. Clustering is a powerful technique that helps in identifying accident-prone areas by grouping locations with similar characteristics. In this tutorial, we will use Fuzzy C-Means (FCM) clustering to analyze London accident data over 36 months, identifying high-density clusters and

Clustering London Accidents Data Using Fuzzy C-Means Read More »