Skip to content

Releases: commondataio/dataportals-registry

v1.2.0 - Major Data Catalog Registry Update

22 Nov 08:24

Choose a tag to compare

Release v1.2.0 - 2025-11-21

Major Additions

  • 1,993 new data catalog records across multiple countries and regions
  • 1,515 ArcGIS Server instances - massive expansion of geoportal coverage
  • 293 World-level catalogs - international and global data repositories
  • 97 French data catalogs - significant expansion of French open data coverage

Geospatial Infrastructure Expansion

  • 83 GeoServer instances
  • 37 GeoNode installations
  • 33 GeoNetwork catalogs
  • 8 Lizmap instances
  • 3 MapProxy instances
  • 2 MapBender instances

Open Data Platforms

  • 47 OpenDataSoft instances
  • 42 CKAN portals
  • 5 DKAN installations

Scientific Data Repositories

  • 38 Figshare-based repositories
  • 6 DSpace installations
  • 6 NADA microdata catalogs
  • 9 THREDDS servers

Improvements

  • 363 records updated with improved metadata
  • Updated API endpoints for IPT-based data catalogs
  • Enhanced metadata completeness across multiple records
  • Better geographic and administrative region coverage

Statistics

Record Changes

  • New records: 1,993
  • Modified records: 363
  • Deleted records: 0

Software Types (Top 10)

  • ArcGIS Server: 1,515
  • Custom/Unknown: 89
  • GeoServer: 83
  • OpenDataSoft: 47
  • CKAN: 42
  • Figshare: 38
  • GeoNode: 37
  • GeoNetwork: 33
  • ArcGIS Hub: 26
  • THREDDS: 9

Catalog Types

  • Geoportal: 1,726 (86.6%)
  • Open data portal: 181 (9.1%)
  • Scientific data repository: 68 (3.4%)
  • Microdata catalog: 7
  • Indicators catalog: 6

Geographic Coverage

  • United States: 1,472 records (top states: Minnesota 54, California 51, Wisconsin 43, Ohio 42, Texas 39)
  • World-level: 293 records
  • France: 97 records
  • Netherlands: 11 records
  • Plus 30+ additional countries

See CHANGELOG.md for complete details and full statistics.

v1.1.0: Data Quality Analysis Tools

15 Nov 15:41

Choose a tag to compare

Added

  • Comprehensive data quality analysis tool (devdocs/analyze_duplicates_and_errors.py)
    • Detects duplicate UID's and ID's across all records
    • Identifies missing required fields
    • Finds filename mismatches (where id field doesn't match filename)
    • Reports empty files and YAML parsing errors
    • Generates detailed reports in JSON, Markdown, and text formats

Changed

  • Updated README.md with data quality and validation section
  • Added documentation for analysis tools in devdocs/ directory

Fixed

  • Identified 7 duplicate ID's (same ID in both entities and software directories)
  • Identified 204 records missing required uid field
  • Identified 63 files with filename mismatches
  • Identified 1 empty file requiring attention

See CHANGELOG.md for full details.

v1.0.0

13 Apr 17:51

Choose a tag to compare

Project is mature enough for first stable release