Closer alignment between international and UK data structures

MikeThacker · September 22, 2021, 2:18pm

UK and international (US-based) teams have been discussing closer alignment of data structures so we can better share learning, documentation, tools etc.

This follows:

extensions added in the UK following OpenComunity Discovery work (with proposed Open Referral changes) and analysis by iStandUK
subsequent issue of international Human Services Data Specification (HSDS) version 2 adopting many of the UK enhancements

One proposed approach is to adopt all UK extensions and the recent international changes that switch from using enumerations to taxonomies. We would then use formally defined “application profiles” to represent: the Open Referral “classic” structure; the current Open Referral UK; and further application-specific profiles as needed over time.

Application profiles might also be used to mandate for particular situations properties that are optional in the overall specification.

We could use the exercise to consider minor other backwards compatible version enhancements that have been proposed.

Please give any comments you have in support of this approach or identifying downsides.

MikeThacker · October 18, 2021, 3:34pm

@Dominic has investigated techniques whereby different application profiles can be defined.

He is suggesting using a tool such as Jolt for JSON to JSON transformation of the tabular data package definition for the full data structure.

One Jolt transformation would exist for each application profile. Transformations would remove optional parts of the data structure that don’t form part of a profile and may change optional properties to being required.

robredpath · November 16, 2021, 3:28pm

In general, there’s three core things required for any sort of application profile / extension / customisation of a standard:

Some way of describing the changes that the profile makes to the base schema
Some documentation of any constraints that can’t be encoded in schema, along with any guidance or useful information about how to use the profile.
Some way of bundling those together to make a coherent artefact that can be reasoned about, discussed, used, etc.

I hadn’t come across Jolt before - it looks interesting! It feels a bit heavy for this particular use, but that might just be my Java vs Python bias showing. Similar tools include json-merge-patch and python-json-patch. I’d prefer to use something that uses one of the actual standards for diff/patching JSON (JSON Patch and JSON Merge Patch). Some of the properties of JSON Table Schema make it quite hard to use those standards effectively, particularly around how you identify which object you’re working on at any given time. Ideally, a patch format would include some human-readable contextual information - I’d rather be talking about “removing the language column from the meta_table_description table” than “deleting /resources/21/schema/fields/3”).

I think it’s worth us considering rolling our own tooling, and seeing if we can collaborate with the folk at Frictionless Data who have already created a Tabular diff format - there’s maybe something to build on there.

For some inspiration, OCDS have a template extension for their extension mechanism which bundles together all of the stuff I mention above. It uses JSON Merge Patch (JSON Merge Patch works much better with “regular” JSON Schema than JSON Table Schema) alongside some pre-defined containers for metadata and other components.

Dominic · November 23, 2021, 5:33pm

Hi @robredpath, that’s a good comprehensive solution you have suggested.

In the meantime, I have created a basic solution to this problem using Jolt, which I have illustrated in GitHub.

Using this approach a “spec” would have to be created for each required view of the extended data package.

MikeThacker · February 10, 2022, 3:30pm

As requested by some, I’ve created a separate thread US and UK Alignment and version control for work just starting in this area.

Topic		Replies	Views
US and UK Alignment and version control General datastructure , api	10	956	August 21, 2022
Upgrade to UK standard and compliance rules (version 3.0) Technical api , datastructure	4	337	September 11, 2023
Our review of the HSDS Docs is available for comment Technical	5	240	October 19, 2023
State of the Upgrade General	0	267	February 6, 2023
Documenting the Profile Mechanism Technical	3	290	August 24, 2023

Closer alignment between international and UK data structures

Related Topics