Select Page

Metadata Catalogue in High Value Nutrition (National Science Challenge)

Rob Carter, Dr Dharani Sontam, Yvette Wharton, Professor Mark Gahegan, Centre for eResearch; Dr Simmon Hofstetter, Operations Manager, Professor Richard Mithen, Liggins Institute; Joanne Todd, Challenge Director, High Value Nutrition, National Science Challenge.

  1. Home
  2.  • 
  3. Project
  4.  • Metadata Catalogue in High Value Nutrition (National Science Challenge)

Nutrition and research data

You’d think that research projects would share methodologies in common with Libraries when it comes to research collections. But in practice it is uncommon to build a Data Catalogue that itemises what research data was created.

The Centre for eResearch’s Research Data Management team provides consultancy on processes and structures that support healthy data workflows. Central to this work, we must be able to answer the question: “Where is the data?”

It’s important to keep standardised records about the research data that is collected and created. Metadata enables future researchers to discover previous knowledge, because metadata records are publicly searchable. Metadata is what makes services like Google search possible.
Apart from being good for exposure and internet search, a Metadata Catalogue should be flexible enough to make statements about this data in the context of Tikanga Māori. For example, where Māori assert kaitiakitanga over a particular species that is the subject of research.

 
 

The Data Catalogue sets out contact information for individuals and organisations who are involved in the guardianship of the data. In this way, it is possible to involve these people in future decisions around the data. We use an industry standard knowledge repository platform to publish these metadata records. Records are regularly syndicated to data.govt.nz (Figure 1) and included in their collection of datasets. Syndication helps to ensure that the data is more likely to continue to be available into the future.

Ko Ngā Kai Whai Painga, High Value Nutrition (HVN) National Science Challenge, is a multi-year, multi-study research programme. It asks questions about nutrition and diet from early childhood onwards. It’s aim: to grow the science excellence and knowledge Aotearoa New Zealand needs to create and deliver food to the world that people choose to stay healthy and well.

CeR provides two staff members, Robert Carter and Dharani Sontam, to develop the Metadata Catalogue from the ground up. The scale and number of studies being conducted, along with the added complexities of COVID, required a flexible, collaborative approach. With many data types spread across multiple organisations, the project has benefited from previous work with CeR on Data Management Plans and Standard Operating Procedures. The Metadata Catalogue links each study with it’s Ethics Registry approval records; providing detailed, searchable, clinical information.

Seeding Through Feeding (SUN): Nourishing the infant microbiome to support immune health

The SUN Study is a double-blind, randomised controlled trial designed to recruit 300 infants from urban and central Auckland, New Zealand. The SUN Study aims to determine the associations, and possible causality between prebiotic feeding, growth of immune health beneficial microbes in the infant gut, with reduced number of respiratory infections and improved vaccination responses in infants 6 to 12 months of age. We are working with the project team to define and include information relating to 19 individual data types resulting from the work.

He Rourou Whai Painga: An Aotearoa New Zealand diet for metabolic health and whanau wellbeing

A national Aotearoa New Zealand dietary intervention study to evaluate the effect of a 12-week whole-diet intervention incorporating nutritious domestically-produced food and beverage products and dietary change support, compared with habitual diet, on the MetS-Z score in individuals at risk of developing metabolic disease in a randomised controlled trial. In this case, CeR collected details of 11 different data result sets, for inclusion in the Metadata Record.

Future-proofing research data

With technology changing at such a rapid pace, the phrase ‘future proof’ might raise alarm bells. Institutional archives have been replaced by Digital Object Stores, filing cabinets with cloud storage, all in the space of one lifetime. How do you make provision for longevity in the digital age?

The tools of Archivists come into play when making calculated guesses about what the future will hold. Metadata must obey some kind of consistent format that is both human and machine readable. Short of inscribing the information on a brass plaque, we try to ensure that only the minimum of technology is required to read and make use of the Metadata Record. The record must carry with it a description of what the fields in the record mean.

To this end, three items of data are collected for each field in the record: the field name, a description of the data the field contains, and the data itself. The intension is to make the Metadata self-descriptive, rather than relying on some external pre-existing schema. Over time data formats change, and where possible, text is probably the most accessible format to use. On top of this, the project employs JSON as it’s baseline machine readable format.

 

Where to from here?

As the team continues to build the Metadata Catalogue, we measure what we have done in terms of the number of studies covered by the work, and in terms of the discoverability of the research. While it sometimes seems that digital data has begin to take on an ephemeral quality, the HVN Metadata Catalogue provides visibility of research data to a standard that supports researchers in the years to come.

 

Figure 1. HVN Metadata recods are regularly sundicated to data.govt.nz

 

See more case study projects

Seeding Through Feeding (SUN) project in High Value Nutrition – National Science Challenge

Seeding Through Feeding (SUN) project in High Value Nutrition – National Science Challenge

Building resilience in young people through sensing technology

Building resilience in young people through sensing technology

Our Voices: using innovative techniques to collect, analyse and amplify the lived experiences of young people in Aotearoa

Our Voices: using innovative techniques to collect, analyse and amplify the lived experiences of young people in Aotearoa

Southern Right Whale Tohora project

Southern Right Whale Tohora project

Asthma exacerbations in New Zealand 2010-2019: a national population-based study

Asthma exacerbations in New Zealand 2010-2019: a national population-based study

The impact of upzoning on housing construction in Auckland

The impact of upzoning on housing construction in Auckland

Extended reality is turning cancer research into a team sport

Extended reality is turning cancer research into a team sport

Analysis of incidents on New Zealand beaches

Analysis of incidents on New Zealand beaches

Painting the brain: multiplexed tissue labelling of human brain tissue to facilitate discoveries in neuroanatomy

Painting the brain: multiplexed tissue labelling of human brain tissue to facilitate discoveries in neuroanatomy

Decoding the work-from-home phenomenon: insights from location-based service data

Decoding the work-from-home phenomenon: insights from location-based service data

The use of digital footprints in the US mortgage market

The use of digital footprints in the US mortgage market

Detecting anomalous matches in professional sports: a novel approach using advanced anomaly detection techniques

Detecting anomalous matches in professional sports: a novel approach using advanced anomaly detection techniques

Benefits of linking routine medical records to the GUiNZ longitudinal birth cohort: Childhood injury predictors

Benefits of linking routine medical records to the GUiNZ longitudinal birth cohort: Childhood injury predictors

Using a virtual machine-based machine learning algorithm to obtain comprehensive behavioural information in an in vivo Alzheimer’s disease model

Using a virtual machine-based machine learning algorithm to obtain comprehensive behavioural information in an in vivo Alzheimer’s disease model

Mapping livability: the “15-minute city” concept for car-dependent districts in Auckland, New Zealand

Mapping livability: the “15-minute city” concept for car-dependent districts in Auckland, New Zealand

Quantifying gas narcosis in compressed gas diving

Quantifying gas narcosis in compressed gas diving

Estimating quality of life: a spatial microsimulation model of wellbeing in Aotearoa New Zealand

Estimating quality of life: a spatial microsimulation model of wellbeing in Aotearoa New Zealand

Video compression for REACH Lab’s study of  family resilience and wellbeing

Video compression for REACH Lab’s study of family resilience and wellbeing

Listening to equations: a tool for the audification of heteroclinic networks

Listening to equations: a tool for the audification of heteroclinic networks

The Effects of Short-Term Tourist Rentals on Local Residents

The Effects of Short-Term Tourist Rentals on Local Residents

Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution

Accounting for Errors in Data Improves Divergence Time Estimates in Single-cell Cancer Evolution

VRhook: A Data Collection Tool for VR Motion Sickness Research

VRhook: A Data Collection Tool for VR Motion Sickness Research

Ahuahu Great Mercury Island Online Database

Ahuahu Great Mercury Island Online Database

Automating Data Collection and Generation for The Rongowai Mission

Automating Data Collection and Generation for The Rongowai Mission

Travelling Heads – Measuring Reproducibility and Repeatability of Magnetic Resonance Imaging in Dementia

Travelling Heads – Measuring Reproducibility and Repeatability of Magnetic Resonance Imaging in Dementia

Novel Subject-Specific Method of Visualising Group Differences from Multiple DTI Metrics without Averaging

Novel Subject-Specific Method of Visualising Group Differences from Multiple DTI Metrics without Averaging

Interpretation of Non-coding Mutations Driving Melanoma Risk and Its Comorbidities

Interpretation of Non-coding Mutations Driving Melanoma Risk and Its Comorbidities

Who Are The 1M and 1X? Police Engagement with Citizens in Mental Distress

Who Are The 1M and 1X? Police Engagement with Citizens in Mental Distress

Representation of Multimodel Data – A Challenging Task

Representation of Multimodel Data – A Challenging Task

Assessing Marine Ecosystems to Improve Management

Assessing Marine Ecosystems to Improve Management

Metadata Catalogue in High Value  Nutrition (National Science Challenge)

Metadata Catalogue in High Value Nutrition (National Science Challenge)

Improving In Vitro Fertilisation (IVF) with Machine and Deep Learning

Improving In Vitro Fertilisation (IVF) with Machine and Deep Learning

Pacific Rheumatic Fever Project

Pacific Rheumatic Fever Project

Developing a genomics-specific Data Management Plan (DMP) using the  Data Stewardship Wizard

Developing a genomics-specific Data Management Plan (DMP) using the Data Stewardship Wizard

Understanding the effects of Airbnb on land use, land value and regulation

Understanding the effects of Airbnb on land use, land value and regulation

Calibrating gravitational wave signal parameters of Extreme Mass Ratio Inspirals (EMRIs)

Calibrating gravitational wave signal parameters of Extreme Mass Ratio Inspirals (EMRIs)

Automated stone artefacts classification using machine learning

Automated stone artefacts classification using machine learning

Hands-on DNA: exploring the impact of virtual reality on teaching DNA structure and function

Hands-on DNA: exploring the impact of virtual reality on teaching DNA structure and function

Re-assess urban spaces under COVID-19 impact: sensing Auckland social ‘hotspots’ with mobile location data

Re-assess urban spaces under COVID-19 impact: sensing Auckland social ‘hotspots’ with mobile location data

Aotearoa New Zealand’s changing coastline – Resilience to Nature’s Challenges (National Science Challenge)

Aotearoa New Zealand’s changing coastline – Resilience to Nature’s Challenges (National Science Challenge)

Auckland housing and land use geo-data

Auckland housing and land use geo-data

Rapid monitoring of infrastructural health using remote sensing

Rapid monitoring of infrastructural health using remote sensing

Enhancing Spontaneous Recovery after Stroke Study (ESPRESSo)

Enhancing Spontaneous Recovery after Stroke Study (ESPRESSo)

Data analytics and visualisation for improving  public health and transport planning

Data analytics and visualisation for improving public health and transport planning

Data maturity project in High Value Nutrition (Phase 2) – National Science Challenge

Data maturity project in High Value Nutrition (Phase 2) – National Science Challenge

Supporting the airborne remote sensing mission – Rongowai

Supporting the airborne remote sensing mission – Rongowai

A collaborative extended reality tool to examine tumour evolution (Phase II)

A collaborative extended reality tool to examine tumour evolution (Phase II)

Data maturity project in High Value Nutrition, National Science Challenge

Data maturity project in High Value Nutrition, National Science Challenge

Haka on the move: sport circuits and cultural performance 

Haka on the move: sport circuits and cultural performance 

Proteins under a computational microscope: designing in-silico strategies to understand and develop molecular functionalities in Life Sciences and Engineering

Proteins under a computational microscope: designing in-silico strategies to understand and develop molecular functionalities in Life Sciences and Engineering

Remote temperature monitoring to reduce the spread of COVID-19

Remote temperature monitoring to reduce the spread of COVID-19

COVID-19 exponential growth visualisation

COVID-19 exponential growth visualisation

Developing virtual capabilities for the Science Payload Operations Centre

Developing virtual capabilities for the Science Payload Operations Centre

Hosting visualisation and analytics tools for COVID-19 studies

Hosting visualisation and analytics tools for COVID-19 studies

Exploring perceptions towards climate change over time on Twitter

Exploring perceptions towards climate change over time on Twitter

Coastal image classification and nalysis based on convolutional neural betworks and pattern recognition

Coastal image classification and nalysis based on convolutional neural betworks and pattern recognition

Calcium signalling in salivary gland acinar cells

Calcium signalling in salivary gland acinar cells

Anti-corruption regulations for promoting socially responsible practices

Anti-corruption regulations for promoting socially responsible practices

Determinants of translation efficiency in the evolutionarily-divergent protist Trichomonas vaginalis

Determinants of translation efficiency in the evolutionarily-divergent protist Trichomonas vaginalis

Analysing text data by time-series feature engineering

Analysing text data by time-series feature engineering

An investigation into Leap Motion device for “gesture-as-sign”

An investigation into Leap Motion device for “gesture-as-sign”

Antibiotic resistance and the “end of modern medicine ”

Antibiotic resistance and the “end of modern medicine ”

Develop short-term eruption warning systems for Whakaari and other volcanoes

Develop short-term eruption warning systems for Whakaari and other volcanoes

Evenly spaced observation fields from irregularly sampled data in the Southern Ocean

Evenly spaced observation fields from irregularly sampled data in the Southern Ocean

Measuring impact of entrepreneurship activities on students’ mindset, capabilities and entrepreneurial intentions

Measuring impact of entrepreneurship activities on students’ mindset, capabilities and entrepreneurial intentions

Using Zebra Finch data and deep learning classification to identify individual bird calls from audio recordings

Using Zebra Finch data and deep learning classification to identify individual bird calls from audio recordings

NETwork! analysis in cancer – managing genomics research data and building a repository workflow

NETwork! analysis in cancer – managing genomics research data and building a repository workflow

The Coronary Atlas – data processing workflow optimisation

The Coronary Atlas – data processing workflow optimisation

3D visualisation of indigenous burial site in Roonka

3D visualisation of indigenous burial site in Roonka

Automated measurement of intracranial cerebrospinal fluid volume and outcome after endovascular thrombectomy for ischemic stroke

Automated measurement of intracranial cerebrospinal fluid volume and outcome after endovascular thrombectomy for ischemic stroke

A new ‘stratigraphy’: interpreting object relationships with 3D point densities

A new ‘stratigraphy’: interpreting object relationships with 3D point densities

Towards the use of deep learning techniques for storm surge prediction

Towards the use of deep learning techniques for storm surge prediction

Using simple models to explore complex dynamics: A case study of macomona liliana (wedge-shell) and nutrient variations

Using simple models to explore complex dynamics: A case study of macomona liliana (wedge-shell) and nutrient variations

Development of Machine Learning methodology for genomic research

Development of Machine Learning methodology for genomic research

An Archaeological database for threatened North Island rock art in New Zealand

An Archaeological database for threatened North Island rock art in New Zealand

Presence: distributed mixed reality learning environment

Presence: distributed mixed reality learning environment

Digital video and the early learning lab

Digital video and the early learning lab

Publishing the Bay of Island Bottlenose dolphin catalogue

Publishing the Bay of Island Bottlenose dolphin catalogue

Modelling the diurnal cycle* of winds and clouds

Modelling the diurnal cycle* of winds and clouds

Presence: distributed mixed reality learning environment

Presence: distributed mixed reality learning environment

Using research virtual machines to analyse fMRI datasets

Using research virtual machines to analyse fMRI datasets

Genomic Virtual Lab (GVL) as a bioinformatics training platform

Genomic Virtual Lab (GVL) as a bioinformatics training platform

SwiftLaTeX- Exploring web-based true WYSIWYG editing for digital publishing

SwiftLaTeX- Exploring web-based true WYSIWYG editing for digital publishing

Climate change impacts on weather-related hazards

Climate change impacts on weather-related hazards

Understanding tumour evolution through augmented reality

Understanding tumour evolution through augmented reality

Myocardial motion tracking and strain calculation using Deep Learning networks

Myocardial motion tracking and strain calculation using Deep Learning networks

OnTask pilot at the Centre for Learning and Research in Higher Education

OnTask pilot at the Centre for Learning and Research in Higher Education

Visualising the University campus in 3D

Visualising the University campus in 3D

Visualising protein interaction

Visualising protein interaction

Biological heritage National Science Challenge eDNA virtual hub

Biological heritage National Science Challenge eDNA virtual hub

Interactive AR art – Project Gordon

Interactive AR art – Project Gordon

1-D numerical models of post-glacial river evolution

1-D numerical models of post-glacial river evolution

Mathematically modelling gastrointestinal electrical activity

Mathematically modelling gastrointestinal electrical activity

3D Cryo-EM reconstructions of macromolecular complexes

3D Cryo-EM reconstructions of macromolecular complexes

Engine knock in a spark-ignition engine with hydrogen supplementation

Engine knock in a spark-ignition engine with hydrogen supplementation

The complex unsteady flow within a fluid-filled annulus and its transition to turbulence

The complex unsteady flow within a fluid-filled annulus and its transition to turbulence

Using data mining for digital ink recognition

Using data mining for digital ink recognition

The landscape costs of brushtail possum dispersal

The landscape costs of brushtail possum dispersal

Accelerating the discovery of natural products made by orphan megasynthases

Accelerating the discovery of natural products made by orphan megasynthases

Improving the short term precipitation forecasts for New Zealand

Improving the short term precipitation forecasts for New Zealand

Finding genetic variants responsible  for human disease hiding in the universe of benign variants

Finding genetic variants responsible for human disease hiding in the universe of benign variants

Revealing key processes in enzyme efficiency through high performance computing

Revealing key processes in enzyme efficiency through high performance computing

3D Electromagnetic modeling and simulation using heterogeneous computing

3D Electromagnetic modeling and simulation using heterogeneous computing

Hemodynamics in the microcirculation

Hemodynamics in the microcirculation

Putting turbulence to work

Putting turbulence to work

Why are some molecules drugs?

Why are some molecules drugs?

Bayesian additive regression trees  vs logistic regression – estimation of propensity scores

Bayesian additive regression trees vs logistic regression – estimation of propensity scores

Fully coupled thermo-hydro-mechanical modelling of permeability enhancement by the finite element method

Fully coupled thermo-hydro-mechanical modelling of permeability enhancement by the finite element method

Modelling dispersal and ecological competition in a statistical phylogeographic framework

Modelling dispersal and ecological competition in a statistical phylogeographic framework

Studying the shape and the size of the universe

Studying the shape and the size of the universe

Planet hunting

Planet hunting

Simulating quantum mechanics on high performance computing cluster

Simulating quantum mechanics on high performance computing cluster

Multiscale modelling of saliva secretion

Multiscale modelling of saliva secretion

Modelling dual reflux pressure swing adsorption (DR-PSA) units for gas separation in natural gas processing

Modelling dual reflux pressure swing adsorption (DR-PSA) units for gas separation in natural gas processing

Improving the treatment of heart disease

Improving the treatment of heart disease

Estimating migration rates in the budding yeast Saccharomyces cerevisiae

Estimating migration rates in the budding yeast Saccharomyces cerevisiae

Number theoretic algorithms in cryptography

Number theoretic algorithms in cryptography

Molecular phylogenetics uses genetic data to reconstruct the evolutionary history of individuals, populations or species

Molecular phylogenetics uses genetic data to reconstruct the evolutionary history of individuals, populations or species

Phylogeny and phylogeography of the family kyphosidae (Perciformes: teleostei)

Phylogeny and phylogeography of the family kyphosidae (Perciformes: teleostei)

Testing what cosmic inflation really predicts

Testing what cosmic inflation really predicts

Multigene environmental DNA data analysis for New Zealand genomic observatory

Multigene environmental DNA data analysis for New Zealand genomic observatory

Finding genetic variants responsible for human disease hiding in universe of benign variants

Finding genetic variants responsible for human disease hiding in universe of benign variants

BEAST, Bayesian evolutionary analysis sampling trees

BEAST, Bayesian evolutionary analysis sampling trees

The formation of surface archaeological deposits in arid Australia

The formation of surface archaeological deposits in arid Australia

Statistical modelling of carryover effects after cessation of treatments

Statistical modelling of carryover effects after cessation of treatments

High-resolution cryo-electron microscopy of protein complexes and machines

High-resolution cryo-electron microscopy of protein complexes and machines

ARCI, archaeology eResearch collaboration initiative

ARCI, archaeology eResearch collaboration initiative

Optimisation of blades on large wind turbines with individual pitch control and trailing edge flaps

Optimisation of blades on large wind turbines with individual pitch control and trailing edge flaps

Quality of care and outcomes in children with cleft lip and/or palate

Quality of care and outcomes in children with cleft lip and/or palate

Geographic and temporal information retrieval on massive document collections

Geographic and temporal information retrieval on massive document collections

Homodynamics in the microcirculation

Homodynamics in the microcirculation

Processing structure-from-motion photogrammetry on the cluster

Processing structure-from-motion photogrammetry on the cluster

Computational investigation of catalysis mechanisms for polyurethane synthesis

Computational investigation of catalysis mechanisms for polyurethane synthesis

Virtual childhood obesity prevention laboratory

Virtual childhood obesity prevention laboratory

Giving Pacific research greater reach

Giving Pacific research greater reach

Development of novel waveguides  in the terahertz (THz) region

Development of novel waveguides in the terahertz (THz) region

Modelling of costs of diets  by INFORMAS

Modelling of costs of diets by INFORMAS

Foodback

Foodback

Finite element method code for  modelling biological cells

Finite element method code for modelling biological cells

The future of memory: Neuroimaging memory and imagination with functional MRI

The future of memory: Neuroimaging memory and imagination with functional MRI

Modelling and visualisation of calcium waves in parotid acinar cells

Modelling and visualisation of calcium waves in parotid acinar cells

Mapping donor contributions in the Pacific

Mapping donor contributions in the Pacific

Visualising humpback whale migration

Visualising humpback whale migration

Visualising the 2010 and 2011  Canterbury earthquakes

Visualising the 2010 and 2011 Canterbury earthquakes

Data management planning for MOA*

Data management planning for MOA*

Research data publishing  and preservation at COMPASS

Research data publishing and preservation at COMPASS

Centre for eResearch machine learning service

Centre for eResearch machine learning service

Building a discrete global  grid gazetteer service

Building a discrete global grid gazetteer service

The new Wanhal catalogue

The new Wanhal catalogue

Passive acoustic modelling

Passive acoustic modelling

Using GPUs to expand our understanding of the Solar System

Using GPUs to expand our understanding of the Solar System

Shedding new light on dark matter

Shedding new light on dark matter

Aerodynamics modelling paves the way for improved yacht designs

Aerodynamics modelling paves the way for improved yacht designs

Modernising models to help diagnose or treat disease and injury

Modernising models to help diagnose or treat disease and injury

Wandering around the molecular landscape: embracing virtual reality as a research showcasing outreach and teaching tool

Wandering around the molecular landscape: embracing virtual reality as a research showcasing outreach and teaching tool

ALTER: Between human and nonhuman – a VR art exhibition

ALTER: Between human and nonhuman – a VR art exhibition

Disposition of Microsoft HoloLenses for a Pop-Up Reality Shop to demonstrate the progress of a research project

Disposition of Microsoft HoloLenses for a Pop-Up Reality Shop to demonstrate the progress of a research project

Improving diagnosis for schistosomiasis by using the ‘metabolic footprint’ of urine samples from an animal model of Schistosoma infection to identify possible biomarkers

Improving diagnosis for schistosomiasis by using the ‘metabolic footprint’ of urine samples from an animal model of Schistosoma infection to identify possible biomarkers

Making stroke recovery prediction tools freely available

Making stroke recovery prediction tools freely available

MFT-ICR mass spectrometry data management and analysis workflow

MFT-ICR mass spectrometry data management and analysis workflow

Taking a ‘Big Data’ approach to find new clinical-omic associations in cancer

Taking a ‘Big Data’ approach to find new clinical-omic associations in cancer

Growing Up in New Zealand

Growing Up in New Zealand

Improving arrival time predictions for vehicles in a public transport network

Improving arrival time predictions for vehicles in a public transport network

Distributed and cloud-based control at field-level for systems interacting with soft bodies

Distributed and cloud-based control at field-level for systems interacting with soft bodies

Mobile Click Fraud Attack (MCFA)

Mobile Click Fraud Attack (MCFA)

Skin-omics: exploring the volatile organic compounds on human skin

Skin-omics: exploring the volatile organic compounds on human skin

New analytics tools for workload planning for the 2018 New Zealand Census

New analytics tools for workload planning for the 2018 New Zealand Census

Visualising the New Zealand Index of Multiple Deprivation

Visualising the New Zealand Index of Multiple Deprivation