Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

GRSciColl - collection descriptors - vocabulary for objectClassificationName #157

Open
ManonGros opened this issue Oct 1, 2024 · 2 comments
Labels
content Label for issue concerning vocabulary content GRSciColl Terms or vocabularies relevant for GRSciColl

Comments

@ManonGros
Copy link
Collaborator

I would like to have a controlled vocabulary for interpreting the Latimer Core field objectClassificationName: https://ltc.tdwg.org/quick-reference/#ObjectClassification.objectClassificationName.

The Latimer core term objectClassificationName is very convenient to describe subsets of collections that do not necessarily have other ways of being grouped. For example, this is helpful for groups of non-monophyletic taxa (for example Algae).

Currently we don't have any vocabulary but it would make sense to integrate the categories of the DISSCO discipline vocabulary which is described here: DOI 10.3897/rio.10.e118244

discipline categories
Anthropology Human Biology Archaeology Other
Botany Algae Bryophytes Fungi/Lichens (including Myxomycetes) Pteridophytes Seed plants
Extraterrestrial Collected on Earth Collected in space Other
Geology Mineralogy Petrology Loose sediment Other
Microorganisms Bacteria and Archaea Phages Plasmids ProtozoaVirus - animal / human Virus - plant Yeast and fungi Other
Palaeontology Botany & Mycology Invertebrates VertebratesTrace fossils MicrofossilsOther
Zoology invertebrates Arthropods - insects (Lepidoptera, Diptera, Hymenoptera, Coleoptera) Arthropods - other insects Arthropods - arachnids Arthropods - crustaceans & myriapods Porifera (sponges) Mollusca (bivalves, gastropods, cephalopods) Other
Zoology Vertebrates Fishes Amphibians Reptiles Birds Mammals Other
Other Geo/Biodiversity Other biological or geological objects which fit into none of the other defined categories

Note that there is some overlap with the GRSciColl discipline vocabulary for institution (https://registry.gbif.org/vocabulary/Discipline) and the GRSciColl collection content type vocabulary (https://registry.gbif.org/vocabulary/CollectionContentType/concepts). However, I think the DISSCO list of proposed values seems quite practical and reflects a lot of the sub-collection divisions I have encountered.

I am not necessarily suggesting that the DISSCO vocabulary be the final one used for the objectClassificationName but that it be integrated in the vocabulary used for interpretation of the field. Perhaps we could remove the "other" categories there?

@ManonGros ManonGros added content Label for issue concerning vocabulary content GRSciColl Terms or vocabularies relevant for GRSciColl labels Oct 1, 2024
@sharifX
Copy link

sharifX commented Oct 22, 2024

@ManonGros,

The SYNTHESYS+ report (https://doi.org/10.3897/rio.10.e118244) you mentioned was the foundation of the work we're doing around our data modelling. However, our schema has evolved a bit since then. I recommend looking at the schema page, particularly the Digital Specimen json schema.

We've changed the structure by adding three main categories:

topicOrigin
topicDomain
topicDiscipline

In the JSON structure, we've added "enum" (enumeration) -- to use as predefined list of acceptable values. Until we have a proper vocabulary server, this approach helps us maintain consistency in how data is categorised.

I think the "other" category is still needed to capture the rest. We are calling this Other Biodiversity and Other Geodiversity.

ods:topicOrigin": {
      "type": "string",
      "description": "Highest-level terms identifying the fundamentals of the activities, in which context the objects in the collection were collected",
      "enum": [
        "Natural",
        "Human-made",
        "Mixed origin",
        "Unclassified"
      ],
      "examples": [
        "Natural"
      ]
    },
    "ods:topicDomain": {
      "type": "string",
      "description": "High-level terms providing general domain information with which the objects are associated",
      "enum": [
        "Life",
        "Environment",
        "Earth System",
        "Extraterrestrial",
        "Cultural Artefacts",
        "Archive Material",
        "Unclassified"
      ],
      "examples": [
        "Life"
      ]
    },
    "ods:topicDiscipline": {
      "type": "string",
      "description": "Overarching classification of the scientific discipline to which the objects within the collection belong or are related",
      "enum": [
        "Anthropology",
        "Botany",
        "Astrogeology",
        "Geology",
        "Microbiology",
        "Palaeontology",
        "Zoology",
        "Ecology",
        "Other Biodiversity",
        "Other Geodiversity",
        "Unclassified"
      ],
      "examples": [
        "Botany"
      ]
    },

@ManonGros
Copy link
Collaborator Author

Thanks @sharifX for letting me know, I wasn't aware of that.

Does it mean that the categories like Algae, Bryophytes, Fungi/Lichens (including Myxomycetes), Pteridophytes and Seed plants are no longer part of any controlled vocabulary?

If it the type of classification I have seen before in several institutions and I think it would be great to make them searchable (things like Algae collections cannot be searched easily otherwise). Will you use some other controlled value to work with these cases?

(I am trying to make these traditional collections more easily discoverable and I am not sure how best to proceed).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
content Label for issue concerning vocabulary content GRSciColl Terms or vocabularies relevant for GRSciColl
Projects
None yet
Development

No branches or pull requests

2 participants