Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minimal provenance in VOTable output #37

Open
gilleslandais opened this issue Apr 16, 2021 · 7 comments
Open

Minimal provenance in VOTable output #37

gilleslandais opened this issue Apr 16, 2021 · 7 comments

Comments

@gilleslandais
Copy link
Collaborator

For VizieR it will be really appreciated (to not say required) to have common way to provide a minimal origin information.

The mango VizieR prototype uses the dock "associatedData" to link a remote URL which contains a "complete" VOProvenance.
I would like to add an other concise provenance output in the VOTable (for "naive" client)

The minimal provenance for a VOTable are: author+year_of_publication, doi or bibcode of the reference article.
In DatasetDM, I didn't see a clear distinction between creator/author .. Markus do you have an example of this serialization in your output?

.. and I would like more - but is it possible in a concise serialization: to specify a short annotation to specify the origin of a measure - e.g. the filter configuration: with the curator + a URL.
Any idea ?

@msdemlei
Copy link
Contributor

msdemlei commented Apr 19, 2021 via email

@mcdittmar
Copy link
Collaborator

gilleslandais wrote:
In DatasetDM, I didn't see a clear distinction between creator/author .. Markus do you have an example of this serialization in your output?

msdemlei wrote:
No. But now that you mention it, what might actually be smart is sync VOResource's "implicit" (there's no VO-DML (yet?)) data model with Dataset dm and friends. I'm not 100% sure I'd like to see a lot of VOResource in instance documents (as usual: What should clients do with it?), and we'd have to think about whether there ought to be a single "resource" DM or whether some of the types could become DMs of their own. But whatever the result of these considerations: if DM deals with Registry content, we should make sure there are no unnecessary inconsistencies.

The DatasetDM does map its content to the Resource Metadata elements.. so is in a sense, actualizing the 'implicit' model, and is VO-DML compliant.

As for creator/author.. what distinction are you looking for?

  • dm:Creator == entity responsible for the creation of the Dataset (under DataID)
  • dm:Contributor == other entities involved in the creation of the Dataset (under DataID)
  • dm:Publisher == entity making the Dataset available (under Curation)

I think 'author' implies the Dataset is a paper or some sort, rather than a Photometric Filter or LightCurve.

I've also been curious to see how Provenance will get conveyed in the context of Datasets.
In the older models (Spectrum, Characterization, ObsCore) there is, if I recall, a Provenance placeholder node in the Observation/Dataset metadata area. I expect there is a distinction between identifying the Agents/Entities involved in THIS dataset (part of DatasetMetadata), vs identifying the HISTORY of the Dataset ( Activity which created it, progenitors, etc ).

I would like to add an other concise provenance output in the VOTable (for "naive" client)

Do you mean outside of the Annotation syntax? similar to a COOSYS/TIMESYS element directly in VOTable.

@gilleslandais
Copy link
Collaborator Author

Ok for the datasetDM - in VizieR context, CDS is the publisher (Curator) and the author is the Creator (including the biblio reference in the same DataID) - so it could be added in AssocDataDock.
But you are true , a including (refCode, author, year) is something that I prefer !

For measures it is may be more complicate -
In vizier the photometry filter characteristics is not a part of the original data (it could - but often it is added by CDS who assigned a filter or a similar filter for magnitude columns) - if VizieR provides the photometry characteristics, it is important to specify the origin. ProvDM is adapted for that, but the parsing is may be a little discouraging for clients..

So may be a simple way could be just a comment ?.. in that case is it better to put the (origin) comment on Mango:Parameter.comment or on Mango:stcextend.Photometry ?

@mcdittmar
Copy link
Collaborator

mcdittmar commented Apr 19, 2021 via email

@lmichel
Copy link
Collaborator

lmichel commented Apr 23, 2021

If the purpose of the embedded Prov is just to say whether a filter has been added by the CDS, we could consider doing things in a simpler way.

As PhotFilter still has to be wrapped into MANGO, we can add a field telling the filter origin. This is somehow similar to the reduction status Mango had at the beginning (raw, calibrated..). This value could be carried either by an enum or a vocabulary.

@msdemlei
Copy link
Contributor

msdemlei commented Apr 28, 2021 via email

@mcdittmar
Copy link
Collaborator

mcdittmar commented May 3, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants