upa_nodes.tsv: All rows should have 14 columns #204

hrshdhgd · 2024-08-04T05:41:44Z

There are some rows that have just 8 columns and the 6 blank entries to make up the 14 columns are missing. So duckdb does not read them properly.

id	category	name	provided_by	iri
RHEA:19032
RHEA:20871
RHEA:32678
RHEA:10139
UPA:UCR00014	biolink:MolecularActivity	pyruvate + thiamine diphosphate = 2-hydroxyethyl-ThPP + CO(2)	upa.json	http://purl.obolibrary.org/obo/UPa_UCR00014

The last line (and almost all like this) have no <tab>blank<tab> for the columns:

The text was updated successfully, but these errors were encountered:

hrshdhgd · 2024-08-04T06:00:36Z

Same goes for ec_nodes.tsv

hrshdhgd · 2024-08-04T22:25:55Z

ec_edges.tsv also needs double checking. The columns are scrambled

subject	predicate	object	relation	primary_knowledge_source
urn:uuid:55d247c3-6b8f-4224-882e-6e19f81613b0	RO:0002327	biolink:inverseOf	RO:0002333	inverseOf
urn:uuid:69e8fdb0-2c3c-4cfe-a2b4-bba96efa5e4d	EC:1	biolink:subclass_of	owl:Thing	rdfs:subClassOf

First column is supposed to be id

hrshdhgd assigned bsantan Aug 4, 2024

Provide feedback