Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cannot find dataset when refreshing #257

Open
samfenske opened this issue Nov 9, 2022 · 7 comments
Open

Cannot find dataset when refreshing #257

samfenske opened this issue Nov 9, 2022 · 7 comments

Comments

@samfenske
Copy link

Hello, I have been able to upload a handful of datasets to a project, and often times I will have to clear the cache for the new dataset to appear. However, I noticed that when I clear the cache with a dataset open or try to open a specific link referencing the dataset (ex. the url for the cellbrowser object with a certain gene selected), I get the following error.

"Could not find a dataset at 01integrated_bal_v5/dataset.json?19d85d6751. If you are sure that the link is correct, please contact the administrator of this server, or [email protected] if this is running at UCSC."

Looking through the project's index.html file, I thought the md5 code may have something to do with this as the 10-digit error code (19d85d6751) did not match the 10-digit md5 code. I tried changing it and reloading the page, but received the same error. Am I on the right track here and just haven't synchronized the md5 code, or is there perhaps another way I can get cellbrowser to recognize the dataset? I should clarify that the objects work perfectly fine when I open them, I just get this error when I clear the cache or open a specific link for the object.

Thanks!

@maximilianh
Copy link
Owner

maximilianh commented Nov 14, 2022

Thank you for this observation, we've had someone else complain about this, and it's not intended to be like this, but a bug that has to do with how the MD5 checksums are calculated. Can you give me a few more details?

having an incorrect MD5 in the URL should not matter, should not trigger an error, as they're after the "?" character, so not part of the URL. The MD5 is there only to get around the caching.

I think your problem is best described like this, let me know if this is not correct:

  1. you add a dataset to a collection
  2. you go to the URL of the collection, but the dataset is not there
  3. you have to reload the page with shift or in another browser to see it.

Is this correct? If so, then somewhere where I calculate the MD5 for the collection, there must be a problem.

@samfenske
Copy link
Author

Maximilian, thank you for getting back to me! I can provide more specifics. When I go to the collection URL, the dataset is there, and I can refresh/clear cache, and the dataset will still be there. I can go ahead and open the dataset, but if I now try to refresh/clear cache I get the error. It seems the link is invalid- with a dataset opened and working, if I copy and paste the URL into another tab I get the error.

@maximilianh
Copy link
Owner

maximilianh commented Nov 14, 2022 via email

@samfenske
Copy link
Author

The collection is internal so you can't access it without certain VPN, but the URL is structured like this: ...northwestern.edu/long-covid/?ds=01integrated_BAL_v5. Some of our public datasets are available at https://nupulmonary.org/resources/, which are structured the same way.

@maximilianh
Copy link
Owner

maximilianh commented Nov 14, 2022 via email

@samfenske
Copy link
Author

I only get the md5 code in the error message, it never actually presents itself in the URL. I haven't seen this happen on nupulmonary. I have gotten this error before on our internal browser, but it was a brand new project so I just made a new collection and it worked fine. This collection was functioning fine until last week something must've happened when I was removing datasets. I would remove the data folder and remove the dataset from the dataset.json file, which I've done before and it hasn't caused any issues. The web server we use is apache httpd 2.4.6.

@maximilianh
Copy link
Owner

maximilianh commented Nov 15, 2022 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants