-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Document menu scraping #33
Comments
Hi, you can refer to this branch for a psql version. I didn't merge it in as the way we were scraping the pdfs were rather awkward because uhs filenames weren't standardized. You'll need to modify |
Thanks! I see, it might be better to get the URL from parsing https://uci.nus.edu.sg/ohs/current-residents/students/daily-menu-2/, I'll see if I can help you with that if I have the time haha |
Just an update on this, I think they actually changed the URLs (https://uci.nus.edu.sg/ohs/current-residents/students/daily-menu-2/). I'm going to work on the parsing but please let me know if you have already found another way around it! Thanks |
Hi @qinghao1 may I know if the issue has been resolved. I was in charge of the bot alone and was not very updated with the scraper done by my teammates. |
Hi there, I think it hasn't been fixed, but this PR should provide everything you need to fix it. That's on the scraper side though, so I don't think it has anything to do with the bot itself. |
Thanks. I just tried running the scraper on the CentOS container again and the lru seems to be breaking on it without meaningful error message. I replaced it with a normal dict and it still doesn't work. May I have your input on this? |
What's the error that you're seeing? It might be the case that the OHS website is blocked. Maybe try running it locally? |
Also you'd have to install lru in pipenv |
I did pipenv install with lru inside, but the installation seems to output some stacktrace without much meaningful error message. |
I guess you could just replace it with a normal dict, I don't think it will exceed memory usage with normal use. |
Hi there, I don't think the code to scrape the OHS menu is in here and the scraper is for mongo. Could you add the code you're using here or in a separate repo?
The text was updated successfully, but these errors were encountered: