-
Notifications
You must be signed in to change notification settings - Fork 2
Why Firefox
wget is an excellent & lightweight tool that does exactly what we need - collecting text from a page, but it doesn't follow hidden redirects embedded in Javascript.
We need a sneak attack that will let us scrape the page and can parse these "enhancements" in order to snag the text needed, without ending up with blank postings.
As of version 1.1.0 of Resgen, we're sending a headless browser after the posting, to simulate a user viewing the page.
Debian users: You don't have to rid Firefox ESR in place of 56+; simply install a new Firefox version and symlink it to your /usr/bin/firefox path so Selenium can see it. (ESR is /usr/bin/firefox-esr
so the two can coexist, but firefox
should point at your new install.)
Steps (Debian users / folks using ESR only; everyone else can ignore this section):
- Download Firefox
tar -jxvf firefox*tar.bz2
- I downloaded Nightly & renamed the directory, just to be extra sure of no conflicts
mv firefox firefox-nightly
- Move to /opt:
mv firefox-nightly /opt/firefox-nightly
- Symlink it:
ln -s /opt/firefox-nightly/firefox /usr/bin/firefox
-
ls -l | grep firefox
to confirm; good to go!
You don't actually have to use it for anything, other than letting Resgen use it as a dependency. This requirement is due to change as soon as ESR adds the headless feature.