I used to be able to download the available pages from books on Google Books with pysheng(3) and/or getxbook, but neither of them seem to work any more. On Firefox I can go into “More Information>Media Info” and download page images one by one, but this is slow and cumbersome. Any recommendations for a replacement?
A nice place to discuss rumors, happenings, innovations, and challenges in the technology sphere. We also welcome discussions on the intersections of technology and society. If it’s technological news or discussion of technology, it probably belongs here.
Remember the overriding ethos on Beehaw: Be(e) Nice. Each user you encounter here is a person, and should be treated with kindness (even if they’re wrong, or use a Linux distro you don’t like). Personal attacks will not be tolerated.
Subcommunities on Beehaw:
This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.
If you’re up on your bash coding skills: in the Firefox debugger, you can find the URL to the page images and see if there’s a usable pattern in the URLs. If there is, you could script it in bash and repeatedly call
curl
to download the images.I’m afraid that’s beyond my current skill set, but thank you anyway! 🙂