Hi! I’m trying to archive papers as soon as they appear in a scientific journal, and I’ve attempted to search for PDF links on each page using some regular web scraping.

The problem is that most of these journals will add their fancy PDF readers, and downloading the file is not as straight-forward as it seems. However, the Zotero Connector works flawlessly when you trigger the extension. Therefore, I attempted to set up a selenium instance with this extension to download the papers given a link, but I struggle to actually get the extension to trigger. I tried sending a Shift + Ctrl + S command, but that doesn’t seem to get picked up. Similarly, I can’t figure out how to call the extension from the console.

Did anyone else attempt such a workflow before? Am I doing something completely unnecessary, as there are better options available? Help a fellow sailor out. Thanks a lot in advance for your help!

@ocean@lemmy.selfhostcat.com
link
fedilink
English
5
edit-2
9d

This sounds smart and helpful but may I ask how much work will be put into this workflow and how much time will it actually save you? If I can’t be bothered to search for the new paper typically it isn’t worth my time.

If you find the answer I’d also love to know! Zotero is awesome

Piracy: ꜱᴀɪʟ ᴛʜᴇ ʜɪɢʜ ꜱᴇᴀꜱ
!piracy@lemmy.dbzer0.com
Create a post
⚓ Dedicated to the discussion of digital piracy, including ethical problems and legal advancements.

Rules • Full Version

1. Posts must be related to the discussion of digital piracy

2. Don’t request invites, trade, sell, or self-promote

3. Don’t request or link to specific pirated titles, including DMs

4. Don’t submit low-quality posts, be entitled, or harass others



Loot, Pillage, & Plunder

📜 c/Piracy Wiki (Community Edition):


💰 Please help cover server costs.

Ko-Fi Liberapay
Ko-fi Liberapay

  • 1 user online
  • 163 users / day
  • 331 users / week
  • 895 users / month
  • 3.24K users / 6 months
  • 1 subscriber
  • 3.66K Posts
  • 86.4K Comments
  • Modlog