I have some good PDF ebooks I’m willing to share, but I suspect the seller embeds some tracking data in them to link them to my account, as every time I download them from the official website they have a different hash while being visually identical. The same when checking against the copies a friend bought from the same seller. Since I dont wanna get banned, can you recommend a way to remove that stuff?
Could look into using exiftool, qpdf or pdftk, if you are comfortable with the terminal ✨
There’s dangerzone by freedom of press
I would try reprinting the PDFs and comparing the hashes afterwards. That should remove any metadata in the headers as new headers are created.
That wouldn’t work for something like Pathfinder PDFs from the Paizo website. They add a text watermark with the name and email associated with your account on their site to each page of the document. It’s not metadata, it’s actual data
Why would the checksum differ between downloads if there was a watermark with user identifiable data
Just checked one of my Paizo pdfs and in addition to my account name and email address it also has the datetime that I downloaded the pdf written in the watermark. Presumably because they append the file creation time when the pdf is being signed
Wow… The amount of information already being shared here is outstanding! Keep on rowing/patching mates
Maybe print the book via print to pdf and check again.