Blogs are for ranting, right? Right then. I HATE PDFs. I know a lot of people who love them, but I do not understand it. And everybody uses them. All the scholarly journals use them. If you want an electronic copy of a journal paper, it’s almost certainly going to be a PDF, and I hate them, and I’ve now frustrated myself for about three hours just trying to DO WHAT I NEED TO DO with it.
Some PDFs are all right. I’ve never created one (cf. the fact that I HATE them), so I don’t know whether the differences are in the way they’re created, or what. All I need to do is freakin’ highlight something. That’s it. So I know upon rereading that that was a part I want to reference or quote or whatever. Some PDFs let me do this. Others don’t. Is this some sort of stupid rights management thing, because the document’s creator thought it would somehow impinge upon their rights if I HIGHLIGHTED SOMETHING? Honestly. Note that I can’t do this at all in Adobe Reader, because even though the help says I should be able to turn on the commenting bar and highlight and make notes, I cannot. It is not there. I’m using Foxit Reader now, which does let me annotate some PDFs (though without paying $40 for the premium edition I can’t get rid of the little evaluation stamp they put on the document, but that doesn’t matter to me). Except the document I just opened? Can’t select text, can’t do ANYTHING. And the weirdest thing is, it looks like the rights management settings are the same on both the document I can edit and the one I can’t. So I had a whole ‘nother paragraph blaming DRM, but maybe that’s not it. Now I’m thoroughly confused.
The only thing I can guess, looking at the two documents, is that one was created from a text file, the other from a scan or photo file. Is that possible? If that’s the case, I guess I might have to be a little less mad. But it just turns my question into: how can I take a PDF and run it through an OCR or something so that I can actually work with it as text, when I didn’t create the PDF? I can’t believe that there’s not some way to make PDFs like this USEFUL short of e-mailing the journal and asking them to reconvert everything using a text format as the basis. Any ideas? Anyone who uses PDFs regularly?












Add New Comment
Viewing 4 Comments
Thanks. Your comment is awaiting approval by a moderator.
Do you already have an account? Log in and claim this comment.
Do you already have an account? Log in and claim this comment.
And you are correct that some PDF's are actually pages that have been photocopied. OCR would be possible but might have restrictions and also might require more proof reading since OCR is not always foolproof. Thus is would be a bit more expensive to create it I suppose.
You might want to see if you can get a copy of Acrobat Writer at a student rate. Although I think that some documents are locked up so that even this will have only limited capabilities. You should go to a computer store or maybe your schools CS department and see if you can get definitive answers for these things. Good luck!
Do you already have an account? Log in and claim this comment.
Thanks for the point about photocopies and OCR. I guess it probably goes on a journal-by-journal basis whether they use text-based or photo-based PDFs. And I have a feeling my school wouldn't understand why I don't just print things out like everyone else here does.
Do you already have an account? Log in and claim this comment.
Do you already have an account? Log in and claim this comment.
Add New Comment
Trackbacks