Replies: 2 comments
-
Duplicate of #277 Biggest hurdle is that most pdf contain really crappy metadata. |
Beta Was this translation helpful? Give feedback.
0 replies
-
Sorry for the duplicate, I didn't go far enough in the list of issue. My understanding is that the crappy metadata come from random document (User manual, Spec). I think that Komga is intended to read book and generally, author of those book try to make those information more clean to keep a signature in it (at least for the Title, Author and ModDate). I think it can be good to parse those information, and if they are bad we can always edit them inside Komga. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Describe your suggested feature
PDF contains metadata information like Title, Author, CreationDate and tag.
It could be useful to parse them when analyzing the file and populate the book information with theses.
Other details
I see that the code is using org.apache.pdfbox.pdmodel.PDDocument to parse PDF document. There is a method getDocumentInformation() that can retrieve this information.
Acknowledgements
Beta Was this translation helpful? Give feedback.
All reactions