Apache PDFBox · Schema

TextExtractionResult

TextExtractionResult schema from Apache PDFBox

Document ProcessingJavaPDFText ExtractionApacheOpen Source

Properties

Name Type Description
documentId string
text string
pageCount integer
wordCount integer
View JSON Schema on GitHub