Class ReadDocumentsFromPdfChain
java.lang.Object
com.github.hakenadu.javalangchains.chains.data.reader.ReadDocumentsFromPdfChainBase<Path>
com.github.hakenadu.javalangchains.chains.data.reader.ReadDocumentsFromPdfChain
public class ReadDocumentsFromPdfChain extends ReadDocumentsFromPdfChainBase<Path>
Utilizes Apache PDFBox to read documents from a PDF or a folder of PDFs
-
Nested Class Summary
Nested classes/interfaces inherited from class com.github.hakenadu.javalangchains.chains.data.reader.ReadDocumentsFromPdfChainBase
ReadDocumentsFromPdfChainBase.PdDocumentWrapper, ReadDocumentsFromPdfChainBase.PdfReadMode -
Constructor Summary
Constructors Constructor Description ReadDocumentsFromPdfChain()creates aReadDocumentsFromPdfChainwhich reads the whole pdf as a documentReadDocumentsFromPdfChain(ReadDocumentsFromPdfChainBase.PdfReadMode readMode)creates aReadDocumentsFromPdfChainReadDocumentsFromPdfChain(ReadDocumentsFromPdfChainBase.PdfReadMode readMode, boolean parallel)creates aReadDocumentsFromPdfChain -
Method Summary
Modifier and Type Method Description protected Stream<ReadDocumentsFromPdfChainBase.PdDocumentWrapper>loadPdDocuments(Path input)load a pdf from an input instanceMethods inherited from class com.github.hakenadu.javalangchains.chains.data.reader.ReadDocumentsFromPdfChainBase
run
-
Constructor Details
-
ReadDocumentsFromPdfChain
public ReadDocumentsFromPdfChain(ReadDocumentsFromPdfChainBase.PdfReadMode readMode, boolean parallel)creates aReadDocumentsFromPdfChain- Parameters:
readMode-ReadDocumentsFromPdfChainBase.readModeparallel-ReadDocumentsFromPdfChainBase.parallel
-
ReadDocumentsFromPdfChain
creates aReadDocumentsFromPdfChain- Parameters:
readMode-ReadDocumentsFromPdfChainBase.readMode
-
ReadDocumentsFromPdfChain
public ReadDocumentsFromPdfChain()creates aReadDocumentsFromPdfChainwhich reads the whole pdf as a document
-
-
Method Details
-
loadPdDocuments
protected Stream<ReadDocumentsFromPdfChainBase.PdDocumentWrapper> loadPdDocuments(Path input) throws IOExceptionDescription copied from class:ReadDocumentsFromPdfChainBaseload a pdf from an input instance- Specified by:
loadPdDocumentsin classReadDocumentsFromPdfChainBase<Path>- Parameters:
input- input instance- Returns:
PDDocument- Throws:
IOException- on error loading pdf
-