splitter
pdf_split_iter_fast(file_bytes, max_page_count)
Splits a PDF into batches of pages up to max_page_count
pages quickly.
Source code in docprompt/utils/splitter.py
pdf_split_iter_with_max_bytes(file_bytes, max_page_count, max_bytes)
Splits a PDF into batches of pages up to max_page_count
pages and max_bytes
bytes.
Source code in docprompt/utils/splitter.py
split_pdf_to_bytes(file_bytes, *, start_page=None, stop_page=None)
Splits a PDF into a list of bytes.