1
0
Fork 0
No description
Find a file
2025-02-04 20:45:45 +00:00
.gitea/workflows [mod] docs 2025-02-04 16:09:34 +00:00
.github/workflows [mod] update github 2025-02-03 21:56:56 +00:00
.vscode [add] image save 2024-04-11 12:45:23 +01:00
docs [mod] gitignore xqdoc 2025-01-26 16:36:02 +00:00
jars [fix] ignore 2025-02-03 14:34:48 +00:00
samples.pdf [add] github workflow 2025-01-03 16:03:13 +00:00
scripts [mod] lost 2025-02-04 20:45:45 +00:00
src [mod] lost 2025-02-04 20:45:45 +00:00
.gitignore [mod] git ignore 2025-02-03 14:31:30 +00:00
.xqdoca [mod] update to pdfbox 3.0.3 2025-01-25 22:19:46 +00:00
LICENSE [add] notebook 2024-02-28 23:11:09 +00:00
package.json [mod] lost 2025-02-04 20:45:45 +00:00
package.xml [mod] lost 2025-02-04 20:45:45 +00:00
README.md [mod] lost 2025-02-04 20:45:45 +00:00

Pdfbox

A BaseX interface for Pdfbox version 3. It is packaged using the Expath format, and is tested against BaseX 10.7 and 11.7.

  • The Pdfbox 3 FAQ may be useful.

Features

  • read PDF page count.
  • read any PDF outline and return as map(s) or XML.
  • read pagelabels.
  • read page text.
  • save pdf page range to a new pdf.
  • save pdf page as an image.

Build

  • scripts/make-xar.xq packages the required jars and xqm files to a xar file in the dist folder.

Action support

The workflow ci-basex.yaml builds and tests the package. This can be used as an action on github, or on a local gitea installation.