Парсер html в pdf

Saved searches

Use saved searches to filter your results more quickly

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

HTML to PDF converter for PHP

License

dompdf/dompdf

This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?

Sign In Required

Please sign in to use Codespaces.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Launching GitHub Desktop

If nothing happens, download GitHub Desktop and try again.

Читайте также:  Php ajax send html

Launching Xcode

If nothing happens, download Xcode and try again.

Launching Visual Studio Code

Your codespace will open once ready.

There was a problem preparing your codespace, please try again.

Latest commit

. since these should be represented in the text with the replacement character. Excludes non-printable characters.

Git stats

Files

Failed to load latest commit information.

README.md

Latest Release Total Downloads License

Dompdf is an HTML to PDF converter

At its heart, dompdf is (mostly) a CSS 2.1 compliant HTML layout and rendering engine written in PHP. It is a style-driven renderer: it will download and read external stylesheets, inline style tags, and the style attributes of individual HTML elements. It also supports most presentational HTML attributes.

This document applies to the latest stable code which may not reflect the current release. For released code please navigate to the appropriate tag.

Check out the demo and ask any question on StackOverflow or in Discussions.

  • Handles most CSS 2.1 and a few CSS3 properties, including @import, @media & @page rules
  • Supports most presentational HTML 4.0 attributes
  • Supports external stylesheets, either local or through http/ftp (via fopen-wrappers)
  • Supports complex tables, including row & column spans, separate & collapsed border models, individual cell styling
  • Image support (gif, png (8, 24 and 32 bit with alpha channel), bmp & jpeg)
  • No dependencies on external PDF libraries, thanks to the R&OS PDF class
  • Inline PHP support
  • Basic SVG support (see «Limitations» below)
  • PHP version 7.1 or higher
  • DOM extension
  • MBString extension
  • php-font-lib
  • php-svg-lib

Note that some required dependencies may have further dependencies (notably php-svg-lib requires sabberworm/php-css-parser).

  • OPcache (OPcache, XCache, APC, etc.): improves performance
  • GD (for image processing)
  • IMagick or GMagick extension: improves image processing performance

About Fonts & Character Encoding

PDF documents internally support the following fonts: Helvetica, Times-Roman, Courier, Zapf-Dingbats, & Symbol. These fonts only support Windows ANSI encoding. In order for a PDF to display characters that are not available in Windows ANSI, you must supply an external font. Dompdf will embed any referenced font in the PDF so long as it has been pre-loaded or is accessible to dompdf and reference in CSS @font-face rules. See the font overview for more information on how to use fonts.

The DejaVu TrueType fonts have been pre-installed to give dompdf decent Unicode character coverage by default. To use the DejaVu fonts reference the font in your stylesheet, e.g. body < font-family: DejaVu Sans; >(for DejaVu Sans). The following DejaVu 2.34 fonts are available: DejaVu Sans, DejaVu Serif, and DejaVu Sans Mono.

To install with Composer, simply require the latest version of this package.

composer require dompdf/dompdf

Make sure that the autoload file from Composer is loaded.

// somewhere early in your project's loading, require the Composer autoloader // see: http://getcomposer.org/doc/00-intro.md require 'vendor/autoload.php';

Download a packaged archive of dompdf and extract it into the directory where dompdf will reside

  • You can download stable copies of dompdf from https://github.com/dompdf/dompdf/releases
  • Or download a nightly (the latest, unreleased code) from http://eclecticgeek.com/dompdf

Use the packaged release autoloader to load dompdf, libraries, and helper functions in your PHP:

// include autoloader require_once 'dompdf/autoload.inc.php';

Note: packaged releases are named according using semantic versioning (dompdf_MAJOR-MINOR-PATCH.zip). So the 1.0.0 release would be dompdf_1-0-0.zip. This is the only download that includes the autoloader for Dompdf and all its dependencies.

From the command line, switch to the directory where dompdf will reside and run the following commands:

git clone https://github.com/dompdf/dompdf.git cd dompdf/lib git clone https://github.com/PhenX/php-font-lib.git php-font-lib cd php-font-lib git checkout 0.5.1 cd .. git clone https://github.com/PhenX/php-svg-lib.git php-svg-lib cd php-svg-lib git checkout v0.3.2 cd .. git clone https://github.com/sabberworm/PHP-CSS-Parser.git php-css-parser cd php-css-parser git checkout 8.1.0

Require dompdf and it’s dependencies in your PHP. For details see the autoloader in the utils project.

Just pass your HTML in to dompdf and stream the output:

// reference the Dompdf namespace use Dompdf\Dompdf; // instantiate and use the dompdf class $dompdf = new Dompdf(); $dompdf->loadHtml('hello world'); // (Optional) Setup the paper size and orientation $dompdf->setPaper('A4', 'landscape'); // Render the HTML as PDF $dompdf->render(); // Output the generated PDF to Browser $dompdf->stream();

Set options during dompdf instantiation:

use Dompdf\Dompdf; use Dompdf\Options; $options = new Options(); $options->set('defaultFont', 'Courier'); $dompdf = new Dompdf($options);
use Dompdf\Dompdf; $dompdf = new Dompdf(); $options = $dompdf->getOptions(); $options->setDefaultFont('Courier'); $dompdf->setOptions($options);

See Dompdf\Options for a list of available options.

Resource Reference Requirements

In order to protect potentially sensitive information Dompdf imposes restrictions on files referenced from the local file system or the web.

Files accessed through web-based protocols have the following requirements:

  • The Dompdf option «isRemoteEnabled» must be set to «true»
  • PHP must either have the curl extension enabled or the allow_url_fopen setting set to true

Files accessed through the local file system have the following requirement:

  • Table cells are not pageable, meaning a table row must fit on a single page.
  • Elements are rendered on the active page when they are parsed.
  • Embedding «raw» SVG’s ( ) isn’t working yet, you need to either link to an external SVG file, or use a DataURI like this:

If you find this project useful, please consider making a donation. Any funds donated will be used to help further development on this project.)

About

HTML to PDF converter for PHP

Источник

Parse HTML

The HTML parser is a free online application that allows to extract images and text from HTML documents. Why do you need that? For example, you have been sent a photo album as a HTML document, and you need to extract all the photos in their original format. The HTML parser application can help to resolve this task. Just open the application page, select the source document, and click the ‘Parse’ button. Your document will be sent to the server. In a while, you will receive an archive containing all the extracted data from your HTML document. This application can process files of the following formats: PDF, DOC (Word 2003), DOCX (Word 2007), RTF, PPT (PowerPoint 2003), PPTX (PowerPoint 2007), XLS (Excel 2003), XLSX (Excel 2007), EPUB, HTML, MHTML. Parsing files can be done online via any popular browser, in any popular operating system: Windows, macOS, Linux, Android, iOS, etc. No desktop software installation is required. It’s powerful, modern, fast, flexible, easy to use, and completely free. Important!We adhere to the privacy policy. Your documents are processed by the Aspose API. We do not use your documents or pass them on to third parties. Your documents are stored in our secure storage for 24 hours, and then automatically deleted (download links will stop working after this period). To delete your documents immediately, click the ‘Delete’ button in the window for the result downloading.

  • Simple way to parse HTML document and extract text and image data
  • Extract from multiple HTML files
  • Parse HTML documents
  • Instantly download or send output file as email attachment

How to parse HTML file

  • 1 Open your browser in PDF free application web site and go to Parser tool.
  • 2 Click inside the file drop area to upload or drag & drop a file.
  • 3 Click on «PARSE» button, file will be automatically uploaded to parse.
  • 4 Download link will be available instantly after parse.
  • 5 You can also send a link of parsed files to your email address.
  • 6 Note that file will be deleted from our servers after 24 hours and download links will stop working after this time period.

FAQ

❓ How can I parse HTML document?

First, you need to add a file for parsing: drag & drop or click inside the white area for choose a file. Then click the ‘PARSE’ button. When document parsing is completed, you can download your result files.

⏱️ How long does it take to parse HTML?

🛡️ Is it safe to parse HTML using this free app?

Of course! The download link of result files will be available instantly after parsing. We delete uploaded files after 24 hours and the download links will stop working after this time period. No one has access to your files. File parsing is absolutely safe.

💻 Can I parse HTML on Mac OS, Android or Linux?

🌐 What browser should I use to parse HTML?

Fast and Easy Parser

Upload your document and click on “PARSE” button. You will get the text file or zip-file with text and images as soon as the Parsing is performed.

Parse from Anywhere

It works from all platforms including Windows, Mac, Android and iOS. All files are processed on our servers. No plugin or software installation required for you.

Parse Quality

All files are processed using Aspose APIs, which are being used by many Fortune 100 companies across 114 countries.

Источник

Оцените статью