Table of Contents

Class PdfTextExtractionOptions

Namespace
BitMiracle.Docotic.Pdf
Assembly
BitMiracle.Docotic.Pdf.dll

The text extraction options.

public class PdfTextExtractionOptions
Inheritance
PdfTextExtractionOptions
Inherited Members

Remarks

The Extract text from PDF in C# and VB.NET article shows how to use these options while extracting text from PDF.

Check out the other articles and code samples on the PDF to text page.

Constructors

PdfTextExtractionOptions()

Initializes a new instance of the PdfTextExtractionOptions class.

Properties

NormalizePresentationForms

Gets or sets a value indicating whether to normalize codepoints from Alphabetic and Arabic Presentation Forms.

ReadingDirection

Gets or sets the main direction of text for the inverse Bidi algorithm in this PdfTextExtractionOptions.

Rectangle

Gets or sets the rectangle of interest.

SkipInvisibleText

Gets or sets a value indicating whether to extract hidden text.

UnmappedCharacterCodeHandler

Gets or sets the handler for character codes that cannot be mapped to Unicode using data from the corresponding PDF font.

UseBidi

Gets or sets a value indicating whether to use the inverse Bidi algorithm for reordering right-to-left and bidirectional text.

WithFormatting

Gets or sets a value indicating whether to extract text formatted as seen in a PDF viewer.