Class PdfTextExtractionOptions
- Namespace
- BitMiracle.Docotic.Pdf
- Assembly
- BitMiracle.Docotic.Pdf.dll
The text extraction options.
public class PdfTextExtractionOptions
- Inheritance
-
PdfTextExtractionOptions
- Inherited Members
Remarks
The Extract text from PDF in C# and VB.NET article shows how to use these options while extracting text from PDF.
Check out the other articles and code samples on the PDF to text page.
Constructors
- PdfTextExtractionOptions()
Initializes a new instance of the PdfTextExtractionOptions class.
Properties
- NormalizePresentationForms
Gets or sets a value indicating whether to normalize codepoints from Alphabetic and Arabic Presentation Forms.
- ReadingDirection
Gets or sets the main direction of text for the inverse Bidi algorithm in this PdfTextExtractionOptions.
- Rectangle
Gets or sets the rectangle of interest.
- SkipInvisibleText
Gets or sets a value indicating whether to extract hidden text.
- UnmappedCharacterCodeHandler
Gets or sets the handler for character codes that cannot be mapped to Unicode using data from the corresponding PDF font.
- UseBidi
Gets or sets a value indicating whether to use the inverse Bidi algorithm for reordering right-to-left and bidirectional text.
- WithFormatting
Gets or sets a value indicating whether to extract text formatted as seen in a PDF viewer.