Table of Contents

Method GetText

Namespace
BitMiracle.Docotic.Pdf
Assembly
BitMiracle.Docotic.Pdf.dll

GetText()

Retrieves all text drawn on all pages of the document in plain text format.

public string GetText()

Returns

string

All text drawn on all pages of the document in plain text format.

Remarks

Bidirectional and right-to-left text is returned according to the logical order.

Unicode code points from Arabic and Hebrew presentation forms are normalized to the Normalization Form KC.

Read the Convert PDF to plain text for an example of using this method.

GetText(PdfTextExtractionOptions)

Retrieves all text drawn on all pages of the document according to the specified options.

public string GetText(PdfTextExtractionOptions options)

Parameters

options PdfTextExtractionOptions

The text extraction options.

Returns

string

All text drawn on all pages of the document according to the specified options.

Remarks

Bidirectional and right-to-left text is returned according to the logical order.

Unicode code points from Arabic and Hebrew presentation forms are normalized to the Normalization Form KC.

Read the Extract text from PDF in C# and VB.NET article to know how to use options when extracting text.