Table of Contents

Method GetTextWithFormatting

Namespace
BitMiracle.Docotic.Pdf
Assembly
BitMiracle.Docotic.Pdf.dll

GetTextWithFormatting()

Retrieves all text drawn on the page formatted as seen in a PDF viewer.

public string GetTextWithFormatting()

Returns

string

All text drawn on the page formatted as seen in a PDF viewer.

Remarks

Bidirectional and right-to-left text is returned according to the logical order.

Unicode code points from Arabic and Hebrew presentation forms are normalized to the Normalization Form KC.

Read the Extract text from PDF in C# and VB.NET article and the Split PDF by condition section for more information about text extraction.