Method GetText
- Namespace
- BitMiracle.Docotic.Pdf
- Assembly
- BitMiracle.Docotic.Pdf.dll
GetText()
Retrieves all text drawn on all pages of the document in plain text format.
public string GetText()
Returns
- string
All text drawn on all pages of the document in plain text format.
Remarks
Bidirectional and right-to-left text is returned according to the logical order.
Unicode code points from Arabic and Hebrew presentation forms are normalized to the Normalization Form KC.
Read the Convert PDF to plain text for an example of using this method.
GetText(PdfTextExtractionOptions)
Retrieves all text drawn on all pages of the document according to the specified options.
public string GetText(PdfTextExtractionOptions options)
Parameters
options
PdfTextExtractionOptionsThe text extraction options.
Returns
- string
All text drawn on all pages of the document according to the specified options.
Remarks
Bidirectional and right-to-left text is returned according to the logical order.
Unicode code points from Arabic and Hebrew presentation forms are normalized to the Normalization Form KC.
Read the Extract text from PDF in C# and VB.NET article to know how to use options when extracting text.