Pdf To Text是一個PDF文檔轉換器,它可以在任何.NET程序中從PDF文檔中提取TEXT文本。而且它可以非常方便的整合到現有的.NET應用程序中,而無需安裝。
The ExpertPDF Pdf to Text Converter can be used in any type of .NET application to extract the text from a PDF document. The integration with existing .NET applications is extremely easy and no installation is necessary in order to run the converter.
通過下載可直接獲得一個.net2.0的程序集和一個可直接使用的控制臺示例程序。并且Samples文件夾中的示例程序均包含了完整的C#源碼。所提供的示例程序均可在Visual Studio 2005或Visual Studio 2008上進行調試。使用ExpertPDF執行轉換后,你可以將所提取的文本作為.NET字符串對象,以便進行搜索或將其存入本地文件中。
特征:
- .NET2.0開發庫文件和C#的示例程序
- 可從PDF文件或PDF流中提取text
- 提取的text可保持PDF原有頁面布局
- 可按PDF閱讀順序提取text
- 可只提取頁面中的指定部分
- 可將提取的text存儲為html格式并添加元標簽描述
- 以html元標簽形式添加從PDF獲得的標題,關鍵字,作者等關鍵字
- 可為提取后的text添加特殊字符標記分頁符
- 支持從密碼保護的PDF文檔中提取text
The downloadable archive contains the assembly for .NET 2.0 and a ready-to-use sample console application. The full C# source code for the sample application is available in the Samples folder. The sample application can be built with Visual Studio 2005 or Visual Studio 2008. The result of conversion is a .NET String object that you can use for example in search operations or save into a file on disk.
Features:
- .NET 2.0 development library and C# samples
- Extract text from PDF stream or a PDF file
- Extract text preserving the original PDF layout
- Extract text in PDF reading order
- Specify the range of pages to be extracted
- Save the extracted text in a HTML format and add description meta tags
- Add the title, keywords, author from PDF description in HTML meta tags
- Mark the page breaks in the extracted text with a special character
- Extract text from password protected PDF documents