DotImage的RecoStar Engine修改字體屬性的問題[DotImage]
發表于2019-05-10
回復:0
查看:2950 |
為了創建可搜索文檔,我們使用了Recostar OCR引擎。它支持土耳其字符。但是當我們使用它的翻譯功能時,出現一些亂碼,而非土耳其的字符。我想知道如何更改其字體屬性?
代碼如下:
FileStream fs = new FileStream(inputFile, FileMode.Open);
Atalasoft.Imaging.Codec.Pdf.PdfDecoder dec = new
Atalasoft.Imaging.Codec.Pdf.PdfDecoder();
dec.Resolution = 200;
int ss = dec.GetFrameCount(fs);
Atalasoft.Imaging.ImageCollection coll = new
Atalasoft.Imaging.ImageCollection();
for (int i = 0; i < ss; i++)
{
Atalasoft.Imaging.AtalaImage aim = dec.Read(fs, i, null);
coll.Add(aim);
}
fs.Close();
Atalasoft.Ocr.RecoStarLoader loader = new
Atalasoft.Ocr.RecoStarLoader();
Atalasoft.Ocr.RecoStar.RecoStarEngine eng = new
Atalasoft.Ocr.RecoStar.RecoStarEngine(loader);
eng.RecognitionCulture =
System.Globalization.CultureInfo.CurrentCulture;// = tr-TR
Atalasoft.Ocr.PdfTranslator trans = new
Atalasoft.Ocr.PdfTranslator();
trans.OutputType =
Atalasoft.Ocr.PdfTranslatorOutputType.TextOnly;
Atalasoft.Imaging.ImageCollectionImageSource source = new
Atalasoft.Imaging.ImageCollectionImageSource(coll);
try
{
eng.Initialize();
eng.Translate(source, "application/pdf",
Application.StartupPath + "\\out_search.pdf", trans);
}
catch (Exception ex)
{
MessageBox.Show(ex.Message);
}
eng.ShutDown();
eng.Dispose();
登錄 慧都網發表評論