這個外掛並未在最新的 3 個 WordPress 主要版本上進行測試。開發者可能不再對這個外掛進行維護或提供技術支援,並可能會與更新版本的 WordPress 產生使用上的相容性問題。

WP Tesseract

外掛說明

A plugin for extracting text from attached images using OCR via Tesseract.
This plugin adds a new post named for each image upload containing any recognized text characters within the file.
This text can then be edited for accuracy and used elsewhere on the site.

The OCR plugin requires a supported version of PHP with the GD extension and the following command line utility:
* Tesseract for the actual OCR
This utility must be manually installed on your server and executable by PHP.
This process, and consequently this plugin, is recommended only for advanced users.

螢幕擷圖

安裝方式

  1. Install Tesseract OCR on your server (Tesseract wiki)
  2. Search and add the plugin from WordPress, or upload a copy of the source to your /wp-content/plugins/ directory
  3. Activate the plugin through the Plugins menu in WordPress
  4. Configure the plugin through the Plugins > OCR link in the sidebar menu in WordPress

常見問題集

What is Tesseract OCR and where do I get it?

Tesseract OCR is an open source optical character recognition library that
the WordPress OCR plugin uses to extract text from images.
The library as well as installation instructions can be found at
https://github.com/tesseract-ocr/tesseract/wiki/.

How do I know if / where I have Tesseract installed on my server?

Linux:

  1. SSH into your server and type which tesseract.
  2. If Tesseract is installed and in your shell environment PATH the terminal should return a path similar to /opt/local/bin/tesseract.
  3. Place this path in the configuration of the OCR plugin through the Settings > Tesseract link in the sidebar menu in WordPress

Where is the detected text stored?

The text detected by the OCR plugin is added as a new post, named after the image file.

What is the ‘Resize percentage’ configuration option?

The OCR plugin is tailored to detect text in images with ~12pt text at 72dpi.
GD is used to upscale the temporary TIFF images fed to Tesseract as Tesseract is generally more accurate with
larger type, even if it’s been upscaled from a smaller source. If you wish to disable this option simply set this
configuration option to 100% and no resizing will occur.

What if I just want to use the plugin but not install anything?

Hosting options are available. See https://tattersoftware.com
for contact info.

How about that great banner photo?

The plugin’s banner photo is by Ekrulila from Pexels.

使用者評論

這個外掛目前沒有任何使用者評論。

參與者及開發者

以下人員參與了開源軟體〈WP Tesseract〉的開發相關工作。

參與者

將〈WP Tesseract〉外掛本地化為台灣繁體中文版

對開發相關資訊感興趣?

任何人均可瀏覽程式碼、查看 SVN 存放庫,或透過 RSS 訂閱開發記錄

變更記錄

0.1.0

Initial Release.

1.0.0

Complete rewrite: updates for PHP 7, ImageMagick replaced by GD, added language support

1.0.1

Actions for automated publication, License updates, Name bugfix

1.0.2

Clean up docs, remove legacy references

1.0.3

Add uploaded file as featured image, bump PHP requirement for EOL

1.0.4

Backend tweaks for WordPress 5.4