This plugin implements ModerateHatespeech.com’s machine learning API to detect and flag potentially toxic and hateful comments. A configurable threshold can be set and comments with scores above that threshold will be sent to the moderation queue for administrative approval.
A free API key from ModerateHatespeech.com is required.
We define “Toxic” as “any content a reasonable, PG-13 forum wouldn’t want for content reasons.”
This does not include:
- Spam or Advertisements
- NSFW/Explicit Content
- Forum-specific Etiquette
This does include:
- Hate Speech
- Verbal Insults
- Slurs Directed Towards a Group
More details on toxicity, as well as model accuracy and bias can be found on ModerateHatespeech.com
This plugin is part of the ModerateHatespeech project, an initiative to research and reduce online hate. Use of the API constitues acceptance of ModerateHatespeech’s Terms of Service and Privacy Policies.
- Upload the .zip file of this plugin to WP Admin
- Activate the plugin through the ‘Plugins’ menu in WordPress
以下人員參與了開源軟體〈Auto Comment Moderation〉的開發相關工作。參與者
- Initial plugin release