Each message gets categorized as containing various categories of behavior that’s abusive or potentially unwelcome in your community allowing for greater transparency and understanding than a simple binary configuration would. The detailed output also allows for greater configurability. You have the choice over which detected categories you deem to be of the highest priority for your community.
The API currently detects 6 main violence categories:
And 2 lexical categories:
Either of these categories may co-occur with any other depending on the content of the message.
The violence categories are divided based on:
The lexical categories are divided based on:
For a more detailed breakdown of the categories and their subdivisions please refer to the PDF document in the spotlights section. All of this information can be used for further configuration and allows you to act on a precise and fine-grained level to make sure that the intended moderation behavior matches your needs perfectly.
Integrate the Cyber Guardian into your suite of moderation tools on various platforms like Discord, Twitch or Youtube.
Moderation Functionalities
Using these functionalities you can empower your current moderation solutions with a versatile tool that can adjust itself to your and your communities needs. If your community has a 13+ audience it is likely wise to maintain all detections at the level of manual moderation at least and add some automated actions for the most egregious offenses. More adult-oriented communities might choose to completely turn off the detection of profanities, sexual remarks and some milder detection categories as they are likely to be seen as acceptable and these detections might not bring in valuable information in that context. Adjust your configurations freely at any time, customize moderation actions and automate to save on manual moderation tasks. Using these principles we have prepared a series of moderation presets for the API. You can review the presets in the Configuration Dashboard and select one that addresses your needs best.
Some of the features mentioned here are platform-dependent and the API currently only supports English. Other languages and new features continuously under development.