The moderation object

Represents if a given input is potentially harmful.

id string
The unique identifier for the moderation request.

model string
The model used to generate the moderation results.

results array

A list of moderation objects.

flagged boolean
Whether any of the below categories are flagged.

categories object
A list of the categories, and whether they are flagged or not.

harassment boolean
Content that expresses, incites, or promotes harassing language towards any target.

harassment/threatening boolean
Harassment content that also includes violence or serious harm towards any target.

sexual boolean
Content meant to arouse sexual excitement, such as the description of sexual activity, or that promotes sexual services (excluding sex education and wellness).

hate boolean
Content that expresses, incites, or promotes hate based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste. Hateful content aimed at non-protected groups (e.g., chess players) is harassment.

hate/threatening boolean
Hateful content that also includes violence or serious harm towards the targeted group based on race, gender, ethnicity, religion, nationality, sexual orientation, disability status, or caste.

illicit boolean or null
Content that includes instructions or advice that facilitate the planning or execution of wrongdoing, or that gives advice or instruction on how to commit illicit acts.

illicit/violent boolean or null
Content that includes instructions or advice that facilitate the planning or execution of wrongdoing that also includes violence, or that gives advice or instruction on the procurement of any weapon.

self-harm boolean
Content that promotes, encourages, or depicts acts of self-harm, such as suicide, cutting, and eating disorders.

self-harm/intent boolean
Content where the speaker expresses that they are engaging or intend to engage in acts of self-harm, such as suicide, cutting, and eating disorders.

self-harm/instructions boolean
Content that encourages performing acts of self-harm, such as suicide, cutting, and eating disorders, or that gives instructions or advice on how to commit such acts.

sexual/minors boolean
Sexual content that includes an individual who is under 18 years old.

violence boolean
Content that depicts death, violence, or physical injury.

violence/graphic boolean
Content that depicts death, violence, or physical injury in graphic detail.

category_scores object
A list of the categories along with their scores as predicted by the model.

harassment number
The score for the category harassment.

harassment/threatening number
The score for the category harassment/threatening.

sexual number
The score for the category sexual.

hate number
The score for the category hate.

hate/threatening number
The score for the category hate/threatening.

illicit number
The score for the category illicit.

illicit/violent number
The score for the category illicit/violent.

self-harm number
The score for the category self-harm.

self-harm/intent number
The score for the category self-harm/intent.

self-harm/instructions number
The score for the category self-harm/instructions.

sexual/minors number
The score for the category sexual/minors.

violence number
The score for the category violence.

violence/graphic number
The score for the category violence/graphic.

category_applied_input_types object
A list of the categories along with the input type(s) that the score applies to.

harassment array
The applied input type(s) for the category harassment.

harassment/threatening array
The applied input type(s) for the category harassment/threatening.

sexual array
The applied input type(s) for the category sexual.

hate array
The applied input type(s) for the category hate.

hate/threatening array
The applied input type(s) for the category hate/threatening.

illicit array
The applied input type(s) for the category illicit.

illicit/violent array
The applied input type(s) for the category illicit/violent.

self-harm array
The applied input type(s) for the category self-harm.

self-harm/intent array
The applied input type(s) for the category self-harm/intent.

self-harm/instructions array
The applied input type(s) for the category self-harm/instructions.

sexual/minors array
The applied input type(s) for the category sexual/minors.

violence array
The applied input type(s) for the category violence.

violence/graphic array
The applied input type(s) for the category violence/graphic.

OBJECT The moderation object

bash

{
  "id": "modr-0d9740456c391e43c445bf0f010940c7",
  "model": "omni-moderation-latest",
  "results": [
    {
      "flagged": true,
      "categories": {
        "harassment": true,
        "harassment/threatening": true,
        "sexual": false,
        "hate": false,
        "hate/threatening": false,
        "illicit": false,
        "illicit/violent": false,
        "self-harm/intent": false,
        "self-harm/instructions": false,
        "self-harm": false,
        "sexual/minors": false,
        "violence": true,
        "violence/graphic": true
      },
      "category_scores": {
        "harassment": 0.8189693396524255,
        "harassment/threatening": 0.804985420696006,
        "sexual": 1.573112165348997e-6,
        "hate": 0.007562942636942845,
        "hate/threatening": 0.004208854591835476,
        "illicit": 0.030535955153511665,
        "illicit/violent": 0.008925306722380033,
        "self-harm/intent": 0.00023023930975076432,
        "self-harm/instructions": 0.0002293869201073356,
        "self-harm": 0.012598046106750154,
        "sexual/minors": 2.212566909570261e-8,
        "violence": 0.9999992735124786,
        "violence/graphic": 0.843064871157054
      },
      "category_applied_input_types": {
        "harassment": [
          "text"
        ],
        "harassment/threatening": [
          "text"
        ],
        "sexual": [
          "text",
          "image"
        ],
        "hate": [
          "text"
        ],
        "hate/threatening": [
          "text"
        ],
        "illicit": [
          "text"
        ],
        "illicit/violent": [
          "text"
        ],
        "self-harm/intent": [
          "text",
          "image"
        ],
        "self-harm/instructions": [
          "text",
          "image"
        ],
        "self-harm": [
          "text",
          "image"
        ],
        "sexual/minors": [
          "text"
        ],
        "violence": [
          "text",
          "image"
        ],
        "violence/graphic": [
          "text",
          "image"
        ]
      }
    }
  ]
}

The moderation object ​

The moderation object