📌 Note: Only applicable for Reputation+ and Engage+ plans. If you are on the Essential Plan, please refer to our Essential Moderation Categories Guide.
🚀 How BrandBastion Moderation Works
At BrandBastion, our AI moderation is purpose-built for the fast-changing, high-volume world of social media. We combine proprietary models, best-in-class third-party tools, and continuous human oversight to ensure accuracy, nuance, and scalability.
Proprietary Technology:
11+ years of real-world social data used to train our models to understand nuance, slang, emojis, sarcasm, and multilingual content.Human + AI Collaboration:
Dedicated linguists and strategists constantly update models based on new formats, trends, and community behaviors—ensuring continuous learning and client-specific customization.
🧹 Core Moderation Categories
Discriminatory
Attacking people based on their race, gender, ethnicity, nationality, religion, disability, serious disease (including obesity), sexual orientation or identity, expressing unfair generalization or supporting unfair treatment of groups of people.
Disturbing/Violent
Comments/images that depict detailed violence, death, suicide, violent threats or other similar topics, and comments wishing or threatening detailed violence, death, or suicide to another user.
Extreme Profanity
Comments including extreme profanity that is inappropriate in normal social media language in the context of adults.
Any form (spelling, grouping) of the following swear words: Fuck, Cock, Wanker, Cunt, Dick, Pussy, Prick
Personally Identifiable Information (PII)
Comments containing Personally Identifiable Information (PII) such as bank accounts or PIN numbers, financial data, pictures or information from card IDs, passport data or social security numbers, contact info such as emails, phone numbers and addresses, other personally identifiable information, such as Skype IDs, Kik IDs, IP addresses, and other similar information.
Spam
Non-human communication such as random character sequences. Messages for the purposes of advertising, phishing, self promotion or gaining followers.
Repeated Non-Harmful
Comments from users who continue to post similar comments several times in a row on the same post.
After 5 repeated comments from the same user, the 6th comment and future comments will be tagged as Repeated non Harmful.
Brand Attack
Comments attacking the brand that are aggressive and intend to do harm to the company, brand, products, services, website, reviews, advertisement and any other company referent. These comments typically cannot be resolved even if the brand would respond to them.
Brand Critique
Comments that are "negative about the brand" and /or expressing criticism towards the company brand, products, services, website, reviews, advertisements and any other company referent.
Competitor Mention
Comments that mention a brand name that is a competitor of the client or another brand that the client has defined in the guidelines. This includes positive comparisons towards the brand.
Competitor Promotion
Comments that promote a brand name that is a competitor of the client or another brand that the client has defined in the guidelines. This includes comments where users are encouraging users towards the competitor(s).
Brand Impersonation
Comments from users attempting to impersonate the brand as a representative or employee, either by using a username that is very similar to the brand name or commenting in a manner indicating that they would be a representative of the brand without using the official brand handle. These comments are the type of comments that cause or aim to cause confusion amongst people reviewing the post in terms of whether it is an official representative posting or not.
Intoxicants
Any mentions of intoxicating substances such as alcohol, drugs, narcotics, or pharmaceuticals, in an excessive, illegal, or unprescribed way.
Against Person Featured
Comments where users are attacking specific influencers or people featured in the ad saying mean things about them.
Personal Attack/Bullying
Comments where users are attacking specific users or groups of users/people other than the brand saying mean things about them.
Mild Profanity
Comments including mild profanity that would be inappropriate for a child but considered normal social media language in the context of adults. This includes abbreviations such as WTF, AF, FML, STFU etc.
Any form (spelling, grouping) of the following swear words: Crap, Damn, Hell, Shit, Bitch, Bollocks, Douche, Ass, 💩
Referral Code
Comments from users indicating, mentioning or requesting a referral code for any product or service of the brand or other brands.
Sexual/Toilet
Comments or emojis and GIFs about sexual activity, sex organs, secretion, or toilet activities, unrelated to posts or ads. Relevant emojis and GIFs include eggplant, teardrop, toilet bowl, and vomit.
Unsolicited Sales
Comments that advertise client products via unsolicited vendor, reselling of client products.
Socio-Political
Comments from users aiming to spread awareness of current societal, current political issues or current events. Government involvement is typically a catalyst in these types of comments.
Repeated Socio-Political
Repeated similar or exact Socio-Political comments from the same user on the same or different posts.
After the first comment labeled as Socio-Political, the second and onwards will be tagged only as Repeated Socio-Political.
🏭 Industry-Specific Categories
Entertainment and Gaming
VPN
Comments including explanations for how to access regionally available online content by using a VPN.
Account Misuse
Comments including trading, selling or buying of accounts, and explanations for how to circumvent terms of service set by the provider.
Piracy
Comments including links that may lead to piracy websites or other copyright infringements, mentions of watching, using or accessing copyrighted content through illegal downloads and streaming.
Beauty, Retail & E-Commerce
Legitimacy
Comments from users asking questions, expressing concerns or confusion regarding the legitimacy or reliability of the company/products or the ownership/structure or business model of the company.
Inventory
Negative and neutral comments including questions, complaints and statements about the availability of products as well as users not being able to find specific items.
Usertag
Comments that include one or multiple user tags only. Ex: @user or @user@user
Price/Cost
Positive, neutral or negative comments discussing price, cost, salaries, billing, monthly payments, etc. This includes comments about money and affordability, and any mentions of the word "free" or "expensive".
Gaming
Hacks and Cheats
Comments including links or explanations of how to hack or cheat a game.
Health
Diseases
Comments from users mentioning diseases of any sort such as Cancer, Diabetes, Parkinsons as well as the word "disease".
🚨 Alerts Categories
Potential Threat
Comments threatening the company, its representatives, locations, or online properties with violence, attacks or legal actions. Users pushing others to take these actions are also included. Specific details like addresses, dates or employee names are not mandatory.
Severe Event
Comments including reports of safety, health issues, harassment or fear involving the clients products, services or representatives.
Post Issue
Comments highlighting issues with the post, such as audio/links not working in ad/post, spelling mistakes, grammar mistakes.
Allegations of IP Misuse
Comments where users are alleging that the company would be infringing someone elses intellectual property rights. This includes claims of the brand using images, photos and citations that are not owned by the brand, not including credits to the owner and/or are selling products that they do not have permission to sell.
Self Harm
Comments from users in need of immediate help who mention or imply that they are planning to commit suicide OR have intentions to do so or have already injured themselves.
Protest Specific
Comments including details of a real life protest against the client, or attempts to organize one. The comment will include specific details such as location, time, target, or other specific details.
🧩 Additional Industry Alerts
Beauty Industry
Shade Range
Comments from users who indicate that the shade ranges offered by the brand are too limited, leading to feelings of exclusion.
Entertainment Industry
Audio/Subtitle Issue
Comments where users are reporting issues with subtitles or dubs in-app. This includes comments mentioning that the audio and the subtitles are not synchronised, subtitles are different from what is being said, incorrect translations, and so on.
Language Discrepancy
Comments where users are expressing negative sentiment about the language used in the ad. Specifically if the ad is in English and is targeting a country where the majority language in the country is not that. Comments where users are expressing that they do not speak the language the ad is in can also be considered.
Food Industry
Food Safety Issue
Comments related to food safety related severe events. These typically include key phrases such as:
Hospital
Got Sick
Food Poisoning
Listeria
E. Coli
FDA
CDC
References to Foreign Matter in the Ingredients
📈 Volume-Based Alerts (Threshold Varies Per Client)
Protest General
Comments using words or hashtags such as boycott, protest or asking people to protest against the client in an online capacity.
Spike in Negative Sentiment
Alert based on negative sentiment tag.
Post Not Resonating
Any comments expressing criticism or dissatisfaction about the post or the creative itself.
Complaints about Outages
Complaints about technical issues with the platform received within threshold in one region.
Animal Welfare
Any comment, statement or question from users regarding animal welfare practices. Comments defending the brand are not included.
Animal Welfare Defending Brand
Comments from users defending the brand and who explain their position on animal testing.
HR and Factory Practices
Comments about HR or Factory practices of the company. Including any comments regarding well-being of factory workers, working conditions in factories, etc.
🌍 Languages Covered
Our moderation models, combined with powerful translation capabilities, cover 109 languages for full global support.
Languages supported include:
Afrikaans, Albanian, Amharic, Arabic, Armenian, Azerbaijani, Basque, Belarusian, Bengali, Bosnian, Bulgarian, Catalan, Cebuano, Chinese (Simplified), Chinese (Traditional), Corsican, Croatian, Czech, Danish, Dutch, English, Esperanto, Estonian, Filipino (Tagalog), Finnish, French, Frisian, Galician, Georgian, German, Greek, Gujarati, Haitian Creole, Hausa, Hawaiian, Hebrew, Hindi, Hmong, Hungarian, Icelandic, Igbo, Indonesian, Irish, Italian, Japanese, Javanese, Kannada, Kazakh, Khmer, Kinyarwanda, Korean, Kurdish, Kyrgyz, Lao, Latin, Latvian, Lithuanian, Luxembourgish, Macedonian, Malagasy, Malay, Malayalam, Maltese, Maori, Marathi, Mongolian, Myanmar (Burmese), Nepali, Norwegian, Nyanja (Chichewa), Odia (Oriya), Pashto, Persian, Polish, Portuguese (Portugal, Brazil), Punjabi, Romanian, Russian, Samoan, Scots Gaelic, Serbian, Sesotho, Shona, Sinhala (Sinhalese), Slovak, Slovenian, Somali, Spanish, Sundanese, Swahili, Swedish, Tajik, Tamil, Tatar, Telugu, Thai, Turkish, Turkmen, Ukrainian, Urdu, Uyghur, Uzbek, Vietnamese, Welsh, Xhosa, Yiddish, Yoruba, Zulu.