Incorporate bugbug ml labelling into moderation process #3574

ksy36 · 2021-05-19T19:29:18Z

In #3571 I've added bugbug classification and issues that are most likely invalid will receive bugbug-probability-high label in the private repository. We need to incorporate this into the moderation process to close such issues automatically.

There is a problem with posting issues content to the public repository without human moderation as we may end up submitting abusive content and get banned by GitHub again. So as a solution, we could post a placeholder text to the public issue, instead of its content for these invalid issues. Something like the current bot is posting now as a comment, but we'll put it in the issue body instead.

issue is labelled invalid by the bot -> do not post the content to the public repository, post a placeholder text instead and close the issue

If we decide to go this path, users won't be able to see the content of their report if it was labelled invalid (they will only see the placeholder text). It's not ideal (but acceptable?), and I wonder how important this is given the overwhelming amount of issues we're getting.

Do you have any thoughts on this?
@softvision-oana-arbuzov @softvision-raul-bucata @karlcow

The text was updated successfully, but these errors were encountered:

ksy36 · 2021-05-19T19:56:52Z

To add to this, we had a discussion with @karlcow about another approach. We could train a second model to detect nsfw sites, label those and make a decision about posting content to the public based on this label.

issue is labelled invalid and nsfw by the bot -> do not post the content to the public repository, post a placeholder text instead and close the issue

issue is labelled invalid by the bot -> post content to the public repository and close the issue

The problem with this approach is that we can't be 100% sure on nsfw prediction. We currently have a good amount of issues that are manually labelled nsfw, however, there always will be a new domain, an image or something on the page that could be considered abusive content, but will not be categorized by the model as nsfw. In addition, since we delete problematic (teen) issues in both repositories, there is no data for those domains either.

softvision-oana-arbuzov · 2021-05-21T12:33:26Z

@ksy36 I think it is a good approach using the placeholder text, most of those issues have no value (meaning no relevant info is given to work with), and I don't think the users that report these kind of issues, will come back to comment.

ksy36 · 2021-05-21T17:50:31Z

Sounds good, I'll go with this approach then, thanks for the feedback :)

karlcow · 2021-05-31T07:35:32Z

Just deployed on staging.

I created a dumb issue which was identified as invalid and autoclosed and the original content was updated

This bug was identified as valid
and after switching to accepted the original content was published.

This is awesome @ksy36
This will accelerate the work of @softvision-oana-arbuzov and @softvision-raul-bucata

ksy36 · 2021-05-31T15:49:41Z

thanks for testing on staging @karlcow, I'll deploy this on production today

ksy36 · 2021-05-31T16:05:37Z

I think @karlcow has deployed it already 😁

ksy36 added the status: discussion label May 19, 2021

ksy36 added a commit that referenced this issue May 27, 2021

Issue #3574: Incorporate bugbug ml labelling into moderation process

d443189

ksy36 mentioned this issue May 27, 2021

Fixes #3574: Incorporate bugbug ml labelling into moderation process #3576

Merged

ksy36 added a commit that referenced this issue May 27, 2021

Issue #3574: Incorporate bugbug ml labelling into moderation process

3ee77a5

ksy36 added a commit that referenced this issue May 28, 2021

Issue #3574: Incorporate bugbug ml labelling into moderation process

07bb08d

ksy36 added a commit that referenced this issue May 28, 2021

Issue #3574: Incorporate bugbug ml labelling into moderation process

3de4ec2

karlcow closed this as completed in 81d7fd5 May 31, 2021

ksy36 mentioned this issue May 31, 2021

2021H1 - Improve Machine Learning Webcompat Bot mozilla/webcompat-team-okrs#194

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorporate bugbug ml labelling into moderation process #3574

Incorporate bugbug ml labelling into moderation process #3574

ksy36 commented May 19, 2021 •

edited

Loading

ksy36 commented May 19, 2021 •

edited

Loading

softvision-oana-arbuzov commented May 21, 2021

ksy36 commented May 21, 2021

karlcow commented May 31, 2021

ksy36 commented May 31, 2021

ksy36 commented May 31, 2021

Incorporate bugbug ml labelling into moderation process #3574

Incorporate bugbug ml labelling into moderation process #3574

Comments

ksy36 commented May 19, 2021 • edited Loading

ksy36 commented May 19, 2021 • edited Loading

softvision-oana-arbuzov commented May 21, 2021

ksy36 commented May 21, 2021

karlcow commented May 31, 2021

ksy36 commented May 31, 2021

ksy36 commented May 31, 2021

ksy36 commented May 19, 2021 •

edited

Loading

ksy36 commented May 19, 2021 •

edited

Loading