Page MenuHomePhabricator

Paste Check: Prompt people pasting text to consider risk of copyright violation
Open, Needs TriagePublic

Description

This task involves the work with introducing an Edit Check that would prompt people to consider whether the text they're pasting into an article is at risk of a copyright violation.

Stories

  1. As an experienced volunteer/moderator motivated to ensure that Wikipedia's text, "...may be freely redistributed, reused and built upon by anyone...",[i][ii] I want everyone contributing new text content in the main namespace to consider whether what they're contributing is copy-written so that I can reduce the likelihood of me needing to do so myself.
  1. As someone who is new and motivated to contribute – what I consider to be – valuable + missing knowledge to Wikipedia, I want to know what I'm contributing adheres to relevant policies and conventions, so that I, and other people, can access this knowledge across time.

User experience

The Editing Team is exploring a range of "resolution pathways"

Requirements

Open questions

  1. Who (logged in, logged out, experience level, etc.) will this check be made available to by default?
  2. What conditions will determine wether the Copyvio is activated on paste?
    • E.g. amount of content being pasted, where the content is being pasted from (e.g. external/internal domain), whether the pasted content is wrapped in quotation marks, etc.
  3. How much content will someone have had to paste in order for this Check to become activated?

References


i. en:wp:copyvio
ii. https://www.wikidata.org/wiki/Q10990487


Thank you to @Pikne. The comment you made in T300942#9597055 was the prompt I needed to file this task.

Event Timeline

Having the editor automatically check whether text being added already appears elsewhere on the internet (i.e. like Earwig's detector does) and warn editors about copyright if so would be an excellent feature. We'd also want instances where the editor declines to change the edit after being warned to be logged for further review. Sometimes there may be no issues (e.g. the text is from a freely licensed source, or is being properly quoted), but often there will be.

Sometimes new editors copy articles from one title to another because they do not know how Special:MovePage works, and are in fact fully unaware of it's existence. It is an false positive for an copy-paste copyvio check.

Having the editor automatically check whether text being added already appears elsewhere on the internet (i.e. like Earwig's detector does) and warn editors about copyright if so would be an excellent feature.

Nice...it's helpful to know a feature of this sort resonates in concept, @Sdkb !

We'd also want instances where the editor declines to change the edit after being warned to be logged for further review. Sometimes there may be no issues (e.g. the text is from a freely licensed source, or is being properly quoted), but often there will be.

Mmm, great spot. Can you think of a case where the "logged for further review" process you're alluding to already happens? I ask this wondering if there is something that seems to be working well that we could learn form. [i]


i. Flagged Revisions came to mind. Tho, I assumed you might be referring to something else seeing as how [I don't think?] it is use widely at en.wiki.

Sometimes new editors copy articles from one title to another because they do not know how Special:MovePage works, and are in fact fully unaware of it's existence. It is an false positive for an copy-paste copyvio check.

Great spot, @Snaevar! I hadn't considered this case...

Any ideas for how we might mitigate against the case you described? One thought: might we be able to check if the content someone is pasting exists elsewhere within the wiki?

Can you think of a case where the "logged for further review" process you're alluding to already happens?

What I'd envision is some sort of software tag, which community-developed tools could then use to create a feed to check. I'm not involved with copyright patrol enough to know what precedents exist currently, but WikiProject Copyright Cleanup would.

Sometimes new editors copy articles from one title to another because they do not know how Special:MovePage works, and are in fact fully unaware of it's existence. It is an false positive for an copy-paste copyvio check.

Worth considering that this is also how "archiving" works for talk pages. Sometimes people have a bot do it, and it's generally going to be in the wikitext editor, but...

ppelberg added a project: Editing-team.
ppelberg moved this task from Untriaged to Upcoming on the Editing-team board.
ppelberg added a subscriber: Dyolf77_WMF.

Adding === Open questions @Dyolf77_WMF raised offline.

ppelberg updated the task description. (Show Details)
ppelberg edited projects, added Editing-team (Kanban Board); removed Editing-team.
ppelberg moved this task from Incoming to Doing on the Editing-team (Kanban Board) board.

Next steps

Per what @nayoub and I discussed offline today, the main next step is converging on the resolution workflow. Said another way: defining what questions, and subsequence actions, Edit Check will prompt people to engage with prior to the system considering a Check "addressed."

More immediately, the Editing Team will share an initial proposal with and refine it with volunteers on-wiki and through Community Conversations.

ppelberg updated the task description. (Show Details)
ppelberg renamed this task from Copyvio Check: Prompt people pasting text to consider risk of copyright violation to Paste Check: Prompt people pasting text to consider risk of copyright violation.Wed, Jul 3, 10:12 PM
ppelberg added a subscriber: Trizek-WMF.

Meta: we're renaming this Check "Paste Check" from "CopyVio check" per suggestion @Trizek-WMF made offline.