Jump to content

Talk:Spam blacklist: Difference between revisions

From Meta, a Wikimedia project coordination wiki
Latest comment: 1 year ago by Billinghurst in topic Proposed additions
Content deleted Content added
Line 103: Line 103:
* {{LinkSummary|everybodywiki.com}}
* {{LinkSummary|everybodywiki.com}}
* {{LinkSummary|en.everybodywiki.com}}
* {{LinkSummary|en.everybodywiki.com}}

* {{LinkSummary|kibrit.com.tr}}
Please remove them, they don't seem to be any sort of spam. I don't know why one of these are in the blacklist.
Please remove them, they don't seem to be any sort of spam. I don't know why one of these are in the blacklist.


:{{Declined}}, thunkable is not blacklisted anywhere, everybodywiki only on fr and pl wikipedia. Nothing we can do here. [[User:Beetstra|Dirk Beetstra]] <sup>[[User_Talk:Beetstra|<span style="color:#0000FF;">T</span>]] [[Special:Contributions/Beetstra|<span style="color:#0000FF;">C</span>]]</sup> (en: [[:en:User:Beetstra|U]], [[:en:User talk:Beetstra|T]]) 17:49, 29 January 2023 (UTC)
:{{Declined}}, thunkable is not blacklisted anywhere, everybodywiki only on fr and pl wikipedia. Nothing we can do here. [[User:Beetstra|Dirk Beetstra]] <sup>[[User_Talk:Beetstra|<span style="color:#0000FF;">T</span>]] [[Special:Contributions/Beetstra|<span style="color:#0000FF;">C</span>]]</sup> (en: [[:en:User:Beetstra|U]], [[:en:User talk:Beetstra|T]]) 17:49, 29 January 2023 (UTC)

=== kibrit removal request ===
* {{LinkSummary|kibrit.com.tr}}
<!-- Template:Unsigned --><small class="autosigned">—&nbsp;The preceding [[Help:Signature|unsigned]] comment was added by [[User:78.179.74.70|78.179.74.70]] ([[User talk:78.179.74.70|{{int:Talkpagelinktext}}]]) </small>


== Troubleshooting and problems ==
== Troubleshooting and problems ==

Revision as of 10:09, 9 February 2023

Shortcut:
WM:SPAM
WM:SBL
The associated page is used by the MediaWiki Spam Blacklist extension, and lists regular expressions which cannot be used in URLs in any page in Wikimedia Foundation projects (as well as many external wikis). Any Meta administrator can edit the spam blacklist; either manually or with SBHandler. For more information on what the spam blacklist is for, and the processes used here, please see Spam blacklist/About.

Proposed additions
Please provide evidence of spamming on several wikis. Spam that only affects a single project should go to that project's local blacklist. Exceptions include malicious domains and URL redirector/shortener services. Please follow this format. Please check back after submitting your report, there could be questions regarding your request.
Proposed removals
Please check our list of requests which repeatedly get declined. Typically, we do not remove domains from the spam blacklist in response to site-owners' requests. Instead, we de-blacklist sites when trusted, high-volume editors request the use of blacklisted links because of their value in support of our projects. Please consider whether requesting whitelisting on a specific wiki for a specific use is more appropriate - that is very often the case.
Other discussion
Troubleshooting and problems - If there is an error in the blacklist (i.e. a regex error) which is causing problems, please raise the issue here.
Discussion - Meta-discussion concerning the operation of the blacklist and related pages, and communication among the spam blacklist team.
#wikimedia-external-linksconnect - Real-time IRC chat for co-ordination of activities related to maintenance of the blacklist.
Whitelists
There is no global whitelist, so if you are seeking a whitelisting of a url at a wiki then please address such matters via use of the respective Mediawiki talk:Spam-whitelist page at that wiki, and you should consider the use of the template {{edit protected}} or its local equivalent to get attention to your edit.

Please sign your posts with ~~~~ after your comment. This leaves a signature and timestamp so conversations are easier to follow.


Completed requests are marked as {{added}}/{{removed}} or {{declined}}, and are generally archived quickly. Additions and removals are logged · current log 2024/07.

SpBot archives all sections tagged with {{Section resolved|1=~~~~}} after 7 days and sections whose most recent comment is older than 15 days.

Proposed additions

This section is for proposing that a website be blacklisted; add new entries at the bottom of the section, using the basic URL so that there is no link (example.com, not http://www.example.com). Provide links demonstrating widespread spamming by multiple users on multiple wikis. Completed requests will be marked as {{added}} or {{declined}} and archived.

about.com



This site has now been taken over and is solely marketing and full of redirects to a whole range of sites and will never be a valid reliable site whilst this is occurring.  — billinghurst sDrewth 07:45, 7 January 2023 (UTC)Reply

@Billinghurst: Added Added to Spam blacklist. -- — billinghurst sDrewth 07:45, 7 January 2023 (UTC)Reply
@Billinghurst your addition seems to cause multiple issues at dewiki [1] & enwiki [2] and likely in other projects as well, judging by the large number of about.com-links in major content wikis [3]. Should all of these links be removed? Johannnes89 (talk) 16:00, 16 January 2023 (UTC)Reply
@Johannnes89: We should be having no new links to about.com, the domain is basically a full redirect service for which there is no overt control. That is what my action does, it does not impact existing links. As such there is no need for new legitimate users to be adding those links as there is no clear external source of that data, users should add the target actual domain if it is relevant. So the domain clearly meets our criteria for blacklisting with all the recent spam activity.

With regard to resolution of existing links these wikis can easily whitelist these domains to alleviate immediate issues, and then make their own programs to resolution with their local consensus. I made the recommendation on User talk:InternetArchiveBot to have their bot do it. I informed enWP of the issue directly and had a conversation with them with regard to moving to archived links, temporarily removed the blacklisting for them to act, then reimposed following the completion of the conversation. If they are going to run bots through to resolve we can again temporarily remove the listing.

What are you suggesting is a better way to act? Leave in a redirecting domain? One that is run by a marketing company that generates articles for the purpose of advertising around the outside and through it? Their target sites are available for diligent use, their redirecting services is now gone. Happy to hear a productive solution rather than hearing that the WPs have got themselves into a pickle.  — billinghurst sDrewth 21:51, 16 January 2023 (UTC)Reply

I think I've misinterpreted some of the spam log entries I saw. Given the scope of this redirect spam I understand the need for blacklisting. I will then suggest removing all links at dewiki (where community members complained to me about being blocked by the blacklist). Johannnes89 (talk) 22:37, 16 January 2023 (UTC)Reply
gudn tach!
one problem is: if there is a link to a about.com webpage which has recently changed, then it's not always possible for the users to replace the link with an archived version, because that could be interpreted by the SBL as adding a new link to the blacklisted website.
normally it's better to first replace all links and then add the domain to the spam-blacklist. in local wikipedias that's sometimes feasible and sometimes complicated. globally that's way more difficult and time-consuming. so basically i understand the blacklisting.
but blacklisting only is useful, if people try to add new links to that domain. if that's not the case (i.e. if there is no new spamming), the blacklisting won't help, but it would just bother some people (and bots) who want to help.
so: is there any new spamming? if not, i don't see the need for urgent blacklisting. -- seth (talk) 22:56, 16 January 2023 (UTC)Reply
i've read w:en:Wikipedia:Link_rot/URL_change_requests#about.com_usurped_and_wiki_blacklisted now. ok, so if there is current spamming, then i understand the blacklisting again. :-)
at least at dewiki we can try to unblacklist the domain temporarily until the links are replaced (or spamming starts at dewiki, too). -- seth (talk) 23:06, 16 January 2023 (UTC)Reply
@Lustiger seth: It was an oversight by me to not ping you as this impacted deWP. My apologies to you and your community.  — billinghurst sDrewth 11:46, 17 January 2023 (UTC)Reply

@Billinghurst, Lustiger seth, and Johannnes89:I have on en.wikipedia whitelisted 'archive\.org.*?http:\/\/.*?\babout.com\b', which now allows for about.com archive links but not for about.com links themselves. Maybe better would be to have a negative lookbefore in our regex here? --Dirk Beetstra T C (en: U, T) 05:47, 23 January 2023 (UTC)Reply

Makes sense.  — billinghurst sDrewth 10:49, 23 January 2023 (UTC)Reply
hi!
i added
web\.archive\.org/web/[0-9]+/https?://(?:[a-z0-9]+\.|)about\.com
at dewiki last week.[4]
a negative look-behind assertion in php is not able to be of variable length,[5] so we would have to use fixed length. still, it might be possible.
however, it's complicated, because the sbl builds the regexps like this:[6][7]
$regexp = '/(?:https?:)?\/\/+[a-z0-9_\-.]*(' . "$sbl_entry_0|" . "$sbl_entry_1|" . ... . "$sbl_entry_n" . ')/im';
now, if we want to blacklist about.com, but not archived about.com, we would need something such as
(?<!web\.archive\.org/web/[0-9]{14}/https://)(?<!web\.archive\.org/web/[0-9]{14}/http://)(?:[a-z0-9]+\.|)about\.com
as sbl entry. but that won't probably work, because of the [a-z0-9_\-.]* part that is added automatically by the sbl script.
so, one way to circumvent this should be the addition of an positive look-behind assertion, to ensure that the full host name is matched with our expression and not the automatically added part:
(?<!web\.archive\.org/web/[0-9]{14}/https://)(?<!web\.archive\.org/web/[0-9]{14}/http://)(?<=/)(?:[a-z0-9]+\.|)about\.com
i have not tested it, so there's a not so small chance that this regexp is incorrect. however, you might give it a try and test it. -- seth (talk) 23:42, 23 January 2023 (UTC)Reply
@Lustiger seth You could test this with some completely non-existing domains on a local wiki to minimize collateral damage, then port it here for about.com. That might also then be ported to those proper official domains where we need to link to an about/index.htm/php or similar. Dirk Beetstra T C (en: U, T) 06:01, 26 January 2023 (UTC)Reply

polskieogloszenia.pl



Being whacked by a lot of spambots with a variety of subdomains.  — billinghurst sDrewth 21:46, 30 January 2023 (UTC)Reply

@Billinghurst: Added Added to Spam blacklist. -- — billinghurst sDrewth 21:46, 30 January 2023 (UTC)Reply

judisbobet88.net



spambot activity for gambling site  — billinghurst sDrewth 10:24, 3 February 2023 (UTC)Reply

@Billinghurst: Added Added to Spam blacklist. -- — billinghurst sDrewth 10:24, 3 February 2023 (UTC)Reply

tiktok.com



To prevent users from promoting their TikTok account. Solaris5296 (talk) 21:25, 4 February 2023 (UTC)Reply

Tiktok ... To prevent users from promoting their any TikTok-related accounts. Solaris5296 (talk) 21:25, 4 February 2023 (UTC)Reply

@Solaris5296:  Declined Please reread the criteria above for what can be blacklisted.  — billinghurst sDrewth 21:31, 5 February 2023 (UTC)Reply

Proposed additions (Bot reported)

This section is for domains which have been added to multiple wikis as observed by a bot.

These are automated reports, please check the records and the link thoroughly, it may report good links! For some more info, see Spam blacklist/Help#COIBot_reports. Reports will automatically be archived by the bot when they get stale (less than 5 links reported, which have not been edited in the last 7 days, and where the last editor is COIBot).

Sysops
  • If the report contains links to less than 5 wikis, then only add it when it is really spam
  • Otherwise just revert the link-additions, and close the report; closed reports will be reopened when spamming continues
  • To close a report, change the LinkStatus template to closed ({{LinkStatus|closed}})
  • Please place any notes in the discussion section below the HTML comment

COIBot

The LinkWatchers report domains meeting the following criteria:

  • When a user mainly adds this link, and the link has not been used too much, and this user adds the link to more than 2 wikis
  • When a user mainly adds links on one server, and links on the server have not been used too much, and this user adds the links to more than 2 wikis
  • If ALL links are added by IPs, and the link is added to more than 1 wiki
  • If a small range of IPs have a preference for this link (but it may also have been added by other users), and the link is added to more than 1 wiki.
COIBot's currently open XWiki reports
List Last update By Site IP R Last user Last link addition User Link User - Link User - Link - Wikis Link - Wikis
vrsystems.ru 2023-06-27 15:51:16 COIBot 195.24.68.17 192.36.57.94
193.46.56.178
194.71.126.227
93.99.104.93
2070-01-01 05:00:00 4 4

Proposed removals

This section is for proposing that a website be unlisted; please add new entries at the bottom of the section. Use a suitable 3rd level heading and display the domain name as per this example {{LinkSummary|targetdomain.com}}.

Remember to provide the specific domain blacklisted, links to the articles they are used in or useful to, and arguments in favour of unlisting. Completed requests will be marked as {{removed}} or {{declined}} and archived.

See also recurring requests for repeatedly proposed (and refused) removals.

Notes:

  • The addition or removal of a domain from the blacklist is not a vote; please do not bold the first words in statements.
  • This page is for the removal of domains from the global blacklist, not for removal of domains from the blacklists of individual wikis. For those requests please take your discussion to the pertinent wiki, where such requests would be made at Mediawiki talk:Spam-blacklist at that wiki. Search spamlists — remember to enter any relevant language code

community.thunkable.com, x.thunkable.com, docs.thunkable.com, and en.everybodywiki.com













Please remove them, they don't seem to be any sort of spam. I don't know why one of these are in the blacklist.

 Declined, thunkable is not blacklisted anywhere, everybodywiki only on fr and pl wikipedia. Nothing we can do here. Dirk Beetstra T C (en: U, T) 17:49, 29 January 2023 (UTC)Reply

kibrit removal request



— The preceding unsigned comment was added by 78.179.74.70 (talk)

Troubleshooting and problems

This section is for comments related to problems with the blacklist (such as incorrect syntax or entries not being blocked), or problems saving a page because of a blacklisted link. This is not the section to request that an entry be unlisted (see Proposed removals above).

Discussion

This section is for discussion of Spam blacklist issues among other users.