wikisexguide.com
robots.txt

Robots Exclusion Standard data for wikisexguide.com

Resource Scan

Scan Details

Site Domain wikisexguide.com
Base Domain wikisexguide.com
Scan Status Ok
Last Scan2024-09-20T02:03:52+00:00
Next Scan 2024-10-20T02:03:52+00:00

Last Scan

Scanned2024-09-20T02:03:52+00:00
URL https://wikisexguide.com/robots.txt
Redirect https://www.wikisexguide.com/robots.txt
Redirect Domain www.wikisexguide.com
Redirect Base wikisexguide.com
Domain IPs 172.66.40.199, 172.66.43.57, 2606:4700:3108::ac42:28c7, 2606:4700:3108::ac42:2b39
Redirect IPs 172.66.40.199, 172.66.43.57, 2606:4700:3108::ac42:28c7, 2606:4700:3108::ac42:2b39
Response IP 172.66.43.57
Found Yes
Hash 965009046a80752c219ca20e4dd919a155e6120e8e9f75e2667f2075daee1137
SimHash 221819c9efc3

Groups

*

Rule Path
Allow /index.php/Special%3ABusinessSearch
Allow /wiki/Special%3ABusinessSearch
Allow /api.php?action=businesscarousel&format=xml
Allow /wiki/Special%3AStaticMap
Disallow /index.php?
Disallow /api.php
Disallow /*feed%3Drss
Disallow /*action%3Dedit
Disallow /*action%3Dhistory
Disallow /*action%3Ddelete
Disallow /*action%3Dwatch
Disallow /index.php/Help
Disallow /index.php/MediaWiki
Disallow /index.php/Special%3A
Disallow /index.php/User%3A
Disallow /index.php/Talk%3A
Disallow /index.php/Template%3A
Disallow /index.php/Form%3A
Disallow /wiki/Help
Disallow /wiki/MediaWiki
Disallow /wiki/Special%3A
Disallow /wiki/User%3A
Disallow /wiki/Talk%3A
Disallow /wiki/Template%3A
Disallow /wiki/Form%3A
Disallow /index.php/Especial%3A
Disallow /index.php/Usuario%3A
Disallow /index.php/Plantilla
Disallow /index.php/Discusi%C3%B3n%3A
Disallow /wiki/Especial%3A
Disallow /wiki/Usuario%3A
Disallow /wiki/Plantilla
Disallow /wiki/Discusi%C3%B3n%3A
Disallow /index.php/Spezial%3A
Disallow /index.php/Benutzer%3A
Disallow /index.php/Vorlage
Disallow /index.php/Benutzer_Diskussion%3A
Disallow /wiki/Spezial%3A
Disallow /wiki/Benutzer%3A
Disallow /wiki/Vorlage
Disallow /wiki/Benutzer_Diskussion%3A
Disallow /index.php/%D0%A1%D0%BB%D1%83%D0%B6%D0%B5%D0%B1%D0%BD%D0%B0%D1%8F%3A
Disallow /index.php/%D0%A3%D1%87%D0%B0%D1%81%D1%82%D0%BD%D0%B8%D0%BA%3A
Disallow /index.php/%D0%9E%D0%B1%D1%81%D1%83%D0%B6%D0%B4%D0%B5%D0%BD%D0%B8%D0%B5%3A
Disallow /index.php/%D0%A8%D0%B0%D0%B1%D0%BB%D0%BE%D0%BD%3A
Disallow /index.php/%D0%A4%D0%BE%D1%80%D0%BC%D0%B0%3A
Disallow /wiki/%D0%A1%D0%BB%D1%83%D0%B6%D0%B5%D0%B1%D0%BD%D0%B0%D1%8F%3A
Disallow /wiki/%D0%A3%D1%87%D0%B0%D1%81%D1%82%D0%BD%D0%B8%D0%BA%3A
Disallow /wiki/%D0%9E%D0%B1%D1%81%D1%83%D0%B6%D0%B4%D0%B5%D0%BD%D0%B8%D0%B5%3A
Disallow /wiki/%D0%A8%D0%B0%D0%B1%D0%BB%D0%BE%D0%BD%3A
Disallow /wiki/%D0%A4%D0%BE%D1%80%D0%BC%D0%B0%3A
Disallow /index.php/*%D8%AE%D8%A7%D8%B5%3A*
Disallow /index.php/*%3A%D9%85%D8%B3%D8%AA%D8%AE%D8%AF%D9%85*
Disallow /index.php/*%D9%86%D9%82%D8%A7%D8%B4%3A*
Disallow /index.php/*%D9%82%D8%A7%D9%84%D8%A8%3A*
Disallow /index.php/*%D8%A7%D8%B3%D8%AA%D9%85%D8%A7%D8%B1%D8%A9%3A*
Disallow /wiki/*%D8%AE%D8%A7%D8%B5%3A*
Disallow /wiki/*%3A%D9%85%D8%B3%D8%AA%D8%AE%D8%AF%D9%85*
Disallow /wiki/*%D9%86%D9%82%D8%A7%D8%B4%3A*
Disallow /wiki/*%D9%82%D8%A7%D9%84%D8%A8%3A*
Disallow /wiki/*%D8%A7%D8%B3%D8%AA%D9%85%D8%A7%D8%B1%D8%A9%3A*

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

hmse_robot

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wikisexguide.com/pro-sitemaps-4099974.php

Comments

  • Crawlers that are kind enough to obey, but which we'd rather not have
  • unless they're feeding search engines.
  • Some bots are known to be trouble, particularly those designed to copy
  • entire sites. Please obey robots.txt.
  • Sorry, wget in its recursive mode is a frequent problem.
  • Please read the man page and use it properly; there is a
  • --wait option you can use to set the delay between hits,
  • for instance.
  • The 'grub' distributed client has been *very* poorly behaved.
  • Doesn't follow robots.txt anyway, but...
  • Hits many times per second, not acceptable
  • http://www.nameprotect.com/botinfo.html
  • A capture bot, downloads gazillions of pages with no public benefit
  • http://www.webreaper.net/