dcuwiki.net
robots.txt

Robots Exclusion Standard data for dcuwiki.net

Resource Scan

Scan Details

Site Domain dcuwiki.net
Base Domain dcuwiki.net
Scan Status Ok
Last Scan2025-11-21T04:22:03+00:00
Next Scan 2025-12-21T04:22:03+00:00

Last Scan

Scanned2025-11-21T04:22:03+00:00
URL https://dcuwiki.net/robots.txt
Domain IPs 198.46.91.127
Response IP 198.46.91.127
Found Yes
Hash 4dc579e762dcb144917dae8c11ece9f7e76894644cc48b22939b5e019e3ab5d8
SimHash 4a58d8a16611

Groups

*

Rule Path
Disallow /forums/
Disallow /who.php
Disallow /chronology.php
Disallow /w/Batman%3A_The_Detective_Title_Index
Disallow /w/Batman_Vol._3_Title_Index
Disallow /w/I_Am_Batman_Title_Index
Disallow /cgi-bin/
Disallow /wiki/
Disallow /wiki2/
Disallow /guide2wiki/
Disallow /forums/
Allow /sitemap/
Allow /sitemap.xml

Other Records

Field Value
crawl-delay 200

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

googlebot
googlebot-image
mediapartners-google
msnbot
msnbot-media
slurp
yahoo-blogs
yahoo-mmcrawler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 100

ahrefsbot
alphaseobot
amazonbot
amazonbot/0.1
baiduspider
barkrowler
barkrowler
bytespider
dataforseobot/1.0
dotbot
friendlycrawler/1.0
gptbot
googleother
mj12bot
semrushbot
the knowledge ai
yandex
yandexbot
yandexbot/3.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://dcuguide.com/sitemap/sitemap.xml

Comments

  • but allow only important bots
  • Directories
  • Disallow certain spiders