thecw.fandom.com
robots.txt

Robots Exclusion Standard data for thecw.fandom.com

Resource Scan

Scan Details

Site Domain thecw.fandom.com
Base Domain fandom.com
Scan Status Ok
Last Scan2025-10-04T05:44:36+00:00
Next Scan 2025-10-18T05:44:36+00:00

Last Scan

Scanned2025-10-04T05:44:36+00:00
URL https://thecw.fandom.com/robots.txt
Domain IPs 199.232.208.194, 199.232.212.194
Response IP 199.232.212.194
Found Yes
Hash 5789b04b8b343e9d5b477c998c741a3b480d72b32717d7c4fa2458b8519095d3
SimHash 611c9ad14385

Groups

semrushbot

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

*

Rule Path
Allow /api.php?
Allow /api.php?action=
Allow /api.php?*&action=
Allow /wiki/Special%3ACreateNewWiki
Allow /wiki/Special%3AAllMaps
Disallow /wiki/Special%3A
Disallow /wiki/User_talk%3A
Disallow /wiki/Template%3A
Disallow /wiki/Template_talk%3A
Disallow /wiki/Help%3A
Disallow /wiki/User%3A
Disallow /wiki/UserProfile%3A

ias_crawler

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://thecw.fandom.com/sitemap-newsitemapxml-index.xml

Warnings

  • `noindex` is not a known field.