guildsofwow.com
robots.txt

Robots Exclusion Standard data for guildsofwow.com

Resource Scan

Scan Details

Site Domain guildsofwow.com
Base Domain guildsofwow.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan5/23/2025, 7:39:14 AM
Next Scan 6/22/2025, 7:39:14 AM

Last Successful Scan

Scanned4/17/2025, 7:38:25 AM
URL https://guildsofwow.com/robots.txt
Domain IPs 104.26.8.143, 104.26.9.143, 172.67.75.22, 2606:4700:20::681a:88f, 2606:4700:20::681a:98f, 2606:4700:20::ac43:4b16
Response IP 104.26.9.143
Found Yes
Hash c0f2085eca0272e7f779a25dc3f746341ba3da3764cad206f34707e1a5759d4b
SimHash 8369b1770b47

Groups

*

Rule Path
Disallow /manage/
Disallow /manage/*
Disallow /api/*
Disallow */api/*
Disallow /recruit-create-alert
Disallow /character/*
Disallow */character/*
Disallow /upcoming-event/*
Disallow */upcoming-event/*
Disallow */roster
Disallow */roster*
Disallow */roster/*
Disallow */roster-by-account
Disallow */roster-by-account*
Disallow */roster-by-account/*
Disallow */audit-roster
Disallow */audit-roster*
Disallow */audit-roster/*
Disallow */guild-report
Disallow */guild-report*
Disallow */guild-report/*
Disallow */keystones
Disallow */keystones*
Disallow */keystones/*
Disallow /my-guilds
Disallow /welcome
Disallow /welcome-back
Disallow */discord
Disallow */youtube
Disallow */facebook
Disallow */twitch
Disallow */twitter
Disallow */instagram

twitterbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

rogerbot
exabot
mj12bot
dotbot
gigabot
ahrefsbot
blackwidow
chinaclaw
custo
disco
download\ demon
ecatch
eirgrabber
emailsiphon
emailwolf
express\ webpictures
extractorpro
eyenetie
flashget
getright
getweb!
go!zilla
go-ahead-got-it
grabnet
grafula
hmview
httrack
image\ stripper
image\ sucker
indy\ library
interget
internet\ ninja
jetcar
joc\ web\ spider
larbin
leechftp
mass\ downloader
midown\ tool
mister\ pix
navroad
nearsite
netants
netspider
net\ vampire
netzip
octopus
offline\ explorer
offline\ navigator
pagegrabber
papa\ foto
pavuk
pcbrowser
realdownload
reget
semrushbot
sitesnagger
smartdownload
superbot
superhttp
surfbot
takeout
teleport\ pro
voideye
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wwwoffle
xaldon\ webspider
zeus

Rule Path
Disallow /

Other Records

Field Value
sitemap https://guildsofwow.com/sitemap.xml