schedule.fightmag.com
robots.txt

Robots Exclusion Standard data for schedule.fightmag.com

Resource Scan

Scan Details

Site Domain schedule.fightmag.com
Base Domain fightmag.com
Scan Status Ok
Last Scan2024-09-14T22:10:53+00:00
Next Scan 2024-10-14T22:10:53+00:00

Last Scan

Scanned2024-09-14T22:10:53+00:00
URL https://schedule.fightmag.com/robots.txt
Domain IPs 104.21.61.48, 172.67.206.6, 2606:4700:3035::6815:3d30, 2606:4700:3037::ac43:ce06
Response IP 104.21.61.48
Found Yes
Hash 36d4b669f527f21d9a611ec6400108cfa6f69e46379020be2ba34a3b134555fd
SimHash cfd7d528d491

Groups

*

Rule Path
Disallow /readme.html
Disallow /cgi-bin/
Disallow /wp-json/
Disallow /?s=*
Disallow /search/*

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://schedule.fightmag.com/sitemap_index.xml