canderecruising.com
robots.txt

Robots Exclusion Standard data for canderecruising.com

Resource Scan

Scan Details

Site Domain canderecruising.com
Base Domain canderecruising.com
Scan Status Ok
Last Scan2024-09-23T07:34:51+00:00
Next Scan 2024-10-23T07:34:51+00:00

Last Scan

Scanned2024-09-23T07:34:51+00:00
URL https://canderecruising.com/robots.txt
Redirect https://www.canderecruising.com/robots.txt
Redirect Domain www.canderecruising.com
Redirect Base canderecruising.com
Domain IPs 192.0.66.239, 2a04:fa87:fffd::c000:4230
Redirect IPs 192.0.66.239, 2a04:fa87:fffd::c000:42ef
Response IP 192.0.66.239
Found Yes
Hash 97cf689dd2f14078d9e8cc885c6d02d84bd0c4366d0baa4209f75d66945fcbd1
SimHash 8c27ce72c489

Groups

*

Rule Path
Allow /edit/wp-includes/js/
Disallow /edit/

megaindex.ru/2.0
megaindex.ru
megaindex.ru
mauibot (crawler.feedback+wc@gmail.com)
seekport crawler
blexbot
baiduspider
barkrowler
gigabot
go-http-client
nuclei
riddler
seznambot
wikido
yandex
zoominfobot
magpie-crawler

Rule Path
Disallow /
Disallow /fhbr-console/
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://www.canderecruising.com/sitemap.xml