guillens.com
robots.txt

Robots Exclusion Standard data for guillens.com

Resource Scan

Scan Details

Site Domain guillens.com
Base Domain guillens.com
Scan Status Ok
Last Scan2025-12-29T23:48:42+00:00
Next Scan 2026-01-28T23:48:42+00:00

Last Scan

Scanned2025-12-29T23:48:42+00:00
URL https://www.guillens.com/robots.txt
Domain IPs 100.51.69.63, 98.85.182.23
Response IP 98.85.182.23
Found Yes
Hash 939441bd17f13f4e09d4b91d2caa061b68ce51f70d87170dccaeb7b7e658f4fc
SimHash 621bd55182c1

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

swish-e

Rule Path
Disallow /

tagoobot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

superpagesbot

Rule Path
Disallow /

superpagesurlverifybot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

zoomspider

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 15

*

Rule Path
Disallow /silver/*.jsp

*

Rule Path
Disallow /custom/*.jsp

*

Rule Path
Disallow /api/*.jsp

*

Rule Path
Disallow /theme/*/*.jsp

Other Records

Field Value
sitemap https://www.guillens.com/sitemap.xml

Warnings

  • 2 invalid lines.