spirol.com
robots.txt

Robots Exclusion Standard data for spirol.com

Resource Scan

Scan Details

Site Domain spirol.com
Base Domain spirol.com
Scan Status Ok
Last Scan2025-08-12T09:14:27+00:00
Next Scan 2025-09-11T09:14:27+00:00

Last Scan

Scanned2025-08-12T09:14:27+00:00
URL https://spirol.com/robots.txt
Redirect https://www.spirol.com/robots.txt
Redirect Domain www.spirol.com
Redirect Base spirol.com
Domain IPs 104.19.232.38, 104.19.233.38
Redirect IPs 104.19.232.38, 104.19.233.38, 2606:4700::6813:e826, 2606:4700::6813:e926
Response IP 104.19.232.38
Found Yes
Hash 65efcbdec53e18ffdbef4b9ea1bf2ad5f8bc8145dd4081db0d939fe756d80345
SimHash 51164186ea19

Groups

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /connectors/
Disallow /core/
Disallow /manager/
Disallow /search-results/
Disallow /pdf-viewer/
Disallow /build/*
Disallow /web/*
Disallow /wcc/
Disallow /hidden-pdfs/

Other Records

Field Value
sitemap https://www.spirol.com/sitemap.xml

Comments

  • Default modx exclusions
  • For sitemaps.xml autodiscovery. Uncomment if you have one:

Warnings

  • 2 invalid lines.