crmonce.com
robots.txt

Robots Exclusion Standard data for crmonce.com

Resource Scan

Scan Details

Site Domain crmonce.com
Base Domain crmonce.com
Scan Status Ok
Last Scan2025-12-31T08:53:57+00:00
Next Scan 2026-01-30T08:53:57+00:00

Last Scan

Scanned2025-12-31T08:53:57+00:00
URL https://crmonce.com/robots.txt
Domain IPs 157.173.220.90
Response IP 157.173.220.90
Found Yes
Hash a373f7e0c825c9567d7c077e486098c14b7da2530afce1934672345da2a8b74d
SimHash 1304da72f623

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /private/
Disallow /api/
Allow /images/
Allow /assets/
Allow /css/
Allow /js/
Disallow /search?
Disallow /filter?
Disallow /sort?

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://crmonce.com/sitemap.xml

Comments

  • Sitemap location
  • Crawl-delay for respectful crawling
  • Disallow admin or private areas (if any)
  • Allow important directories
  • Block common bot traps