iptoasn.com
robots.txt

Robots Exclusion Standard data for iptoasn.com

Resource Scan

Scan Details

Site Domain iptoasn.com
Base Domain iptoasn.com
Scan Status Ok
Last Scan2025-12-06T09:53:23+00:00
Next Scan 2025-12-20T09:53:23+00:00

Last Scan

Scanned2025-12-06T09:53:23+00:00
URL https://iptoasn.com/robots.txt
Domain IPs 104.21.86.94, 172.67.217.105, 2606:4700:3030::ac43:d969, 2606:4700:3034::6815:565e
Response IP 104.21.86.94
Found Yes
Hash 17eafb8b800a3ba6c9cece8dcb61f93288591c373a2365a5c277bafdb1f9eb65
SimHash 42c4d8d0c657

Groups

*

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

ia_archiver

Rule Path
Allow /

*

Rule Path
Disallow /data/*.gz$

Other Records

Field Value
sitemap https://iptoasn.com/sitemap.xml

Comments

  • IPtoASN robots.txt
  • Sitemap location
  • Crawl-delay for polite crawling
  • Allow archiving
  • Disallow heavy downloads for bots (they should use direct links)