grossarchive.com
robots.txt

Robots Exclusion Standard data for grossarchive.com

Resource Scan

Scan Details

Site Domain grossarchive.com
Base Domain grossarchive.com
Scan Status Ok
Last Scan2024-11-08T05:37:14+00:00
Next Scan 2024-11-15T05:37:14+00:00

Last Scan

Scanned2024-11-08T05:37:14+00:00
URL https://grossarchive.com/robots.txt
Domain IPs 104.21.36.67, 172.67.186.197, 2606:4700:3030::6815:2443, 2606:4700:3036::ac43:bac5
Response IP 172.67.186.197
Found Yes
Hash b4254b55b5d653bb5262594adc9456a45abc4791a5c4e90b46d538b63e38caa9
SimHash 632c5b45899a

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /akara/

Other Records

Field Value
sitemap https://www.grossarchive.com/sitemap.xml

Comments

  • Blocks robots from specific folders / directories