scanharga.com
robots.txt

Robots Exclusion Standard data for scanharga.com

Resource Scan

Scan Details

Site Domain scanharga.com
Base Domain scanharga.com
Scan Status Ok
Last Scan2024-09-27T18:47:27+00:00
Next Scan 2024-10-04T18:47:27+00:00

Last Scan

Scanned2024-09-27T18:47:27+00:00
URL https://scanharga.com/robots.txt
Redirect https://www.scanharga.com/robots.txt
Redirect Domain www.scanharga.com
Redirect Base scanharga.com
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 172.253.118.121, 2404:6800:4003:c01::79
Response IP 142.251.12.121
Found Yes
Hash 2f2496ea6b981b82335baf039e9e114adf812d57219dd43a657e2dc89d715f94
SimHash 480818400eb0

Groups

*

Rule Path
Disallow /p/pri.html
Disallow /p/term-of-service.html
Disallow /search?updated-min=
Disallow /search?updated-max=
Disallow /search/label/*?updated-min=
Disallow /search/label/*?updated-max=
Allow /

Other Records

Field Value
sitemap https://www.scanharga.com/atom.xml?redirect=false&start-index=1&max-results=500