allgazettes.com
robots.txt

Robots Exclusion Standard data for allgazettes.com

Resource Scan

Scan Details

Site Domain allgazettes.com
Base Domain allgazettes.com
Scan Status Ok
Last Scan2024-10-04T17:19:32+00:00
Next Scan 2024-10-11T17:19:32+00:00

Last Scan

Scanned2024-10-04T17:19:32+00:00
URL https://allgazettes.com/robots.txt
Redirect https://www.allgazettes.com/robots.txt
Redirect Domain www.allgazettes.com
Redirect Base allgazettes.com
Domain IPs 216.239.32.21, 216.239.34.21, 216.239.36.21, 216.239.38.21
Redirect IPs 2404:6800:4003:c1a::79, 64.233.170.121
Response IP 142.251.175.121
Found Yes
Hash df1e91b6b5f788ee1cf30eefa0cf782ba126f9df1e3b4ca7d1e5a477dbbe02d3
SimHash 49149d7047d3

Groups

*

Rule Path
Disallow /search
Allow /

Other Records

Field Value
sitemap https://www.allgazettes.com/atom.xml?redirect=false&start-index=1&max-results=1000