gethearth.com
robots.txt
Robots Exclusion Standard data for gethearth.com
Resource Scan
Scan Details
Site Domain | gethearth.com |
Base Domain | gethearth.com |
Scan Status | Ok |
Last Scan | 2024-04-22T11:47:36+00:00 |
Next Scan | 2024-05-22T11:47:36+00:00 |
Last Scan
Scanned | 2024-04-22T11:47:36+00:00 |
URL | https://www.gethearth.com/robots.txt |
Domain IPs | 13.226.228.20, 13.226.228.5, 13.226.228.6, 13.226.228.61, 2600:9000:24bc:1200:e:a412:1400:93a1, 2600:9000:24bc:1600:e:a412:1400:93a1, 2600:9000:24bc:2400:e:a412:1400:93a1, 2600:9000:24bc:6e00:e:a412:1400:93a1, 2600:9000:24bc:8a00:e:a412:1400:93a1, 2600:9000:24bc:9a00:e:a412:1400:93a1, 2600:9000:24bc:a000:e:a412:1400:93a1, 2600:9000:24bc:be00:e:a412:1400:93a1 |
Response IP | 18.165.171.21 |
Found | Yes |
Hash | 410e12dae9d4e493810d6790cd215fb301a5a1c2c2cf427f0d238bb35b27c1bd |
SimHash | 29250280ec5a |
Groups
*
Rule | Path |
---|---|
Disallow | /roi-pages/ |
Disallow | /mb/ |
Disallow | /past-roi/ |
Disallow | /partners/ |
Disallow | /fb/ |
Disallow | /v1/ |
Disallow | /web-email/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.gethearth.com/sitemap_index.xml |
Comments