gethearth.com
robots.txt

Robots Exclusion Standard data for gethearth.com

Resource Scan

Scan Details

Site Domain gethearth.com
Base Domain gethearth.com
Scan Status Ok
Last Scan2024-04-22T11:47:36+00:00
Next Scan 2024-05-22T11:47:36+00:00

Last Scan

Scanned2024-04-22T11:47:36+00:00
URL https://www.gethearth.com/robots.txt
Domain IPs 13.226.228.20, 13.226.228.5, 13.226.228.6, 13.226.228.61, 2600:9000:24bc:1200:e:a412:1400:93a1, 2600:9000:24bc:1600:e:a412:1400:93a1, 2600:9000:24bc:2400:e:a412:1400:93a1, 2600:9000:24bc:6e00:e:a412:1400:93a1, 2600:9000:24bc:8a00:e:a412:1400:93a1, 2600:9000:24bc:9a00:e:a412:1400:93a1, 2600:9000:24bc:a000:e:a412:1400:93a1, 2600:9000:24bc:be00:e:a412:1400:93a1
Response IP 18.165.171.21
Found Yes
Hash 410e12dae9d4e493810d6790cd215fb301a5a1c2c2cf427f0d238bb35b27c1bd
SimHash 29250280ec5a

Groups

*

Rule Path
Disallow /roi-pages/
Disallow /mb/
Disallow /past-roi/
Disallow /partners/
Disallow /fb/
Disallow /v1/
Disallow /web-email/

Other Records

Field Value
sitemap https://www.gethearth.com/sitemap_index.xml

Comments

  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK