goodmorningamerica.com
robots.txt

Robots Exclusion Standard data for goodmorningamerica.com

Resource Scan

Scan Details

Site Domain goodmorningamerica.com
Base Domain goodmorningamerica.com
Scan Status Ok
Last Scan2024-05-09T14:31:58+00:00
Next Scan 2024-05-16T14:31:58+00:00

Last Scan

Scanned2024-05-09T14:31:58+00:00
URL https://goodmorningamerica.com/robots.txt
Redirect https://www.goodmorningamerica.com/robots.txt
Redirect Domain www.goodmorningamerica.com
Redirect Base goodmorningamerica.com
Domain IPs 18.164.174.101, 18.164.174.13, 18.164.174.17, 18.164.174.42
Redirect IPs 2600:9000:2024:1c00:0:22c5:a180:93a1, 2600:9000:2024:5400:0:22c5:a180:93a1, 2600:9000:2024:6200:0:22c5:a180:93a1, 2600:9000:2024:6a00:0:22c5:a180:93a1, 2600:9000:2024:8000:0:22c5:a180:93a1, 2600:9000:2024:b600:0:22c5:a180:93a1, 2600:9000:2024:e200:0:22c5:a180:93a1, 2600:9000:2024:f600:0:22c5:a180:93a1, 65.9.112.21, 65.9.112.49, 65.9.112.76, 65.9.112.81
Response IP 3.160.246.20
Found Yes
Hash 4d53b47d782da4a639dc5ce5a111687cb195b916a000b9cca679829b9f557305
SimHash f14d0d4465d3

Groups

*

Rule Path
Disallow /alpha
Disallow /beta
Disallow /error
Disallow /groovity
Disallow /healthcheck
Disallow /staging
Disallow /test

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.goodmorningamerica.com/sitemap.xml
sitemap https://www.goodmorningamerica.com/lateststories.xml
sitemap https://www.goodmorningamerica.com/latestvideos.xml

Comments

  • robots.txt for https://www.goodmorningamerica.com/