goodmorningamerica.com
robots.txt

Robots Exclusion Standard data for goodmorningamerica.com

Resource Scan

Scan Details

Site Domain goodmorningamerica.com
Base Domain goodmorningamerica.com
Scan Status Ok
Last Scan2024-11-09T21:35:49+00:00
Next Scan 2024-11-16T21:35:49+00:00

Last Scan

Scanned2024-11-09T21:35:49+00:00
URL https://goodmorningamerica.com/robots.txt
Redirect https://www.goodmorningamerica.com/robots.txt
Redirect Domain www.goodmorningamerica.com
Redirect Base goodmorningamerica.com
Domain IPs 18.66.192.100, 18.66.192.109, 18.66.192.114, 18.66.192.35
Redirect IPs 18.66.196.109, 18.66.196.14, 18.66.196.22, 18.66.196.54, 2600:9000:2024:3200:0:22c5:a180:93a1, 2600:9000:2024:5400:0:22c5:a180:93a1, 2600:9000:2024:800:0:22c5:a180:93a1, 2600:9000:2024:8a00:0:22c5:a180:93a1, 2600:9000:2024:ac00:0:22c5:a180:93a1, 2600:9000:2024:b200:0:22c5:a180:93a1, 2600:9000:2024:f000:0:22c5:a180:93a1, 2600:9000:2024:fc00:0:22c5:a180:93a1
Response IP 65.9.112.49
Found Yes
Hash 4d53b47d782da4a639dc5ce5a111687cb195b916a000b9cca679829b9f557305
SimHash f14d0d4465d3

Groups

*

Rule Path
Disallow /alpha
Disallow /beta
Disallow /error
Disallow /groovity
Disallow /healthcheck
Disallow /staging
Disallow /test

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.goodmorningamerica.com/sitemap.xml
sitemap https://www.goodmorningamerica.com/lateststories.xml
sitemap https://www.goodmorningamerica.com/latestvideos.xml

Comments

  • robots.txt for https://www.goodmorningamerica.com/