goodmorningamerica.com
robots.txt
Robots Exclusion Standard data for goodmorningamerica.com
Resource Scan
Scan Details
Site Domain | goodmorningamerica.com |
Base Domain | goodmorningamerica.com |
Scan Status | Ok |
Last Scan | 2024-11-09T21:35:49+00:00 |
Next Scan | 2024-11-16T21:35:49+00:00 |
Last Scan
Scanned | 2024-11-09T21:35:49+00:00 |
URL | https://goodmorningamerica.com/robots.txt |
Redirect | https://www.goodmorningamerica.com/robots.txt |
Redirect Domain | www.goodmorningamerica.com |
Redirect Base | goodmorningamerica.com |
Domain IPs | 18.66.192.100, 18.66.192.109, 18.66.192.114, 18.66.192.35 |
Redirect IPs | 18.66.196.109, 18.66.196.14, 18.66.196.22, 18.66.196.54, 2600:9000:2024:3200:0:22c5:a180:93a1, 2600:9000:2024:5400:0:22c5:a180:93a1, 2600:9000:2024:800:0:22c5:a180:93a1, 2600:9000:2024:8a00:0:22c5:a180:93a1, 2600:9000:2024:ac00:0:22c5:a180:93a1, 2600:9000:2024:b200:0:22c5:a180:93a1, 2600:9000:2024:f000:0:22c5:a180:93a1, 2600:9000:2024:fc00:0:22c5:a180:93a1 |
Response IP | 65.9.112.49 |
Found | Yes |
Hash | 4d53b47d782da4a639dc5ce5a111687cb195b916a000b9cca679829b9f557305 |
SimHash | f14d0d4465d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /alpha |
Disallow | /beta |
Disallow | /error |
Disallow | /groovity |
Disallow | /healthcheck |
Disallow | /staging |
Disallow | /test |
Other Records
Field | Value |
---|---|
sitemap | https://www.goodmorningamerica.com/sitemap.xml |
sitemap | https://www.goodmorningamerica.com/lateststories.xml |
sitemap | https://www.goodmorningamerica.com/latestvideos.xml |
Comments