goodmorningamerica.com
robots.txt
Robots Exclusion Standard data for goodmorningamerica.com
Resource Scan
Scan Details
Site Domain | goodmorningamerica.com |
Base Domain | goodmorningamerica.com |
Scan Status | Ok |
Last Scan | 2024-05-09T14:31:58+00:00 |
Next Scan | 2024-05-16T14:31:58+00:00 |
Last Scan
Scanned | 2024-05-09T14:31:58+00:00 |
URL | https://goodmorningamerica.com/robots.txt |
Redirect | https://www.goodmorningamerica.com/robots.txt |
Redirect Domain | www.goodmorningamerica.com |
Redirect Base | goodmorningamerica.com |
Domain IPs | 18.164.174.101, 18.164.174.13, 18.164.174.17, 18.164.174.42 |
Redirect IPs | 2600:9000:2024:1c00:0:22c5:a180:93a1, 2600:9000:2024:5400:0:22c5:a180:93a1, 2600:9000:2024:6200:0:22c5:a180:93a1, 2600:9000:2024:6a00:0:22c5:a180:93a1, 2600:9000:2024:8000:0:22c5:a180:93a1, 2600:9000:2024:b600:0:22c5:a180:93a1, 2600:9000:2024:e200:0:22c5:a180:93a1, 2600:9000:2024:f600:0:22c5:a180:93a1, 65.9.112.21, 65.9.112.49, 65.9.112.76, 65.9.112.81 |
Response IP | 3.160.246.20 |
Found | Yes |
Hash | 4d53b47d782da4a639dc5ce5a111687cb195b916a000b9cca679829b9f557305 |
SimHash | f14d0d4465d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /alpha |
Disallow | /beta |
Disallow | /error |
Disallow | /groovity |
Disallow | /healthcheck |
Disallow | /staging |
Disallow | /test |
Other Records
Field | Value |
---|---|
sitemap | https://www.goodmorningamerica.com/sitemap.xml |
sitemap | https://www.goodmorningamerica.com/lateststories.xml |
sitemap | https://www.goodmorningamerica.com/latestvideos.xml |
Comments