fcc.gov
robots.txt
Robots Exclusion Standard data for fcc.gov
Resource Scan
Scan Details
Site Domain | fcc.gov |
Base Domain | fcc.gov |
Scan Status | Ok |
Last Scan | 2024-06-13T05:19:22+00:00 |
Next Scan | 2024-07-13T05:19:22+00:00 |
Last Scan
Scanned | 2024-06-13T05:19:22+00:00 |
URL | https://fcc.gov/robots.txt |
Redirect | https://transition.fcc.gov/robots.txt |
Redirect Domain | transition.fcc.gov |
Redirect Base | fcc.gov |
Domain IPs | 104.110.74.61, 2600:1413:1:593::132d, 2600:1413:1:59d::132d |
Redirect IPs | 23.44.4.170, 23.44.4.186, 2600:1417:3f::b81c:eb5a, 2600:1417:3f::b81c:eb68 |
Response IP | 23.44.4.186 |
Found | Yes |
Hash | a3faaba5e28b1bad0e506e165cd925597a807a3542dc8aa90d9bb95e45425cda |
SimHash | 140175482e92 |
Groups
*
Rule | Path |
---|---|
Disallow | /fcc-bin/ |
Disallow | /images/ |
Disallow | /oet/ITU_tsk_grp/ |
Disallow | /oet/info/TG-18/ |
Disallow | /statelocal/protected/ |