50states.com
robots.txt
Robots Exclusion Standard data for 50states.com
Resource Scan
Scan Details
Site Domain | 50states.com |
Base Domain | 50states.com |
Scan Status | Ok |
Last Scan | 2024-06-01T00:39:15+00:00 |
Next Scan | 2024-06-08T00:39:15+00:00 |
Last Scan
Scanned | 2024-06-01T00:39:15+00:00 |
URL | https://50states.com/robots.txt |
Redirect | https://www.50states.com/robots.txt |
Redirect Domain | www.50states.com |
Redirect Base | 50states.com |
Domain IPs | 104.26.10.223, 104.26.11.223, 172.67.73.45, 2606:4700:20::681a:adf, 2606:4700:20::681a:bdf, 2606:4700:20::ac43:492d |
Redirect IPs | 104.26.10.223, 104.26.11.223, 172.67.73.45, 2606:4700:20::681a:adf, 2606:4700:20::681a:bdf, 2606:4700:20::ac43:492d |
Response IP | 104.26.10.223 |
Found | Yes |
Hash | 22f3d7e88acb31a8fc57e555746cb887f8d7097ca05d46333949c9f7ded1d8fe |
SimHash | 8d515cd0cb8a |
Groups
*
Rule | Path |
---|---|
Disallow | /adbuys/ |
Disallow | /affsearch/ |
Disallow | /redir/ |
Disallow | /toolbar/ |
Disallow | /wbsearch.htm |
Disallow | /branding/ |
Disallow | /contact/ |
Disallow | /college-search-results/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.50states.com/sitemap.xml |
sitemap | https://www.50states.com/education/sitemap_index.xml |
Warnings
- 2 invalid lines.
Comments