money.cnn.com
robots.txt
Robots Exclusion Standard data for money.cnn.com
Resource Scan
Scan Details
Site Domain | money.cnn.com |
Base Domain | cnn.com |
Scan Status | Ok |
Last Scan | 2024-05-05T23:49:13+00:00 |
Next Scan | 2024-05-19T23:49:13+00:00 |
Last Scan
Scanned | 2024-05-05T23:49:13+00:00 |
URL | https://money.cnn.com/robots.txt |
Domain IPs | 151.101.131.5, 151.101.195.5, 151.101.3.5, 151.101.67.5, 2a04:4e42:200::773, 2a04:4e42:400::773, 2a04:4e42:600::773, 2a04:4e42::773 |
Response IP | 199.232.47.5 |
Found | Yes |
Hash | e6af7b2774dd7d3d2e8cd192345f937eea319f0a0450205069df48ba1f0e780c |
SimHash | dc1110d96174 |
Groups
*
Rule | Path |
---|---|
Disallow | /SEARCH |
Disallow | /WEB-INF |
Disallow | /cgi-bin |
Disallow | /images |
Disallow | /java |
Disallow | /portfolio |
Disallow | /pr |
Disallow | /profile |
Disallow | /quotes |
Disallow | /tq |
Disallow | /virtual |
Disallow | /.element |
Disallow | /stcejorp |
Disallow | /cnn_adspaces |
Disallow | /fn_adspaces |
Disallow | /ssi |
Disallow | /emmy |
Disallow | /test |
Other Records
Field | Value |
---|---|
sitemap | https://money.cnn.com/registry/sitemaps/index.xml |