sgs-caspian.com
robots.txt
Robots Exclusion Standard data for sgs-caspian.com
Resource Scan
Scan Details
Site Domain | sgs-caspian.com |
Base Domain | sgs-caspian.com |
Scan Status | Ok |
Last Scan | 2024-10-26T01:35:16+00:00 |
Next Scan | 2024-11-25T01:35:16+00:00 |
Last Scan
Scanned | 2024-10-26T01:35:16+00:00 |
URL | https://sgs-caspian.com/robots.txt |
Redirect | https://www.sgs-caspian.com/robots.txt |
Redirect Domain | www.sgs-caspian.com |
Redirect Base | sgs-caspian.com |
Domain IPs | 52.232.96.213 |
Redirect IPs | 23.215.7.22, 23.215.7.9, 2600:1413:b000:1b::17d7:709, 2600:1413:b000:1b::17d7:716 |
Response IP | 96.17.180.32 |
Found | Yes |
Hash | f8ed9fd40861d8e19bbb9db9c8041dc09886cd4bfd32715b2799ca231809375e |
SimHash | 39044648f511 |
Groups
*
Rule | Path |
---|---|
Disallow | *.aspx |
Disallow | *?date= |
Disallow | *?id= |
Disallow | *?p= |
Disallow | *?query= |
Disallow | *?s= |
Disallow | *?ServiceID= |
Disallow | *?topic= |
Disallow | *?type= |
Disallow | *sitecore/ |
Disallow | /*page-not-found |
Disallow | /*searchresults |
Disallow | /api/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.sgs-caspian.com/sitemap.xml.gz |