compromat.site
robots.txt
Robots Exclusion Standard data for compromat.site
Resource Scan
Scan Details
Site Domain | compromat.site |
Base Domain | compromat.site |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2025-05-24T01:37:22+00:00 |
Next Scan | 2025-07-23T01:37:22+00:00 |
Last Successful Scan
Scanned | 2025-03-03T00:32:11+00:00 |
URL | https://compromat.site/robots.txt |
Domain IPs | 104.21.24.28, 172.67.216.126, 2606:4700:3033::6815:181c, 2606:4700:3037::ac43:d87e |
Response IP | 104.21.24.28 |
Found | Yes |
Hash | ab8fb94441d3c74c976e060529c476370abb70cc295ab0af0ceac5b7216d3be7 |
SimHash | a21d1d188bf4 |
Groups
*
Rule | Path |
---|---|
Disallow | /administrator/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cli/ |
Disallow | /components/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /modules/ |
Disallow | /plugins/ |
Disallow | /tmp/ |
Other Records
Field | Value |
---|---|
sitemap | https://the-gardian.com/sitemap/sitemap-index-gz.xml |
Comments